token-optimization

Token Optimization Best Practices

Deep Knowledge: Use mcp__documentation__fetch_docs with technology: token-optimization for comprehensive documentation.

Guidelines for minimizing token consumption in MCP server and external tool interactions.

When NOT to Use This Skill

This skill focuses on API/tool call optimization. Do NOT use for:

Runtime performance - Use performance skill for speed optimization
Code minification - Use build tools (Vite, Webpack, etc.)
Database query optimization - Use database-specific skills
Algorithm efficiency - Use computer science fundamentals
Prompt engineering - This is about tool usage, not prompt design

General Principles

Principle Description

Lazy Loading Load information only when strictly necessary

Minimal Output Request only needed data, use limit and compact parameters

Progressive Detail Start with overview/summary, drill down only if needed

Cache First Check if information is already in context before external calls

Anti-Patterns

Anti-Pattern Why It's Bad Token-Efficient Solution

**SELECT *** Returns unnecessary columns Specify exact columns needed

No LIMIT clause Returns entire dataset Always add LIMIT (e.g., 100)

Full schema requests Returns massive specs Use compact=true or format="summary"

Recursive documentation fetch Fetches entire doc tree Use search_docs with specific query

Fetching full logs Returns thousands of lines Use tail_logs or find_errors with limit

Copy-paste documentation Duplicates content Summarize and reference, don't quote verbatim

No pagination Returns all results at once Use offset/limit for large datasets

Full API schema exploration Multi-MB specifications Get endpoint list first, details on-demand

Quick Troubleshooting

Issue Check Solution

Large MCP response Output size > 2000 tokens Add limit parameter, use compact format

Repeated API calls Calling same tool multiple times Cache results in conversation context

Slow context buildup Too many tool calls Batch related queries, use more specific tools

Unnecessary documentation fetch Info already known Check skill files first, fetch docs as last resort

Full table scan results Database query returns too much Add WHERE clause and LIMIT

Verbose error logs Full stack traces repeated Summarize errors, reference line numbers

MCP Server Patterns

database-query

-- BAD: Query without limits SELECT * FROM users

-- GOOD: Query with filters and limits SELECT id, name, email FROM users WHERE active = true LIMIT 100

Tool usage:

execute_query : ALWAYS use limit parameter (default: 1000)
get_schema(compact=true) : For DB structure overview
describe_table : Before exploratory queries
explain_query : Before complex queries on large tables

api-explorer

-- BAD: Full schema get_api_schema(format="full")

-- GOOD: Summary only for overview get_api_schema(format="summary")

-- GOOD: Path list with limit list_api_paths(limit=50)

-- GOOD: Single endpoint details get_api_endpoint_details(path="/users/{id}", method="GET")

Tool usage:

get_api_schema(format="summary") : For API overview
list_api_paths(limit=50) : For endpoint list
get_api_models(compact=true) : For model list without full schema
search_api(limit=10) : For targeted searches

documentation

-- BAD: Entire document fetch_docs(topic="react")

-- GOOD: Targeted search search_docs(query="useEffect cleanup", maxResults=3)

Tool usage:

search_docs(maxResults=3) : For specific information search
fetch_docs : Only for very specific topics
Check skill files FIRST before fetching documentation

log-analyzer

-- BAD: All logs parse_logs(file="/var/log/app.log")

-- GOOD: Recent errors only find_errors(file="/var/log/app.log", limit=50)

-- GOOD: Tail for live debugging tail_logs(file="/var/log/app.log", lines=50)

Tool usage:

tail_logs(lines=50) : For recent logs
find_errors(limit=50) : For error debugging
parse_logs(limit=200) : Only if full analysis needed

security-scanner

Tool usage:

scan_dependencies : Prefer over scan_all
scan_secrets : Faster than full scan
scan_all : Only for complete audits

code-quality

Tool usage:

analyze_complexity(path="src/specific/file.ts") : Target specific files
find_duplicates(minLines=10) : Filter significant duplicates only
code_metrics : Compact output for overview

Pre-Call MCP Checklist

Before calling an MCP tool, verify:

Do I already have this information in context?
Can I use a more specific tool instead of a generic one?
Have I set an appropriate limit ?
Have I used compact=true if available?
Is the expected output reasonable (< 2000 tokens)?

Output Format Standards

For code analysis

Max 5 issues per category
Snippets max 10 lines
Use tables for lists

For database queries

Max 20 rows in direct output
For results > 20: "Found N rows. First 20: ..."
Compact tabular format

For documentation

Quote only relevant parts (max 500 characters)
Link to complete docs instead of copying content
Summarize instead of quoting verbatim

Efficient Response Examples

Database Query - Compact Output

Found 1523 rows. First 20:

id	name	status
1	...	active
...
Use offset=20 for next page.

API Exploration - Progressive Detail

API has 45 endpoints. Summary by tag:

users: 8 endpoints
auth: 5 endpoints
products: 12 endpoints ... Use get_api_endpoint_details for specifics.

Log Analysis - Focused Output

Found 234 errors in last hour. Top 5 by frequency:

ConnectionTimeout: 89 occurrences
ValidationError: 45 occurrences ... Use tail_logs or parse_logs with filters for details.

Reference Documentation

Deep Knowledge: Use mcp__documentation__fetch_docs with technology: token-optimization for advanced optimization techniques.

token-optimization

Safety Notice

Copy this and send it to your AI assistant to learn

Source Transparency

Related Skills

Node.js Project Architecture

cron-scheduling

webrtc