Data-Driven Benchmark

MCP Token Bloat Benchmark

We audited 11 MCP servers. Here's how much context they waste.

March 2026 · 137 tools analyzed

Total Tokens

Tools Audited

Issues Found

Avg Tokens / Tool

Server Leaderboard

Sorted by token cost descending. The GitHub MCP server consumes 74.4% of all tokens across 11 servers.

	Server	Tools	Tokens	% Total	Issues
#1	GitHub	80	15,927	69.4%	50
#2	Filesystem	14	1,841	8.0%	31
#3	Sequential Thinking	1	976	4.3%	2
#4	Memory	9	975	4.2%	9
#5	Git	12	897	3.9%	12
#6	Slack	8	815	3.6%	10
#7	Puppeteer	7	642	2.8%	10
#8	Brave Search	2	374	1.6%	4
#9	Fetch	1	249	1.1%	2
#10	Time	2	215	0.9%	1
#11	Postgres	1	34	0.1%	1

Top 5 Costliest Individual Tools (all from GitHub MCP)

assign_copilot_to_issue 810 tokens

actions_list 714 tokens

projects_write 704 tokens

request_copilot_review 646 tokens

merge_pull_request 610 tokens

Optimization Issues by Rule

The agent-friend optimizer checks 7 heuristic rules. Here's what we found across all 137 tools.

long_param_description

Parameter descriptions over 100 chars

verbose_prefix

Redundant phrasing like "This tool..."

missing_description

Tool or param with no description

long_description

Tool descriptions over 200 chars

deep_nesting

Schema depth exceeds 3 levels

redundant_param_description

Param description just repeats its name

duplicate_param_description

Copy-pasted param descriptions

Context Window Impact

What 22,945 tokens looks like as a percentage of popular context windows.

GPT-4o

128K context

18%

Claude 3.5 Sonnet

200K context

12%

Gemini 2.0

1M context

This is JUST tool definitions. Before any conversation history, documents, system prompts, or chain-of-thought reasoning. Every message you send pays this tax again.

Audit Your Own Tools

agent-friend audit measures your tool schemas at build time. agent-friend optimize applies the same 7 rules to shrink them automatically. The only build-time linter for AI tool schemas.

Token Calculator Schema Converter GitHub Repo

pip install agent-friend · Open source · MIT License

Methodology

Sources: Official MCP reference servers (filesystem, git, memory, fetch, puppeteer, slack, brave-search, time, postgres, sequential-thinking) and GitHub MCP server
Tool schemas extracted from TypeScript, Python, and Go source code of each server
Token estimation: len(json.dumps(schema)) / 4 (standard approximation)
Optimization rules: same 7 heuristics used by agent-friend optimize CLI
Data generated March 2026. Server versions at time of analysis
Format comparison (OpenAI, Anthropic, Google, MCP, JSON Schema) available per server in raw data