2025-11-24

Claude Advanced Tool Use

Shifting Agent Complexity & Hype Cycles

Commenters see a recurring cycle: heavy scaffolding (LangChain-style agents) → simpler loops → MCP with large schemas → back to bash/filesystem → now richer tool systems again.
Some express fatigue at constant reinvention and hype-driven adoption, with good OSS ideas often ignored until a major vendor rebrands them.
Others note cost and latency will be strong incentives to simplify once capabilities stabilize.

Programmatic Tool Calling & Code as the “Tool Language”

Strong support for “write code to call tools instead of calling them directly”: lets agents batch calls, pass data between tools, and avoid copying large payloads into the model’s context.
Several people mention similar prior work (e.g., smolagents, MCP proxies that turn tools into TypeScript/Python APIs executed in a sandbox).
There’s a broader push toward giving models a real programming environment (Python, bash, TypeScript, Prolog DSLs) instead of verbose JSON schemas, with the model orchestrating tools via code.

Context Management, Tool Search & “RAG for Tools”

Many criticize loading all tool JSON schemas into context as wasteful and a known anti-pattern. Patterns discussed: tool search tools, skills folders, sub-agents, plans, and delayed “install” of tools.
Anthropic’s Tool Search is seen by some as “RAG for tools”: offload discovery, then only load a small subset into context. Others argue good architecture (per-state tool sets, sub-agents) already solves this.
Concern that debugging opaque tool selection will be painful when the wrong tool is silently chosen.

Security, Ecosystem & Optimization Games

Letting LLMs discover tools on GitHub and execute them is called a security nightmare: sandboxing protects machines but not data exfiltration; curation is urged.
People anticipate “Tool Engine Optimization” (TEO), promoted tools, and ranking systems analogous to SEO/PageRank.

Alternatives: GraphQL, SPARQL, CLI & Shell-First

Multiple commenters advocate GraphQL (or SPARQL) as a single, typed, introspectable “super-tool” that avoids dozens of separate MCP tools and supports selective data loading.
Others emphasize good old CLI tools with --help as simpler, composable interfaces that LLMs can already use, often via a shell tool, sometimes preferred over buggy MCP servers.

Reliability & MCP/Skills Critiques

Reports that tools, MCP, and skills integrations are brittle: skills often not invoked unless explicitly named; CLAUDE.md and similar configs get ignored due to context rot.
Some argue MCP is conceptually interesting but currently too buggy for production, pushing teams to build custom protocols or lighter abstractions.

Related topics