Claude Advanced Tool Use
Shifting Agent Complexity & Hype Cycles
- Commenters see a recurring cycle: heavy scaffolding (LangChain-style agents) → simpler loops → MCP with large schemas → back to bash/filesystem → now richer tool systems again.
- Some express fatigue at constant reinvention and hype-driven adoption, with good OSS ideas often ignored until a major vendor rebrands them.
- Others note cost and latency will be strong incentives to simplify once capabilities stabilize.
Programmatic Tool Calling & Code as the “Tool Language”
- Strong support for “write code to call tools instead of calling them directly”: lets agents batch calls, pass data between tools, and avoid copying large payloads into the model’s context.
- Several people mention similar prior work (e.g., smolagents, MCP proxies that turn tools into TypeScript/Python APIs executed in a sandbox).
- There’s a broader push toward giving models a real programming environment (Python, bash, TypeScript, Prolog DSLs) instead of verbose JSON schemas, with the model orchestrating tools via code.
Context Management, Tool Search & “RAG for Tools”
- Many criticize loading all tool JSON schemas into context as wasteful and a known anti-pattern. Patterns discussed: tool search tools, skills folders, sub-agents, plans, and delayed “install” of tools.
- Anthropic’s Tool Search is seen by some as “RAG for tools”: offload discovery, then only load a small subset into context. Others argue good architecture (per-state tool sets, sub-agents) already solves this.
- Concern that debugging opaque tool selection will be painful when the wrong tool is silently chosen.
Security, Ecosystem & Optimization Games
- Letting LLMs discover tools on GitHub and execute them is called a security nightmare: sandboxing protects machines but not data exfiltration; curation is urged.
- People anticipate “Tool Engine Optimization” (TEO), promoted tools, and ranking systems analogous to SEO/PageRank.
Alternatives: GraphQL, SPARQL, CLI & Shell-First
- Multiple commenters advocate GraphQL (or SPARQL) as a single, typed, introspectable “super-tool” that avoids dozens of separate MCP tools and supports selective data loading.
- Others emphasize good old CLI tools with
--helpas simpler, composable interfaces that LLMs can already use, often via a shell tool, sometimes preferred over buggy MCP servers.
Reliability & MCP/Skills Critiques
- Reports that tools, MCP, and skills integrations are brittle: skills often not invoked unless explicitly named; CLAUDE.md and similar configs get ignored due to context rot.
- Some argue MCP is conceptually interesting but currently too buggy for production, pushing teams to build custom protocols or lighter abstractions.