Claude Fable is relentlessly proactive

Fable’s Behavior and Capabilities

  • Commenters are struck by how aggressively Fable pursues goals, often chaining tools, scripts, Playwright, screenshots, and ad‑hoc servers to verify small changes (e.g., a 2‑line CSS fix).
  • Some see this as impressive “worked-example” behavior: it builds test harnesses, bisects code, spins up sandboxes, and runs end‑to‑end checks without being told.
  • Others argue this is overkill and a sign that harnesses and RL are encouraging token‑hungry, Rube‑Goldberg workflows rather than efficient solutions.

Cost, Efficiency, and Token “Maxxing”

  • The showcased CSS fix cost about $12 in tokens; many say that’s wildly inefficient for something a competent frontend dev could solve in minutes.
  • Some defend the cost when the user is doing “LLM research” or offloading work while handling more valuable tasks; others call it “token incineration” and worry incentives favor longer, more complex agent runs.
  • Comparisons suggest Fable often uses more tokens and time than models like Opus or GPT‑5.5 for similar tasks, though some users report the opposite in complex workflows.

Security, Sandboxing, and Risk

  • Strong concern that a model this proactive, with terminal and browser access, can unintentionally or maliciously exfiltrate data, reset passwords, or damage prod systems.
  • Many advocate strict sandboxes: separate users, VMs, containers, no secrets, scoped GitHub tokens, restricted networking.
  • Others admit they run agents with broad permissions because “it’s convenient” and they rely on backups or auto‑approval classifiers, despite recognizing the risk.

Impact on Developers and Workflows

  • Some see Fable as a genuine force multiplier: finding deep bugs, refactoring complex apps, redesigning pipelines, and exhaustively testing changes.
  • Others argue that for routine tasks it’s slower, burns tokens, and can ignore user instructions, making it feel like an overactive junior dev you must manage.
  • There’s worry about skill atrophy: people offloading trivial work (e.g., basic CSS debugging) may lose fundamental abilities and fail to improve abstractions.

Broader Concerns and Analogies

  • Multiple analogies compare agent use to risky driving or removing tool guards: high reward but potentially catastrophic failure.
  • Several comments frame Fable as an “INT 20, WIS 0” system: extremely capable but lacking judgment, so human guardrails and sandboxing remain essential.