Sakana Fugu

What Fugu Is and How It Works

  • Described as an “orchestrator” or “coordinator” LLM that sits in front of multiple models.
  • Regular Fugu appears to route a request to the most suitable model; Fugu Ultra can create multi-step workflows (e.g., one model for math, another for security checks, another for synthesis).
  • It is exposed as a single OpenAI-compatible endpoint, enabling “agent of agents” setups inside existing tools.

Comparisons to Other Multi-Model Systems

  • Frequently compared to OpenRouter Fusion:
    • Fusion = call several models in parallel + a synthesizer step.
    • Fugu = use a dedicated routing model to pick models and sometimes plan sequences.
  • Also compared to Perplexity, Databricks Omnigent, various open-source orchestrators, and simple ensemble methods from classical ML.
  • Some users note similar open-source fusions reportedly match or beat top models at lower cost.

Performance, Quality, and Speed

  • Mixed reports:
    • Some say Fugu-level performance is near frontier models for certain code-review / reasoning tasks.
    • Others find it weaker than leading frontier models for implementation tasks and prone to mistakes.
  • Latency is a recurring complaint; orchestration and multi-step workflows make it feel slow.
  • Technical report is criticized for showing only modest gains over underlying models.

Pricing, Economics, and Alternatives

  • Many find the $200/month tier expensive, especially given reported ~5 hours of usable time and high token burn.
  • Several commenters prefer:
    • Direct access to frontier models at similar or lower effective cost.
    • Very cheap models (e.g., DeepSeek) or local/open-source setups, sometimes orchestrated themselves.
  • Others note subscription fatigue from stacking many paid AI tools.

Research, Architecture, and Future Potential

  • Some see routing/orchestration as the logical next step: combining specialized models and agents can outperform any single model.
  • There’s interest in applying similar ideas to boost smaller, locally hostable models.
  • Concern that frontier labs could integrate equivalent meta-reasoning and make such services obsolete.

Criticism, Risks, and Ethics

  • Skepticism about “frontier-level” marketing and about just replacing one vendor lock-in with another.
  • Frustration over lack of EU availability.
  • Ethical objections to the company’s involvement in defense/military work.