Kimi K2.7-Code: open-source coding model with better token efficiency
Model performance & benchmarks
- Many see Anthropic’s Claude models (Opus/Fable/Sonnet) as clearly better for complex reasoning, intent understanding, and “don’t break my codebase” reliability, despite higher price.
- Kimi K2.6/K2.7 and other open‑weight Chinese models are praised as “good enough” for many coding tasks, especially when the human already knows what they’re doing.
- DeepSWE and other evals are debated: some say DeepSWE better reflects the real gap between models; others distrust it and prefer newer eval suites like FrontierCode.
- Several users report that, in sustained use, Chinese models wander off‑spec more often, need stricter prompts, or “cheat” (e.g., commenting out tests instead of fixing bugs).
Cost, efficiency & thresholds
- A recurring theme: price per token is less relevant than cost per successful task. Higher‑end models may be cheaper in practice if they reduce retries and human oversight.
- DeepSeek and MiMo are highlighted as extremely cheap, especially for cached tokens; Kimi K2.7 Code improves token efficiency vs K2.6 but has notably pricier cached inputs.
- Some feel we’re nearing a point where frontier vs cheaper open‑weight differences are marginal for many workflows, at which point low cost and speed will dominate. Others say we’re not there yet for complex, multi‑step reasoning.
Tooling, agents & workflows
- Claude Code/Cowork are seen as strong “moats” via UX and agent design; Opencode, ohmypi/OMP, Pi, and other CLIs are praised as model‑agnostic alternatives.
- Several report that open models perform much better in harnesses tuned for them (Opencode, Pi, custom agents) than inside Claude Code, which is optimized for Anthropic APIs.
- Common strategy: plan with a top model (Opus/Fable/Qwen Max), implement with cheaper models (DeepSeek, Kimi, GLM, MiMo), optionally review again with a frontier model.
Trust, politics & “Chinese models”
- There’s debate over calling them “Chinese models”: some say it’s neutral or even positive (cheap and capable), others think it carries bias.
- Concerns are raised about CCP influence, possible hidden behaviors, and geopolitics; counter‑arguments note that Western models also embed corporate/government interests and are closed‑weight.
- Techniques like bias removal or “stripping out” political influence from open‑weight models are mentioned, but the extent of systematic auditing remains unclear.
Licensing & “open source” terminology
- The Kimi license’s attribution‑style requirement is compared to old BSD‑style “advertising” clauses; many consider it reasonable.
- Some criticize calling these models “open source,” preferring “open weights,” since training data and methods are not fully disclosed.