Cursor Introduces Composer 2.5

Model & Technical Approach

  • Composer 2.5 is built on Moonshot’s open Kimi K2.5 checkpoint, with extra RL and coding-focused fine-tuning.
  • Several comments note this isn’t a “from scratch” model; the “new from scratch” training is said to be a separate, larger model on SpaceX/xAI’s Colossus 2 cluster.
  • Some argue much of coding capability comes from RL and harness design, not just base model quality.

Performance, Benchmarks & Real-World Use

  • Benchmarks claim near-Opus / GPT frontier performance at ~1/10–1/16 the cost; many are skeptical, citing past Composer 2 claims that didn’t match real-world usage.
  • Multiple developers report Composer 2.5 (especially the fast variant) feels weaker than Opus / Claude Code / GPT for planning, code quality, bug avoidance, and session-level behavior.
  • Others say Composer 2.x is “good enough” and very fast for many day-to-day coding tasks, especially as a sub-agent or for autocomplete.

Pricing, Limits & Economics

  • Confusion around Cursor’s pricing tiers; some on $20 personal plans never hit limits, others report hitting caps or large bills on team plans.
  • Several teams report costs “skyrocketing” after switching to team plans or after fast mode became default; some companies are moving to Claude Code or Codex for cost reasons.
  • Debate over whether cheaper high-quality tokens will compress revenue or instead expand usage (Jevons-like effects).

Product, UX & Harness Quality

  • Strong praise for Cursor’s tab completion and coding harness when it works; some say it remains best-in-class for integrated coding workflows.
  • Many complaints about constant UI changes, regressions, bugs, lag, memory use, and weaker integration with GitHub and agents than competitors.
  • CLI and alternative harnesses (e.g., via Zed) exist but are described as immature or less capable.

Moat, Data & Strategy

  • Ongoing argument whether Cursor has a moat:
    • Skeptics: “still a VS Code fork,” open models are commoditized, big tech has more data.
    • Supporters: the IDE+harness+RL on rich coding/edit data could be defensible, especially with early large-scale usage.
  • Concerns that user code and interaction data are likely used for fine-tuning; some hope users “wake up” to this.

xAI / SpaceX Context & Future

  • Training on Colossus 2 and rumored acquisition by xAI are seen as giving massive compute and cash, but also raising questions about sustainability and strategy.
  • Some view Cursor’s ambition as impressive; others see it as a necessary move to escape low-margin dependence on external APIs.