2026-05-18

Cursor Introduces Composer 2.5

Model & Technical Approach

Composer 2.5 is built on Moonshot’s open Kimi K2.5 checkpoint, with extra RL and coding-focused fine-tuning.
Several comments note this isn’t a “from scratch” model; the “new from scratch” training is said to be a separate, larger model on SpaceX/xAI’s Colossus 2 cluster.
Some argue much of coding capability comes from RL and harness design, not just base model quality.

Performance, Benchmarks & Real-World Use

Benchmarks claim near-Opus / GPT frontier performance at ~1/10–1/16 the cost; many are skeptical, citing past Composer 2 claims that didn’t match real-world usage.
Multiple developers report Composer 2.5 (especially the fast variant) feels weaker than Opus / Claude Code / GPT for planning, code quality, bug avoidance, and session-level behavior.
Others say Composer 2.x is “good enough” and very fast for many day-to-day coding tasks, especially as a sub-agent or for autocomplete.

Pricing, Limits & Economics

Confusion around Cursor’s pricing tiers; some on $20 personal plans never hit limits, others report hitting caps or large bills on team plans.
Several teams report costs “skyrocketing” after switching to team plans or after fast mode became default; some companies are moving to Claude Code or Codex for cost reasons.
Debate over whether cheaper high-quality tokens will compress revenue or instead expand usage (Jevons-like effects).

Product, UX & Harness Quality

Strong praise for Cursor’s tab completion and coding harness when it works; some say it remains best-in-class for integrated coding workflows.
Many complaints about constant UI changes, regressions, bugs, lag, memory use, and weaker integration with GitHub and agents than competitors.
CLI and alternative harnesses (e.g., via Zed) exist but are described as immature or less capable.

Moat, Data & Strategy

Ongoing argument whether Cursor has a moat:
- Skeptics: “still a VS Code fork,” open models are commoditized, big tech has more data.
- Supporters: the IDE+harness+RL on rich coding/edit data could be defensible, especially with early large-scale usage.
Concerns that user code and interaction data are likely used for fine-tuning; some hope users “wake up” to this.

xAI / SpaceX Context & Future

Training on Colossus 2 and rumored acquisition by xAI are seen as giving massive compute and cash, but also raising questions about sustainability and strategy.
Some view Cursor’s ambition as impressive; others see it as a necessary move to escape low-margin dependence on external APIs.

Related topics