2026-05-29

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

Pricing, Caching, and OpenRouter Economics

Hy3 is popular partly because it’s very cheap on OpenRouter and “good enough” in quality for many workloads.
DeepSeek V4 Flash can be even cheaper if used via DeepSeek’s own API due to stronger caching discounts; using it through OpenRouter can lose significant savings.
Within OpenRouter, effective price varies by provider due to different input/output/cache pricing, so “cheapest” depends on workload mix.
There’s confusion over why some providers are cheaper than DeepSeek itself; one explanation is that cheaper tiers use lower-precision quantization (fp4/fp8) while DeepSeek’s own endpoint may be unquantized.
OpenRouter also offers a “Response Caching” feature for identical requests at no cost, though some question how often identical requests occur in practice.

Usage Rankings and Market Signal Quality

OpenRouter leaderboards reflect only traffic through OpenRouter, so they underrepresent models where many users go direct (e.g., Anthropic).
Some argue the rankings are still a valid market signal; others emphasize that a single large app (“whale”) can dominate tokens, making it hard to infer broad user preference.
Requests for better metrics include unique-user indicators, spend-based charts, and token volume over time.
It’s noted that a single SaaS product can burn billions of tokens per day, reinforcing that token volume ≠ number of users.

Hy3 Popularity, Performance, and “Mystery”

Hy3 is from Tencent, so several commenters question why it’s labeled “mysterious”; others suggest the mystery is more about its sudden leaderboard dominance and deployment details.
One commenter reports multi-benchmark testing where Hy3 ranks mid-pack among ~25–80 models, suggesting it’s not top-tier but competitive.
Another user found it poor for coding/agentic tasks compared to newer frontier models and even strong local models, citing thought loops and undesired edits.
Others find it attractive as a post-training base: good factual knowledge, with issues more on agent behavior than core capabilities.
A link notes Hy3 started as a >400B parameter model later reduced to ~295B as an “optimal zone.”

Free Access and Usage Spikes

Debate over whether Hy3’s ranking is inflated by free access.
Some insist usage stayed high after the free period; others point out it’s still free in various apps and via a “free” OpenRouter endpoint, so the true paid-only demand is unclear.

Data Privacy, Trust, and Jurisdiction Concerns

Some are wary of sending sensitive business data to an LLM whose developers or jurisdiction they don’t fully trust, especially with state-linked entities.
Others argue what matters is the inference provider’s data-handling policies (e.g., zero data retention, opt-out of training), which OpenRouter claims to control via contracts.
Skeptics question how such contracts can be verified or enforced and note that both model developers and hosts are potential threat vectors.
There is broader unease about major US and Chinese AI companies’ relationships with their respective governments and the opacity around data use.

Related topics