The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin
Pricing, Caching, and OpenRouter Economics
- Hy3 is popular partly because it’s very cheap on OpenRouter and “good enough” in quality for many workloads.
- DeepSeek V4 Flash can be even cheaper if used via DeepSeek’s own API due to stronger caching discounts; using it through OpenRouter can lose significant savings.
- Within OpenRouter, effective price varies by provider due to different input/output/cache pricing, so “cheapest” depends on workload mix.
- There’s confusion over why some providers are cheaper than DeepSeek itself; one explanation is that cheaper tiers use lower-precision quantization (fp4/fp8) while DeepSeek’s own endpoint may be unquantized.
- OpenRouter also offers a “Response Caching” feature for identical requests at no cost, though some question how often identical requests occur in practice.
Usage Rankings and Market Signal Quality
- OpenRouter leaderboards reflect only traffic through OpenRouter, so they underrepresent models where many users go direct (e.g., Anthropic).
- Some argue the rankings are still a valid market signal; others emphasize that a single large app (“whale”) can dominate tokens, making it hard to infer broad user preference.
- Requests for better metrics include unique-user indicators, spend-based charts, and token volume over time.
- It’s noted that a single SaaS product can burn billions of tokens per day, reinforcing that token volume ≠ number of users.
Hy3 Popularity, Performance, and “Mystery”
- Hy3 is from Tencent, so several commenters question why it’s labeled “mysterious”; others suggest the mystery is more about its sudden leaderboard dominance and deployment details.
- One commenter reports multi-benchmark testing where Hy3 ranks mid-pack among ~25–80 models, suggesting it’s not top-tier but competitive.
- Another user found it poor for coding/agentic tasks compared to newer frontier models and even strong local models, citing thought loops and undesired edits.
- Others find it attractive as a post-training base: good factual knowledge, with issues more on agent behavior than core capabilities.
- A link notes Hy3 started as a >400B parameter model later reduced to ~295B as an “optimal zone.”
Free Access and Usage Spikes
- Debate over whether Hy3’s ranking is inflated by free access.
- Some insist usage stayed high after the free period; others point out it’s still free in various apps and via a “free” OpenRouter endpoint, so the true paid-only demand is unclear.
Data Privacy, Trust, and Jurisdiction Concerns
- Some are wary of sending sensitive business data to an LLM whose developers or jurisdiction they don’t fully trust, especially with state-linked entities.
- Others argue what matters is the inference provider’s data-handling policies (e.g., zero data retention, opt-out of training), which OpenRouter claims to control via contracts.
- Skeptics question how such contracts can be verified or enforced and note that both model developers and hosts are potential threat vectors.
- There is broader unease about major US and Chinese AI companies’ relationships with their respective governments and the opacity around data use.