2025-07-25

Show HN: Price Per Token – LLM API Pricing Data

Existing tools & discoverability

Several commenters point out prior LLM price comparison tools (OpenRouter, llm-prices.com, Helicone, models.dev, llmprices.dev, etc.) and are surprised the author didn’t find them.
Some say they now just use OpenRouter or similar services to check prices instead of vendor pages.

Scope, completeness, and “low effort” debate

Strong criticism: site initially covers ~26 models from 3 big providers, omitting many popular ones (Mistral, Llama, Gemma, DeepSeek, Qwen, Groq, etc.) and prompt-cache pricing, leading some to call it “low effort” or “a mockup.”
Others strongly defend the project: they value the simplicity, clear UI/graph, and see it as a useful starting point that can be iterated on.
The author says they intentionally started small to gauge interest and plans to add many more models and cache pricing.

Token pricing complexity

Multiple comments argue that “price per token” alone is misleading:
- Tokenizers differ between models; images and structured output can be billed differently.
- Providers have batch pricing, off-peak pricing, context-window-based pricing, “thinking” vs non-thinking token prices, tiering, and implicit/explicit caching.
- Same model via different providers can have very different prices; open models often vary widely in cost across hosts.
Some suggest the right unit is “cost of a standardized task run” rather than per-token price.

Requested features & enhancements

Cost calculator for custom input/output token counts and blended input/output metrics.
Benchmarks or leaderboards joined with pricing to show “bang for buck,” possibly per endpoint / API shape.
Periodic standardized tasks (summarization, coding) to estimate real query cost, with timestamps and historical trend tracking.
Additional metadata: context length, modalities, cache pricing, provider, tiered pricing, etc.
Monitoring/alerting on pricing changes as a potential paid service.

Data accuracy & maintenance

One pricing error (Gemini 2.5 Flash Lite) is called out; the initial defensive response and later correction spur discussion about tone and trust.
Several people discuss scraping APIs (e.g., OpenRouter, LiteLLM) and using agents/scrapers to keep a prices database continuously up to date.

Related topics