Show HN: Price Per Token – LLM API Pricing Data
Existing tools & discoverability
- Several commenters point out prior LLM price comparison tools (OpenRouter, llm-prices.com, Helicone, models.dev, llmprices.dev, etc.) and are surprised the author didn’t find them.
- Some say they now just use OpenRouter or similar services to check prices instead of vendor pages.
Scope, completeness, and “low effort” debate
- Strong criticism: site initially covers ~26 models from 3 big providers, omitting many popular ones (Mistral, Llama, Gemma, DeepSeek, Qwen, Groq, etc.) and prompt-cache pricing, leading some to call it “low effort” or “a mockup.”
- Others strongly defend the project: they value the simplicity, clear UI/graph, and see it as a useful starting point that can be iterated on.
- The author says they intentionally started small to gauge interest and plans to add many more models and cache pricing.
Token pricing complexity
- Multiple comments argue that “price per token” alone is misleading:
- Tokenizers differ between models; images and structured output can be billed differently.
- Providers have batch pricing, off-peak pricing, context-window-based pricing, “thinking” vs non-thinking token prices, tiering, and implicit/explicit caching.
- Same model via different providers can have very different prices; open models often vary widely in cost across hosts.
- Some suggest the right unit is “cost of a standardized task run” rather than per-token price.
Requested features & enhancements
- Cost calculator for custom input/output token counts and blended input/output metrics.
- Benchmarks or leaderboards joined with pricing to show “bang for buck,” possibly per endpoint / API shape.
- Periodic standardized tasks (summarization, coding) to estimate real query cost, with timestamps and historical trend tracking.
- Additional metadata: context length, modalities, cache pricing, provider, tiered pricing, etc.
- Monitoring/alerting on pricing changes as a potential paid service.
Data accuracy & maintenance
- One pricing error (Gemini 2.5 Flash Lite) is called out; the initial defensive response and later correction spur discussion about tone and trust.
- Several people discuss scraping APIs (e.g., OpenRouter, LiteLLM) and using agents/scrapers to keep a prices database continuously up to date.