2025-05-18

Show HN: Chat with 19 years of HN

Overall Reaction to the Tool

Many commenters find the HN chat interface technically impressive, fun to play with, and surprisingly insightful about users, topics, and trends.
People enjoyed examples like: “best database according to HN”, “best time to post Show HN”, language popularity stats, retirement-number analysis, and user-behavior summaries.
Some hit the free usage limit quickly and wanted more to explore, such as pre-generated/browsable analyses or blog-style writeups of interesting queries.

UX, Access, and Pricing

Several complaints about friction: mandatory email/login, confusing redirects, wrong URL in the submission, and difficulty seeing the HN dataset at first.
Multiple users say they won’t give their email “just to try it”; suggestions include captchas instead of logins and showing value before signup.
Common request: let users plug in their own OpenAI/Claude keys or some “Login with ChatGPT”–style billing to avoid another subscription.
The creator notes that LLM costs and the need to avoid abuse drive the login wall and pricing, and that the app is roughly break-even.

Use of HN Data, Copyright, and Rights

Ongoing debate about whether it’s appropriate to monetize analyses of HN comments:
- One side: HN is public; the data is in a public BigQuery dataset and via API; anything public can be analyzed.
- Other side: commenters retain copyright; HN only has a license; third parties don’t automatically gain commercial rights just because there’s an API or dataset.
The BigQuery listing that appears “official” is clarified (via linked prior discussion) as a third-party project, not something HN/Y Combinator publishes directly.
Some find it especially distasteful that their own contributions are turned into a paid product “sold back” to them.

Privacy, Anonymity, and Doxing Concerns

Strong unease about prompts like “What do you think about user X?” and how easily the tool (or other LLMs) can:
- Aggregate a user’s entire history,
- Infer real-world identity or other accounts,
- “Dox” people or link throwaways via writing style.
Several say this makes them reconsider posting at all; others argue that the damage is already done because of past scraping and datasets.
Distinction is made between public records and impersonation: commenters broadly see AI (or humans) role-playing as real individuals as ethically unacceptable.
Some propose a convention like “NoAI/NoIndex” in profiles as a soft opt-out signal, while acknowledging it wouldn’t be enforceable.

Technical Aspects and Safety

People praise the multi-tool setup (SQL runner, Python transform, charting, search) and how well it orchestrates queries and visualizations.
There’s curiosity about safeguards that prevent destructive SQL (e.g., DELETE), with speculation that the database is read-only plus prompt- or tool-level restrictions.

Language Popularity & HN Bias

The tool’s outputs suggest Rust and Go dominate by story count and karma, while Lua/Erlang have high per-story scores.
Follow-up queries on Show HN titles show Python, JavaScript, Go, and Rust leading by project count.
Commenters note:
- Title-bias (Rust/Go often mentioned in titles),
- Possible undercounting (e.g., TypeScript, Lisp) due to regex-based detection,
- That HN “attention” doesn’t necessarily reflect real-world usage.
Some perceive a systemic Rust bias on HN and speculate that YC/startup culture amplifies it.

Ethical Discomfort with AI Over Social Data

Multiple users express a general “gross” or “icky” feeling about:
- AI systems mining social conversations for fine-grained judgment of individuals,
- Normalizing surveillance-like analysis of casual, in-the-moment discussion.
Others counter that public forums are inherently public, but even they acknowledge the emotional shock of seeing an LLM instantly surface and summarize one’s entire online persona.

Related topics