Ask HN: Why does no one seem to care that AI gives wrong answers?

Perceived Problem: Wrong Answers & “Hallucinations”

  • Many commenters say they do care; they’ve been “burned” enough to stop trusting LLMs for factual Q&A.
  • “Hallucination” is viewed by some as PR spin for faulty output; others treat it as an inherent property of current LLMs.
  • For some users, frequent, confident errors make LLMs effectively useless as answer bots, especially when even simple explanations are wrong.
  • Others argue hallucination isn’t a showstopper in all contexts; it’s a risk to be managed with tests, checks, and product design.

Why Some Still Use It

  • Strong adoption for low‑stakes or creative tasks: drafting emails, rewriting, summarizing, translations, basic code scaffolding, text-to-speech, word transformations.
  • Many accept “90% right” if it saves time and they plan to verify or edit outputs.
  • For inherently probabilistic tasks (e.g., sentiment analysis), higher accuracy than older methods is considered good enough.
  • Some prefer LLMs over ad‑ridden, SEO‑polluted search, even if both can be wrong.

Limits of LLMs and Difficulty of Fixing

  • Multiple comments stress that LLMs are language models, not factual databases; they produce likely next tokens, not guaranteed truths.
  • Some see current architectures as intrinsically prone to overfitting, hidden failure modes, and irreducible hallucinations; they expect an asymptote, not explosive improvement.
  • Others claim that with retrieval (RAG), few-shot prompting, and domain constraints, wrong answers can be treated as normal software bugs and pushed very low for narrow tasks.
  • There’s disagreement whether better training or more complexity will ever make them reliably factual.

Comparison to Humans and Expectations

  • Analogies to junior engineers: useful but must be supervised; critics respond that juniors learn and stop repeating the same mistakes, LLMs do not.
  • Humans can say “I don’t know” or convey uncertainty; LLMs typically answer confidently regardless of reliability.
  • Many note users anthropomorphize models and are misled by fluent, confident language, similar to how some fraudsters operate.

Business Incentives, Hype, and Regulation

  • Commenters highlight incentives: investors and large vendors profit from shipping “good enough” AI now, betting that the “next model” will fix accuracy.
  • Hype cycles (VR, blockchain, NFTs, now AI) are seen as driving deployment even where correctness is critical.
  • Some predict a coming reckoning as enterprises discover AI agents and automation can’t deliver promised reliability.
  • Concerns include lack of regulation, massive energy use, and a broader culture that tolerates broken, ad‑driven tech as long as “line go up.”