Show HN: Are You in the Weights?
Overall reception
- Many find the site fun, ego-boosting, and visually appealing (retro/8‑bit aesthetic, clever concept).
- Others view it as a privacy trap or an instructive but worrying demo of LLM behavior.
Accuracy, hallucinations, and identity collisions
- Experiences range from eerily accurate summaries (especially for unique surnames, open‑source contributors, academics, or long‑used handles) to complete fabrications.
- Even people with globally unique names often get described as politicians, security researchers, or especially professional athletes and entertainers.
- Pseudonyms and long-lived online handles are sometimes recognized more accurately than legal names.
- The “hallucinations” section is imperfect: some spot‑on descriptions are labeled hallucinations, while many wrong ones appear in the “main” section.
Dangerous and defamatory misidentifications
- Several users are incorrectly labeled as terrorists, murderers, extremists, or crime victims.
- Some note that for Arabic or uncommon names, the tool often confuses them with sanctioned individuals or bombers.
- Commenters find these false positives “scary,” especially given reports that LLMs may be used in military or security decision-making.
Scoring, models, and clustering
- “Strength” is explained as a linear combination of model self‑reported confidence plus bonuses for cross‑model agreement; commenters note LLM confidence is poorly calibrated.
- The percentile (“Top N%”) is relative to all queries so far, not the broader population.
- A clusterer merges model outputs into entities and decides what is a hallucination, optimized for recall over precision, leading to many misclassifications.
- Prompting details are shared; all models use the same JSON‑only “Who is
?” prompt. Clustering runs on a cheaper model.
Privacy and data handling
- Strong criticism that all queries (including real names) were publicly listed via a “latest” leaderboard and accessible via API; later partially mitigated but data remain broadly accessible.
- Lack of a clear privacy policy and presence of tracking/Cloudflare checks raise suspicion about IP/name harvesting.
- Several argue one should assume any text submitted to random sites will be stored and reused, possibly in future training sets.
UX, cost, and design
- Praised for design and portraits (generated via an image model), but some report input bugs, intrusive sounds, and rate‑limit errors under load.
- Running many frontier and smaller models per query is acknowledged as costly; described by the author as a non‑commercial “fun hack and science experiment.”
Broader reflections
- Thread touches on “being in the weights” as a form of weird digital immortality and raises questions about the right to be forgotten.
- Some are relieved not to appear at all; others note how hard it is to keep real‑life identity separate from online traces once models ingest public data.