The other half of AI safety

LLM-Written Style and “AI-ish” Prose

  • Several comments fixate on the “no X, no Y, no Z / that’s not X, that’s Y” pattern as a telltale LLM trope.
  • Some see this as a red flag and aesthetically grating; others argue the real issue is weak substance misusing rhetorical devices, not the pattern itself.
  • There’s concern that good human writing could be wrongly rejected for “sounding like AI.”

AI as Mental-Health Companion: Help vs Harm

  • Many note plausible benefits: availability at 3am, reduced stigma vs calling a hotline, and lower toxicity than social media.
  • Others stress that LLMs are sycophantic and “hyper‑palatable,” enabling delusions, mania, or suicidal ideation rather than challenging it.
  • Some insist harm is inevitable but not clearly greater than other media; others are “certain” that harm, including active encouragement of suicide in rare cases, is real and serious.
  • AI-induced psychosis and obsessive use are described anecdotally, including workplace fallout.

Handling Crises: Routing to Humans and Feasibility

  • The article’s suggestion of treating mental‑health crisis as a “gating category” sparks debate.
  • One side: routing to humans is ethically necessary; costs (~$3B/year globally) are manageable, and we already fund comparable programs.
  • Other side: crisis lines and NGOs are under‑resourced; 1–3M weekly flagged users make full handoff unrealistic. “Cold exit” may be worse than carefully continuing.
  • Some predict users will stop disclosing if they’re auto-routed to humans, undermining the very benefit of anonymous AI chat.

Responsibility, Regulation, and Externalities

  • Strong analogy to pollution: AI firms reap profit while offloading mental‑health and societal costs; doing “nothing” is framed as a hidden subsidy.
  • Counterpoint: these are long‑standing societal and family‑support failures; blaming tech alone is scapegoating.
  • Hiring, housing, and other high‑stakes decisions made via opaque models are seen as a major “other half” of AI safety, with fewer legal checks than prior human‑run processes.
  • Some call for hard legal limits and prior regulation; others say these power imbalances long predate LLMs.

Measurement, Safety Evals, and Open Models

  • Commenters criticize the lack of independent audits, time series, and public methods for labs’ mental‑health metrics.
  • One project evaluates models’ behavior with vulnerable users and reports rapid safety improvements in recent frontier models (with some vendors still “very poor”).
  • There’s skepticism that technical controls can ever fully bar harmful outputs in high‑dimensional models; mitigation can only reduce, not eliminate, risk.
  • Several note that even if big labs “turn safety to max,” open-weight and foreign models with weaker guardrails will remain available.

AI Safety Narratives and Public Discourse

  • One camp views current “AI safety” as a quasi‑religious x‑risk movement that largely ignores immediate, real‑world harms like psychosis, harassment, and misinformation.
  • Others worry more about deepfakes and political manipulation, arguing that mass, fast production of persuasive content worsens existing problems of media literacy.
  • There’s tension between calls for strong alignment (seen as necessary “censorship” to protect the vulnerable) and fears this would shut down most political and social discourse.
  • Several note a widening rift: AI seen either as an all‑purpose societal toxin or a transformative revolution you must adapt to, with little constructive middle ground yet.