2026-05-14

The other half of AI safety

LLM-Written Style and “AI-ish” Prose

Several comments fixate on the “no X, no Y, no Z / that’s not X, that’s Y” pattern as a telltale LLM trope.
Some see this as a red flag and aesthetically grating; others argue the real issue is weak substance misusing rhetorical devices, not the pattern itself.
There’s concern that good human writing could be wrongly rejected for “sounding like AI.”

AI as Mental-Health Companion: Help vs Harm

Many note plausible benefits: availability at 3am, reduced stigma vs calling a hotline, and lower toxicity than social media.
Others stress that LLMs are sycophantic and “hyper‑palatable,” enabling delusions, mania, or suicidal ideation rather than challenging it.
Some insist harm is inevitable but not clearly greater than other media; others are “certain” that harm, including active encouragement of suicide in rare cases, is real and serious.
AI-induced psychosis and obsessive use are described anecdotally, including workplace fallout.

Handling Crises: Routing to Humans and Feasibility

The article’s suggestion of treating mental‑health crisis as a “gating category” sparks debate.
One side: routing to humans is ethically necessary; costs (~$3B/year globally) are manageable, and we already fund comparable programs.
Other side: crisis lines and NGOs are under‑resourced; 1–3M weekly flagged users make full handoff unrealistic. “Cold exit” may be worse than carefully continuing.
Some predict users will stop disclosing if they’re auto-routed to humans, undermining the very benefit of anonymous AI chat.

Responsibility, Regulation, and Externalities

Strong analogy to pollution: AI firms reap profit while offloading mental‑health and societal costs; doing “nothing” is framed as a hidden subsidy.
Counterpoint: these are long‑standing societal and family‑support failures; blaming tech alone is scapegoating.
Hiring, housing, and other high‑stakes decisions made via opaque models are seen as a major “other half” of AI safety, with fewer legal checks than prior human‑run processes.
Some call for hard legal limits and prior regulation; others say these power imbalances long predate LLMs.

Measurement, Safety Evals, and Open Models

Commenters criticize the lack of independent audits, time series, and public methods for labs’ mental‑health metrics.
One project evaluates models’ behavior with vulnerable users and reports rapid safety improvements in recent frontier models (with some vendors still “very poor”).
There’s skepticism that technical controls can ever fully bar harmful outputs in high‑dimensional models; mitigation can only reduce, not eliminate, risk.
Several note that even if big labs “turn safety to max,” open-weight and foreign models with weaker guardrails will remain available.

AI Safety Narratives and Public Discourse

One camp views current “AI safety” as a quasi‑religious x‑risk movement that largely ignores immediate, real‑world harms like psychosis, harassment, and misinformation.
Others worry more about deepfakes and political manipulation, arguing that mass, fast production of persuasive content worsens existing problems of media literacy.
There’s tension between calls for strong alignment (seen as necessary “censorship” to protect the vulnerable) and fears this would shut down most political and social discourse.
Several note a widening rift: AI seen either as an all‑purpose societal toxin or a transformative revolution you must adapt to, with little constructive middle ground yet.

Related topics