Creativity has left the chat: The price of debiasing language models
Meme Titles and Research Professionalism
- Several comments debate “meme-ified” titles (“left the chat”, etc.).
- Some argue this has existed for decades and is harmless or even adds personality.
- Others see it as a decline in professionalism and a sign of attention‑seeking or “clickbaitizing” research.
Debiasing vs Biasing and the Possibility of Neutrality
- Strong disagreement on whether “debiasing” is real or just imposing a different bias.
- One camp: all models and corpora are inherently biased; “debiasing” simply aligns to someone’s preferred values or legal definitions.
- Another camp: you can reduce measurable biases (e.g., disparate treatment across protected groups) even if full neutrality is impossible.
- Some suggest more precise terminology like “bias-aligned models” instead of “unbiased.”
Unbiased / Raw Models and Creativity
- Interest in “raw” next‑token models (Wikipedia, HN, Common Crawl) without RLHF.
- Base models are described as more diverse, less refusal‑prone, often better for naming, prose, or technical depth but harder to steer.
- Multiple users report that instruction‑tuned / RLHF’d models feel blander, more generic, and sometimes “lazy” or evasive.
- Discussion notes that RLHF intentionally reduces entropy and mode diversity; some call this “creativity loss,” others say it’s the cost of usability and safety.
Alignment, Censorship, and Morality
- Big thread on whether alignment is analogous to authoritarian control.
- One side: over‑alignment makes models propagandistic, avoids uncomfortable truths, and trains users into self‑censorship.
- The other: models are not moral patients; adjusting outputs is ethically about users and society, not about “torturing” AIs.
- Complaints that current major models show asymmetric political and racial treatment (e.g., reluctance to praise some groups) and heavy filtering in politics‑related queries.
Creativity, Diversity, and Measurement
- Skepticism about equating syntactic/semantic diversity with true creativity.
- Higher variance can also mean hallucinations or noise; lowering variance can improve reliability but flatten style.
- Temperature and sampling tweaks reportedly help less on heavily aligned models due to “flattened logits,” limiting recovery of base‑model diversity.