2024-06-17

Creativity has left the chat: The price of debiasing language models

Meme Titles and Research Professionalism

Several comments debate “meme-ified” titles (“left the chat”, etc.).
Some argue this has existed for decades and is harmless or even adds personality.
Others see it as a decline in professionalism and a sign of attention‑seeking or “clickbaitizing” research.

Debiasing vs Biasing and the Possibility of Neutrality

Strong disagreement on whether “debiasing” is real or just imposing a different bias.
One camp: all models and corpora are inherently biased; “debiasing” simply aligns to someone’s preferred values or legal definitions.
Another camp: you can reduce measurable biases (e.g., disparate treatment across protected groups) even if full neutrality is impossible.
Some suggest more precise terminology like “bias-aligned models” instead of “unbiased.”

Unbiased / Raw Models and Creativity

Interest in “raw” next‑token models (Wikipedia, HN, Common Crawl) without RLHF.
Base models are described as more diverse, less refusal‑prone, often better for naming, prose, or technical depth but harder to steer.
Multiple users report that instruction‑tuned / RLHF’d models feel blander, more generic, and sometimes “lazy” or evasive.
Discussion notes that RLHF intentionally reduces entropy and mode diversity; some call this “creativity loss,” others say it’s the cost of usability and safety.

Alignment, Censorship, and Morality

Big thread on whether alignment is analogous to authoritarian control.
One side: over‑alignment makes models propagandistic, avoids uncomfortable truths, and trains users into self‑censorship.
The other: models are not moral patients; adjusting outputs is ethically about users and society, not about “torturing” AIs.
Complaints that current major models show asymmetric political and racial treatment (e.g., reluctance to praise some groups) and heavy filtering in politics‑related queries.

Creativity, Diversity, and Measurement

Skepticism about equating syntactic/semantic diversity with true creativity.
Higher variance can also mean hallucinations or noise; lowering variance can improve reliability but flatten style.
Temperature and sampling tweaks reportedly help less on heavily aligned models due to “flattened logits,” limiting recovery of base‑model diversity.

Related topics