2024-08-18

Markov chains are funnier than LLMs

Perceived Humor: Markov Chains vs LLMs

Many report Markov bots (on IRC, Twitter, Reddit, Discord, etc.) as consistently funnier because they produce semi-coherent nonsense and abrupt context shifts.
Humor often comes from “unserious surprise” and the brain trying to make sense of wrong-but-plausible sentences or style mashups (e.g., Bible + programming, AWS blogs, police reports, personals, Nietzsche, Dota patch notes).
Several say modern LLMs feel too coherent and “clean,” so their mistakes are tedious (like a confident but ignorant uncle) rather than delightfully absurd.
Others argue Markov comedy is heavily cherry‑picked; most outputs are dull gibberish, whereas LLMs are broadly more usable and only less funny because of expectations.

Role of Alignment, Temperature, and Model Choice

Multiple commenters blame RLHF/guardrails for “sanding off” the weirdness and risk-taking that made early GPT-2/3 generations funny.
Suggestions: use base models, crank up temperature, disable safety prompts, or fine-tune specifically on humor to recover absurdity.
There’s interest in “locked” or reproducible models for research; people turn to open-weight models (Llama, WizardLM, etc.) and local runners.

Debates on Reasoning, Originality, and “Understanding”

One camp claims LLMs only interpolate within training data, can’t robustly reason or form new concepts, and mostly do advanced retrieval; they cite papers on failures in planning, counterfactuals, and abstraction.
Another camp counters that LLMs demonstrably solve many unseen problems, generalize over huge corpora, and that human reasoning is also error-prone; they argue tests often conflate “imperfect” with “no reasoning.”
There is disagreement over definitions of “real reasoning” and “understanding,” with some calling attempts to reserve these terms for humans circular and unfalsifiable.

Relationship Between Markov Models and LLMs

Several note that both are next-token predictors; LLMs can be viewed as very high-order, factored Markov models. Others push back, emphasizing transformer specifics.
One detailed comment argues LLMs contain the information to emulate Markov-style low-level “feel” but architecture and training bias them toward higher-level coherence, suppressing that particular flavor of absurdity.

Related topics