Markov chains are funnier than LLMs

Perceived Humor: Markov Chains vs LLMs

  • Many report Markov bots (on IRC, Twitter, Reddit, Discord, etc.) as consistently funnier because they produce semi-coherent nonsense and abrupt context shifts.
  • Humor often comes from “unserious surprise” and the brain trying to make sense of wrong-but-plausible sentences or style mashups (e.g., Bible + programming, AWS blogs, police reports, personals, Nietzsche, Dota patch notes).
  • Several say modern LLMs feel too coherent and “clean,” so their mistakes are tedious (like a confident but ignorant uncle) rather than delightfully absurd.
  • Others argue Markov comedy is heavily cherry‑picked; most outputs are dull gibberish, whereas LLMs are broadly more usable and only less funny because of expectations.

Role of Alignment, Temperature, and Model Choice

  • Multiple commenters blame RLHF/guardrails for “sanding off” the weirdness and risk-taking that made early GPT-2/3 generations funny.
  • Suggestions: use base models, crank up temperature, disable safety prompts, or fine-tune specifically on humor to recover absurdity.
  • There’s interest in “locked” or reproducible models for research; people turn to open-weight models (Llama, WizardLM, etc.) and local runners.

Debates on Reasoning, Originality, and “Understanding”

  • One camp claims LLMs only interpolate within training data, can’t robustly reason or form new concepts, and mostly do advanced retrieval; they cite papers on failures in planning, counterfactuals, and abstraction.
  • Another camp counters that LLMs demonstrably solve many unseen problems, generalize over huge corpora, and that human reasoning is also error-prone; they argue tests often conflate “imperfect” with “no reasoning.”
  • There is disagreement over definitions of “real reasoning” and “understanding,” with some calling attempts to reserve these terms for humans circular and unfalsifiable.

Relationship Between Markov Models and LLMs

  • Several note that both are next-token predictors; LLMs can be viewed as very high-order, factored Markov models. Others push back, emphasizing transformer specifics.
  • One detailed comment argues LLMs contain the information to emulate Markov-style low-level “feel” but architecture and training bias them toward higher-level coherence, suppressing that particular flavor of absurdity.