Where the goblins came from

Overall reaction to the goblin post

  • Many found the write-up genuinely funny and appreciated the transparency and level of detail.
  • Others saw it as evidence that frontier labs are “vibe-tuning” products with crude hacks rather than principled engineering.
  • Some suspected it was partly a marketing stunt to humanize the company and its models.

Personality training, RLHF, and “culture” in models

  • Commenters note that the goblin quirk arose from RLHF on a “Nerdy” persona and then leaked into other modes.
  • Several liken this to the emergence of a proto‑culture: rewards for small stylistic quirks can spread and stabilize across generations of models.
  • There’s debate over whether this is an amusing example of “AI anthropology” or a sign of poor control over training dynamics.

Bias, safety, and hidden quirks

  • The visible goblin tic prompts concern about subtler biases (e.g., trust or political judgments) that would be much harder to detect.
  • Some see it as confirmation that models can be “poisoned” by small reward signals or data artifacts.
  • Others argue this is analogous to human bias and culture—concerning but not surprising.

System prompts, style tics, and prompt engineering

  • The explicit “never talk about goblins” line in Codex’s system prompt is widely mocked as emblematic of ad‑hoc prompt engineering.
  • Many dislike the highly anthropomorphic system prompts (“you have a vivid inner life”, “epistemically curious collaborator”) and find them cringe or manipulative.
  • Users report strong global effects from seemingly small instructions (e.g., “don’t use exclamation points” killing all enthusiasm; “follow this structure” suppressing refactors).
  • People catalog recurring LLM “tells”: words like “seam”, “shape”, “smoking gun”, “wired”, “load‑bearing”, “quietly”, em‑dash overuse, specific idioms, and favored numbers.

Debates about what LLMs are

  • One camp insists LLMs are just sophisticated autocomplete without selfhood; another argues they implement a genuine, if alien, form of intelligence.
  • There is disagreement over how much we “understand” LLMs: low‑level math is clear, but emergent behavior and internal representations are seen as poorly understood and an active research area.
  • Some doubt LLMs are a path to AGI; others think they’re at least key components.

Data, privacy, and control

  • The quantitative analysis of goblin frequency leads some to infer that a large fraction of user chats are stored and mined, raising privacy worries.
  • Skepticism is expressed about opt‑outs and “no‑training” guarantees, and about how much unseen censorship or steering might already be present.