Political bias in AI: Where the AI models stand

Methodology and Limits of Measuring LLM Political Bias

  • Several comments note that any left/right score depends heavily on how questions are written and how answers are coded.
  • The site’s author (in-thread) describes using political science datasets (e.g., party and population surveys) and separate models to code stances, but concedes it is imperfect and anchored to a particular Overton window.
  • Some propose higher‑dimensional or embedding-based approaches instead of fixed axes, but note these are harder to interpret for consumers.

Problems with Left/Right and the Political Compass

  • Many see the political compass as oversimplified or “garbage,” arguing real politics spans many dimensions and that people often hold cross‑quadrant, internally inconsistent views.
  • Others argue left/right and authoritarian/libertarian are culture- and time-dependent, so there is no objective “center” or “neutral” position.
  • Attempts to locate global politicians (e.g., Macron, Xi, Putin, Albanese, Obama) are widely criticized as implausible or inconsistent with other compass-style analyses.

Model-Specific Observations

  • Grok is perceived as having explicit right‑leaning tuning, with references to public statements about “fixing” it when it contradicts MAGA narratives.
  • Gemini is seen as having swung from visibly overcorrected DEI-style behavior to appearing surprisingly “neutral” on this chart.
  • DeepSeek is plotted as centrist, but multiple commenters say it censors or distorts sensitive Chinese topics (especially Tiananmen), so “center” is questioned.
  • Some note all tested models map closest to the US Democratic Party, which itself is contested as either center-right or left depending on the comparator.

Prompting, Guardrails, and Apparent “Opinions”

  • Several point out that models don’t hold stable ideologies; responses can be primed strongly by prompt framing.
  • Others insist prompt engineering cannot override deeper training-set and guardrail biases, disputing how easily bias can be “created” via prompting.

Visualization and Presentation Critiques

  • The main chart is accused of “chart crime”: low‑contrast dots and label offsets can visually exaggerate Grok’s position and obscure others.
  • Inconsistencies between headline compass positions and more detailed breakdowns lower trust in the study.

Why Bias Matters (or Not)

  • Some argue it’s important because LLMs are already influencing policy, resource allocation, and public understanding.
  • Others say individuals shouldn’t—and mostly won’t—let an LLM decide their politics, so bias measurement is more academic than practical.