2024-06-10

AI Hype is completely out of control – especially since ChatGPT-4o [video]

What “human-level” AI means

Strong disagreement over claims that there’s “no evidence” we’re approaching human-level AI.
Some argue machines already surpass humans on many tasks and the frontier is moving fast; others say this isn’t “intelligence,” just narrow tools.
Definitions vary: human-level as “mediocre human on routine tasks,” “general adaptive intelligence,” or “intelligence = knowledge + reasoning.”
Debate over whether intelligence requires autonomy, ability to learn from few examples, or a drive to seek new knowledge.

Capabilities vs. Limitations of Current Models

LLMs praised for language tasks, translation, standardized boilerplate code, summarization, and as new user interfaces.
Critics emphasize brittleness, hallucinations, poor multi-step reasoning, and difficulty with larger, real codebases or complex back-and-forth.
Some compare current systems to “dog-level” intelligence at best; others insist they’re “stupid as hell” and far from general intelligence.
Evidence cited that GPT‑4-class models are clearly better than 3.5, but still unreliable for “real work” without human checking.

Economic and Labor Impacts

Consensus that AI doesn’t need to beat top humans to be disruptive; matching mediocre humans on rote work is enough.
Parallels to ATMs and bank tellers: automation shrinks some roles but shifts humans “up the value chain.”
Anxiety about large populations who can only do rote work becoming economically useless; debates over capitalism’s need for consumers, UBI, or a future with a small AI-owning elite.

Hype, Hype Cycles, and Investment

Many see a mix of genuine disruption and extreme hype, with comparisons to dot‑com and crypto bubbles.
Some think we’re at or past peak hype and heading toward a “trough of disillusionment” before a productivity plateau.
Others argue progress (better models, lower costs) in the last year is substantial, but fundamental limitations remain.
Concerns about “AI-washing,” dark patterns (cute branding, anthropomorphic voices), and a broader “culture of lying” in tech marketing.

Developer and Everyday User Experiences

Programmers report LLMs are great for scaffolding, small scripts, and boilerplate, but weak at non-trivial, multi-file changes without very careful prompting.
Some non-technical users have deeply integrated AI into daily work (marketing emails, planning) and healthcare settings (auto-generating clinical notes), finding big time savings.
Others feel newer models (e.g., GPT‑4o) are worse or “dumbed down” and are canceling paid subscriptions.

Related topics