AI Hype is completely out of control – especially since ChatGPT-4o [video]
What “human-level” AI means
- Strong disagreement over claims that there’s “no evidence” we’re approaching human-level AI.
- Some argue machines already surpass humans on many tasks and the frontier is moving fast; others say this isn’t “intelligence,” just narrow tools.
- Definitions vary: human-level as “mediocre human on routine tasks,” “general adaptive intelligence,” or “intelligence = knowledge + reasoning.”
- Debate over whether intelligence requires autonomy, ability to learn from few examples, or a drive to seek new knowledge.
Capabilities vs. Limitations of Current Models
- LLMs praised for language tasks, translation, standardized boilerplate code, summarization, and as new user interfaces.
- Critics emphasize brittleness, hallucinations, poor multi-step reasoning, and difficulty with larger, real codebases or complex back-and-forth.
- Some compare current systems to “dog-level” intelligence at best; others insist they’re “stupid as hell” and far from general intelligence.
- Evidence cited that GPT‑4-class models are clearly better than 3.5, but still unreliable for “real work” without human checking.
Economic and Labor Impacts
- Consensus that AI doesn’t need to beat top humans to be disruptive; matching mediocre humans on rote work is enough.
- Parallels to ATMs and bank tellers: automation shrinks some roles but shifts humans “up the value chain.”
- Anxiety about large populations who can only do rote work becoming economically useless; debates over capitalism’s need for consumers, UBI, or a future with a small AI-owning elite.
Hype, Hype Cycles, and Investment
- Many see a mix of genuine disruption and extreme hype, with comparisons to dot‑com and crypto bubbles.
- Some think we’re at or past peak hype and heading toward a “trough of disillusionment” before a productivity plateau.
- Others argue progress (better models, lower costs) in the last year is substantial, but fundamental limitations remain.
- Concerns about “AI-washing,” dark patterns (cute branding, anthropomorphic voices), and a broader “culture of lying” in tech marketing.
Developer and Everyday User Experiences
- Programmers report LLMs are great for scaffolding, small scripts, and boilerplate, but weak at non-trivial, multi-file changes without very careful prompting.
- Some non-technical users have deeply integrated AI into daily work (marketing emails, planning) and healthcare settings (auto-generating clinical notes), finding big time savings.
- Others feel newer models (e.g., GPT‑4o) are worse or “dumbed down” and are canceling paid subscriptions.