2024-08-25

Looming Liability Machines (LLMs)

LLMs in Critical and High‑Risk Contexts

Several commenters warn about LLMs in nuclear, military, and diplomatic settings, e.g., mistranslations in crisis communications potentially triggering war.
Others mock the idea of delegating all decisions to LLMs, referencing sci‑fi (Skynet) and joking about crypto-style “NukeCoin” deterrence.
General sentiment: using opaque, non-deterministic models for life‑or‑death systems is reckless.

Suitability for Root Cause Analysis (RCA) and Incident Response

Strong skepticism: LLMs are seen as text generators that produce plausible stories, not causal explanations of complex failures.
Concerns include hallucinations, management over-trust, and the risk RCA becomes “conman simulation.”
Advocates argue LLMs can be useful assistants: interpreting confusing logs, suggesting hypotheses, triaging incidents faster, or summarizing detailed postmortems.
Some propose fine-tuning on internal incident data or combining LLMs with RAG and tools; others reply that many incidents are novel and correlation-based models may fail.

“Next-Token Prediction” vs Reasoning Debate

One camp: LLMs are just next-token predictors lacking genuine understanding or world models; unsuited for deep causal reasoning.
Counter-arguments:
- Next-token prediction can implicitly require complex internal models (e.g., math-like tasks), so dismissing LLMs on this basis is oversimplified.
- Humans also operate with patterns and partial understanding; distinction between human and LLM “stochastic parrots” may be smaller than critics claim.
Technical subthread on token-level vs sequence-level probability, planning, beam search, and how humans write with broader structure than single-word steps.

Verification, Error Rates, and Safety

Key concern: model output must be human-verified, and LLMs can produce more text than humans can safely check, especially in “agentic” multi-step systems.
Suggestions:
- Automate verification with deterministic systems where possible (e.g., use LLM to write code, then verify with a precise tool).
- Domain-specific verifiers or constraints may eventually bound errors, but using LLMs to verify LLMs is seen as circular.
Some liken this to software correctness and formal methods: general guarantees are impossible, but specific systems can be verified.

Organizational and Process Issues

Many incidents are low-stakes business outages; LLM RCA may be a way to cope with bureaucratic, time-consuming postmortem processes.
Others worry that automating postmortems removes a key feedback channel where engineers highlight structural and quality problems.
Anecdotes describe management chasing AI for promotion/PR, underestimating review costs, and causing subtle, hard-to-debug failures.

Java 17 Migration and Productivity Claims

Commenters question claims of “4,500 developer-years” saved on Java 17 upgrades via LLMs.
Some argue Java upgrades (especially from recent LTS versions) should be relatively easy; others note real compatibility pain from older baselines like Java 8.
Overall tone: skepticism that headline numbers reflect reality rather than executive storytelling.

Related topics