Oracle engineers caused five days software outage at U.S. hospitals

Why organizations still buy Oracle / Cerner

  • Several comments argue Oracle wins through aggressive enterprise sales: courting executives/CTOs with “we’ll handle everything” pitches, not developer preference.
  • Others note in this case “Oracle wasn’t bought, Cerner was” and that Cerner’s core products historically sat on Oracle backends.
  • Oracle is seen as offering a full-stack menu (DB, ERP, identity, CRM, cloud, etc.) that appeals to large bureaucracies wanting one vendor and “one throat to choke.”
  • Some mention educational seeding: universities teaching Java/Oracle stacks create a pipeline of Oracle-literate juniors.

Legacy lock‑in, mainframes, and COBOL

  • Many see Oracle use as legacy lock‑in: systems built decades ago when alternatives were weaker, now too risky/expensive to replace.
  • Comparisons to COBOL/mainframes: systems run for 40–60 years, deeply embedded in business processes; migration is huge and rarely justified if the old system “still works.”
  • Discussion on COBOL careers: some say it’s a high-pay, long-term niche; others call it “zombieware” that young devs avoid.
  • A few suggest AI-assisted codebase translation as an unexplored opportunity, but others note the complexity and risk.

Technical views on Oracle vs Postgres/SQL Server

  • Strong split: many insist Postgres is better for 99% of use cases; others say if money is no object, OracleDB still wins for extreme scale and fine-grained control.
  • Examples cited: Oracle’s partitioned global unique indexes, ability to pin/prioritize execution plans, Exadata storage-level optimizations.
  • Postgres’s refusal to fully honor query hints is highlighted as a pain point in some mission-critical scenarios.
  • Some note OracleDB is technically impressive but surrounded by awful licensing, audits, tooling, and operational complexity.
  • Others argue Oracle quality is generally low across its vast product line, even if the core database engine is strong.

Cause of the outage: human error vs process failure

  • The reported root cause (“engineers deleted critical storage”) leads many to blame poor change management rather than individual engineers.
  • Several describe what good process should look like in healthcare: strict procedures, staged disablement, read‑only aging, delayed physical deletion, clear rollback and recovery plans.
  • The multi-day recovery time is read as evidence that procedures, safeguards, and tested backups were inadequate.

Culture, blame, and management pressure

  • Debate around whether such failures stem from unrealistic deadlines and executive pressure vs plain incompetence or bad ops hygiene.
  • Some argue it’s a pattern: decisions and budgets arrive late, but delivery dates don’t move, forcing compressed, risky work.
  • Others push back on reflexively blaming management, emphasizing that individuals and organizations both share responsibility.

LLMs, “vibe coding,” and reliability

  • Thread digresses into AI-assisted “vibe coding”: rapid progress initially but poor architecture, weak understanding, and fragile prototypes.
  • Experienced developers report that heavy LLM reliance can degrade learning and insight; LLMs are seen as powerful search/idea tools, not mentors or safety nets.
  • Concern that novices using LLMs may ship systems they don’t understand deeply enough to debug safely in critical environments.

Healthcare / EHR specifics and Cerner design

  • Cerner’s architecture is criticized: shared “multitenant” database setups for multiple hospitals and high access privileges (e.g., widespread SSH and production DB write access).
  • Multiple EHRs (Cerner, Epic, others) are described as dreadful from clinician and operational perspectives, even when technically “up.”
  • Some note regional regulatory constraints (e.g., strict privacy rules) making generic EHR products hard to adapt.
  • Skepticism that Oracle’s future AI-based EHR will prioritize real quality over marketing.