GPUHammer: Rowhammer attacks on GPU memories are practical
Why Rowhammer-like Issues Persist
- Several comments argue manufacturers knowingly traded integrity for density, speed, and cost: “fast, dense, cheap now” beat “provably correct, larger, slower.”
- Rowhammer-like “pattern sensitivity” in DRAM was reportedly known for decades and once treated as a blocking defect, but later tolerated as process shrinks made it harder to avoid.
- Some suggest vendors assumed such attacks were impractical from userland until public proofs made them real.
- Others frame this as an economic externality: consumers can’t evaluate memory integrity, vendors compete on price/GB, and there’s little liability or regulatory pressure.
Inherent DRAM/GPU Vulnerabilities
- Rowhammer is described as inherent to modern high-density DRAM and expected to worsen with further scaling.
- GPUs historically got away with occasional VRAM bitflips because they were “just” for graphics; now they host critical compute (e.g., DNNs), so integrity matters more.
- One paper-highlighted PoC flips a single bit to destroy a model’s accuracy (80% → 0.1%).
ECC and Performance Trade-offs
- Disagreement on ECC cost:
- Some note ECC DIMMs often ship at lower rated speeds/latency and that GPU ECC (especially Nvidia’s GDDR-based “soft ECC”) can reduce bandwidth.
- Others counter that proper ECC adds extra chips and bus width so bandwidth is preserved; the extra check cycle is usually hidden by caches.
- Consensus that ECC is valuable, but many devices still ship without it; some call mass non‑ECC systems unethical.
Multi-tenant GPUs and Practical Exploitability
- Discussion centers on whether GPUs are realistically shared across tenants:
- Major clouds generally expose dedicated GPUs to customers, though they internally time-slice or partition (MIG, Kubernetes time-sharing).
- Some smaller services and on-prem HPC setups do share GPUs across users or containers.
- Concern that browser APIs (WebGL/WebGPU) might become vectors, but current attacks are “blind” corruption, not straightforward data exfiltration.
Meta/Philosophical Threads
- Several comments riff on the appeal of “hammering” as exploiting analog physics beneath digital abstractions, extending this into simulation and cosmology analogies.