Calculating the cost of a Google DeepMind paper
Scale of a $10M Experiment
- Many note that $10M is huge at individual or small-company scale but tiny for Alphabet, whose revenue and profit make such losses almost unnoticeable.
- Others argue that the real risk is not a single $10M miss but failing to learn from repeated mistakes, which could signal structural issues.
Internal vs External Compute Costs
- Multiple comments stress the article estimates replication cost at public cloud prices, not Google’s internal cost.
- Google likely used TPUs and internal quota/priority systems, making marginal cost closer to electricity and depreciation than list GPU prices.
- Some argue that if you can buy H100s outright, full ownership can be far cheaper than $3/GPU-hour cloud rates.
Electricity, Infrastructure, and Opportunity Cost
- Disagreement over whether electricity is “negligible”: for hyperscalers it’s small relative to total cost but still in the multi‑million range for heavily utilized clusters.
- Some insist opportunity cost matters: internal cycles displace revenue-generating customer workloads; others counter that utilization is not 100%, so idle capacity makes internal use very cheap.
Best-Effort / Spot Compute and Utilization
- Description of internal “best effort” or low-priority tiers: jobs run on spare capacity and get preempted by higher-priority workloads.
- Others note GPUs/TPUs are harder to preempt efficiently; frequent checkpointing and reloads can make pure “spare cycles” training unrealistic at this scale.
Reproducibility and the “GPU Poor”
- Concern that multi‑million‑dollar experiments are effectively irreproducible for most academics and small labs.
- Some compare this to high-energy physics or space experiments: expensive but still part of science.
- Worry that big labs’ compute-heavy standards crowd out “GPU poor” research and raise reviewer expectations beyond what smaller groups can afford.
Why Big Labs Publish
- Suggested motives: recruiting and retaining researchers, marketing and brand building, impressing investors, and occasionally blocking patents by placing ideas in the public domain.
- Several note that top AI papers function as high-value advertising and career currency, which can distort incentives toward hype and opaque presentation.
Comparisons and Value
- Parallels drawn to other fields: mouse studies and high-throughput drug screens routinely cost hundreds of thousands of dollars or more.
- Some argue similar or greater money is burned across industry on failed projects; research of this scale is not unusual globally.
Critiques of the Cost Estimate
- Technical commenters question assumptions on model-parallelism, hardware choice, utilization (MFU), and pricing, suggesting the blog’s dollar figure could be off by a factor (likely too high).
- Others defend the order of magnitude and emphasize that even with optimization, replication would still be extremely expensive.