2024-07-30

Calculating the cost of a Google DeepMind paper

Scale of a $10M Experiment

Many note that $10M is huge at individual or small-company scale but tiny for Alphabet, whose revenue and profit make such losses almost unnoticeable.
Others argue that the real risk is not a single $10M miss but failing to learn from repeated mistakes, which could signal structural issues.

Internal vs External Compute Costs

Multiple comments stress the article estimates replication cost at public cloud prices, not Google’s internal cost.
Google likely used TPUs and internal quota/priority systems, making marginal cost closer to electricity and depreciation than list GPU prices.
Some argue that if you can buy H100s outright, full ownership can be far cheaper than $3/GPU-hour cloud rates.

Electricity, Infrastructure, and Opportunity Cost

Disagreement over whether electricity is “negligible”: for hyperscalers it’s small relative to total cost but still in the multi‑million range for heavily utilized clusters.
Some insist opportunity cost matters: internal cycles displace revenue-generating customer workloads; others counter that utilization is not 100%, so idle capacity makes internal use very cheap.

Best-Effort / Spot Compute and Utilization

Description of internal “best effort” or low-priority tiers: jobs run on spare capacity and get preempted by higher-priority workloads.
Others note GPUs/TPUs are harder to preempt efficiently; frequent checkpointing and reloads can make pure “spare cycles” training unrealistic at this scale.

Reproducibility and the “GPU Poor”

Concern that multi‑million‑dollar experiments are effectively irreproducible for most academics and small labs.
Some compare this to high-energy physics or space experiments: expensive but still part of science.
Worry that big labs’ compute-heavy standards crowd out “GPU poor” research and raise reviewer expectations beyond what smaller groups can afford.

Why Big Labs Publish

Suggested motives: recruiting and retaining researchers, marketing and brand building, impressing investors, and occasionally blocking patents by placing ideas in the public domain.
Several note that top AI papers function as high-value advertising and career currency, which can distort incentives toward hype and opaque presentation.

Comparisons and Value

Parallels drawn to other fields: mouse studies and high-throughput drug screens routinely cost hundreds of thousands of dollars or more.
Some argue similar or greater money is burned across industry on failed projects; research of this scale is not unusual globally.

Critiques of the Cost Estimate

Technical commenters question assumptions on model-parallelism, hardware choice, utilization (MFU), and pricing, suggesting the blog’s dollar figure could be off by a factor (likely too high).
Others defend the order of magnitude and emphasize that even with optimization, replication would still be extremely expensive.

Related topics