AlphaGenome: AI for better understanding the genome

Perceptions of Google/DeepMind and Tech Leadership

  • Several comments pivot to leadership and strategy: some argue Google’s CEO is uninspiring and has enshittified products but grown profits massively; others credit him for early, heavy AI infra investment and backing DeepMind.
  • Comparisons are made with other big tech CEOs and eras (notably cloud under one major competitor), with debate over how much success is “set up by predecessors” versus real strategic vision.
  • DeepMind is seen as “punching above its weight” in high‑impact AI for science, though commenters note many strong but less‑visible efforts in pharma, biotech, and newer institutes.

Model Capabilities and Scientific Novelty

  • AlphaGenome is viewed as a strong, well‑engineered demonstration of sequence‑to‑function modeling, in the lineage of Enformer/AlphaFold, using U‑nets/transformers and conformer‑like ideas.
  • Some biologists emphasize that similar approaches already exist; this is seen as a scale and integration advance rather than something conceptually revolutionary.

Causality, Fine-Mapping, and Limits

  • A key criticism: the work largely sidesteps fine‑mapping—distinguishing causal from correlated variants in linkage-disequilibrium blocks, which is central for drug target discovery.
  • Commenters discuss current statistical fine‑mapping (polyfun, SuSiE, etc.) and note that functional prediction scores can be integrated as priors, but prediction ≠ causation, especially in highly correlated genomic regions.
  • There is debate over whether sequence‑to‑function models inherently encode a kind of causal direction (DNA → molecular phenotype).

Non-Coding Genome and Function

  • Excitement centers on improved predictions for “non‑coding” regulatory variants and regulatory RNAs.
  • Others caution that much non‑coding activity may be noisy or effectively neutral, and there is a long‑running, unresolved argument over what “functional” really means in these regions.

Access, Openness, and Commercial Positioning

  • Strong debate over Google’s choice to initially expose AlphaGenome only via a non‑commercial API:
    • Critics say this blocks reproducibility, prevents use on confidential pharma data, and feels like a thinly veiled product pitch.
    • Defenders note this fits DeepMind’s historical pattern and argue API access enables usage monitoring and safety controls.
  • Multiple people highlight a line in the preprint stating that model code and weights will be released upon final publication, which softens earlier criticism.
  • There is concern that non‑commercial or restricted licenses, now common, hinder serious scientific and translational work.

Simulation, Scale, and Broader Bio-AI Goals

  • Some dream of whole‑cell simulations analogous to molecular dynamics, but others argue full MD at cellular scale is intractable and biologically misguided; coarse models and data‑driven perturbation models (like recent “virtual cell” efforts) may be more useful.
  • Discussion touches on genome context length (megabase‑scale windows vs entire chromosomes or genome), 3D genome organization, and long‑range enhancer interactions as future modeling frontiers.

Miscellaneous Notes

  • A side thread critiques the blog’s DNA hero image for mis‑rendering major/minor grooves and uses this to explain basic DNA geometry.
  • Commenters highlight the importance of curated ontologies (e.g., anatomy/metadata standards) in making large functional‑genomics datasets usable for models like AlphaGenome.