Open source AI must win
Definitions & scope
- Frequent distinction between:
- Open source AI (code, data, training recipes open).
- Open weights (downloadable models but opaque/undocumented data and training).
- Some argue open weights are “freeware, not open”; others say weights are the key artifact and already allow modification and fine‑tuning.
- Terminology confusion (open source vs open weights) seen as strategically exploited by big labs’ marketing.
Can open models match frontier labs?
- Many think capital, expert data, and hyperscale datacenters give closed labs a lasting lead; open models will be like GIMP vs Photoshop or Linux vs Windows.
- Counterview: local models (e.g., 30–70B+) are already “Sonnet-ish” for coding and other tasks; for most users, “good enough” open models are sufficient.
- Several note diminishing returns at the bleeding edge: open models a year behind frontier may still capture most practical value.
Compute, hardware, and decentralization
- Strong emphasis that the real moat is GPUs, RAM, and fast interconnects, not weights.
- Debate on whether consumer/prosumer hardware plus time can ever rival hyperscale clusters; some invoke tech deflation, used enterprise GPUs, and future ASICs.
- Many proposals for distributed/federated training and inference (BOINC‑style, MoE, gradient compression), but:
- Latency, bandwidth, and verification/poisoning are major unsolved issues.
- Some see decentralized training as mainly symbolic or as a deterrent, not a path to Fable‑class models.
Funding & business models
- Training frontier models seen as “unfathomably expensive”; skepticism that donations or volunteers can sustain true open competitors.
- Ideas floated: public datacenters, inter‑university or multi‑nation consortia, state‑funded “public LLMs,” corporate consortia that commoditize closed labs’ complements.
- Mistrust that any “open lab” won’t later pivot to for‑profit; OpenAI cited as precedent.
Control, safety, and regulation
- Deep concern about dependence on a few US megacorps for cognition, with attendant risks to speech, work, and “thinking itself.”
- Some argue democratizing powerful models is necessary to build defenses against malicious AI; others insist open access to frontier‑level systems is incompatible with bio/cybersecurity.
- Expectation that a serious AI‑linked incident could trigger harsh regulation of open models and possibly consumer compute.
Geopolitics and “winning”
- Chinese open‑weight labs are praised for preventing a US monopoly, but also viewed as state‑aligned and not altruistically “open.”
- “Winning” is variously defined as:
- Frontier‑class open models.
- Ubiquitous, locally runnable “good enough” open systems.
- Collective or public ownership/governance of models and infrastructure.