2024-07-23

Open source AI is the path forward

Licensing and “open source” debate

Many argue Llama is not “open source” in the OSI/FOSS sense: license restricts commercial use above 700M users, bans some use-cases (e.g., training other models in older licenses, certain industries, governments), and imposes an acceptable-use policy.
Strong push to distinguish “open weights” (freely downloadable models) from true open source (no usage restrictions, OSI-style freedoms).
Some worry Meta is “open‑washing” to gain goodwill while keeping strategic control. Others see the license as generous in practice for 99% of users.

Training data, weights, and reproducibility

Big disagreement on what counts as “source” for AI:
- One camp: training data + training code + curation pipeline = source; weights are like binaries.
- Another camp: for practical modification, weights are the “preferred form” and thus close enough to source.
Many note that major models don’t release training data, often because it likely includes copyrighted or private material. A few projects (e.g., OLMo, Dolma, some Apple/AI2 work) are cited as closer to fully open.
Non‑determinism in training means even with full data and code you can’t perfectly reproduce the same weights.

Government, hardware, and infrastructure

Long subthread on whether governments should fund public GPU clusters or fabs, vs. just research grants or cloud credits.
Arguments for: democratize access, reduce dependence on hyperscalers, analogy to Cold War “Heavy Press Program.”
Arguments against: hardware obsolescence, huge capital costs, risk of distorting markets, IP conflicts (esp. NVIDIA/CUDA dominance).

Meta’s motives and competitive dynamics

Widely seen as “commoditize your complement”: Meta doesn’t sell API access as its core business, so undercutting closed providers (OpenAI, Anthropic, Google) with strong free models helps it.
Some praise Meta’s track record of releasing impactful tools (React, PyTorch) and view this as aligned. Others see it as strategic, not altruistic, and potentially anti‑competitive but still beneficial to developers.

Safety, misuse, and geopolitics

Debate over whether open weights increase or decrease risk:
- Concerns: easier for bad actors and states to build weapons, propaganda, or uncensored tools; guardrails can be stripped.
- Counterpoints: powerful actors can likely steal closed models anyway; open models aid research, robustness, and avoid concentration of power.

Related topics