2024-06-23

Llama.ttf: A font which is also an LLM

Core idea and implementation

A TrueType font embeds a tiny LLM plus inference engine, using Harfbuzz’s experimental WebAssembly (WASM) shaper.
Specific character sequences (e.g., runs of !) are “shaped” into model output at render time; the underlying text data stays unchanged.
This does not use classic TrueType hinting bytecode but a separate WASM shaper path in Harfbuzz.
The demo model is ~15M parameters (≈60–90 MB font file); a 70B-parameter variant would be ~280–290 GB and is shown only conceptually.

Determinism, UX, and copying text

Identical generations across apps come from fixed seeding and temperature 0, making inference deterministic given the same input.
Some suggest input-controlled seeds, regeneration symbols, and letting typed letters override suggestions to act like a custom autocomplete.
Because only the visual shaping changes, copy/paste returns the original punctuation, not the rendered prose.
Users note this feels like a built‑in “DRM”: readable but not trivially copiable; OCR (including OS‑level OCR APIs and tools) is proposed as the workaround.

Security, sandboxing, and complexity

Several posters are alarmed that fonts can execute code at all, citing:
- Long history of font-based exploits and that Windows once parsed TTF in the kernel.
- Increased attack surface: now a WASM runtime is needed just to render text.
- Risks of misleading displays (text says one thing, glyphs show another) for phishing and content scanning.
Others argue:
- WASM is strongly sandboxed, with limited APIs, and comparable in risk to web JS/WASM.
- Harfbuzz’s WASM shaper is experimental and not enabled in mainstream browsers.
- WASM is preferable to ad‑hoc VMs (TrueType, Graphite2) that already exist in font stacks.
Concerns remain about DoS, side channels, quotas, and the general “compute-mad” trend of putting Turing-complete layers everywhere.

Typographic and broader implications

Complex scripts (e.g., Urdu with tens of thousands of ligatures) are cited as justification for powerful shaping logic.
Some find the project “terrifying” yet “awesome”; others see it as a proof‑of‑concept hack rather than something that should become common.
Ideas raised include animated or game fonts, Doom-in-a-font, text-adventure interpreters, and model‑per‑font “personalities” for richer UIs.

Related topics