2026-03-28

CERN uses ultra-compact AI models on FPGAs for real-time LHC data filtering

Scope and Terminology Confusion

Early versions of the article incorrectly described the system as using “LLMs” and models “burned into silicon”; this was later edited to “AI” and then clarified further.
Commenters emphasize that these are not large language models but small, purpose‑built neural networks for anomaly detection.
Several see “AI” and “LLM” here as marketing language, noting that the underlying techniques would previously just be called machine learning or even just statistics.

What CERN Is Actually Doing

The deployed models are described as VAE‑based architectures (AXOL1TL, CICADA), with variants using VICReg‑trained feature extractors.
They are implemented on FPGAs with aggressive quantization and “distributed arithmetic” (shift‑add instead of full multipliers), achieving ~sub‑microsecond latency at 40 MHz.
Weights are hard‑wired into FPGA fabric for inference, but the chips remain reprogrammable; not literally fixed in ASIC silicon for this specific project.
Related work includes tools like hls4ml and flows such as hls4ml‑da4ml for mapping quantized networks to hardware.

FPGAs, ASICs, and Tooling

There is debate over whether CERN is using only FPGAs or also ASICs; for this system it appears FPGA‑based, while other CERN detector electronics do use custom ASICs.
Toolchain limitations (Vivado/Vitis HLS being slow, buggy, and hard to debug) are identified as major practical challenges.
Alternatives like direct RTL generation and open/tool‑agnostic flows are being explored to reduce dependence on commercial HLS.

AI vs. “Traditional” Methods and History

Several note that CERN and others have used neural networks and complex triggers for decades; this work continues that trend rather than starting something fundamentally new.
There is broad discussion about the overbroad use of “AI” today, including cases where linear regression or simple rules are marketed as AI.
Some welcome the sophisticated on‑detector inference; others are wary of potential bias from aggressive prefiltering and of the difficulty of updating models when they are tightly coupled to hardware.

Related topics