2026-05-21

Indexing a year of video locally on a 2021 MacBook with Gemma4-31B (50GB swap)

Overall reaction & use case

Many commenters find the project impressive: indexing ~1TB / a year of video locally with a big vision-capable LLM on a 2021 MacBook is seen as a strong proof of what consumer hardware can do.
Several readers say they now have “a weekend project” and have similar piles of unorganized photos/videos they’d love to index.
There’s interest in extending this to still photos and to assist with editing in DaVinci Resolve or similar tools.

Hardware, performance & swap

Multiple reports of good local LLM performance on Apple Silicon (M1–M5) and even older x86 laptops, though with loud fans and heavy CPU usage.
Some question why so much swap is used, noting SSD wear and that a 4-bit 31B model should fit in far less RAM if the system is cleaned up.
Detailed benchmarks are shared for Qwen 3.6 models on M5 Pro (generation and prefill speeds), plus notes that MLX/oMLX are currently faster than llama.cpp on Apple Silicon.
Discussion of prefill vs generation speeds and how Apple Silicon currently lags GPUs for prefill-heavy workloads.

Implementation details & tooling

The main pipeline uses:
- Sidecar .description.md files per clip for a plain-text “index” that travels with the media.
- Face embeddings via insightface/ArcFace into a local SQLite DB.
- EXIF GPS + Nominatim/OSM for location, keeping faces/locations out of the LLM’s remit.
Others describe similar setups using Whisper, ffmpeg, semantic search, vector DBs, and external LLMs for tagging and video chat.
Scene detection is flagged as a key next step. One implementation uses per-frame histograms to detect changes and select representative frames; ffmpeg’s built-in scene detection is also mentioned.

Local vs cloud, privacy & cost

Local models are praised for:
- Cost control when processing large archives.
- Privacy of personal/family media.
- Freedom from API limits, uptime issues, and safety-policy false positives.
Some argue equivalent setups could work in the cloud if data already lived there; others emphasize the psychological benefit of “no limits” with local models.

AI-assisted writing & HN norms

Many readers find the article’s style “AI-sloppy”: staccato, trope-heavy, and distracting despite solid technical content.
Others say they enjoyed it and see it as one of the better LLM-assisted posts, with real substance underneath.
Several suggest tools and strategies to “de-AI” prose, remove common tropes, and demand higher editorial standards.
A moderator reiterates that AI-generated comments are against HN rules; AI-assisted articles are described as a gray area under active consideration.

Related topics