Indexing a year of video locally on a 2021 MacBook with Gemma4-31B (50GB swap)
Overall reaction & use case
- Many commenters find the project impressive: indexing ~1TB / a year of video locally with a big vision-capable LLM on a 2021 MacBook is seen as a strong proof of what consumer hardware can do.
- Several readers say they now have “a weekend project” and have similar piles of unorganized photos/videos they’d love to index.
- There’s interest in extending this to still photos and to assist with editing in DaVinci Resolve or similar tools.
Hardware, performance & swap
- Multiple reports of good local LLM performance on Apple Silicon (M1–M5) and even older x86 laptops, though with loud fans and heavy CPU usage.
- Some question why so much swap is used, noting SSD wear and that a 4-bit 31B model should fit in far less RAM if the system is cleaned up.
- Detailed benchmarks are shared for Qwen 3.6 models on M5 Pro (generation and prefill speeds), plus notes that MLX/oMLX are currently faster than llama.cpp on Apple Silicon.
- Discussion of prefill vs generation speeds and how Apple Silicon currently lags GPUs for prefill-heavy workloads.
Implementation details & tooling
- The main pipeline uses:
- Sidecar
.description.mdfiles per clip for a plain-text “index” that travels with the media. - Face embeddings via insightface/ArcFace into a local SQLite DB.
- EXIF GPS + Nominatim/OSM for location, keeping faces/locations out of the LLM’s remit.
- Sidecar
- Others describe similar setups using Whisper, ffmpeg, semantic search, vector DBs, and external LLMs for tagging and video chat.
- Scene detection is flagged as a key next step. One implementation uses per-frame histograms to detect changes and select representative frames; ffmpeg’s built-in scene detection is also mentioned.
Local vs cloud, privacy & cost
- Local models are praised for:
- Cost control when processing large archives.
- Privacy of personal/family media.
- Freedom from API limits, uptime issues, and safety-policy false positives.
- Some argue equivalent setups could work in the cloud if data already lived there; others emphasize the psychological benefit of “no limits” with local models.
AI-assisted writing & HN norms
- Many readers find the article’s style “AI-sloppy”: staccato, trope-heavy, and distracting despite solid technical content.
- Others say they enjoyed it and see it as one of the better LLM-assisted posts, with real substance underneath.
- Several suggest tools and strategies to “de-AI” prose, remove common tropes, and demand higher editorial standards.
- A moderator reiterates that AI-generated comments are against HN rules; AI-assisted articles are described as a gray area under active consideration.