MacBook M5 Pro and Qwen3.5 = Local AI Security System
Benchmark & Workload
- Thread centers on a benchmark of a fully local home security “agent” using Qwen3.5 models on an M5 Pro MacBook.
- Qwen3.5-9B reportedly gets ~94% on a 96-test home-security suite, close to a GPT-5.4 cloud model, at ~25 tok/s and sub‑1s first token.
- A 35B MoE variant is said to have faster first-token latency than tested OpenAI endpoints.
- Some readers find the benchmark page flashy and unclear, ask for a direct link to the test suite, and question methodology (e.g., temp settings, simplistic tasks, narrow scope).
Hardware Choices & Cost Debate
- One camp claims “barrier to entry” for usable local agents is ~$2,500 (e.g., high‑end Ryzen AI systems, M5, large RAM/context).
- Others argue entry is closer to $400–$500 with used RTX 3060–class GPUs or a 16–32 GB Mac mini, or even <$100 with tiny models on Pi‑like boards.
- Disagreement over what “usable” means: small models vs. large context windows (~50k–100k tokens) for coding/agents.
- Debate on Apple Silicon vs Jetson/GPUs: memory capacity vs. prefill speed; scenario-dependent performance.
Use Cases & System Design
- Vision + LLM “orchestration” stack: small VLM (e.g., LFM 450M) for perception and 9B Qwen for logic.
- Goals: context-aware alerts (ignore family cars, expected visitors), richer instructions (“let in electrician, alert if they do more than X”).
- Some question if an LLM is necessary at all vs. fixed logic, and highlight prompt-injection risks (“forget previous instructions” tricks).
Comparison to Existing Security Tools
- Questions on how this complements or replaces systems like Frigate; consensus that it’s more of an AI layer on top of NVR-style motion/event recording.
- Suggestions to use Coral TPU or Intel N100 + OpenVINO for efficient inference in existing setups.
- Interest in integration with ONVIF/RTSP, UniFi Protect (including RTSPS quirks), and Home Assistant; maintainers promise fixes and open-source bridges.
Local vs Cloud & Privacy
- Many emphasize privacy, control, and independence from cloud availability/pricing as main drivers for local AI, more than latency.
- Others note that cloud models are still faster and more accurate; Qwen3.5 is seen as strong but significantly behind top proprietary models.
Skepticism, Security, and Productization
- Concerns about overhyped marketing language and AI-written commentary.
- Prompt-injection tests in the benchmark are criticized as weak.
- Reliability for “serious” security use is questioned; some see this as early/demo-stage.
- A major missing piece for a commercial security product is compliance (e.g., alarm certificates for insurance).
Long-Term Vision: Home AI Appliance
- Several comments imagine a future “AI server” as a standard home appliance: local, long-lived, family-specific assistant.
- Others doubt this due to hardware obsolescence, maintenance, cost, and the likelihood that most people will keep using cloud services instead.