2026-03-20

MacBook M5 Pro and Qwen3.5 = Local AI Security System

Benchmark & Workload

Thread centers on a benchmark of a fully local home security “agent” using Qwen3.5 models on an M5 Pro MacBook.
Qwen3.5-9B reportedly gets ~94% on a 96-test home-security suite, close to a GPT-5.4 cloud model, at ~25 tok/s and sub‑1s first token.
A 35B MoE variant is said to have faster first-token latency than tested OpenAI endpoints.
Some readers find the benchmark page flashy and unclear, ask for a direct link to the test suite, and question methodology (e.g., temp settings, simplistic tasks, narrow scope).

Hardware Choices & Cost Debate

One camp claims “barrier to entry” for usable local agents is ~$2,500 (e.g., high‑end Ryzen AI systems, M5, large RAM/context).
Others argue entry is closer to $400–$500 with used RTX 3060–class GPUs or a 16–32 GB Mac mini, or even <$100 with tiny models on Pi‑like boards.
Disagreement over what “usable” means: small models vs. large context windows (~50k–100k tokens) for coding/agents.
Debate on Apple Silicon vs Jetson/GPUs: memory capacity vs. prefill speed; scenario-dependent performance.

Use Cases & System Design

Vision + LLM “orchestration” stack: small VLM (e.g., LFM 450M) for perception and 9B Qwen for logic.
Goals: context-aware alerts (ignore family cars, expected visitors), richer instructions (“let in electrician, alert if they do more than X”).
Some question if an LLM is necessary at all vs. fixed logic, and highlight prompt-injection risks (“forget previous instructions” tricks).

Comparison to Existing Security Tools

Questions on how this complements or replaces systems like Frigate; consensus that it’s more of an AI layer on top of NVR-style motion/event recording.
Suggestions to use Coral TPU or Intel N100 + OpenVINO for efficient inference in existing setups.
Interest in integration with ONVIF/RTSP, UniFi Protect (including RTSPS quirks), and Home Assistant; maintainers promise fixes and open-source bridges.

Local vs Cloud & Privacy

Many emphasize privacy, control, and independence from cloud availability/pricing as main drivers for local AI, more than latency.
Others note that cloud models are still faster and more accurate; Qwen3.5 is seen as strong but significantly behind top proprietary models.

Skepticism, Security, and Productization

Concerns about overhyped marketing language and AI-written commentary.
Prompt-injection tests in the benchmark are criticized as weak.
Reliability for “serious” security use is questioned; some see this as early/demo-stage.
A major missing piece for a commercial security product is compliance (e.g., alarm certificates for insurance).

Long-Term Vision: Home AI Appliance

Several comments imagine a future “AI server” as a standard home appliance: local, long-lived, family-specific assistant.
Others doubt this due to hardware obsolescence, maintenance, cost, and the likelihood that most people will keep using cloud services instead.

Related topics