Show HN: Open source alternative to Perplexity Comet

Architecture and Form Factor

  • Built as a Chromium fork / “wrapper” with C++ patches rather than just an Electron/Playwright-style controller.
  • Key change is enriched DOM and accessibility trees exposed at the C++ level for fast agent actions (click, input, element lookup), claimed to be 20–40x faster than injected JS.
  • Team plans to expose these low-level APIs to developers and potentially implement MCP so the browser can act as a sub-agent for external orchestrators (e.g., automated testing).
  • Several users would prefer a browser extension; maintainers argue Chromium’s extension APIs don’t expose what they need (especially the accessibility tree) and are intentionally constrained.

Security, Maintenance, and Engine Choice

  • Strong concern that a tiny team cannot realistically track Chromium’s rapid, security-critical release cycle and associated CVEs.
  • Critics emphasize the RCE risk of lagging even a few weeks behind upstream; they ask whether every release branch will be rebuilt and shipped.
  • Maintainers say they currently build on the same Chromium release as Chrome, have an auto-updater, and plan to scale the team; Brave is cited as proof a fork can be viable.
  • Some question using Chromium for a “privacy-first” product instead of Firefox; maintainers respond that non-Chromium engines require huge effort to achieve site compatibility and extension support.

Privacy, Models, and Local vs Cloud

  • Pitch is privacy-first: agent runs locally, users can bring their own API keys, and local LLMs via Ollama are supported.
  • Default today is Gemini via a liteLLM proxy (key hidden server-side); a smaller, fine-tuned local model is promised as the future default.
  • Plans include a built-in adblocker plus LLM-based detection of more ad formats; skeptics question using LLMs per-request.

Use Cases, UX, and Current Limitations

  • Vision: automate repetitive workflows (e.g., ecommerce repricing, complex trip planning).
  • Some users say existing tools (Playwright scripts, traditional sites like Orbitz) often solve these better today.
  • Demo (buying toothpaste) is criticized as underspecified and not clearly time-saving; calls for more challenging, realistic demos and better failure recovery.
  • One tester reports a comment-summarization task taking >20 minutes, requiring manual guidance, with many small scroll actions—currently too slow and fragile for their use.
  • Requests for clearer UI affordances (visible cursor movement, typed keys) to make agent behavior understandable.

Business Model and Ecosystem

  • Revenue plan: free consumer browser, paid enterprise edition; AGPL licensing is seen as consistent with that.
  • Linux support is repeatedly requested; maintainers say it’s imminent.
  • Some see this as the “real” open-source version of Perplexity Comet and welcome transparent, local-first agents; others dismiss it as minimal patches over a huge upstream codebase and a “solution looking for a problem.”