2025-02-25

Launch HN: Browser Use (YC W25) – open-source web agents

Positioning vs Other Tools

Compared to Stagehand, Browser Use is described as higher‑level and “end‑to‑end” rather than step‑by‑step, and compatible with many LLMs (including local ones via Ollama).
Versus Skyvern, commenters see Browser Use as an open‑source framework with a simple cloud “one prompt and go” layer, while Skyvern feels more like a closed web app.
Some ask how it compares to TaxyAI’s DOM‑based extension; no detailed answer appears in the thread.

Security & DevTools Concerns

A substantial subthread criticizes the guidance to attach to a user’s real Chrome profile via remote debugging (CDP) as “inherently insecure,” referencing known Chromium issues.
Attack vector: devtools port has no auth and can be exploited (e.g., via XSS) to compromise the browser and potentially the host machine.
Alternatives discussed: ChromeDriver, minimal‑permission extensions, WebViews; disagreement over which approach is safer.
Several commenters say local use in this mode is “unusable” until addressed; cloud use shifts risk to trusting the provider with passwords instead.

Integrations & MCP Debate

Some users want MCP (Model Context Protocol) support to plug Browser Use into tools like Cursor and Claude Desktop.
Maintainers initially view MCP as unnecessary “gimmick” vs direct HTTP APIs, but are swayed by repeated user requests and existing community MCP wrappers.
Pro‑MCP arguments: standardized tooling, easier integrations, explicit user opt‑in, and better security boundaries than arbitrary HTTP/curl.

Scraping Ethics & Website Impact

Concerns about agents driving huge traffic spikes, ignoring robots.txt and rate limits, and increasing hosting bills 100–1000%.
Browser Use team mentions possible mitigations: only fetching interactive elements, skipping media, caching via eTags, and a long‑term vision where agents pay for data.
Critics highlight marketing claims like “extract data behind login walls” as adversarial to sites and liken it to “breaking into someone’s house.”
There’s a request for an easy way for site admins to detect and block Browser Use traffic.

Reliability, Determinism & Speed

Mixed reports: quickstart Reddit example sometimes fails, while more complex internal workflows can work surprisingly well.
Issues cited: loops while loading a tab, hallucinated form inputs (e.g., fake addresses), lack of a verifiable feedback loop that what was requested actually happened.
Ideas: record agent histories, extract xPaths/DOM selectors from successful runs, then re‑run deterministically and use LLMs as a repair fallback.
Users want faster execution; maintainers say they’re already significantly faster than some vision‑heavy competitors and discuss specialized inference (e.g., Groq) and custom actions for common flows like WhatsApp.

Use Cases, RPA & Testing

Strong excitement from people doing:
- Legacy web/CRM integration and internal tools
- RPA‑style workflows, with Browser Use turning agent runs into repeatable scripts
- Automated web testing and accessibility, especially if xPaths/DOM or accessibility trees can be captured once and reused.
There’s interest in extending these ideas to native Windows apps (with pointers to OmniParser, Pig, AskUI) and eventually to mobile device automation.

Licensing & Open‑Source Strategy

MIT license draws praise but also warnings: cloud vendors could repackage and undercut them, as allegedly happened with Elasticsearch, MongoDB, Redis.
Some predict a future shift to AGPL or a business license; maintainers don’t commit either way in the thread.

Anti‑bot / Platform‑Specific Issues

LinkedIn is highlighted as particularly hostile to automation: CAPTCHAs, nested scrollable UIs, detection. Suggested mitigations include using a separate browser profile.
A user reports being blocked within seconds by a real‑estate site and asks about stealth measures; no concrete anti‑detection roadmap is given.
One commenter distinguishes “agents” (acting as user clones) from traditional scraping bots but acknowledges sites that truly don’t want agents will escalate defenses.

Community Reception

Overall sentiment is highly positive: many see Browser Use as a leading open‑source building block for browser agents and RPA.
Users share creative demos (e.g., ordering food from smart glasses) and plan to integrate it into products, while recognizing that security hardening, speed, determinism, and site‑friendliness remain open challenges.

Related topics