Browser Use
π Description
browser-use/browser-use β open-source Python framework that lets an LLM drive a real browser. Instead of feeding screenshots, it surfaces a structured DOM the agent can read and act on. Pair with any LLM (Browser Use Cloud, Anthropic, Google) and either run open source locally or use the fully-hosted cloud agent for stealth, proxy rotation, and 1000+ integrations.
The umbrella project for the Browser Harness (thin CDP harness, self-healing) and Video Use (same βgive the LLM a structured surface, not raw framesβ pattern, applied to video).
π§© Features
- Python>=3.11,
uv add browser-use, one-callAgent(task=..., llm=..., browser=...).run() - Multi-LLM:
ChatBrowserUse,ChatGoogle('gemini-3-flash-preview'),ChatAnthropic('claude-sonnet-4-6') - CLI:
browser-use open <url>,state,click <idx> - Templates:
uvx browser-use init --template default|advanced|tools - Cloud free tier: 3 concurrent browsers, captcha solving, proxies, no card
- LLM Quickstart: point any coding agent (Cursor, Claude Code) at Agents.md β no manual onboarding
- Benchmark suite open at browser-use/benchmark β 100 real-world tasks
π¨ Why it matters
The βDOM as contextβ insight maps directly to how Video Use reads transcripts instead of frames. Same author, same philosophy: structured surface > pixel dump. Aligns with Spec-driven SEO and GEO thinking β give the model the schema, not the rendered noise.
Reasoning for
For tasks where a coding agent needs to actually click around: form filling, scraping with login, account onboarding flows, e-commerce purchases. The hosted Cloud Agent is the easy path for production; OSS for custom-tool integration. See related Hermes Agent and Paperclip for orchestration around such agents.
Alternatives considered
- Playwright/Puppeteer directly β lower-level, no agent reasoning layer
- Browser Harness β same org, but a thinner CDP layer where the agent writes its own helpers
- Hosted Cloud Agent β strongest stealth + scale, paid
π Links
- Repo: https://github.com/browser-use/browser-use
- Docs: https://docs.browser-use.com/
- Cloud: https://cloud.browser-use.com/
π Further reading
Template: tool