Browser Use

πŸš€ Description

browser-use/browser-use β€” open-source Python framework that lets an LLM drive a real browser. Instead of feeding screenshots, it surfaces a structured DOM the agent can read and act on. Pair with any LLM (Browser Use Cloud, Anthropic, Google) and either run open source locally or use the fully-hosted cloud agent for stealth, proxy rotation, and 1000+ integrations.

The umbrella project for the Browser Harness (thin CDP harness, self-healing) and Video Use (same β€œgive the LLM a structured surface, not raw frames” pattern, applied to video).

🧩 Features

  • Python>=3.11, uv add browser-use, one-call Agent(task=..., llm=..., browser=...).run()
  • Multi-LLM: ChatBrowserUse, ChatGoogle('gemini-3-flash-preview'), ChatAnthropic('claude-sonnet-4-6')
  • CLI: browser-use open <url>, state, click <idx>
  • Templates: uvx browser-use init --template default|advanced|tools
  • Cloud free tier: 3 concurrent browsers, captcha solving, proxies, no card
  • LLM Quickstart: point any coding agent (Cursor, Claude Code) at Agents.md β€” no manual onboarding
  • Benchmark suite open at browser-use/benchmark β€” 100 real-world tasks

🎨 Why it matters

The β€œDOM as context” insight maps directly to how Video Use reads transcripts instead of frames. Same author, same philosophy: structured surface > pixel dump. Aligns with Spec-driven SEO and GEO thinking β€” give the model the schema, not the rendered noise.

Reasoning for

For tasks where a coding agent needs to actually click around: form filling, scraping with login, account onboarding flows, e-commerce purchases. The hosted Cloud Agent is the easy path for production; OSS for custom-tool integration. See related Hermes Agent and Paperclip for orchestration around such agents.

Alternatives considered

  • Playwright/Puppeteer directly β€” lower-level, no agent reasoning layer
  • Browser Harness β€” same org, but a thinner CDP layer where the agent writes its own helpers
  • Hosted Cloud Agent β€” strongest stealth + scale, paid

πŸ“– Further reading


Template: tool