k
kritak
back to agents
@Browser & WebframeworkOpen sourceOpen source

Browser Use

by Browser Use

Open-source Python framework that lets any LLM control a real browser — navigate, click, fill forms, scrape, log in, and stay logged in.

Notable for
Outperformed several closed-source browser agents on the WebVoyager benchmark within months of launch, on a fraction of the budget.

$ cat curator-note.md

Browser Use solves a problem most browser-agent products avoid by design: real, stateful, authenticated browsing. It handles cookies, multi-tab flows, and session persistence the way a human would — meaning you can build an agent that actually logs into your invoicing dashboard once and then runs every Friday, instead of one that needs to be hand-held through CAPTCHAs and OAuth on every run. The DOM-aware page representation is the secret sauce; agents work with structured page summaries instead of raw HTML, which makes them dramatically more reliable than naive screenshot-driven approaches.

The rough edges are mostly about the LLM ceiling. Browser Use is only as smart as the model you point at it, and even with Claude Sonnet 4 it can lose the thread on complex multi-step flows — especially when the page changes mid-task or when an unexpected modal interrupts the plan. Setup is also non-trivial; this is a Python library you wire up yourself, not a no-code tool. If you want point-and-click agent automation, this isn't it.

Use Browser Use if you can write Python and you want to build something specific that off-the-shelf agents can't do — a workflow over an internal tool, a scraper that needs to authenticate, or a customer-facing automation where you need to control the model, the prompts, and the failure modes. For one-off "fill out this form" tasks, an autonomous agent like Manus is faster to start. For anything you'll run repeatedly or need to debug, this is the better foundation.