OpenBrowser - The General-Purpose Agentic Browser

The General-Purpose Agentic Browser

Give your AI agent a real browser. 2-6x fewer tokens than every competitor, tested across MCP and CLI benchmarks with 100% accuracy.

Read the Docs

openbrowser.me

Benchmarks

Competitors

100%

Accuracy

2-6x

Fewer Tokens

Why OpenBrowser

Built different from the ground up.

Fewest Tokens in Every Benchmark

Tested against 6 competitors across 2 independent benchmarks. Text-first architecture means 2-6x fewer tokens, every time.

tokens (fewer is better)

Chrome DevTools MCP

299,486

Playwright MCP

158,787

OpenBrowser MCP

50,1956x fewer

Claude Sonnet 4.6 | Feb 2026Methodology

Any LLM Provider

Works with 15 LLM providers out of the box, plus any OpenAI-compatible endpoint.

Gemini

OpenAI

Claude

DeepSeek

Groq

Ollama

Qwen

Azure

Bedrock

OpenRouter

Cerebras

OCI

Azure AI

Novita

LiteLLM

One Tool, Full Control

One execute_code tool, persistent Python namespace. Navigate, click, type, extract.

await navigate("https://news.ycombinator.com")

Live Browser View

Watch your agent browse in real-time via VNC streaming. See every click, scroll, and navigation live.

openbrowser.me

Live VNC stream

Production Ready

Docker, Kubernetes, cloud deployment. Battle-tested infrastructure for any scale.

Open Source

MIT licensed. Community-driven. Fully extensible. Build on top of OpenBrowser.

Star on GitHub

Get Started in 3 Steps

From zero to autonomous browsing in minutes.

Install

pip install openbrowser-ai

Write a Task

from openbrowser import CodeAgent, ChatGoogle

from dotenv import load_dotenv

import asyncio

load_dotenv()

async def main():

agent = CodeAgent(

task="Find the #1 post on Show HN",

llm=ChatGoogle(),

)

await agent.run()

asyncio.run(main())

Watch It Browse

The agent navigates, clicks, types, and extracts data autonomously. Watch it work in real-time through the live browser view.

See OpenBrowser in Action

From ad-hoc browsing to hands-off scheduled automation.

Agent Demo

Watch a real agent browse the web autonomously.

Scheduled Workflows

Schedule recurring browser tasks that reuse saved login sessions.

Works With Your Favorite Tools

OpenBrowser integrates with any MCP-compatible client.

Claude Code

Cursor

Windsurf

VS Code

n8n

Any MCP Client

Cline

Roo Code

Claude Code

Cursor

Windsurf

VS Code

n8n

Any MCP Client

Cline

Roo Code

Claude Code

Cursor

Windsurf

VS Code

n8n

Any MCP Client

Cline

Roo Code

Claude Code

Cursor

Windsurf

VS Code

n8n

Any MCP Client

Cline

Roo Code

Published Research

Two research projects developing custom fine-tuned models, with open-source weights on HuggingFace.

+9.1%reward over SFT (p=0.007)

Browser-in-the-Loop: Reinforcement Fine-Tuning LLM Agents for Web Form Filling

SFT + GRPO on Qwen3-8B with browser execution rewards across 1,250 tasks spanning 8 domains including healthcare, finance, and legal.

Reinforcement LearningSFTGRPOQwen3-8B

Read Paper Qwen3-8B SFT Qwen3-8B GRPO

95.2%task completion (ReFusion + MDPO)

Concentrate or Collapse: When RL Meets Diffusion Language Models for Web Planning

First cross-paradigm study of diffusion LMs for web planning. MDPO achieves near-parity with autoregressive models on form-filling benchmarks.

Diffusion LMsFlow MatchingMDPOESPO

Read Paper 10 Checkpoints

Open-Source Models

Trained weights available on HuggingFace, ready for inference or further fine-tuning.

SFT Baseline

Qwen3-8B SFT

Qwen3-8B-FormFactory-SFT-LoRA

View on HuggingFace RL Improved

Qwen3-8B GRPO

Qwen3-8B-FormFactory-GRPO-LoRA

View on HuggingFace Diffusion

ReFusion 8B

ReFusion-8B-MDPO

View on HuggingFace Compact 1.3B

FS-DFM 1.3B

FS-DFM-1.3B-ESPO

View on HuggingFace

View all models on HuggingFace

Get Early Access

Be the first to try the hosted version of OpenBrowser.