Mcp Server Browser Use Ollama

1 MIT

FreeCommunity

AI Systems

MCP Server for browser automation with Ollama AI models, enabling efficient web interactions.

What is Mcp Server Browser Use Ollama

mcp-server-browser-use-ollama is a powerful browser automation and control system that allows AI agents to interact with web browsers using the Model Context Protocol (MCP). It is specifically designed for use with Ollama local models, ensuring secure and efficient browser automation.

Use cases

Use cases include automating form submissions, navigating websites, extracting data from web pages, and performing repetitive tasks in browsers, all facilitated by AI agents.

How to use

To use mcp-server-browser-use-ollama, first ensure that Ollama is installed and running locally. Then, clone the repository, install the required dependencies, and configure your preferred Ollama model. You can initialize the MCP server and execute browser actions through the provided Python code.

Key features

Key features include full MCP integration for structured communication, support for Ollama models, direct browser control with screenshot capabilities, advanced DOM management, an AI agent system for message management, built-in telemetry for monitoring, and an extensible architecture for custom actions.

Where to use

mcp-server-browser-use-ollama can be used in various fields such as web scraping, automated testing, data collection, and any application requiring automated interactions with web browsers using AI.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Mcp Server Browser Use Ollama

Use cases

Use cases include automating form submissions, navigating websites, extracting data from web pages, and performing repetitive tasks in browsers, all facilitated by AI agents.

How to use

Key features

Where to use

mcp-server-browser-use-ollama can be used in various fields such as web scraping, automated testing, data collection, and any application requiring automated interactions with web browsers using AI.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Browser Use MCP

A powerful browser automation and control system that enables AI agents to interact with web browsers through the Model Context Protocol (MCP). This implementation is specifically designed to work with Ollama local models, providing a secure and efficient way to automate browser interactions using locally-hosted AI models.

Features

MCP Integration: Full support for Model Context Protocol, enabling structured communication between AI models and browser automation
Ollama Model Support: Optimized for local AI models running through Ollama
Browser Control: Direct browser manipulation and automation with screenshot capabilities
DOM Management: Advanced DOM tree building and processing
AI Agent System: Sophisticated message management and service orchestration
Telemetry: Built-in system monitoring and performance tracking
Extensible Architecture: Modular design supporting custom actions and features

Prerequisites

Ollama installed and running locally
Python 3.8 or higher
pip package manager

Installation

# Clone the repository
git clone https://github.com/yourusername/browser-use-mcp.git
cd browser-use-mcp

# Install dependencies
pip install -r requirements.txt

# Configure Ollama (ensure Ollama is running)
ollama pull qwen2.5-coder:7b # or your preferred model

Quick Start

from browser_use.agent import Agent
from browser_use.browser import Browser
from browser_use.mcp import MCPServer

# Initialize MCP server and Ollama model
mcp_server = MCPServer(model="qwen2.5-coder:7b")

# Initialize browser and agent
browser = Browser()
agent = Agent(browser, mcp_server)

# Execute browser actions through MCP
agent.execute("Navigate to https://example.com and click the first button")

Project Structure

browser_use/
├── agent/              # AI agent coordination
├── browser/           # Browser control and automation
├── dom/               # DOM tree management
├── controller/        # System coordination
└── telemetry/        # System monitoring

Documentation

Comprehensive documentation is available in the .context directory:

Project overview and goals in .context/index.md
System architecture in .context/docs/architecture.md
System flow diagrams in .context/diagrams/system-flow.md

Using with MCP Clients

Claude Desktop Integration

To use browser-use-mcp with Claude Desktop:

Add the MCP server configuration to Claude Desktop’s settings (claude_desktop_config.json):

{
  "mcpServers": {
    "browser-use": {
      "command": "/path/to/.venv/bin/python",
      "args": [
        "/path/to/server.py"
      ]
    }
  }
}

Restart Claude Desktop to load the new MCP server
The browser control tools will now be available to Claude through the MCP protocol:
- browser_action: Control browser interactions
- read_dom: Access page DOM information
- get_screenshot: Capture browser state

Other MCP Clients

For other MCP-compatible clients, configure the server using these parameters:

Command: python
Arguments: ["-m", "browser_use.mcp_server"]
Environment Variables:
- OLLAMA_HOST: Ollama API host (default: http://localhost:11434)
- BROWSER_HEADLESS: Run browser in headless mode (default: false)
- SCREENSHOT_DIR: Directory for saving screenshots (default: ./screenshots)

Examples

Check out the examples/ directory for various use cases:

Simple browser automation
Custom function integration
Multi-tab handling
Parallel agent operations
MCP client integration examples
And more!

Testing

# Run all tests
pytest

# Run specific test file
pytest tests/test_browser.py

Contributing

Fork the repository
Create your feature branch (git checkout -b feature/amazing-feature)
Commit your changes (git commit -m 'Add amazing feature')
Push to the branch (git push origin feature/amazing-feature)
Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers