MCP ExplorerExplorer

Hyperbrowser MCP Server

@hyperbrowseraion 11 days ago
291 MIT
FreeOfficial
Web Services
#browser#web#scraping#crawling#automation
This is Hyperbrowser's Model Context Protocol (MCP) Server. It provides various tools to scrape, extract structured data, and crawl webpages. It also provides easy access to general purpose browser agents like OpenAI's CUA, Anthropic's Claude Computer Use, and Browser Use.

Overview

What is Hyperbrowser MCP Server

The Hyperbrowser MCP Server is a tool designed to facilitate web scraping, data extraction, and webpage crawling. It offers users the ability to access various general-purpose browser agents in a seamless manner, enabling automated interactions with web content.

Use cases

Common use cases include scraping web pages for specific data formats, crawling linked websites to gather structured content, searching the internet via Bing, and utilizing AI agents for complex automation tasks. This server is particularly beneficial for applications requiring structured data extraction from unstructured HTML.

How to use

To use the Hyperbrowser MCP Server, users must first install it using the command ‘npx hyperbrowser-mcp ’. Alternatively, it can be configured in applications like Cursor and Windsurf by specifying command parameters in the respective configuration files. Developers can also run the server from the source after cloning the repository and installing dependencies.

Key features

The server boasts several key features, including tools for scraping webpages, extracting structured data, searching the web with Bing, and general-purpose automation through different AI agents like OpenAI’s CUA and Anthropic’s Claude. Additionally, it offers profile management capabilities such as creating and deleting persistent profiles.

Where to use

The Hyperbrowser MCP Server can be utilized in various programming environments and applications that require web automation and data extraction. It’s particularly suited for developers and data analysts who need to interact with web content programmatically, making it useful in data science, research, and application development.

Content

Hyperbrowser MCP Server

smithery badge

Frame 5

This is Hyperbrowser’s Model Context Protocol (MCP) Server. It provides various tools to scrape, extract structured data, and crawl webpages. It also provides easy access to general purpose browser agents like OpenAI’s CUA, Anthropic’s Claude Computer Use, and Browser Use.

More information about the Hyperbrowser can be found here. The hyperbrowser API supports a superset of features present in the mcp server.

More information about the Model Context Protocol can be found here.

Table of Contents

Installation

Manual Installation

To install the server, run:

npx hyperbrowser-mcp <YOUR-HYPERBROWSER-API-KEY>

Running on Cursor

Add to ~/.cursor/mcp.json like this:

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": [
        "-y",
        "hyperbrowser-mcp"
      ],
      "env": {
        "HYPERBROWSER_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Running on Windsurf

Add to your ./codeium/windsurf/model_config.json like this:

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": [
        "-y",
        "hyperbrowser-mcp"
      ],
      "env": {
        "HYPERBROWSER_API_KEY": "YOUR-API-KEY"
      }
    }
  }
}

Development

For development purposes, you can run the server directly from the source code.

  1. Clone the repository:

    git clone [email protected]:hyperbrowserai/mcp.git hyperbrowser-mcp
    cd hyperbrowser-mcp
    
  2. Install dependencies:

    npm install # or yarn install
    npm run build
    
  3. Run the server:

    node dist/server.js
    

Claude Desktop app

This is an example config for the Hyperbrowser MCP server for the Claude Desktop client.

{
  "mcpServers": {
    "hyperbrowser": {
      "command": "npx",
      "args": [
        "--yes",
        "hyperbrowser-mcp"
      ],
      "env": {
        "HYPERBROWSER_API_KEY": "your-api-key"
      }
    }
  }
}

Tools

  • scrape_webpage - Extract formatted (markdown, screenshot etc) content from any webpage
  • crawl_webpages - Navigate through multiple linked pages and extract LLM-friendly formatted content
  • extract_structured_data - Convert messy HTML into structured JSON
  • search_with_bing - Query the web and get results with Bing search
  • browser_use_agent - Fast, lightweight browser automation with the Browser Use agent
  • openai_computer_use_agent - General-purpose automation using OpenAI’s CUA model
  • claude_computer_use_agent - Complex browser tasks using Claude computer use
  • create_profile - Creates a new persistent Hyperbrowser profile.
  • delete_profile - Deletes an existing persistent Hyperbrowser profile.
  • list_profiles - Lists existing persistent Hyperbrowser profiles.

Installing via Smithery

To install Hyperbrowser MCP Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @hyperbrowserai/mcp --client claude

Resources

The server provides the documentation about hyperbrowser through the resources methods. Any client which can do discovery over resources has access to it.

License

This project is licensed under the MIT License.

Tools

scrape_webpage
Scrape a webpage and extract its content in various formats. This tool allows fetching content from a single URL with configurable browser behavior options. Use this for extracting text content, HTML structure, collecting links, or capturing screenshots of webpages.
crawl_webpages
Crawl a website starting from a URL and explore linked pages. This tool allows systematic collection of content from multiple pages within a domain. Use this for larger data collection tasks, content indexing, or site mapping.
extract_structured_data
Extract structured data from a webpage. This tool allows you to extract structured data from a webpage using a schema.
browser_use_agent
This tool employs an open-source browser automation agent optimized specifically for fast, efficient, and cost-effective browser tasks using a cloud browser. It requires explicit, detailed instructions to perform highly specific interactions quickly. Optimal for tasks requiring: - Precise, explicitly defined interactions and actions - Speed and efficiency with clear, unambiguous instructions - Cost-effective automation at scale with straightforward workflows Best suited use cases include: - Explicitly defined registration and login processes - Clearly guided navigation through web apps - Structured, step-by-step web scraping with detailed guidance - Extracting data via explicitly specified browser interactions You must provide extremely detailed step-by-step instructions, including exact elements, actions, and explicit context. Clearly define the desired outcome for optimal results. Returns the completed result or an error message if issues arise. Note: This agent trades off flexibility for significantly faster performance and lower costs compared to Claude and OpenAI agents.
openai_computer_use_agent
This tool utilizes OpenAI's model to autonomously execute general-purpose browser-based tasks with balanced performance and reliability using a cloud browser. It handles complex interactions effectively with practical reasoning and clear execution. Optimal for tasks requiring: - Reliable, general-purpose browser automation - Clear, structured interactions with moderate complexity - Efficient handling of common web tasks and workflows Best suited use cases include: - Standard multi-step registration or form submissions - Navigating typical web applications requiring multiple interactions - Conducting structured web research tasks - Extracting data through interactive web processes Provide a clear step-by-step description, necessary context, and expected outcomes. Returns the completed result or an error message if issues arise.
claude_computer_use_agent
This tool leverages Anthropic's Claude model to autonomously execute complex browser tasks with sophisticated reasoning capabilities using a cloud browser. It specializes in handling intricate, nuanced, or highly context-sensitive web interactions. Optimal for tasks requiring: - Complex reasoning over multiple web pages - Nuanced interpretation and flexible decision-making - Human-like interaction with detailed context awareness Best suited use cases include: - Multi-step processes requiring reasoning (e.g., detailed registrations or onboarding) - Interacting intelligently with advanced web apps - Conducting in-depth research with complex conditions - Extracting information from dynamic or interactive websites Provide detailed task instructions, relevant context, and clearly specify the desired outcome for best results. Returns the completed result or an error message if issues arise.
search_with_bing
Search the web using Bing. This tool allows you to search the web using bing.com

Comments