Selenium Mcp Server

1 MIT

FreeCommunity

AI Systems

A server enabling AI agents to control web browsers via Selenium WebDriver.

What is Selenium Mcp Server

The selenium-mcp-server is an MCP server that utilizes Selenium to interact with a WebDriver instance, enabling AI agents to control web browser sessions for various automated tasks.

Use cases

Use cases include automated testing of web applications, scraping data from websites, filling out forms automatically, and performing repetitive tasks in a web browser.

How to use

To use selenium-mcp-server, clone the repository, install the necessary dependencies using npm, configure the WebDriver, build the server, and then run it. Optionally, it can be integrated with MCP hosts like Cursor or Claude Desktop.

Key features

Key features include exposing Selenium WebDriver actions as MCP tools, such as navigating to URLs, finding elements, clicking elements, sending keystrokes, and retrieving page source.

Where to use

Selenium-mcp-server can be used in fields such as web scraping, automated testing, and any scenario where web browser automation is required.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Selenium Mcp Server

The selenium-mcp-server is an MCP server that utilizes Selenium to interact with a WebDriver instance, enabling AI agents to control web browser sessions for various automated tasks.

Use cases

Use cases include automated testing of web applications, scraping data from websites, filling out forms automatically, and performing repetitive tasks in a web browser.

How to use

Key features

Key features include exposing Selenium WebDriver actions as MCP tools, such as navigating to URLs, finding elements, clicking elements, sending keystrokes, and retrieving page source.

Where to use

Selenium-mcp-server can be used in fields such as web scraping, automated testing, and any scenario where web browser automation is required.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Selenium MCP Server

smithery badge

An MCP server that uses Selenium to interact with a WebDriver instance. Built using the MCP-Server-Starter template.

Overview

This server allows AI agents to control a web browser session via Selenium WebDriver, enabling tasks like web scraping, automated testing, and form filling through the Model Context Protocol.

Core Components

MCP Server: Exposes Selenium WebDriver actions as MCP tools.
Selenium WebDriver: Interacts with the browser.
MCP Clients: AI hosts (like Cursor, Claude Desktop) that can utilize the exposed tools.

Prerequisites

Node.js (v18 or later)
npm (v7 or later)
A WebDriver executable (e.g., ChromeDriver, GeckoDriver) installed and available in your system’s PATH.
A compatible web browser (e.g., Chrome, Firefox).

Getting Started

Clone the repository:

git clone <your-repo-url> selenium-mcp-server
cd selenium-mcp-server

Install dependencies:
```
npm install
```
Configure WebDriver:
- Ensure your WebDriver (e.g., chromedriver) is installed and in your PATH.
- Modify src/seleniumService.ts (you’ll create this file) if needed to specify browser options or WebDriver paths.
Build the server:
```
npm run build
```
Run the server:
```
npm start
```
Alternatively, integrate it with an MCP host like Cursor or Claude Desktop (see Integration sections below).

Tools

This server will provide tools such as:

selenium_navigate: Navigates the browser to a specific URL.
selenium_findElement: Finds an element on the page using a CSS selector.
selenium_click: Clicks an element.
selenium_sendKeys: Sends keystrokes to an element.
selenium_getPageSource: Retrieves the current page source HTML.
(Add more tools as needed)

TypeScript Implementation

The server uses the @modelcontextprotocol/sdk and selenium-webdriver libraries.

import { Server } from "@modelcontextprotocol/sdk/server/index.js";
import { StdioServerTransport } from "@modelcontextprotocol/sdk/server/stdio.js";
import { Builder, By, Key, until, WebDriver } from 'selenium-webdriver';

// Basic server setup (details in src/index.ts)
const server = new Server({
  name: "selenium-mcp-server",
  version: "0.1.0",
  capabilities: {
    tools: {}, // Enable tools capability
  }
});

// Selenium WebDriver setup (details in src/seleniumService.ts)
let driver: WebDriver;

async function initializeWebDriver() {
  driver = await new Builder().forBrowser('chrome').build(); // Or 'firefox', etc.
}

// Example tool implementation (details in src/tools/)
server.registerTool('selenium_navigate', {
  description: 'Navigates the browser to a specific URL.',
  inputSchema: { /* ... zod schema ... */ },
  outputSchema: { /* ... zod schema ... */ },
  handler: async (params) => {
    await driver.get(params.url);
    return { success: true };
  }
});

// Connect transport
async function startServer() {
  await initializeWebDriver();
  const transport = new StdioServerTransport();
  await server.connect(transport);
  console.log("Selenium MCP Server connected via stdio.");

  // Graceful shutdown
  process.on('SIGINT', async () => {
    console.log("Shutting down WebDriver...");
    if (driver) {
      await driver.quit();
    }
    process.exit(0);
  });
}

startServer();

Development

Build: npm run build
Run: npm start (executes node build/index.js)
Lint: npm run lint
Format: npm run format

Debugging

Use the MCP Inspector or standard Node.js debugging techniques.

Integration with MCP Hosts

(Keep relevant sections from the original README for Cursor, Claude Desktop, Smithery, etc., updating paths and commands as necessary)

Cursor Integration

Build your server: npm run build
In Cursor: Settings > Features > MCP: Add a new MCP server.
Register your server:
- Select stdio as the transport type.
- Name: Selenium Server (or similar).
- Command: node /path/to/selenium-mcp-server/build/index.js.
Save.

Claude Desktop Integration

Build your server: npm run build

Modify claude_desktop_config.json:

{
  "mcpServers": {
    "selenium-mcp-server": {
      "command": "node",
      "args": [
        "/path/to/selenium-mcp-server/build/index.js"
      ]
    }
  }
}

Restart Claude Desktop.

Best Practices

Use TypeScript and Zod for type safety and validation.
Keep tools modular (e.g., one file per tool in src/tools/).
Handle WebDriver errors gracefully (e.g., element not found, navigation issues).
Ensure proper WebDriver shutdown (e.g., driver.quit() on server exit).
Follow MCP best practices for schemas, error handling, and content types.

Learn More

Credits

Based on the template created by Seth Rose:

Website: https://www.sethrose.dev
𝕏 (Twitter): https://x.com/TheSethRose
🦋 (Bluesky): https://bsky.app/profile/sethrose.dev

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers