- Explore MCP Servers
- wayland-mcp
Wayland Mcp
What is Wayland Mcp
Wayland MCP is a specialized MCP Server designed for Wayland environments, providing tools for screenshot capture, analysis, and input simulation on modern Linux desktops.
Use cases
Use cases for Wayland MCP include automated testing of applications, creating tutorials or documentation that require screenshots, and developing applications that require precise input control.
How to use
To use Wayland MCP, set up input control by running ‘./setup.sh’ to configure permissions. Then, add the necessary configuration to your MCP server’s config file (.roo/mcp.json) to integrate Wayland MCP.
Key features
Key features include screenshot and image analysis capabilities, custom VLM integration, mouse and keyboard input simulation, and action chaining for complex input sequences.
Where to use
Wayland MCP is ideal for use in modern Linux desktop environments, particularly where reliable screenshot and input simulation tools are needed.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Overview
What is Wayland Mcp
Wayland MCP is a specialized MCP Server designed for Wayland environments, providing tools for screenshot capture, analysis, and input simulation on modern Linux desktops.
Use cases
Use cases for Wayland MCP include automated testing of applications, creating tutorials or documentation that require screenshots, and developing applications that require precise input control.
How to use
To use Wayland MCP, set up input control by running ‘./setup.sh’ to configure permissions. Then, add the necessary configuration to your MCP server’s config file (.roo/mcp.json) to integrate Wayland MCP.
Key features
Key features include screenshot and image analysis capabilities, custom VLM integration, mouse and keyboard input simulation, and action chaining for complex input sequences.
Where to use
Wayland MCP is ideal for use in modern Linux desktop environments, particularly where reliable screenshot and input simulation tools are needed.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Content
🚀 Wayland MCP Server
Note: This package was created because existing screenshot solutions didn’t work reliably on my Wayland setup.
Wayland MCP provides screenshot, analysis, mouse and keyboard control tools for modern Linux desktops.
✨ Features
-
📸 Screenshot & Analysis
- Custom VLM integration for image analysis
- Image comparison capabilities
-
🖱️ Input Simulation
- Mouse control (move, click, drag, scroll)
- Keyboard input (typing, key presses)
- Action chaining for complex sequences
⚠️ Security Warning
WARNING: Enabling input control gives the MCP server full access to your mouse and keyboard.
Only use with trusted MCP servers and models.
🚀 Quick Start
🔧 Input Control Setup
./setup.sh
Configures permissions for evemu-event to control input devices.
⚙️ MCP Server Configuration
Add to your MCP server config (.roo/mcp.json):
{
"mcpServers": {
"wayland-screenshot": {
"command": "uvx",
"args": [
"wayland-mcp"
],
"env": {
"OPENROUTER_API_KEY": "your-api-key",
"VLM_MODEL": "qwen/qwen2.5-vl-72b-instruct:free",
"XDG_RUNTIME_DIR": "/run/user/1000",
"WAYLAND_MCP_PORT": "4999",
"DISPLAY": ":0",
"WAYLAND_DISPLAY": "wayland-0",
"XDG_SESSION_TYPE": "wayland"
}
}
}
}
🛠️ Development
git clone https://github.com/someaka/wayland-mcp
cd wayland-mcp
pip install -e .
📜 License
GPL 3
Dev Tools Supporting MCP
The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.










