- Explore MCP Servers
- omniparser-autogui-mcp
Omniparser Autogui Mcp
What is Omniparser Autogui Mcp
omniparser-autogui-mcp is an MCP server that utilizes OmniParser to analyze the screen and automatically operate the graphical user interface (GUI). It is confirmed to work on Windows.
Use cases
Use cases include automating repetitive tasks in software applications, testing user interfaces, and enhancing accessibility for users with disabilities.
How to use
To use omniparser-autogui-mcp, clone the repository, set up the environment variables, and configure it in the ‘claude_desktop_config.json’ file. Run the server using the command provided in the documentation.
Key features
Key features include automatic GUI operation, screen analysis using OmniParser, support for multiple languages, and customizable configurations for window targeting and server settings.
Where to use
omniparser-autogui-mcp can be used in various fields such as software testing, automated user interactions, and any application requiring GUI automation.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Overview
What is Omniparser Autogui Mcp
omniparser-autogui-mcp is an MCP server that utilizes OmniParser to analyze the screen and automatically operate the graphical user interface (GUI). It is confirmed to work on Windows.
Use cases
Use cases include automating repetitive tasks in software applications, testing user interfaces, and enhancing accessibility for users with disabilities.
How to use
To use omniparser-autogui-mcp, clone the repository, set up the environment variables, and configure it in the ‘claude_desktop_config.json’ file. Run the server using the command provided in the documentation.
Key features
Key features include automatic GUI operation, screen analysis using OmniParser, support for multiple languages, and customizable configurations for window targeting and server settings.
Where to use
omniparser-autogui-mcp can be used in various fields such as software testing, automated user interactions, and any application requiring GUI automation.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Content
omniparser-autogui-mcp
(日本語版はこちら)
This is an MCP server that analyzes the screen with OmniParser and automatically operates the GUI.
Confirmed on Windows.
License notes
This is MIT license, but Excluding submodules and sub packages.
OmniParser’s repository is CC-BY-4.0.
Each OmniParser model has a different license (reference).
Installation
- Please do the following:
git clone --recursive https://github.com/NON906/omniparser-autogui-mcp.git cd omniparser-autogui-mcp uv sync set OCR_LANG=en uv run download_models.py
(Other than Windows, use export instead of set.)
(If you want langchain_example.py to work, uv sync --extra langchain instead.)
- Add this to your
claude_desktop_config.json:
{ "mcpServers": { "omniparser_autogui_mcp": { "command": "uv", "args": [ "--directory", "D:\\CLONED_PATH\\omniparser-autogui-mcp", "run", "omniparser-autogui-mcp" ], "env": { "PYTHONIOENCODING": "utf-8", "OCR_LANG": "en" } } } }
(Replace D:\\CLONED_PATH\\omniparser-autogui-mcp with the directory you cloned.)
env allows for the following additional configurations:
-
OMNI_PARSER_BACKEND_LOAD
If it does not work with other clients (such as LibreChat), specify1. -
TARGET_WINDOW_NAME
If you want to specify the window to operate, please specify the window name.
If not specified, operates on the entire screen. -
OMNI_PARSER_SERVER
If you want OmniParser processing to be done on another device, specify the server’s address and port, such as127.0.0.1:8000.
The server can be started withuv run omniparserserver. -
SSE_HOST,SSE_PORT
If specified, communication will be done via SSE instead of stdio. -
SOM_MODEL_PATH,CAPTION_MODEL_NAME,CAPTION_MODEL_PATH,OMNI_PARSER_DEVICE,BOX_TRESHOLD
These are for OmniParser configuration.
Usually, they are not necessary.
Usage Examples
- Search for “MCP server” in the on-screen browser.
etc.
Dev Tools Supporting MCP
The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.










