Mcp Server Cvdlt

3 GPL-3.0

FreeCommunity

AI Systems

The repo is based on Model Context procotol of Python SDK, including DL models in CV, and provide the abilities to the LLM or vLLM model

What is Mcp Server Cvdlt

mcp-server-cvdlt is a Python-based server implementing the Model Context Protocol (MCP) for various computer vision tasks, including object detection, segmentation, and pose estimation using deep learning models.

Use cases

Use cases include real-time object detection in surveillance systems, image segmentation for medical imaging analysis, human pose estimation in fitness applications, and automated quality inspection in manufacturing.

How to use

To use mcp-server-cvdlt, install the required dependencies using ‘uv sync’ and ‘uv pip install -r requirements.txt’. Start the server in stdio mode with ‘python server.py’ or in SSE mode with ‘python server.py sse [port]’. Ensure to download the necessary model weights into the ./checkpoints directory.

Key features

Key features include object detection using YOLOv10, image segmentation with YOLOv8, segmentation of entire images using Ultralytics SAM, human pose estimation with YOLOv8, and support for both local and network image inputs.

Where to use

mcp-server-cvdlt can be used in various fields such as robotics, autonomous vehicles, security surveillance, healthcare imaging, and any application requiring advanced image analysis.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Mcp Server Cvdlt

Use cases

How to use

Key features

Where to use

mcp-server-cvdlt can be used in various fields such as robotics, autonomous vehicles, security surveillance, healthcare imaging, and any application requiring advanced image analysis.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

MCP Server for CVDLT(Computer Vision & Deep Learning Tools)

The repo is based on Ultralytics and Model Context procotol of Python SDK
Related Links:

MCP Playground(client) - https://github.com/MRonaldo-gif/mcp-playground-local

Ultralytics - https://github.com/ultralytics/ultralytics

MCP of Python - https://github.com/modelcontextprotocol/python-sdk

Python server implementing Model Context Protocol (MCP) for image object detection, segmentation, and pose estimation operations.

样式图

detect样式图

Features

Detect objects in images using YOLOv10
Segment objects in images using YOLOv8
Segment entire images using Ultralytics SAM
Estimate human poses in images using YOLOv8
Support for local and network image inputs
MCP tool integration for client interactions
Stdio and SSE transport protocols

Note: The server requires valid image paths or URLs and access to the following model files: yolov10b.pt (YOLOv10 detection), yolov8n-seg.pt (YOLOv8 segmentation), yolov8n-pose.pt (YOLOv8 pose estimation), and sam_b.pt (Ultralytics SAM).

TODO

3D Detection
AIGC(GAN, Diffusion)
Denso Estimation
Deploy DL(Deep Learning) Models

QucikStart

Install Dependencies

uv sync
//如需要清华源
uv sync --index https://pypi.tuna.tsinghua.edu.cn/simple --extra-index-url https://pypi.org/simple

uv pip install -r requirements.txt -i https://pypi.tuna.tsinghua.edu.cn/simple

Start Server

stdio 模式：

python server.py

输出：

使用 stdio 传输启动 MCP 服务器（YOLO）

SSE 模式：

python server.py sse [端口号]

示例：

python server.py sse 8080

输出：

在端口 8080 上启动 MCP 服务器（YOLO），使用 SSE 传输

Moreover, users need to download the weights into the ./checkpoints directory.
Downloads Links🔗：https://docs.ultralytics.com/models/yolov10/，https://docs.ultralytics.com/models/yolov8/，https://docs.ultralytics.com/models/sam-2/

├── checkpoints
│ ├── sam_b.pt
│ ├── yolov10b.pt
│ ├── yolov8n-pose.pt
│ └── yolov8n-seg.pt

API

Resources

image://system: Image processing operations interface

Tools

detect_objects
- Detect objects in an image using YOLOv10
- Input: image_url (string)
- Supports local paths (file:// or relative) and network URLs (http:// or https://)
- Returns JSON array of detected objects with bounding boxes, confidence scores, and class labels
- Example output: [{"box": [x, y, w, h], "confidence": 0.9, "class": "person"}, ...]
segment_objects
- Segment objects in an image using YOLOv8
- Input: image_url (string)
- Supports local paths (file:// or relative) and network URLs (http:// or https://)
- Returns JSON array of segmented objects with bounding boxes, confidence scores, and class labels
- Example output: [{"box": [x, y, w, h], "confidence": 0.85, "class": "car"}, ...]
segment_image
- Segment entire image using Ultralytics SAM
- Input: image_url (string)
- Supports local paths (file:// or relative) and network URLs (http:// or https://)
- Returns JSON array of segmented regions with bounding boxes, areas, and confidence scores
- Example output: [{"bbox": [x, y, w, h], "area": 2500, "confidence": 0.95}, ...]
estimate_pose
- Estimate human poses in an image using YOLOv8
- Input: image_url (string)
- Supports local paths (file:// or relative) and network URLs (http:// or https://)
- Returns JSON array of detected poses with keypoint coordinates and confidence scores
- Example output: [{"keypoints": [[x1, y1], [x2, y2], ...], "confidence": [0.9, 0.8, ...]}, ...]

Usage with Claude Desktop

Add this to your claude_desktop_config.json:

Note: You can provide sandboxed directories to the server by mounting them to /projects. Adding the ro flag will make the directory readonly by the server.

SSE

{
  "mcpServers": {
    "server-with-yolo": {
      "url": "http://localhost:8080/sse"
    }
  }
}

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers