MCP ExplorerExplorer

Mcp Florence2

@jkawamotoon 10 months ago
1 MIT
FreeCommunity
AI Systems
#florence-2#mcp-server#python
An MCP server for processing images using Florence-2

Overview

What is Mcp Florence2

mcp-florence2 is an MCP server designed for processing images using the Florence-2 model, enabling functionalities such as Optical Character Recognition (OCR) and image caption generation.

Use cases

Use cases for mcp-florence2 include extracting text from scanned documents, generating captions for images in media applications, and enhancing accessibility by providing text descriptions for visual content.

How to use

To use mcp-florence2, configure it in your application (Claude Desktop or Goose CLI) by editing the respective configuration files to include the server details. After configuration, restart the application to start processing images or PDF files.

Key features

Key features of mcp-florence2 include OCR capabilities for text extraction from images and PDF files, and the ability to generate descriptive captions for images.

Where to use

mcp-florence2 can be used in various fields including document processing, content generation, accessibility services, and any application requiring image analysis.

Content

Florence-2 MCP Server

Python Application
GitHub License
pre-commit
Ruff
smithery badge

An MCP server for processing images using Florence-2.

You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character
Recognition) or generate descriptive captions summarizing the content of the images.

Installation

For Claude Desktop

To configure this server for Claude Desktop, edit the claude_desktop_config.json file with the following entry under
mcpServers:

{
  "mcpServers": {
    "florence-2": {
      "command": "uvx",
      "args": [
        "--from",
        "git+https://github.com/jkawamoto/mcp-florence2",
        "mcp-florence2"
      ]
    }
  }
}

After editing, restart the application.
For more information,
see: For Claude Desktop Users - Model Context Protocol.

For Goose CLI

To enable the Bear extension in Goose CLI,
edit the configuration file ~/.config/goose/config.yaml to include the following entry:

extensions:
  bear:
    name: Florence-2
    cmd: uvx
    args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ]
    enabled: true
    type: stdio

For Goose Desktop

Add a new extension with the following settings:

  • Type: Standard IO
  • ID: florence-2
  • Name: Florence-2
  • Description: An MCP server for processing images using Florence-2
  • Command: uvx --from git+https://github.com/jkawamoto/mcp-florence2 mcp-florence2

For more details on configuring MCP servers in Goose Desktop,
refer to the documentation:
Using Extensions - MCP Servers.

Tools

ocr

Process an image file or URL using OCR to extract text.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

caption

Processes an image file and generates captions for the image.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

License

This application is licensed under the MIT License. See the LICENSE file for more details.

Tools

No tools

Comments

Recommend MCP Servers

View All MCP Servers