MCP ExplorerExplorer

Textin Mcp

@intsig-textinon 9 months ago
13 MIT
FreeCommunity
AI Systems
Textin MCP Server for OCR: recognize text, convert to Markdown, and extract key info.

Overview

What is Textin Mcp

textin-mcp is an MCP Server designed for Textin OCR, enabling text recognition and document conversion functionalities.

Use cases

Use cases include automating the extraction of text from scanned documents, converting reports to Markdown for easier editing, and extracting key information from contracts or legal documents.

How to use

To use textin-mcp, set up your APP_KEY and APP_SECRET as per the instructions provided. You can then run the server using NPX with the specified command and arguments.

Key features

Key features include text recognition from various document formats (images, Word, PDF), conversion of documents to Markdown, and intelligent extraction of key information from documents.

Where to use

textin-mcp can be used in various fields such as document management, data extraction, and digital archiving, where text recognition and conversion are required.

Content

TextIn OCR MCP

TextIn

English | 中文

TextIn OCR MCP Server

TextIn MCP Server is a tool for extracting text and performing OCR on documents, including document text recognition, ID recognition, and invoice recognition. It also supports converting documents into Markdown format.

Tools

  • recognition_text

    • Text recognition from images, Word documents, and PDF files.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Text of the document.
    • Supports conversion for:
      • PDF
      • Image (Jpeg, Jpg, Png, Bmp)
  • doc_to_markdown

    • Convert images, PDFs, and Word documents to Markdown.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
    • Return: Markdown of the document.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)
  • general_information_extration

    • Automatically identify and extract information from documents, or identify and extract user-specified information.
    • Inputs:
      • path (string, required): file path or a URL (HTTP/HTTPS) pointing to a document
      • key (string[], optional): The non-tabular text information that the user wants to identify, input format is an array of strings.
      • table_header (string[], optional): The table information that the user wants to identify, input format is an array of strings.
    • Return: The key information JSON.
    • Supports conversion for:
      • PDF
      • Microsoft Office Documents (Word, Excel)
      • Image (Jpeg, Jpg, Png, Bmp)

When the input is a URL, it does not support handling access to protected resources.

Setup

APP_ID and APP_SECRET

Click here to register for a TextIn account.

Get Textin APP_ID and APP_SECRET by following the instructions here.

NPX

{
  "mcpServers": {
    "textin-ocr": {
      "command": "npx",
      "args": [
        "-y",
        "@intsig/server-textin"
      ],
      "env": {
        "APP_ID": "<YOUR_APP_ID>",
        "APP_SECRET": "<YOUR_APP_SECRET>",
        "MCP_SERVER_REQUEST_TIMEOUT": "600000"
      },
      "timeout": 600
    }
  }
}

License

This MCP server is licensed under the MIT License. This means you are free to use, modify, and distribute the software, subject to the terms and conditions of the MIT License. For more details, please see the LICENSE file in the project repository.

Tools

No tools

Comments

Recommend MCP Servers

View All MCP Servers