Transcriber Mcp

8 MIT

FreeCommunity

AI Systems

Transcriber MCP is a lightweight server for converting audio/video files to text using MCP protocol.

What is Transcriber Mcp

Transcriber MCP is a server that converts audio and video files into text, compliant with the Model Context Protocol (MCP). It utilizes faster-whisper and operates in a lightweight and practical manner on CPU environments.

Use cases

Use cases include transcribing interviews, generating subtitles for videos, creating text records of meetings, and converting podcasts into written format.

How to use

To use Transcriber MCP, clone the repository, set up a virtual environment, install dependencies, and run the server. You can then send requests to the server using an MCP-compliant client to transcribe audio or video files.

Key features

Key features include compliance with MCP protocol, support for various audio and video file formats (mp3, mp4, wav, mov, avi), text output of transcription results, and a communication interface for MCP clients.

Where to use

Transcriber MCP can be used in fields such as media production, content creation, accessibility services, and any domain requiring transcription of audio or video content.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Transcriber Mcp

Use cases

Use cases include transcribing interviews, generating subtitles for videos, creating text records of meetings, and converting podcasts into written format.

How to use

Key features

Where to use

Transcriber MCP can be used in fields such as media production, content creation, accessibility services, and any domain requiring transcription of audio or video content.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Transcriber MCP

音声・動画ファイルをテキストに変換するMCP（Model Context Protocol）準拠のサーバー「Transcriber MCP」です。faster-whisperを利用し、CPU環境で動作する軽量かつ実用的な文字起こしサーバーを提供します。

機能

MCPプロトコルに準拠したサーバーの実装
音声・動画ファイル（mp3, mp4, wav, mov, avi）を受け取り、文字起こしを実施
結果をテキストファイル形式で出力
MCPクライアントとの通信インターフェースを提供

インストール方法

必要条件

Python 3.8以上
faster-whisper
ffmpeg（音声・動画ファイル処理用）

インストール手順

リポジトリをクローン

git clone https://github.com/yourusername/transcriber-mcp.git
cd transcriber-mcp

仮想環境の作成と依存パッケージのインストール

# uvを使用して仮想環境を作成
uv venv

# 依存パッケージをインストール
uv pip install -r requirements.txt

使用方法

サーバーの起動

# uvを使用してサーバーを起動
uv run -m src.main

クライアント例を使用した動作確認

# テスト用の音声ファイルを作成
uv pip install gtts
uv run -c "from gtts import gTTS; tts = gTTS('これはテスト用の音声ファイルです。文字起こしが正しく機能するかを確認します。', lang='ja'); tts.save('test_audio.mp3')"

# 文字起こしを実行
uv run -m src.client_example test_audio.mp3

MCPクライアントからの利用

MCPプロトコルに準拠したクライアントから以下のようにリクエストを送信します：

{
  "jsonrpc": "2.0",
  "id": 1,
  "method": "transcribe",
  "params": {
    "file_path": "/path/to/your/audio_or_video_file.mp3"
  }
}

レスポンス例

{
  "jsonrpc": "2.0",
  "id": 1,
  "result": {
    "result": "/path/to/output/audio_or_video_file_transcribed.txt"
  }
}

Clineでの設定方法

Clineと連携して使用することで、LLMとの対話を通じて文字起こしを実行できます。

Clineの設定

Clineの設定ファイル（通常は~/.config/cline/settings/mcp_settings.json）に以下の設定を追加します：

{
  "transcribe": {
    "command": "uv",
    "args": [
      "run",
      "--directory",
      "/path/to/transcriber-mcp",
      "python",
      "-m",
      "src.main",
      "--model-size=base"
    ]
  }
}

※ --directoryのパスは実際の環境に合わせて変更してください。

※ --model-sizeは "tiny, “base”, “small”, “medium”, “large” から選択できます。

サポートするファイル形式

音声ファイル: mp3, wav
動画ファイル: mp4, mov, avi

モデルサイズの変更

文字起こしの精度を向上させるために、モデルサイズを変更することができます。

# src/transcriber.py を編集
self.model_size = "medium"  # tiny, base, small, medium, large から選択

より大きなモデルを使用すると精度が向上しますが、メモリ使用量とロード時間が増加します。

将来的な拡張予定

タイムスタンプ付き文字起こし
多言語対応
モデル切り替え機能

ライセンス

このプロジェクトはMITライセンスの下で公開されています。詳細はLICENSEファイルを参照してください。

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers