- Explore MCP Servers
- mcp_transcribe_online_vids
Mcp Transcribe Online Vids
What is Mcp Transcribe Online Vids
mcp_transcribe_online_vids is an MCP server designed to transcribe online videos from platforms like YouTube and Bilibili by utilizing advanced models and temporary file hosting services.
Use cases
Use cases include generating subtitles for educational videos, creating transcripts for video content analysis, enabling accessibility for hearing-impaired users, and facilitating research by providing text data from video sources.
How to use
To use mcp_transcribe_online_vids, set up the environment by installing Python 3.12 and required packages, configure the .env file with your credentials, and start the server using ‘python main.py’. You can then call the transcription functions for YouTube or Bilibili videos.
Key features
Key features include timestamped transcriptions, automatic audio extraction and format conversion, cloud-based transcription using WhisperX models, and temporary file hosting for large files.
Where to use
mcp_transcribe_online_vids can be used in various fields such as education, content creation, research, and accessibility services where video content needs to be transcribed for analysis or accessibility.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Overview
What is Mcp Transcribe Online Vids
mcp_transcribe_online_vids is an MCP server designed to transcribe online videos from platforms like YouTube and Bilibili by utilizing advanced models and temporary file hosting services.
Use cases
Use cases include generating subtitles for educational videos, creating transcripts for video content analysis, enabling accessibility for hearing-impaired users, and facilitating research by providing text data from video sources.
How to use
To use mcp_transcribe_online_vids, set up the environment by installing Python 3.12 and required packages, configure the .env file with your credentials, and start the server using ‘python main.py’. You can then call the transcription functions for YouTube or Bilibili videos.
Key features
Key features include timestamped transcriptions, automatic audio extraction and format conversion, cloud-based transcription using WhisperX models, and temporary file hosting for large files.
Where to use
mcp_transcribe_online_vids can be used in various fields such as education, content creation, research, and accessibility services where video content needs to be transcribed for analysis or accessibility.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Content
MCP Transcribe Online Videos
A FastMCP server that allows LLMs to access and transcribe online videos from YouTube and Bilibili using Replicate and 0x0.st for temporary file hosting.
Features
- Transcribe YouTube/ Bilibili videos with timestamped output
- Automatic audio extraction and format conversion
- Cloud-based transcription via WhisperX models
- Temporary file hosting for large files
Installation
Prerequisites
- Python 3.12+
- Conda (recommended for environment management)
Setting Up the Environment
# Create and activate a conda environment
conda create --name mcp_transcribe_online_vids python=3.12
conda activate mcp_transcribe_online_vids
# Install required system packages
conda install conda-forge::uv conda-forge::ffmpeg conda-forge::sqlite
# Install Python dependencies
uv pip install -r requirements.txt
Configuration
-
Copy the
.env.templatefile to.env:cp .env.template .env -
Edit the
.envfile with your credentials:REPLICATE_API_TOKEN: Get from Replicate’s API key pageZERO_X_URL: URL for the 0x0.st instance (default: public instance)TEMP_FILE_PATH: Directory for temporary filesLOCAL_FILE_SIZE_LIMIT: Maximum file size in MB for direct API uploads
Usage
Starting the Server
python main.py
Available Tools
get_youtube_transcript(url): Transcribe a YouTube videoget_bilibili_transcript(url): Transcribe a Bilibili video
Example
from fastmcp import MCPClient
import asyncio
async def get_transcript():
client = MCPClient("http://localhost:8000")
async with client:
transcript = await client.call_tool("get_youtube_transcript",
{"url": "https://www.youtube.com/watch?v=dQw********"})
print(transcript)
# Run the async function
asyncio.run(get_transcript())
Deployment Options
For deploying the server in production environments, see FastMCP transport options.
To customize the transport, modify the mcp.run() call in main.py.
Self-hosting File Storage
It’s highly recommended to host your own instance of 0x0.st for file storage. Follow the hosting instructions to set up your own instance.
Roadmap
- [ ] Add tools to retrieve video metadata (title, date, description)
- [ ] Add YAML configuration for deployment settings
- [ ] Add more file hosting options (Google Cloud, S3)
- [ ] Complete local transcription option using WhisperX
- [ ] Support for other media sources e.g. Spotify podcasts
Contributing
- Install development dependencies:
pip install pre-commit pre-commit install - Run pre-commit checks before submitting code:
pre-commit run --all-files
License
Licensed under the GPL-3.0 License
Dev Tools Supporting MCP
The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.










