- Explore MCP Servers
- mcp-server-speech
Mcp Server Speech
What is Mcp Server Speech
mcp-server-speech is an implementation of a Speech Model Context Protocol Server for VolcEngine, designed to handle automatic speech recognition (ASR) and text-to-speech (TTS) functionalities.
Use cases
Use cases include converting audio recordings into text for documentation, generating audio from written content for accessibility, and enabling real-time speech recognition in interactive applications.
How to use
To use mcp-server-speech, set the required environment variables for VolcEngine, configure the MCP settings file to include the server, and then utilize the ASR and TTS tools by providing the necessary arguments.
Key features
Key features include Automatic Speech Recognition (ASR) that converts audio files to text, and Text-to-Speech (TTS) that synthesizes text into audio with customizable speed and encoding options.
Where to use
mcp-server-speech can be used in various fields such as voice assistants, automated customer service, transcription services, and any application requiring speech processing.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Overview
What is Mcp Server Speech
mcp-server-speech is an implementation of a Speech Model Context Protocol Server for VolcEngine, designed to handle automatic speech recognition (ASR) and text-to-speech (TTS) functionalities.
Use cases
Use cases include converting audio recordings into text for documentation, generating audio from written content for accessibility, and enabling real-time speech recognition in interactive applications.
How to use
To use mcp-server-speech, set the required environment variables for VolcEngine, configure the MCP settings file to include the server, and then utilize the ASR and TTS tools by providing the necessary arguments.
Key features
Key features include Automatic Speech Recognition (ASR) that converts audio files to text, and Text-to-Speech (TTS) that synthesizes text into audio with customizable speed and encoding options.
Where to use
mcp-server-speech can be used in various fields such as voice assistants, automated customer service, transcription services, and any application requiring speech processing.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Content
Speech Model Context Protocol Server
An MCP server implementation for speech of volcengine
Features
Tools
- asr
Automatic Speech Recognition: Converts audio to text.- Args:
- content: url or absolute path of the audio file to transcribe.
- Returns:
- Asr text
- Args:
- tts
Text-to-Speech: Synthesizes text into audio.- Args:
- text: The text to synthesize into speech.
- speed: Speech speed (e.g., 1.0 for normal). default: 1.0.
- encoding: Desired audio output format (e.g., ‘mp3’, ‘wav’). default: ‘mp3’.
- Returns:
- Return the path of audio file.
- Args:
Configuration
The server requires the following environment variables to be set:
VOLC_APPID
: Required, The APP ID for the VolcEngine.VOLC_TOKEN
: Required, The Access Token for the VolcEngine.VOLC_VOICE_TYPE
: Optional, Large speech synthesis model service voice_type, default is ‘zh_female_meilinvyou_moon_bigtts’VOLC_CLUSTER
: Required, Large speech synthesis model service cluster ID
The services that need to be activated on Volcengine are: Large speech synthesis model、Streaming speech recognition large model、Large model for audio file recognition
You can set these environment variables in your shell.
MCP Settings Configuration
To add this server to your MCP configuration, add the following to your MCP settings file:
{
"mcpServers": {
"speech-mcp-server": {
"command": "uv",
"args": [
"--directory",
"/ABSOLUTE/PATH/TO/PARENT/FOLDER/src/mcp_server_speech",
"run",
"main.py"
]
}
}
}
or
Usage
Running the Server
# Run the server with stdio transport (default)
python -m mcp_server_speech [--transport/-t {sse,stdio}]
License
This library is licensed under the MIT-0 License. See the LICENSE file.
Dev Tools Supporting MCP
The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.