MCP ExplorerExplorer

Edge Tts Mcp

@Hwenyion 9 months ago
2 MIT
FreeCommunity
AI Systems
A MCP created using the TTS of Edge browser, quickly add text-to-speech reading function to your AI client, and have a nice chat with LLMs!

Overview

What is Edge Tts Mcp

edge-tts-mcp is a Model Context Protocol (MCP) server that utilizes Microsoft Edge’s Text-to-Speech service, enabling AI assistants to read text aloud with natural-sounding voices.

Use cases

Use cases for edge-tts-mcp include creating interactive voice responses in customer service applications, enhancing learning experiences in educational software, providing voice output for visually impaired users, and integrating with chatbots for more engaging conversations.

How to use

To use edge-tts-mcp, clone the repository from GitHub, install the necessary dependencies using Bun, and configure the environment variables for voice, rate, volume, pitch, and audio saving options before starting the server.

Key features

Key features include generating lifelike speech from text input, support for multiple voice options, customizable speech parameters (rate, volume, pitch), optional audio saving capability, and easy integration with clients compatible with MCP.

Where to use

edge-tts-mcp can be used in various fields such as virtual assistants, educational tools, accessibility applications, and any AI-driven platforms requiring text-to-speech functionality.

Content

Edge TTS MCP

A Model Context Protocol (MCP) server for Microsoft Edge Text-to-Speech service that allows AI assistants to read text aloud with natural-sounding voices.

Language

Features

  • Generate lifelike speech from text input
  • Support for multiple voice options
  • Customizable speech parameters (rate, volume, pitch)
  • Optional audio saving capability
  • Easy integration with Cline and other MCP-compatible clients

Installation

Prerequisites

Setup

  1. Clone the repository:
git clone https://github.com/Hwenyi/edge-tts-mcp.git
cd edge-tts-mcp
  1. Install dependencies:
bun install
  1. Build the project:
bun run build

Configuration

Environment Variables

The Edge TTS MCP server supports the following environment variables:

Variable Description Default Value Example Values
VOICE The voice to use for speech generation zh-CN-XiaoxiaoNeural en-US-AriaNeural, ja-JP-NanamiNeural
RATE The speech rate 0% -10%, +20%
VOLUME The speech volume 0% -50%, +50%
PITCH The speech pitch 0Hz -10Hz, +5Hz
SAVE_AUDIO Whether to save audio files (true/false) false true

You can set these environment variables before starting the server.

Usage

Starting the Server

# Using default settings
bun run start

# Or with custom configuration
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true bun run start

Integrating with Cline

To use this MCP server with Cline, add the following configuration to your Cline config:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "bun",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Replace /path/to/edge-tts-mcp with the actual path to your installation.

MCP Tool Parameters

The MCP server exposes the following tool:

Tool Name: speech_text_aloud

Parameters:

  • input (string): The text to be converted to speech and read aloud

Using with Node.js

You can also run the server using Node.js instead of Bun:

# Run with Node.js
node dist/index.js

# Or with custom environment variables
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true node dist/index.js

For Cline integration with Node.js, update your configuration:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "node",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Audio File Storage

When the SAVE_AUDIO environment variable is set to true, audio files will be saved in the dist directory by default. Each file is named with a random UUID to prevent overwriting.

Configuration for Other Clients

5ire or Claude

You can also configure this MCP server in other clients like 5ire or Claude. Here’s an example configuration:

{
  "name": "edge-tts-mcp",
  "key": "EdgeTTSMCP",
  "description": "Read text aloud using Edge TTS",
  "command": "bun",
  "args": [
    "/path/to/edge-tts-mcp/dist/index.js"
  ]
}

⚠️ Path Format Warning: Pay attention to the path format in your configuration:

  • Windows: Uses backslashes (\) and needs to be escaped in JSON as \\ or converted to forward slashes (/)
  • macOS/Linux: Uses forward slashes (/)

Examples:

  • Windows path: C:\\Users\\username\\edge-tts-mcp\\dist\\index.js or C:/Users/username/edge-tts-mcp/dist/index.js
  • macOS/Linux path: /Users/username/edge-tts-mcp/dist/index.js

Incorrect path formatting is a common cause of setup issues across different operating systems.

Make sure to adjust the file path according to your actual installation directory.

⚠️ Important Notice: Currently, there are known issues with MCP integration in Cherry-Studio. The configuration above may not work properly in Cherry-Studio. We recommend using Cline or other well-tested MCP clients until these issues are resolved.

Example Usage in an AI Assistant

When your AI assistant needs to read text aloud, it can use a prompt like:

I need to read this text aloud: "Hello world, this is a test of the Edge TTS system."

The assistant will call the speech_text_aloud tool with the appropriate input text.

Voice Options

Microsoft Edge TTS provides many voices across different languages. Some popular options include:

  • en-US-AriaNeural (English, US, Female)
  • en-US-GuyNeural (English, US, Male)
  • zh-CN-XiaoxiaoNeural (Chinese, Female)
  • ja-JP-NanamiNeural (Japanese, Female)
  • de-DE-KatjaNeural (German, Female)
  • fr-FR-DeniseNeural (French, Female)

For a complete list of available voices, refer to the Microsoft Edge TTS documentation.

License

MIT

Tools

No tools

Comments

Recommend MCP Servers

View All MCP Servers