Edge Tts Mcp

@Hwenyion 9 months ago

2 MIT

FreeCommunity

AI Systems

A MCP created using the TTS of Edge browser, quickly add text-to-speech reading function to your AI client, and have a nice chat with LLMs!

What is Edge Tts Mcp

edge-tts-mcp is a Model Context Protocol (MCP) server that utilizes Microsoft Edge’s Text-to-Speech service, enabling AI assistants to read text aloud with natural-sounding voices.

Use cases

Use cases for edge-tts-mcp include creating interactive voice responses in customer service applications, enhancing learning experiences in educational software, providing voice output for visually impaired users, and integrating with chatbots for more engaging conversations.

How to use

To use edge-tts-mcp, clone the repository from GitHub, install the necessary dependencies using Bun, and configure the environment variables for voice, rate, volume, pitch, and audio saving options before starting the server.

Key features

Key features include generating lifelike speech from text input, support for multiple voice options, customizable speech parameters (rate, volume, pitch), optional audio saving capability, and easy integration with clients compatible with MCP.

Where to use

edge-tts-mcp can be used in various fields such as virtual assistants, educational tools, accessibility applications, and any AI-driven platforms requiring text-to-speech functionality.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Edge Tts Mcp

edge-tts-mcp is a Model Context Protocol (MCP) server that utilizes Microsoft Edge’s Text-to-Speech service, enabling AI assistants to read text aloud with natural-sounding voices.

Use cases

How to use

Key features

Where to use

edge-tts-mcp can be used in various fields such as virtual assistants, educational tools, accessibility applications, and any AI-driven platforms requiring text-to-speech functionality.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Edge TTS MCP

A Model Context Protocol (MCP) server for Microsoft Edge Text-to-Speech service that allows AI assistants to read text aloud with natural-sounding voices.

Language

Features

Generate lifelike speech from text input
Support for multiple voice options
Customizable speech parameters (rate, volume, pitch)
Optional audio saving capability
Easy integration with Cline and other MCP-compatible clients

Installation

Prerequisites

Node.js (v16 or later)
Bun (v1.0.0 or later)

Setup

Clone the repository:

git clone https://github.com/Hwenyi/edge-tts-mcp.git
cd edge-tts-mcp

Install dependencies:

bun install

Build the project:

bun run build

Configuration

Environment Variables

The Edge TTS MCP server supports the following environment variables:

Variable	Description	Default Value	Example Values
`VOICE`	The voice to use for speech generation	`zh-CN-XiaoxiaoNeural`	`en-US-AriaNeural`, `ja-JP-NanamiNeural`
`RATE`	The speech rate	`0%`	`-10%`, `+20%`
`VOLUME`	The speech volume	`0%`	`-50%`, `+50%`
`PITCH`	The speech pitch	`0Hz`	`-10Hz`, `+5Hz`
`SAVE_AUDIO`	Whether to save audio files (true/false)	`false`	`true`

You can set these environment variables before starting the server.

Usage

Starting the Server

# Using default settings
bun run start

# Or with custom configuration
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true bun run start

Integrating with Cline

To use this MCP server with Cline, add the following configuration to your Cline config:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "bun",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Replace /path/to/edge-tts-mcp with the actual path to your installation.

MCP Tool Parameters

The MCP server exposes the following tool:

Tool Name: speech_text_aloud

Parameters:

input (string): The text to be converted to speech and read aloud

Using with Node.js

You can also run the server using Node.js instead of Bun:

# Run with Node.js
node dist/index.js

# Or with custom environment variables
VOICE=en-US-AriaNeural RATE="+10%" SAVE_AUDIO=true node dist/index.js

For Cline integration with Node.js, update your configuration:

{
  "mcpServers": {
    "edge-tts-mcp": {
      "command": "node",
      "args": [
        "/path/to/edge-tts-mcp/dist/index.js"
      ],
      "disabled": false,
      "autoApprove": []
    }
  }
}

Audio File Storage

When the SAVE_AUDIO environment variable is set to true, audio files will be saved in the dist directory by default. Each file is named with a random UUID to prevent overwriting.

Configuration for Other Clients

5ire or Claude

You can also configure this MCP server in other clients like 5ire or Claude. Here’s an example configuration:

{
  "name": "edge-tts-mcp",
  "key": "EdgeTTSMCP",
  "description": "Read text aloud using Edge TTS",
  "command": "bun",
  "args": [
    "/path/to/edge-tts-mcp/dist/index.js"
  ]
}

⚠️ Path Format Warning: Pay attention to the path format in your configuration:

Windows: Uses backslashes (\) and needs to be escaped in JSON as \\ or converted to forward slashes (/)

macOS/Linux: Uses forward slashes (/)

Examples:

Windows path: C:\\Users\\username\\edge-tts-mcp\\dist\\index.js or C:/Users/username/edge-tts-mcp/dist/index.js

macOS/Linux path: /Users/username/edge-tts-mcp/dist/index.js

Incorrect path formatting is a common cause of setup issues across different operating systems.

Make sure to adjust the file path according to your actual installation directory.

⚠️ Important Notice: Currently, there are known issues with MCP integration in Cherry-Studio. The configuration above may not work properly in Cherry-Studio. We recommend using Cline or other well-tested MCP clients until these issues are resolved.

Example Usage in an AI Assistant

When your AI assistant needs to read text aloud, it can use a prompt like:

I need to read this text aloud: "Hello world, this is a test of the Edge TTS system."

The assistant will call the speech_text_aloud tool with the appropriate input text.

Voice Options

Microsoft Edge TTS provides many voices across different languages. Some popular options include:

en-US-AriaNeural (English, US, Female)
en-US-GuyNeural (English, US, Male)
zh-CN-XiaoxiaoNeural (Chinese, Female)
ja-JP-NanamiNeural (Japanese, Female)
de-DE-KatjaNeural (German, Female)
fr-FR-DeniseNeural (French, Female)

For a complete list of available voices, refer to the Microsoft Edge TTS documentation.

License

MIT

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers