Audio Mcp Server

2 MIT

FreeCommunity

AI Systems

The Audio-MCP-Server is an MCP server that enables AI assistants like Claude to interact with audio systems. It features audio device listing, recording, playback, and future text-to-speech capabilities, enhancing user interaction with seamless audio management.

What is Audio Mcp Server

Audio-MCP-Server is an MCP (Model Context Protocol) server designed to provide audio input and output capabilities for AI assistants like Claude, enabling interaction with the computer’s audio system.

Use cases

Use cases include asking Claude about available audio devices, recording audio for transcription, playing back recordings for review, and playing audio files for entertainment or information.

How to use

To use Audio-MCP-Server, clone the repository, set up a virtual environment, install dependencies, and configure Claude Desktop to connect to the server. After setup, restart Claude Desktop and use voice commands to interact with audio devices.

Key features

Key features include listing audio devices, recording audio from microphones, playing back recordings, playing audio files, and a placeholder for future text-to-speech functionality.

Where to use

Audio-MCP-Server can be used in various fields such as AI development, voice recognition applications, and any scenario requiring audio interaction with AI assistants.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Audio Mcp Server

Use cases

Use cases include asking Claude about available audio devices, recording audio for transcription, playing back recordings for review, and playing audio files for entertainment or information.

How to use

Key features

Key features include listing audio devices, recording audio from microphones, playing back recordings, playing audio files, and a placeholder for future text-to-speech functionality.

Where to use

Audio-MCP-Server can be used in various fields such as AI development, voice recognition applications, and any scenario requiring audio interaction with AI assistants.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Audio MCP Server

An MCP (Model Context Protocol) server that provides audio input/output capabilities for AI assistants like Claude. This server enables Claude to interact with your computer’s audio system, including recording from microphones and playing audio through speakers.

Features

List Audio Devices: View all available microphones and speakers on your system
Record Audio: Capture audio from any microphone with customizable duration and quality
Playback Recordings: Play back your most recent recording
Audio File Playback: Play audio files through your speakers
Text-to-Speech: (Placeholder for future implementation)

Requirements

Python 3.8 or higher
Audio input/output devices on your system

Installation

Installing via Smithery

To install Audio Interface Server for Claude Desktop automatically via Smithery:

npx -y @smithery/cli install @GongRzhe/Audio-MCP-Server --client claude

Manual Installation

Clone this repository or download the files to your computer:

git clone https://github.com/GongRzhe/Audio-MCP-Server.git
cd Audio-MCP-Server

Create a virtual environment and install dependencies:

# Windows
python -m venv .venv
.venv\Scripts\activate
pip install -r requirements.txt

# macOS/Linux
python -m venv .venv
source .venv/bin/activate
pip install -r requirements.txt

Or use the included setup script to automate installation:

python setup_mcp.py

Configuration

Claude Desktop Configuration

To use this server with Claude Desktop, add the following to your Claude Desktop configuration file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json
Windows: %APPDATA%\Claude\claude_desktop_config.json

{
  "mcpServers": {
    "audio-interface": {
      "command": "/path/to/your/.venv/bin/python",
      "args": [
        "/path/to/your/audio_server.py"
      ],
      "env": {
        "PYTHONPATH": "/path/to/your/audio-mcp-server"
      }
    }
  }
}

Replace the paths with the actual paths on your system. The setup script will generate this configuration for you.

Usage

After setting up the server, restart Claude Desktop. You should see a hammer icon in the input box, indicating that tools are available.

Try asking Claude:

“What microphones and speakers are available on my system?”
“Record 5 seconds of audio from my microphone.”
“Play back the audio recording.”
“Play an audio file from my computer.”

Available Tools

list_audio_devices

Lists all available audio input and output devices on your system.

record_audio

Records audio from your microphone.

Parameters:

duration: Recording duration in seconds (default: 5)
sample_rate: Sample rate in Hz (default: 44100)
channels: Number of audio channels (default: 1)
device_index: Specific input device index to use (default: system default)

play_latest_recording

Plays back the most recently recorded audio.

play_audio

Placeholder for text-to-speech functionality.

Parameters:

text: The text to convert to speech
voice: The voice to use (default: “default”)

play_audio_file

Plays an audio file through your speakers.

Parameters:

file_path: Path to the audio file
device_index: Specific output device index to use (default: system default)

Troubleshooting

No devices found

If no audio devices are found, check:

Your microphone and speakers are properly connected
Your operating system recognizes the devices
You have the necessary permissions to access audio devices

Playback issues

If audio playback isn’t working:

Check your volume settings
Ensure the correct output device is selected
Try restarting the Claude Desktop application

Server connectivity

If Claude can’t connect to the server:

Verify your configuration paths are correct
Ensure Python and all dependencies are installed
Check Claude’s logs for error messages

License

MIT

Acknowledgments

Built using the Model Context Protocol
Uses sounddevice and soundfile for audio processing

Note: This server provides tools that can access your microphone and speakers. Always review and approve tool actions before they execute.

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers