Llmo

@doggybeeon 24 days ago

1 MIT

FreeCommunity

AI Systems

Lightweight LLM-MCP Orchestrator: A configurable service that orchestrates interactions between LLM APIs and MCP servers

What is Llmo

LLMO (Lightweight LLM-MCP Orchestrator) is a configurable service that orchestrates interactions between LLM APIs and MCP servers, providing dynamic tool discovery and robust process management.

Use cases

Use cases for LLMO include managing multiple LLM providers, facilitating complex interactions in conversational agents, and enabling seamless integration of tools from different MCP servers.

How to use

To use LLMO, clone the repository, install dependencies using npm or yarn, configure your API keys in the .env file, and set up your LLM providers and MCP servers in the config.yaml file. Finally, start the service to manage interactions.

Key features

Key features of LLMO include dynamic tool discovery, configuration-driven setup, robust process management, resilient communication via stdio JSON-RPC, orchestration of tool calls, streaming support for responsive UX, and structured logging for debugging.

Where to use

LLMO can be used in various fields such as AI development, chatbot implementation, and any application requiring orchestration between LLMs and MCP servers.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Llmo

LLMO (Lightweight LLM-MCP Orchestrator) is a configurable service that orchestrates interactions between LLM APIs and MCP servers, providing dynamic tool discovery and robust process management.

Use cases

Use cases for LLMO include managing multiple LLM providers, facilitating complex interactions in conversational agents, and enabling seamless integration of tools from different MCP servers.

How to use

Key features

Where to use

LLMO can be used in various fields such as AI development, chatbot implementation, and any application requiring orchestration between LLMs and MCP servers.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Lightweight LLM-MCP Orchestrator (LLMO)

A configurable service that orchestrates interactions between LLM APIs and Micro-Capability Protocol (MCP) servers, with dynamic tool discovery and robust process management.

Overview

LLMO v1.4 is a backend service that:

Manages local MCP server processes (starting, stopping, monitoring)
Dynamically discovers tools from MCP servers via stdio JSON-RPC
Routes requests to different LLM providers
Orchestrates tool calls between LLMs and MCPs
Supports streaming responses for fluid conversational experiences

Key Features

Dynamic Tool Discovery: Automatically discovers available tools from each MCP on startup
Configuration-Driven: Simple configuration of LLM providers and MCP servers
Robust Process Management: Reliable MCP process lifecycle with graceful shutdown
Resilient Communication: Robust stdio JSON-RPC communication with comprehensive error handling
Tool Call Orchestration: Sequential tool call execution with detailed error reporting
Streaming Support: Server-Sent Events (SSE) for responsive UX
Structured Logging: Detailed context-rich logging for debugging and monitoring

Requirements

Node.js 18+ (LTS)
npm or yarn

Installation

Clone the repository:
```
git clone [repository-url]
cd llmo
```
Install dependencies:
```
npm install
```
Copy the example environment file and update it with your API keys:
```
cp .env.example .env
# Edit .env with your API keys
```
Review and update the config.yaml file to configure your LLM providers and MCP servers.

Configuration

LLMO v1.4 uses a simplified configuration format in YAML or JSON:

LLM Providers

Configure one or more LLM providers with their API endpoints, authentication, and supported models.

MCP Servers

Configure MCP servers with just their launch parameters - LLMO will dynamically discover their tools on startup.

Example Configuration

# LLM Providers
llmProviders:
  - name: openai
    apiEndpoint: https://api.openai.com/v1/chat/completions
    authType: bearer
    authEnvVar: OPENAI_API_KEY
    models:
      - gpt-4
      - gpt-3.5-turbo

# MCP Servers (Local Processes)
mcpServers:
  - name: filesystem
    command: npx
    args:
      - -y
      - "@modelcontextprotocol/server-filesystem"
      - "/path/to/directory"
  
  - name: calculator
    command: node
    args: 
      - ./mcp-servers/calculator.js
    env:
      DEBUG: "true"

# Timeouts (in milliseconds)
timeouts:
  mcpResponse: 30000       # 30 seconds
  gracefulShutdown: 5000   # 5 seconds

Running the Service

Build the project:
```
npm run build
```
Start the server:
```
npm start
```

For development with auto-reload:

npm run dev

How It Works

Startup Process:
- LLMO loads the configuration and validates it
- Launches all configured MCP processes
- Sends tools/list requests to each MCP to discover available tools
- Caches tool definitions in memory
- Starts the HTTP server
Tool Discovery:
- On startup, LLMO sends a tools/list JSON-RPC request to each MCP
- MCPs respond with their available tools (name, description, parameterSchema)
- LLMO caches these definitions and maps tool names to their providing MCP
Chat Request Flow:
- Client sends a request to /chat endpoint
- LLMO routes to the appropriate LLM based on the requested model
- LLMO includes all cached tool definitions in the LLM request
- When LLM returns tool_calls, LLMO:
  - Maps the tool name to its MCP
  - Sends a tools/call request to the appropriate MCP
  - Returns the MCP’s response (or error) back to the LLM
- Streamed responses are forwarded to the client in real-time

API Endpoints

Health Check

GET /health

Returns server status information.

Chat

POST /chat

Body:

{
  "model": "gpt-4",
  "messages": [
    {
      "role": "user",
      "content": "What's in my Documents folder?"
    }
  ],
  "stream": true
}

Error Handling

LLMO implements standardized error handling:

For non-streaming responses: JSON error objects with code and message
For streaming responses: SSE error events
For tool calls: Specific MCP_* error types with detailed messages:
- MCP_UNAVAILABLE: The MCP process is not available
- MCP_COMMUNICATION_ERROR: Error in stdio communication
- MCP_TIMEOUT: The MCP response timed out
- MCP_INVALID_RESPONSE: Invalid JSON-RPC response from MCP

Project Structure

The project is organized as follows:

src/config: Configuration schema and loading
src/process-manager: MCP process lifecycle management
src/mcp-client: Stdio JSON-RPC communication
src/llm-client: LLM API interaction
src/routes: API endpoints
src/types: TypeScript interfaces and types
src/utils: Shared utilities

License

MIT

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers