Exp Llm Mcp Rag

78 MIT

FreeCommunity

AI Systems

Python implementation based on KelvinQiu802/llm-mcp-rag, used for learning and practicing LLM (Large Language Model), MCP (Model-Conditioned Prompting), and RAG (Retrieval-Augmented Generation) technologies.

What is Exp Llm Mcp Rag

exp-llm-mcp-rag is a Python implementation based on KelvinQiu802/llm-mcp-rag, designed for learning and practicing Large Language Models (LLM), Model Context Protocol (MCP), and Retrieval-Augmented Generation (RAG) technologies.

Use cases

Use cases include building intelligent chatbots, creating automated customer support systems, developing educational tools, and enhancing search engines with contextual understanding.

How to use

To use exp-llm-mcp-rag, clone the repository from GitHub, install the necessary dependencies, and follow the instructions in the README to set up the environment. Users can interact with the AI assistant by sending queries, which will be processed through the LLM and MCP.

Key features

Key features include OpenAI API integration for LLM calls, interaction between LLM and external tools via MCP, implementation of a vector retrieval-based RAG system, and support for file system operations and web content retrieval.

Where to use

exp-llm-mcp-rag can be used in various fields such as AI research, natural language processing, chatbot development, and any application requiring enhanced information retrieval and generation capabilities.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Exp Llm Mcp Rag

Use cases

Use cases include building intelligent chatbots, creating automated customer support systems, developing educational tools, and enhancing search engines with contextual understanding.

How to use

Key features

Where to use

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

LLM-MCP-RAG 实验项目

本项目是基于 KelvinQiu802/llm-mcp-rag 的 Python 实现版本，用于学习和实践 LLM、MCP 和 RAG 技术。

该项目作者有演示视频见 https://www.bilibili.com/video/BV1dcRqYuECf/

强烈建议先浏览其README, 本仓库对一些逻辑进行了微调和命名调整!

项目简介

本项目是一个基于大语言模型（LLM）、模型上下文协议（MCP）和检索增强生成（RAG）的实验性项目。它展示了如何构建一个能够与外部工具交互并利用检索增强生成技术的 AI 助手系统。

核心功能

基于 OpenAI API 的大语言模型调用
通过 MCP（Model Context Protocol）实现 LLM 与外部工具的交互
实现基于向量检索的 RAG（检索增强生成）系统
支持文件系统操作和网页内容获取

系统架构

graph TD
    A[用户] -->|提问| B[Agent]
    B -->|调用| C[LLM]
    C -->|生成回答/工具调用| B
    B -->|工具调用| D[MCP 客户端]
    D -->|执行| E[MCP 服务器]
    E -->|文件系统操作| F[文件系统]
    E -->|网页获取| G[网页内容]
    H[文档/知识库] -->|嵌入| I[向量存储-内存形式]
    B -->|查询| I
    I -->|相关上下文| B

主要组件

classDiagram
    class Agent {
        +mcp_clients: list[MCPClient]
        +model: str
        +llm: AsyncChatOpenAI
        +system_prompt: str
        +context: str
        +init()
        +cleanup()
        +invoke(prompt: str)
    }

    class MCPClient {
        +name: str
        +command: str
        +args: list[str]
        +version: str
        +init()
        +cleanup()
        +get_tools()
        +call_tool(name: str, params: dict)
    }

    class AsyncChatOpenAI {
        +model: str
        +messages: list
        +tools: list[Tool]
        +system_prompt: str
        +context: str
        +chat(prompt: str, print_llm_output: bool)
        +get_tools_definition()
        +append_tool_result(tool_call_id: str, tool_output: str)
    }

    class EembeddingRetriever {
        +embedding_model: str
        +vector_store: VectorStore
        +embed_query(query: str)
        +embed_documents(document: str)
        +retrieve(query: str, top_k: int)
    }

    class VectorStore {
        +items: list[VectorStoreItem]
        +add(item: VectorStoreItem)
        +search(query_embedding: list[float], top_k: int)
    }

    class ALogger {
        +prefix: str
        +title(text: str, rule_style: str)
    }

    Agent --> MCPClient
    Agent --> AsyncChatOpenAI
    Agent ..> EembeddingRetriever
    EembeddingRetriever --> VectorStore
    Agent ..> ALogger
    AsyncChatOpenAI ..> ALogger

快速开始

环境准备

确保已安装 Python 3.12 或更高版本
克隆本仓库
复制 .env.example 为 .env 并填写必要的配置信息：
- OPENAI_API_KEY: OpenAI API 密钥
- OPENAI_BASE_URL: OpenAI API 基础 URL, 注意要保留后面的’/v1’ (默认为 ‘https://api.openai.com/v1’)
- DEFAULT_MODEL_NAME: (可选) 默认使用的模型名称（默认为 “gpt-4o-mini”）
- EMBEDDING_KEY: (可选) 嵌入模型 API 密钥（默认为 $OPENAI_API_KEY）
- EMBEDDING_BASE_URL: (可选) 嵌入模型 API 基础 URL, 如硅基流动的API或兼容OpenAI格式的API （默认为 $OPENAI_BASE_URL）
- USE_CN_MIRROR: (可选) 是否使用中国镜像, 设置任意值(如’1’)为 true (默认为 false)
- PROXY_URL: (可选) 代理 URL (如 “http(s)://xxx”), 用于 fetch (mcp-tool) 走代理

安装依赖

# 使用 uv 安装依赖
uv sync

运行示例

本项目使用 just 命令工具来运行不同的示例：

# 查看可用命令
just help

RAG 示例流程

sequenceDiagram
    participant User as 用户
    participant Agent as Agent
    participant LLM as LLM
    participant ER as EmbeddingRetriever
    participant VS as VectorStore
    participant MCP as MCP客户端
    participant Logger as ALogger

    User->>Agent: 提供查询
    Agent->>Logger: 记录操作日志
    Agent->>ER: 检索相关文档
    ER->>VS: 查询向量存储
    VS-->>ER: 返回相关文档
    ER-->>Agent: 返回上下文
    Agent->>LLM: 发送查询和上下文
    LLM-->>Agent: 生成回答或工具调用
    Agent->>Logger: 记录工具调用
    Agent->>MCP: 执行工具调用
    MCP-->>Agent: 返回工具结果
    Agent->>LLM: 发送工具结果
    LLM-->>Agent: 生成最终回答
    Agent-->>User: 返回回答

项目结构

src/augmented/: 主要源代码目录
- agent.py: Agent 实现，负责协调 LLM 和工具
- chat_openai.py: OpenAI API 客户端封装
- mcp_client.py: MCP 客户端实现
- embedding_retriever.py: 嵌入检索器实现
- vector_store.py: 向量存储实现
- mcp_tools.py: MCP 工具定义
- utils/: 工具函数
  - info.py: 项目信息和配置
  - pretty.py: 统一日志输出系统
rag_example.py: RAG 示例程序
justfile: 任务运行配置文件

学习资源

Model Context Protocol (MCP): 了解 MCP 协议
OpenAI API 文档: OpenAI API 参考
RAG (Retrieval-Augmented Generation): RAG 技术论文

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers