Local Rag Mcp

1 MIT

FreeCommunity

AI Systems

This is my attempt at creating a local RAG mcp server to chat with local docx, xlsx files.

What is Local Rag Mcp

local_RAG_mcp is a Python-based MCP server designed to facilitate natural language interactions with local Microsoft Word (.docx) and Excel (.xlsx) documents, ensuring that all data processing remains on the user’s machine.

Use cases

Use cases for local_RAG_mcp include academic research where users query research papers in .docx format, business analysts extracting data from Excel spreadsheets, and personal document management for quick information retrieval.

How to use

To use local_RAG_mcp, set up the server by installing the necessary dependencies, specify a local directory containing your .docx and .xlsx files, and then initiate the MCP agent to start querying your documents in natural language.

Key features

Key features include local processing for privacy, support for .docx and .xlsx files, simple indexing for document management, natural language Q&A capabilities, and integration with MCP tools for enhanced functionality.

Where to use

local_RAG_mcp can be used in various fields such as education, research, and business, where users need to extract information from local documents without compromising data privacy.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Local Rag Mcp

Use cases

How to use

Key features

Where to use

local_RAG_mcp can be used in various fields such as education, research, and business, where users need to extract information from local documents without compromising data privacy.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

local_RAG_mcp

This is my attempt at creating a local RAG mcp server to chat with local .docx, .xlsx files.

Local Document Q&A with Ollama & MCP

This project provides a Python-based MCP agent that allows you to chat with your local Word (.docx) and Excel (.xlsx) documents using Ollama for local language model inference, ensuring your private data stays on your machine.

It uses LangChain community components for document loading, text splitting, embeddings, and vector storage (ChromaDB), and mcp-sdk to expose this functionality as a set of tools.

Features:

Private & Local: All processing, embedding, and language model inference happens locally via Ollama. No data leaves your machine.
Supported Document Types: Currently supports Microsoft Word (.docx) and Excel (.xlsx) files. (Easily extensible for PDFs, .txt, etc.)
Simple Indexing: A dedicated MCP tool to scan a specified directory, process documents, and build a searchable vector index.
Natural Language Q&A: Ask questions in natural language about the content of your documents.
MCP Integration: Exposes functionality through MCP tools, usable with MCP Inspector or mcp-cli.

How it Works

Document Loading: The agent scans a designated local folder for supported documents.
Text Chunking: Document content is split into smaller, manageable chunks.
Embedding: Each chunk is converted into a numerical representation (embedding) using a local Ollama embedding model (e.g., nomic-embed-text).
Vector Storage: These embeddings and their corresponding text chunks are stored in a local ChromaDB vector store.
Querying (RAG - Retrieval Augmented Generation):
- When you ask a question, it’s also embedded.
- The system searches the vector store for document chunks with embeddings most similar to your question’s embedding.
- These relevant chunks (context) are combined with your original question into a prompt.
- This prompt is sent to a local Ollama chat model (e.g., llama3) to generate an answer.

Prerequisites

Python: Python 3.9+
Ollama: You need Ollama installed and running.
- Installation: ollama.com
- Ensure Ollama is serving models. You can test this by running ollama list in your terminal.
Required Ollama Models:
- Pull the embedding model: ollama pull nomic-embed-text
- Pull the chat model: ollama pull llama3

Installation

Clone the Repository (or download the script):

# If you create a Git repository:
# git clone https://github.com/ItsMistahJi/local_RAG_mcp
# cd local_RAG_mcp

Create a Virtual Environment (Recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install Python Dependencies:
```
pip install -r requirements.txt
```

Setup & Usage

Prepare Your Documents:
- Create a folder named docs_simple in the same directory as the server.py script (or configure DOC_DIR in the script).
- Place your .docx and .xlsx files into this docs_simple folder.
Run the MCP Agent Script:
Open your terminal, navigate to the project directory, and run:
```
python server.py
```
The script will start, attempt to connect to Ollama, and then wait for MCP client connections (like MCP Inspector). You’ll see log output in this terminal.
Interact using MCP Inspector (Recommended):
- Download and install MCP Inspector.
- Open MCP Inspector.
- Configure the Agent:
  - If the agent isn’t automatically detected, you may need to manually add it.
  - Go to “File” > “Preferences” > “Agents” (or a similar section).
  - Click “Add” or the “+” icon.
  - Name: Give it a descriptive name (e.g., “My Local RAG Agent”).
  - Command: Enter the full command to run your script: python /full/path/to/your/server.py.
  - Transport: Select stdio.
  - Save the configuration.
- Connect and Use:
  - Back in the main MCP Inspector window, find your configured agent in the list.
  - Click “Connect” or the play icon next to it. MCP Inspector will run your script.
  - Once connected, you’ll see the available tools:
    - initialize_and_index: Run this tool first. It takes no arguments. It will process the documents in docs_simple and create a local vector database in chroma_db_simple. Check the agent’s terminal logs for progress.
    - ask_question: After indexing is complete, use this tool. It takes one argument:
```
{
  "question": "Your question about the documents here"
}
```
      The agent will retrieve relevant information and generate an answer.

Interact using mcp-cli (Alternative):
Ensure your Python script (server.py) is not already running. mcp-cli will launch it for each command.

List available tools (optional check):

mcp tool list CompanyDocumentQA-Ollama --command "python server.py" --transport stdio

Index Documents:

mcp tool call CompanyDocumentQA-Ollama initialize_and_index --command "python server.py" --transport stdio

(Wait for this to complete. You’ll see logs in your terminal.)

Ask a Question:

mcp tool call CompanyDocumentQA-Ollama ask_question '{"question": "What is the company policy on annual leave?"}' --command "python server.py" --transport stdio

Script Overview (`server.py`)

Configuration: Constants at the top for document directory, ChromaDB path, Ollama models, etc.
Helper Functions: For loading and splitting documents, initializing Ollama components.
initialize_and_index (MCP Tool):
- Loads .docx and .xlsx files.
- Splits them into chunks.
- Generates embeddings using Ollama.
- Stores chunks and embeddings in a persistent ChromaDB.
ask_question (MCP Tool):
- Takes a user question.
- Embeds the question.
- Retrieves relevant document chunks from ChromaDB.
- Constructs a prompt with the question and context.
- Gets an answer from the Ollama LLM.
Main Block: Sets up logging, initializes Ollama components, and starts the MCP agent server on stdio.

Customization & Future Enhancements

More Document Types: Add loaders for PDFs (PyPDFLoader), text files (TextLoader), etc., in load_documents_from_directory. Remember to install necessary packages (e.g., pypdf, unstructured).
Different Models: Change EMBEDDING_MODEL_NAME and LLM_MODEL_NAME to use other Ollama models. Ensure they are pulled locally.
Chunking Strategy: Experiment with chunk_size and chunk_overlap in RecursiveCharacterTextSplitter for better results.
Retriever Options: Modify search_kwargs={"k": 3} in ask_question to retrieve more/fewer chunks. Explore other retrieval modes if needed.
Prompt Engineering: Refine the prompt template in ask_question for better LLM responses.
Error Handling: Enhance error handling and user feedback.
Web UI (e.g., Streamlit/Gradio): Wrap the MCP agent or its core logic in a simple web UI for easier non-technical user access, potentially by having the UI call the MCP agent tools.

Troubleshooting

“Ollama not found” / Connection Errors:
- Ensure Ollama is running (ollama serve or the Ollama desktop app).
- Verify OLLAMA_BASE_URL in the script matches your Ollama setup (default is http://localhost:11434).
- Make sure the models (nomic-embed-text, llama3) are pulled: ollama list.
“No documents found” / “Vector store empty”:
- Double-check the DOC_DIR path in the script and ensure it points to the correct folder.
- Make sure your document files are in that folder and have supported extensions (.docx, .xlsx).
- Run the initialize_and_index tool. Check the terminal logs for errors during indexing.
mcp-cli issues:
- Ensure mcp-cli is installed correctly (pip install "mcp-cli[cli]").
- Use the full --command "python /path/to/script.py" and --transport stdio flags.
MCP Inspector doesn’t see the agent:
- Ensure you’ve configured the agent correctly in MCP Inspector preferences with the full path to the Python script and stdio transport.
- Make sure the script is not already running when MCP Inspector tries to start it.

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

AMap MCP Server Amap Maps is a server that supports any MCP protocol client, allowing users to easily utilize the Amap Maps MCP server for various location-based services.

View All MCP Servers

Local Rag Mcp

What is Local Rag Mcp

Use cases

How to use

Key features

Where to use

Clients Supporting MCP

Overview

What is Local Rag Mcp

Use cases

How to use

Key features

Where to use

Clients Supporting MCP

Content

local_RAG_mcp

Local Document Q&A with Ollama & MCP

How it Works

Prerequisites

Installation

Setup & Usage

Script Overview (server.py)

Customization & Future Enhancements

Troubleshooting

Dev Tools Supporting MCP

Tools

Comments

Recommend MCP Servers

Script Overview (`server.py`)