Myrag

1 MIT

FreeCommunity

AI Systems

# My RAG Project Involvement in MCP

What is Myrag

myRAG is a project that integrates with MCP, built using Spring Boot. It provides an AI service interface through a simple controller that supports both synchronous and asynchronous communication for generating text responses.

Use cases

Use cases for myRAG include: 1) Chatbots that provide instant responses to user queries; 2) Applications that require real-time text generation for dynamic content; 3) Systems that need to handle both short and long text generation tasks effectively.

How to use

To use myRAG, you can make HTTP requests to its API endpoints. For synchronous text generation, use the ‘/generate’ endpoint, and for asynchronous streaming responses, use the ‘/generate_stream’ endpoint. Both require specifying the model and the message as parameters.

Key features

Key features of myRAG include: 1) Synchronous and asynchronous text generation capabilities; 2) Support for short and long text generation; 3) Real-time content streaming for enhanced user experience; 4) Built on Spring Boot for easy deployment and scalability.

Where to use

myRAG can be used in various fields such as customer support, content creation, and interactive applications where AI-generated text responses are needed.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Myrag

Use cases

How to use

Key features

Where to use

myRAG can be used in various fields such as customer support, content creation, and interactive applications where AI-generated text responses are needed.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

v1版本

用springboot构建了一个基本的应用
一个简单的入口类

@SpringBootApplication
@Configuration
public class Application {
    public static void main(String[] args) {
        SpringApplication.run(Application.class, args);
    }
}

做了一个简单的调用ollama的Controller,这个controller实现了AI的服务接口,主要是有非流式服务和流式服务两种:
chatClient.call:

返回类型是 ChatResponse
同步执行，等待完整响应后一次性返回
适用于短文本生成场景
客户端需要等待全部内容生成完才能看到响应
chatClient.stream:
返回类型是 Flux
异步执行，使用响应式流处理
适用于长文本生成场景
客户端可以实时看到生成的内容（类似 ChatGPT 的打字效果）
需要客户端支持流式处理（如 Server-Sent Events）

@RestController
@CrossOrigin("*")
@RequestMapping("api/v1/ollama")
public class OllamaController implements IAiService {

    @Resource
    private OllamaChatClient chatClient;


    @RequestMapping(value = "generate", method = RequestMethod.GET)
    @Override
    public ChatResponse generate(@RequestParam String model,@RequestParam String message) {
        return chatClient.call(new Prompt(message, OllamaOptions.create().withModel(model)));
    }

    @RequestMapping(value = "generate_stream", method = RequestMethod.GET)
    @Override
    public Flux<ChatResponse> generateStream(String model, String message) {
        return chatClient.stream(new Prompt(message, OllamaOptions.create().withModel(model)));
    }
}

其中OllamaChatClient的注入是通过在configuration

@Configuration
public class OllamaConfig {

    @Bean
    public OllamaApi ollamaApi(@Value("${spring.ai.ollama.base-url}") String baseUrl) {
        return new OllamaApi(baseUrl);
    }
    
    @Bean
    public OllamaChatClient ollamaChatClient(OllamaApi ollamaApi) {
        return new OllamaChatClient(ollamaApi);
    }

}

可以通过http请求的方式调用ollama的服务

curl -X GET "http://127.0.0.1:8090/api/v1/ollama/generate?model=deepseek-r1:1.5b&message=hello"

流式调用

curl -N -X GET "http://127.0.0.1:8090/api/v1/ollama/generate_stream?model=deepseek-r1:1.5b&message=hello"

配置maven的profile的时候可以指定jvm参数

<profile>
<id>dev</id>
<activation>
<activeByDefault>true</activeByDefault>
</activation>
<properties>
<java_jvm>-Xms1G -Xmx1G -server  -XX:MaxPermSize=256M -Xss256K -Dspring.profiles.active=test -XX:+DisableExplicitGC -XX:+UseG1GC  -XX:LargePageSizeInBytes=128m -XX:+UseFastAccessorMethods -XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=/export/Logs/xfg-frame-archetype-lite-boot -Xloggc:/export/Logs/xfg-frame-archetype-lite-boot/gc-xfg-frame-archetype-lite-boot.log -XX:+PrintGCDetails -XX:+PrintGCDateStamps</java_jvm>
<profileActive>dev</profileActive>
</properties>
</profile>

内存配置：
-Xms1G: 初始堆内存大小为 1GB
-Xmx1G: 最大堆内存大小为 1GB
-XX:MaxPermSize=256M: 永久代最大大小为 256MB（注意：JDK8 后被 Metaspace 替代）
-Xss256K: 每个线程的栈大小为 256KB

v2版本

生成前端,通过AI生成前端页面
首先整理问题:

根据以下信息编写,HTML的UI以对接服务端的的接口

我们实现了流式的GET请求接口
@RestController
@CrossOrigin(“*”)
@RequestMapping(“api/v1/ollama”)
public class OllamaController implements IAiService {
@Resource
private OllamaChatClient chatClient;
@RequestMapping(value = “generate”, method = RequestMethod.GET)
@Override
public ChatResponse generate(@RequestParam String model,@RequestParam String message) {
return chatClient.call(new Prompt(message, OllamaOptions.create().withModel(model)));
}
@RequestMapping(value = “generate_stream”, method = RequestMethod.GET)
@Override
public Flux generateStream(String model, String message) {
return chatClient.stream(new Prompt(message, OllamaOptions.create().withModel(model)));
}
}
通过 GET http://127.0.0.1:8090/api/v1/ollama/generate_stream?model=deepseek-r1:1.5b&message=hello
我们在前端获得了流式应答:
[
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: “1”,
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: " +",
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: " “,
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: “1”,
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: " equals”,
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “”
},
“content”: “2”,
“media”: []
},
“metadata”: {
“finishReason”: null,
“contentFilterMetadata”: null
}
}
},
{
“result”: {
“output”: {
“messageType”: “ASSISTANT”,
“properties”: {
“id”: “chatcmpl-B3HPw95SsqmhoWeJ8azGLxK1Vf4At”,
“role”: “ASSISTANT”,
“finishReason”: “STOP”
},
“content”: null,
“media”: []
},
“metadata”: {
“finishReason”: “STOP”,
“contentFilterMetadata”: null
}
}
}
]
根据上述的说明,帮我编写一款简单的AI对话页面

输入内容,点击发送按钮,调用业务流失请求,前端渲染页面
以html,js代码实现,css样式采用tailwind来编写
通过 const eventSource = new EventSource(ApiUrl),调用api接口
从result.output.content获取应答的文本展示,注意content可能为空
从result.metadata.finishReason获取应答的结束标志,如果是STOP,则停止请求
整体样式要求美观
结果效果也不好有bug直接用小福哥的了

V3版本

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers