MCP ExplorerExplorer

Arxiv Analyst

@devjothishon 10 months ago
1 MIT
FreeCommunity
AI Systems
#arxiv-papers#fastapi#langchain#modelcontextprotocol#streamlit
Research Assistant using MCP for ArXiv Papers

Overview

What is Arxiv Analyst

ArXiv-Analyst is a modular AI research assistant designed to assist users in finding and analyzing research papers from ArXiv using the Model Context Protocol (MCP), LangChain, FastAPI, and Streamlit.

Use cases

Use cases include finding recent papers on specific topics, conducting linguistic analysis for research purposes, summarizing large volumes of academic texts, and extracting key insights from research documents.

How to use

Users can interact with ArXiv-Analyst through a simple web interface built with Streamlit. They submit queries to search for relevant papers, and the system retrieves and analyzes the documents based on the input.

Key features

Key features include searching for research papers from ArXiv, performing linguistic analysis on documents, summarizing content, and extracting insights, all presented through an intuitive web interface.

Where to use

ArXiv-Analyst is primarily used in academic and research settings, particularly for researchers, students, and professionals who need to access and analyze scientific literature.

Content

ArXiv Analyst – A Modular AI Research Assistant using MCP & LangChain

Welcome to ArXiv Analyst, a modular research assistant built using Model Context Protocol (MCP), LangChain, FastAPI, and Streamlit. This tool can:

  • 🔍 Search for relevant research papers from ArXiv
  • 📝 Run linguistic analysis on documents
  • 📊 Summarize and extract insights
  • 🖥️ Display results through a simple web interface

🚀 Why MCP?

Model Context Protocol (MCP) is like a communication bridge that connects different AI models or tools, allowing them to collaborate without custom wiring.

In this project, MCP makes it easy to combine:

  • A paper search tool (ArXiv)
  • A document analysis tool (DocLing)

Instead of gluing everything together manually, MCP lets them work seamlessly.


DEMO SCREENSHOTS

interface-1 ss-search ss-docling

🏗 Project Overview

This system is made of 3 main parts:

1. MCP Servers

  • ArXiv Server – Retrieves papers based on search queries
  • DocLing Server – Analyzes text and extracts linguistic insights

2. FastAPI Client Server

  • Uses MultiServerMCPClient to connect to the MCP servers
  • Implements the logic for deciding which tool to use
  • Provides an API layer for the UI to talk to

3. Streamlit UI

  • Clean frontend interface
  • Lets users submit queries and view search results, summaries, and analysis interactively

⚙️ How It Works (Step by Step)

  1. User submits a query via Streamlit (e.g., “Find recent papers on GPT models”)
  2. FastAPI Client routes the query to the correct MCP server
  3. The agent invokes the right tools (search, analyze, etc.)
  4. Results are returned to the UI
  5. Streamlit displays everything neatly

🐳 Dockerized Setup

MCP services are fully containerized. No messy environments—just run with Docker.

Included Dockerfiles:

  • arxiv-server
  • docling-server
# Build the ArXiv server
docker build -t arxiv-server ./arxiv_server

# Run it
docker run -p 8080:8080 arxiv-server

Do the same for the docling-server. Once both are running, they’re accessible by the FastAPI client.

ss1 docker-ss

👨‍🔬 Why ArXiv Analyst is Useful for Researchers

If you’re a PhD student or researcher, ArXiv Analyst helps by:

  • Automating paper search and analysis
  • Reducing the time spent scanning through full papers
  • Giving fast, digestible insights for literature reviews or research planning

📂 Repo Links


🙌 Questions or Feedback?

If you try the project or want to chat about modular AI systems, feel free to reach out!

– Jothiswaran Arumugam
Jo Cloud AI Hub

Tools

No tools

Comments

Recommend MCP Servers

View All MCP Servers