Pdf Reader Mcp

1 MIT

FreeCommunity

AI Systems

A simple MCP server for reading PDFs in Claude Code

What is Pdf Reader Mcp

pdf-reader-mcp is a Model Context Protocol (MCP) server designed for reading and analyzing PDF documents using Google’s Gemini API. It facilitates AI assistants like Claude and Cursor to read, extract, and analyze PDF content directly from their interfaces.

Use cases

Use cases include extracting text from legal documents, analyzing research papers for specific data, searching for keywords in educational materials, and enabling AI assistants to provide insights from PDF reports.

How to use

To use pdf-reader-mcp, install it via Homebrew, from source, or download pre-built binaries. Once installed, integrate it with Claude Code by following the setup instructions provided in the README.

Key features

Key features include direct PDF access without manual conversion, advanced AI analysis using Gemini’s vision capabilities, text extraction from PDFs, content search within documents, seamless integration with AI assistants, and simple installation methods.

Where to use

pdf-reader-mcp can be used in various fields such as education for document analysis, legal for reviewing contracts, research for extracting data from academic papers, and any domain requiring PDF content extraction and analysis.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Pdf Reader Mcp

Use cases

How to use

To use pdf-reader-mcp, install it via Homebrew, from source, or download pre-built binaries. Once installed, integrate it with Claude Code by following the setup instructions provided in the README.

Key features

Where to use

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

PDF Reader MCP Server

A Model Context Protocol (MCP) server for reading and analyzing PDF documents using Google’s Gemini API, written in Go. This server enables AI assistants like Claude (Code and Desktop) and Cursor to seamlessly read, extract, and analyze PDF content directly from their interfaces.

Description

The PDF Reader MCP Server acts as a bridge between AI assistants and PDF documents, allowing them to:

Read and extract text from PDF files using the pdf_read tool
Analyze PDF content with Gemini’s powerful vision capabilities using the pdf_analyze tool
Search within PDF documents for specific information using the pdf_search tool

This integration lets AI assistants like Claude access and understand PDF content without leaving their interface, creating a seamless experience for document analysis and information extraction.

Key Benefits

Direct PDF access: Read and analyze PDF documents without manual conversion
Advanced AI analysis: Leverage Gemini’s vision and language models for deep document understanding
Text extraction: Extract structured and unstructured text from PDFs
Content search: Search for specific information within PDF documents
Seamless integration: Works natively with Claude Code, Claude Desktop, and Cursor
Simple installation: Quick setup with Homebrew, Go, or pre-built binaries

Installation

Using Homebrew (macOS and Linux)

brew tap alcova-ai/tap
brew install pdf-reader-mcp

From Source

Clone the repository and build manually:

git clone https://github.com/Alcova-AI/pdf-reader-mcp.git
cd pdf-reader-mcp
go build -o pdf-reader-mcp-server .

From Binary Releases (Other platforms)

Download pre-built binaries from the releases page.

Usage

This server supports only the stdio protocol for MCP communication.

Setup with Claude Code

Adding to Claude Code:

claude mcp add-json --scope user pdf-reader-mcp '{"type":"stdio","command":"pdf-reader-mcp","env":{"GEMINI_API_KEY":"YOUR-GEMINI-API-KEY-HERE"}}'

That’s it! You can now read and analyze PDFs in Claude Code.

Setup with Claude Desktop

Adding to Claude Desktop:

Edit the Claude Desktop MCP config:

code ~/Library/Application\ Support/Claude/claude_desktop_config.json

Add the PDF Reader MCP server:

  {
    "mcpServers": {
+        "pdf-reader-mcp": {
+            "command": "pdf-reader-mcp",
+            "args": [
+                "--model",
+                "gemini-1.5-flash"
+            ],
+            "env": {
+                "GEMINI_API_KEY": "YOUR-GEMINI-API-KEY-HERE"
+            }
+        }
    }
  }

Command Line Options

--model, -m: Specify the Gemini model to use for PDF analysis (default: “gemini-1.5-flash”)
- Can also be set with the GEMINI_MODEL environment variable
--max-pages: Maximum number of pages to process at once (default: 10)
--temp-dir: Directory for temporary file storage (default: system temp)

Example:

pdf-reader-mcp --model gemini-1.5-pro --max-pages 20

Direct Execution

If you want to run the server directly (not recommended for most users):

Set your Gemini API key as an environment variable:
```
export GEMINI_API_KEY=your-api-key-here
```
Run the server:
```
pdf-reader-mcp
```

License

MIT

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

AMap MCP Server Amap Maps is a server that supports any MCP protocol client, allowing users to easily utilize the Amap Maps MCP server for various location-based services.

View All MCP Servers