- Explore MCP Servers
- mcp-doc-scraper
Mcp Doc Scraper
What is Mcp Doc Scraper
mcp-doc-scraper is a Model Context Protocol (MCP) server designed to scrape documentation from web URLs and convert it into markdown format using jina.ai’s conversion service.
Use cases
Use cases include generating markdown files from online API documentation, converting user manuals from HTML to markdown, and creating documentation for software projects from web sources.
How to use
To use mcp-doc-scraper, install it via Smithery or clone the repository. After setting up a virtual environment and installing dependencies, run the server using Python. Use the ‘scrape_docs’ tool to input a URL and specify an output path for the markdown file.
Key features
Key features include scraping documentation from any web URL, converting HTML documentation to markdown format, saving the converted documentation to a specified output path, and integration with the Model Context Protocol (MCP).
Where to use
mcp-doc-scraper can be used in software development, technical writing, and any field that requires converting web-based documentation into a more manageable markdown format.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Overview
What is Mcp Doc Scraper
mcp-doc-scraper is a Model Context Protocol (MCP) server designed to scrape documentation from web URLs and convert it into markdown format using jina.ai’s conversion service.
Use cases
Use cases include generating markdown files from online API documentation, converting user manuals from HTML to markdown, and creating documentation for software projects from web sources.
How to use
To use mcp-doc-scraper, install it via Smithery or clone the repository. After setting up a virtual environment and installing dependencies, run the server using Python. Use the ‘scrape_docs’ tool to input a URL and specify an output path for the markdown file.
Key features
Key features include scraping documentation from any web URL, converting HTML documentation to markdown format, saving the converted documentation to a specified output path, and integration with the Model Context Protocol (MCP).
Where to use
mcp-doc-scraper can be used in software development, technical writing, and any field that requires converting web-based documentation into a more manageable markdown format.
Clients Supporting MCP
The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.
Content
Doc Scraper MCP Server
A Model Context Protocol (MCP) server that provides documentation scraping functionality. This server converts web-based documentation into markdown format using jina.ai’s conversion service.
Features
- Scrapes documentation from any web URL
- Converts HTML documentation to markdown format
- Saves the converted documentation to a specified output path
- Integrates with the Model Context Protocol (MCP)
Installation
Installing via Smithery
To install Doc Scraper for Claude Desktop automatically via Smithery:
npx -y @smithery/cli install @askjohngeorge/mcp-doc-scraper --client claude
- Clone the repository:
git clone https://github.com/askjohngeorge/mcp-doc-scraper.git
cd mcp-doc-scraper
- Create and activate a virtual environment:
python -m venv venv
source venv/bin/activate # On Windows, use: venv\Scripts\activate
- Install the dependencies:
pip install -e .
Usage
The server can be run using Python:
python -m mcp_doc_scraper
Tool Description
The server provides a single tool:
- Name:
scrape_docs - Description: Scrape documentation from a URL and save as markdown
- Input Parameters:
url: The URL of the documentation to scrapeoutput_path: The path where the markdown file should be saved
Project Structure
doc_scraper/ ├── __init__.py ├── __main__.py └── server.py
Dependencies
- aiohttp
- mcp
- pydantic
Development
To set up the development environment:
- Install development dependencies:
pip install -r requirements.txt
- The server uses the Model Context Protocol. Make sure to familiarize yourself with MCP documentation.
License
MIT License
Dev Tools Supporting MCP
The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.











