Mcp Fetch Node

@tgambeton a year ago

6 MIT

FreeCommunity

AI Systems

#fetch#mcp#mcp-server

A Model Context Protocol server that provides web content fetching capabilities.

What is Mcp Fetch Node

mcp-fetch-node is a Model Context Protocol server that enables web content fetching capabilities, allowing LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.

Use cases

Use cases include extracting information from news articles, summarizing web pages, and enabling LLMs to access real-time data from the internet for various applications.

How to use

To use mcp-fetch-node, you can run it using Node.js with the command ‘npx -y mcp-fetch-node’ or via Docker with ‘docker run -it tgambet/mcp-fetch-node’. It exposes an SSE endpoint at ‘/sse’ on port 8080 by default.

Key features

Key features include fetching and extracting relevant content from a URL, respecting robots.txt (with an option to disable), user-agent customization, markdown conversion, and pagination.

Where to use

mcp-fetch-node can be used in various fields such as web scraping, content aggregation, and any application that requires automated retrieval and processing of web content.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Mcp Fetch Node

Use cases

Use cases include extracting information from news articles, summarizing web pages, and enabling LLMs to access real-time data from the internet for various applications.

How to use

Key features

Key features include fetching and extracting relevant content from a URL, respecting robots.txt (with an option to disable), user-agent customization, markdown conversion, and pagination.

Where to use

mcp-fetch-node can be used in various fields such as web scraping, content aggregation, and any application that requires automated retrieval and processing of web content.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

Fetch MCP Server

A port of the official Fetch MCP Server for Node.js. Please check the key differences with original project section for more details.

Description

A Model Context Protocol server that provides web content fetching capabilities. This server enables LLMs to retrieve and process content from web pages, converting HTML to markdown for easier consumption.

The fetch tool will truncate the response, but by using the start_index argument, you can specify where to start the content extraction. This lets models read a webpage in chunks, until they find the information they need.

Available Tools

fetch - Fetches a URL from the internet and extracts its contents as markdown.
- url (string, required): URL to fetch
- max_length (integer, optional): Maximum number of characters to return (default: 5000)
- start_index (integer, optional): Start content from this character index (default: 0)
- raw (boolean, optional): Get raw content without markdown conversion (default: false)

Available Prompts

fetch - Fetch a URL and extract its contents as markdown
- url (string, required): URL to fetch

Usage

mcp-fetch-node exposes an SSE endpoint at /sse on port 8080 by default.

Node.js:

npx -y mcp-fetch-node

Docker:

docker run -it tgambet/mcp-fetch-node

Customization - robots.txt

By default, the server will obey a websites robots.txt file if the request came from the model (via a tool), but not if the request was user initiated (via a prompt). This can be disabled by adding the argument --ignore-robots-txt to the run command.

Customization - User-agent

By default, depending on if the request came from the model (via a tool), or was user initiated (via a prompt), the server will use either the user-agent

# Tool call
ModelContextProtocol/1.0 (Autonomous; +https://github.com/tgambet/mcp-fetch-node)

# Prompt
ModelContextProtocol/1.0 (User-Specified; +https://github.com/tgambet/mcp-fetch-node)

This can be customized by adding the argument --user-agent=YourUserAgent to the run command, which will override both.

Key differences with the original project

This implementation is written in TypeScript and targets the Node.js runtime.
It is suited for situations where python is not available.
This implementation provides an SSE interface instead of stdio.
It is more suitable for deployment as a web service, increasing flexibility.
This implementation does not rely on Readability.js library for content extraction.
It uses a custom implementation that is more generic and suited for websites other that news-related ones.

The api and tool description is, however, the same as the original project so you can try mcp-fetch-node as a drop-in replacement for the original project.

Please report any issue to the issue tracker.

Features

Fetch and extract relevant content from a URL
Respect robots.txt (can be disabled)
User-Agent customization
Markdown conversion
Pagination

Development

pnpm install
pnpm dev
pnpm lint:fix
pnpm format
pnpm test
pnpm build
pnpm start
pnpm inspect

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

License

MIT

TODO

[ ] Add user logs and progress
[ ] Add documentation & examples
[ ] Performance benchmarks and improvements
[ ] Benchmarks for extraction quality: cf https://github.com/adbar/trafilatura/blob/master/tests/comparison_small.py

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers