Ai Cursor Scraping Assistant

5 MIT

FreeCommunity

AI Systems

#camoufox#claude#claude-ai#cursor-ai#cursorai#cursorrules#mcp#python#scraping#scraping-python#scrapy#scrapy-spider#webscraping

A powerful tool that leverages Cursor AI and MCP (Model Context Protocol) to easily generate web scrapers for various types of websites.

What is Ai Cursor Scraping Assistant

AI-Cursor-Scraping-Assistant is a powerful tool that combines Cursor AI and the Model Context Protocol (MCP) to facilitate the creation of web scrapers for various types of websites.

Use cases

Use cases include scraping product data from e-commerce sites, gathering blog content, extracting information from news websites, and collecting data for research purposes.

How to use

To use AI-Cursor-Scraping-Assistant, clone the repository, navigate to the directory, install the required dependencies, and run the main script using Python.

Key features

Key features include a user-friendly interface, customizable scrapers, multi-protocol support, integration with Cursor AI for enhanced data extraction, and being open source for community collaboration.

Where to use

AI-Cursor-Scraping-Assistant can be used in fields such as data analysis, market research, e-commerce, content aggregation, and any area requiring web data extraction.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Overview

What is Ai Cursor Scraping Assistant

AI-Cursor-Scraping-Assistant is a powerful tool that combines Cursor AI and the Model Context Protocol (MCP) to facilitate the creation of web scrapers for various types of websites.

Use cases

Use cases include scraping product data from e-commerce sites, gathering blog content, extracting information from news websites, and collecting data for research purposes.

How to use

To use AI-Cursor-Scraping-Assistant, clone the repository, navigate to the directory, install the required dependencies, and run the main script using Python.

Key features

Where to use

AI-Cursor-Scraping-Assistant can be used in fields such as data analysis, market research, e-commerce, content aggregation, and any area requiring web data extraction.

Clients Supporting MCP

The following are the main client software that supports the Model Context Protocol. Click the link to visit the official website for more information.

Claude Desktop: Official desktop application from Anthropic, natively supports MCP protocol. claude.ai

Cherry Studio: Cross-platform desktop client supporting multiple LLM providers, built-in MCP server support. cherry-ai.com

LobeChat: Modern open-source ChatGPT/LLMs UI, supports MCP protocol integration. lobehub.com

DeepChat: Cross-platform desktop AI assistant, compatible with MCP protocol, focusing on privacy and efficiency. deepchat.thinkinai.xyz

5ire: Cross-platform open-source desktop intelligent assistant MCP client, supports local knowledge base and MCP server. 5ire.app

View More MCP Clients

Content

AI Cursor Scraping Assistant

GitHub Release

🚀 Introduction

Welcome to the AI Cursor Scraping Assistant! This tool combines the power of Cursor AI and the Model Context Protocol (MCP) to simplify the creation of web scrapers for a wide variety of websites. Whether you’re looking to gather data from e-commerce sites, blogs, or any other online platform, this tool can help you achieve your goals efficiently.

You can find the latest releases of this project here. Download the files and execute them to get started!

🛠 Features

User-Friendly Interface: Designed for both beginners and experienced developers.
Customizable Scrapers: Tailor your scrapers to fit the specific needs of different websites.
Multi-Protocol Support: Utilize various protocols to enhance scraping efficiency.
Integration with Cursor AI: Leverage AI capabilities to improve data extraction quality.
Open Source: Contribute and collaborate with a community of developers.

📦 Installation

To get started with the AI Cursor Scraping Assistant, follow these simple steps:

Clone the Repository:

git clone https://github.com/Solihatun1/AI-Cursor-Scraping-Assistant.git

Navigate to the Directory:
```
cd AI-Cursor-Scraping-Assistant
```
Install Dependencies:
Ensure you have Python installed. Then, run:
```
pip install -r requirements.txt
```
Run the Assistant:
Execute the following command to start the tool:
```
python main.py
```

You can find the latest releases of this project here. Download the files and execute them to get started!

🌐 Topics

This repository covers a range of topics relevant to web scraping:

Cursor AI: An AI tool that helps in generating intelligent scraping strategies.
Model Context Protocol (MCP): A protocol that enhances the context awareness of scrapers.
Scrapy: A powerful web scraping framework for Python.
Web Scraping: The act of extracting data from websites.

📚 Usage

Basic Example

Here’s a simple example to get you started:

from cursor_ai import Cursor
from mcp import ModelContext

# Initialize Cursor and MCP
cursor = Cursor()
mcp = ModelContext()

# Define the target website
url = "https://example.com"

# Create a scraper
scraper = cursor.create_scraper(url)

# Execute the scraper
data = scraper.run()

# Process the data
print(data)

Advanced Configuration

For more advanced usage, you can customize your scraper by specifying parameters like:

Headers: Customize request headers.
Timeouts: Set timeouts for requests.
Retry Logic: Implement retry logic for failed requests.

Example:

scraper.set_headers({"User-Agent": "MyScraper"})
scraper.set_timeout(10)
scraper.enable_retries(max_retries=3)

🤝 Contributing

We welcome contributions! Here’s how you can help:

Fork the Repository: Click the “Fork” button on the top right.
Create a New Branch:
```
git checkout -b feature/YourFeature
```
Make Your Changes: Implement your feature or fix.
Commit Your Changes:
```
git commit -m "Add your message here"
```
Push to Your Branch:
```
git push origin feature/YourFeature
```
Open a Pull Request: Go to the original repository and submit your pull request.

🧪 Testing

To ensure everything works as expected, run the test suite:

pytest tests/

📈 Roadmap

Version 1.0: Initial release with basic scraping capabilities.
Version 1.1: Add support for more protocols.
Version 1.2: Enhance AI features for smarter scraping.
Version 2.0: Introduce a graphical user interface (GUI).

📄 License

This project is licensed under the MIT License. See the LICENSE file for details.

🗣 Community

Join our community to discuss ideas, share projects, and get support:

GitHub Discussions: Engage with other users and contributors.
Discord Channel: Join our Discord server for real-time chat.
Twitter: Follow us for updates and news.

📖 Documentation

For detailed documentation, visit our Wiki. Here you will find:

Setup instructions
Detailed API documentation
Examples and use cases

📧 Contact

For inquiries, please reach out to us at [email protected].

🎉 Acknowledgments

Thank you to everyone who has contributed to this project. Your support makes it possible!

Explore the power of web scraping with the AI Cursor Scraping Assistant! Visit the Releases section for the latest updates. Download the files and start building your scrapers today!

Dev Tools Supporting MCP

The following are the main code editors that support the Model Context Protocol. Click the link to visit the official website for more information.

Zed: High-performance collaborative code editor, supports MCP protocol, providing a smooth programming experience. zed.dev

Cursor: AI code editor built on VS Code, supports MCP protocol for context-aware programming. cursor.com

Windsurf: AI code editor from Codeium, integrates MCP protocol to provide intelligent code assistance. windsurf.com

Continue: Open-source AI programming assistant plugin, supports VS Code and JetBrains, compatible with MCP protocol. continue.dev

Trae: AI-driven code editor, supports MCP protocol, focusing on enhancing developer programming experience. trae.ai

View More MCP Dev Tools

Tools

No tools

Comments

Recommend MCP Servers

Tavily MCP Server The Tavily MCP server provides: search, extract, map, crawl tools Real-time web search capabilities through the tavily-search tool Intelligent data extraction from web pages via the tavily-extract tool Powerful web mapping tool that creates a structured map of website Web crawler that systematically explores websites.

MCP Server Chart This is a TypeScript-based MCP server that provides chart generation capabilities. It allows you to create various types of charts through MCP tools. You can also use it in Dify.

GitHub MCP Server MCP Server for the GitHub API, enabling file operations, repository management, search functionality, and more.

Brave Search MCP Server Web and local search using Brave's Search API

Firecrawl MCP Server Advanced web scraping with JavaScript rendering, PDF support, and smart rate limiting

Context7 MCP LLMs rely on outdated or generic information about the libraries you use. You get:

Slack MCP server Channel management and messaging capabilities

Sequential Thinking MCP Server Dynamic and reflective problem-solving through thought sequences

Fetch MCP Server A Model Context Protocol server that provides web content fetching capabilities.

Playwright MCP A Model Context Protocol (MCP) server that provides browser automation capabilities using [Playwright](https://playwright.dev). This server enables LLMs to interact with web pages through structured accessibility snapshots, bypassing the need for screenshots or visually-tuned models.

View All MCP Servers