capabilities / scrape-website
Capability
Scrape a website
Fetch and extract content from web pages.
01Best MCP servers for this capability
| Server | Status | Trust | Match |
|---|---|---|---|
| playwright-mcp Playwright MCP server | partner | 76 | 50% |
| openagent ⚡️next-generation personal AI assistant powered by LLM, RAG and agent loops, supporting computer-use, browser-use and coding agent, demo: https://demo.openagentai.org | community | 61 | 50% |
| Notte Leverage Notte Web AI agents & cloud browser sessions for scalable browser automation & scraping workflows | community | 61 | 50% |
| exa-mcp-server Exa MCP for web search and web crawling! | community | 61 | 50% |
| firecrawl-mcp-server 🔥 Official Firecrawl MCP Server - Adds powerful web scraping and search to Cursor, Claude and any other LLM clients. | community | 61 | 50% |
| Firecrawl Extract web data with [Firecrawl](https://firecrawl.dev) | community | 61 | 50% |
| Scrapling 🕷️ An adaptive Web Scraping framework that handles everything from a single request to a full-scale crawl! | community | 61 | 50% |
| firecrawl-fastmcp A TypeScript framework for building MCP servers. | community | 61 | 50% |
| vessel-browser Built from the ground-up for agents, Vessel Browser is an open source AI browser for Linux/Mac/Windows that provides a durable state, MCP control, and BYOK with full autonomous browsing. Use with Hermes Agent, OpenClaw, or connect to your favorite API provider. | community | 58 | 50% |
| mcp-omnisearch 🔍 A Model Context Protocol (MCP) server providing unified access to multiple search engines (Tavily, Brave, Kagi, Exa), AI tools (Kagi FastGPT, Exa, Linkup), and content extraction services (Firecrawl, Tavily, Kagi). Includes GitHub search. All through a single interface. | community | 58 | 50% |
| GoLogin MCP server Manage your GoLogin browser profiles and automation directly through AI conversations! | community | 55 | 50% |
| olostep-mcp-server MCP server for Olostep — the web scraping, crawling, and search infrastructure used by top AI companies. Gives any MCP-compatible AI agent the ability to scrape, crawl, batch-extract, and search the web in real time. | community | 55 | 50% |
| Browserbase Automate browser interactions in the cloud (e.g. web navigation, data extraction, form filling, and more) | community | 55 | 50% |
| anansi A self-healing web scraper built for hostile sites: selectors repair themselves, browser rendering kicks in when needed, and Chrome TLS fingerprinting evades bot detection. Ships with an MCP server so any LLM can drive a full crawl through conversation. | community | 55 | 50% |
| browser-tools-mcp Monitor browser logs directly from Cursor and other MCP compatible IDEs. | community | 55 | 50% |
| browse-mcp-proxy A proxy server for MCP Inspector that enables browser-based connections to MCP servers. | community | 53 | 50% |
| XActions ⚡ The Complete X/Twitter Automation Toolkit — Scrapers, MCP server for AI agents (Claude/GPT), CLI, browser scripts. No API fees. Open source. Unfollow people who don't follow back. Monitor real-time analytics. Auto follow, like, comment, scrape, without API. | community | 52 | 50% |
| Search1API One API for Search, Crawling, and Sitemaps | community | 52 | 50% |
| @executeautomation/playwright-mcp-server Model Context Protocol servers for Playwright | community | 51 | 50% |
| tradingview-chart-mcp MCP server that captures TradingView chart images via Selenium — supports any ticker/interval with browser pooling for concurrent performance | community | 49 | 50% |
| crawl4ai-mcp-server 🕷️ A lightweight Model Context Protocol (MCP) server that exposes Crawl4AI web scraping and crawling capabilities as tools for AI agents. Similar to Firecrawl's API but self-hosted and free. Perfect for integrating web scraping into your AI workflows with OpenAI Agents SDK, Cursor, Claude Code, and other MCP-compatible tools. | community | 49 | 50% |
| mcp-typescript-sdk A TypeScript SDK for implementing Model Context Protocol (MCP) over MQTT, supporting both browser and Node.js environments. | community | 49 | 50% |
| Scrapeless Integrate real-time [Scrapeless](https://www.scrapeless.com/en) Google SERP(Google Search, Google Flight, Google Map, Google Jobs....) results into your LLM applications. This server enables dynamic context retrieval for AI workflows, chatbots, and research tools. | community | 48 | 50% |
| Hyperbrowser [Hyperbrowser](https://www.hyperbrowser.ai/) is the next-generation platform empowering AI agents and enabling effortless, scalable browser automation. | community | 48 | 50% |
| Driflyte MCP Server for [Driflyte](https://console.driflyte.com). The Driflyte MCP Server exposes tools that allow AI assistants to query and retrieve topic-specific knowledge from recursively crawled and indexed web pages. | community | 45 | 50% |
| Crawlbase MCP Enables AI agents to access real-time web data with HTML, markdown, and screenshot support. SDKs: Node.js, Python, Java, PHP, .NET. | community | 43 | 50% |
| AnyCrawl [AnyCrawl](https://anycrawl.dev) MCP Server, Powerful web scraping and crawling for Cursor, Claude, and other LLM clients via the Model Context Protocol (MCP). | community | 41 | 50% |
| web-agent-protocol 🌐Web Agent Protocol (WAP) - Record and replay user interactions in the browser with MCP support | community | 38 | 50% |
| Scrapezy Turn websites into datasets with [Scrapezy](https://scrapezy.com) | community | 35 | 50% |
| puppeteer-mcp-server Self-hosted Puppeteer MCP server with remote SSE access, API key authentication, and Docker deployment. Complete tool suite for browser automation via Model Context Protocol. | community | 33 | 50% |
| @hisma/server-puppeteer Fork and update (v0.6.5) of the original @modelcontextprotocol/server-puppeteer MCP server for browser automation using Puppeteer. | community | 33 | 50% |
| mcp-rss-crawler RSS Crawler MCP Server | community | 29 | 50% |