206,846 tools. Last updated 2026-06-17 16:48

"Tools for Extracting Content from Websites and PDFs" matching MCP tools:

drive_get_fileA
google-workspace-mcp-server
Download a file from Google Drive by providing its file ID. Returns binary content for PDFs, images, and other files.
MIT
search_webA
Jina AI Remote MCP Server
Search the web for current information, news, articles, and websites to find up-to-date content, research topics, or answer questions about recent events.
Apache 2.0
read_urlA
Jina AI Remote MCP Server
Extract web page content and convert it to clean, readable markdown format for analysis, bypassing paywalls and obtaining structured text data from websites.
Apache 2.0
octagon-scraper-agentB
mcp-octagon
Extract structured financial data from investor relations websites and online sources for investment research when APIs are unavailable.
MIT
renderB
Urlbox MCP Server
Render websites to images, PDFs, HTML, or markdown with full control over viewport, content blocking, and metadata extraction.
MIT
list_machine_manualsA
Predictive Maintenance MCP Server
View a list of all machine manuals (PDFs and text files) with file names, sizes, and modification dates to identify available documentation before extracting specifications.
MIT

Matching MCP Servers

content-core
Web Scraping Multimedia Processing
lfnovo
A
license
B
quality
B
maintenance
Extract content from URLs, documents, videos, and audio files using intelligent auto-engine selection. Supports web pages, PDFs, Word docs, YouTube transcripts, and more with structured JSON responses.
Last updated 2026-05-12
1
160
MIT
MCP from Scratch Server
Developer Tools Education & Learning Tools
pguso
A
license
-
quality
D
maintenance
A fully working MCP server built from scratch in plain Node.js, implementing tools, resources, prompts, notifications, and sampling according to the MCP specification, designed to connect to Claude Desktop or any MCP client.
Last updated 2026-05-25
17
MIT

Matching MCP Connectors

Gov Uk Content
GOV.UK Content + Search APIs (every gov.uk page + full search)
Content to Social
Transform any blog post or article URL into ready-to-post social media content for Twitter/X threads, LinkedIn posts, Instagram captions, Facebook posts, and email newsletters. Pay-per-event: $0.07 for all 5 platforms, $0.03 for single platform.

source_get_contentA
NotebookLM MCP Server
Extract raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.
MIT
source_get_contentA
NotebookLM MCP Server
Extract raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.
MIT
source_get_contentA
NotebookLM MCP Server
Retrieve raw text content from PDFs, web pages, pasted text, or YouTube transcripts for content export without AI processing.
MIT
source_get_contentA
notebooklm-mcp
Retrieve raw text content from a source. Extract original indexed text from PDFs, web pages, pasted text, or YouTube transcripts for direct export.
MIT
read_websiteA
read-website-fast
Extract web content and convert it to clean Markdown for reading documentation, analyzing content, and gathering information from websites while preserving links and structure.
MIT
read_file_contentA
Custom Google Drive MCP
Retrieves content from Google Drive files by ID, extracting text from documents, spreadsheets, PDFs, and images, including files in shared drives.
MIT
fetch_urlA
MCP Starter Kit
Retrieve text content from web URLs via HTTP/HTTPS, returning response body, status code, and content type while rejecting binary files like images and PDFs.
MIT
crw_parse_fileA
crw-mcp
Parse a PDF document uploaded as base64-encoded bytes and return its content as markdown. Handles text-based PDFs but not scanned/image-only PDFs without OCR.
web_search_exaA
Exa MCP Server
Perform real-time web searches with configurable parameters to retrieve and scrape content from relevant websites for AI assistants.
MIT
read_documentA
go-docs-mcp
Read text content from PDF, TXT, MD, DOCX, or CSV files. Supports page ranges and auto-OCR for scanned PDFs.
MIT
read_websiteA
Read-Website
Extract web content and convert it to clean Markdown format for reading documentation, analyzing information, and gathering data from websites while preserving links and structure.
Apache 2.0
read_urlA
Jina AI Remote MCP Server
Extract web page content and convert it to clean markdown format for reading articles, documentation, or analyzing text from websites.
Apache 2.0
create_crawlB
olostep-mcp
Discover and scrape entire websites by following links from a starting URL to extract content in various formats for data collection.
MIT
crawling_exaB
Exa MCP Server
Extract full text content, metadata, and structured information from specific web URLs for detailed content analysis and data retrieval.

"Tools for Extracting Content from Websites and PDFs" matching MCP tools:

drive_get_fileA

search_webA

read_urlA

octagon-scraper-agentB

renderB

list_machine_manualsA

Matching MCP Servers

content-core

MCP from Scratch Server

Matching MCP Connectors

source_get_contentA

source_get_contentA

source_get_contentA

source_get_contentA

read_websiteA

read_file_contentA

fetch_urlA

crw_parse_fileA

web_search_exaA

read_documentA

read_websiteA

read_urlA

create_crawlB

crawling_exaB