Tools and Methods for Extracting Data from PDFs and Images

Search for:

Tools and Methods for Extracting Data from PDFs and Images

View all MCP Servers

Why this server?
Enables integration with Google Drive for listing, reading, and searching over files, supporting various file types with automatic export for Google Workspace files. This is useful for accessing PDF and image data stored in Google Drive.
Google Drive MCP Server
Cloud Storage File Systems Developer Tools
felores
A
license
-
quality
F
maintenance
Enables integration with Google Drive for listing, reading, and searching over files, supporting various file types with automatic export for Google Workspace files.
Last updated 2025-11-07
5,577
72
MIT
Why this server?
Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support. This can be used to manage and extract data from PDF documentation.
RagDocs MCP Server
RAG Systems Vector Databases Web Scraping
heltonteixeira
A
license
-
quality
-
maintenance
Provides RAG capabilities for semantic document search using Qdrant vector database and Ollama/OpenAI embeddings, allowing users to add, search, list, and delete documentation with metadata support.
Last updated 2025-01-05
35
16
Why this server?
A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
MCP Docling Server
Documentation Access Text Summarization
zanetworker
A
license
-
quality
D
maintenance
A server that provides document processing capabilities using the Model Context Protocol, allowing conversion of documents to markdown, extraction of tables, and processing of document images.
Last updated 2025-04-05
19
MIT
Why this server?
A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing.
MCP Development Framework
Developer Tools Documentation Access Web Scraping
aigo666
A
license
C
quality
C
maintenance
A powerful Model Context Protocol framework that extends Cursor IDE with tools for web content retrieval, PDF processing, and Word document parsing.
Last updated 2025-05-15
8
17
MIT
Why this server?
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
PDF Reader MCP Server
File Systems App Automation
trafflux
F
license
-
quality
D
maintenance
Provides tools for reading and extracting text from PDF files, supporting both local files and URLs.
Last updated 2025-02-20
46
Why this server?
Provides HTML file preview and analysis capabilities. This server enables capturing full-page screenshots of local HTML files and analyzing their structure, allowing extraction of image data.
MCP File Preview Server
File Systems
seanivore
A
license
B
quality
F
maintenance
Provides HTML file preview and analysis capabilities. This server enables capturing full-page screenshots of local HTML files and analyzing their structure.
Last updated 2025-11-29
2
24
MIT
Why this server?
A zero-configuration tool that automatically exposes FastAPI endpoints as Model Context Protocol (MCP) tools, allowing LLM systems like Claude to interact with your API without additional coding. You could build an API that takes a PDF and extracts data, and expose that to Claude.
FastAPI-MCP
Developer Tools Code Execution API Testing
tadata-org
A
license
-
quality
D
maintenance
A zero-configuration tool that automatically exposes FastAPI endpoints as Model Context Protocol (MCP) tools, allowing LLM systems like Claude to interact with your API without additional coding.
Last updated 2025-11-24
11,966
MIT
Why this server?
Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities. Could be used to search for images inside PDFs.
Jina AI MCP Server
Search Vector Databases Image & Video Processing
Sheshiyer
A
license
-
quality
D
maintenance
Enables semantic search, image search, and cross-modal search functionalities through integration with Jina AI's neural search capabilities.
Last updated 2025-01-08
5
MIT
Why this server?
The Box MCP Server facilitates searching and reading PDF and Word files in Box using Developer Token authentication.
Box MCP Server
Search File Systems Cloud Storage
hmk
A
license
-
quality
D
maintenance
The Box MCP Server facilitates searching and reading PDF and Word files in Box using Developer Token authentication.
Last updated 2025-08-22
131
11
BSD 3-Clause