How to extract text from images

Search for:

How to extract text from images

View all MCP Servers

Why this server?
Provides screenshot and OCR capabilities for macOS, which can be used to extract text from images.
mcp-screenshot
Image & Video Processing OS Automation
kazuph
A
license
A
quality
C
maintenance
Provides screenshot and OCR capabilities for macOS.
Last updated 2026-06-12
1
9
23
MIT
Why this server?
Enables AI assistants to interact with Obsidian vaults, providing tools for reading, creating, editing and managing notes and tags; potentially useful if the images are stored as notes in Obsidian.
obsidian-mcp
Note Taking Developer Tools
StevenStavrakis
A
license
-
quality
F
maintenance
Enables AI assistants to interact with Obsidian vaults, providing tools for reading, creating, editing and managing notes and tags.
Last updated 2025-06-23
3,296
719
MIT
Why this server?
Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
@kazuph/mcp-fetch
Browser Automation Image & Video Processing
kazuph
A
license
A
quality
A
maintenance
Model Context Protocol server for fetching web content and processing images. This allows Claude Desktop (or any MCP client) to fetch web content and handle images appropriately.
Last updated 2026-06-12
1
1,080
40
MIT
Why this server?
Enables capturing screenshots of web pages and local HTML files through a simple MCP tool interface using Puppeteer with configurable options for dimensions and output paths.
MCP Screenshot Server
Browser Automation Web Scraping
sethbang
A
license
A
quality
A
maintenance
Enables capturing screenshots of web pages and local HTML files through a simple MCP tool interface using Puppeteer with configurable options for dimensions and output paths.
Last updated 2026-07-27
2
99
27
Apache 2.0
Why this server?
A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models, which could be used for extracting text.
YOLO MCP Server
Image & Video Processing Autonomous Agents
GongRzhe
A
license
-
quality
F
maintenance
A computer vision service that allows Claude to perform object detection, segmentation, classification, and real-time camera analysis using state-of-the-art YOLO models.
Last updated 2025-03-11
38
MIT
Why this server?
A server providing text-to-speech and speech-to-text capabilities using Windows' native speech services without external dependencies. Could be used as an auxiliary service.
MS-Lucidia-Voice-Gateway-MCP
Speech Processing Text-to-Speech
TheLucidTech
F
license
-
quality
F
maintenance
A server providing text-to-speech and speech-to-text functionalities using Windows' native speech services without external dependencies.
Last updated 2025-01-17
6
Why this server?
Helps refine AI-generated content to sound more natural and human-like. Built with advanced AI detection and text enhancement capabilities; could be used for post-processing the extracted text.
AI Humanizer MCP Server
Text Summarization App Automation
Text2Go
A
license
C
quality
D
maintenance
Helps refine AI-generated content to sound more natural and human-like. Built with advanced AI detection and text enhancement capabilities.
Last updated 2024-12-27
1
47
57
MIT
Why this server?
A Model Context Protocol server that provides a tool to extract text content from local PDF files, supporting both standard PDF reading and OCR capabilities with optional page selection.
MCP PDF Server
Image & Video Processing Text Summarization Multimedia Processing
DeepSeekMine
F
license
-
quality
D
maintenance
A PDF processing server that extracts text via normal parsing or OCR, and retrieves images from PDF files through the MCP protocol with a built-in web debugger.
Last updated 2025-04-28
36