The Florence-2 MCP Server processes images and PDF files by:
OCR: Extracting text from images or PDFs provided via file path or URL
Caption Generation: Creating descriptive captions for images provided via file path or URL
Hosts the repository for the Florence-2 MCP server, allows installation via git
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@Florence-2 MCP Serverextract text from this screenshot of a document"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
Florence-2 MCP Server
An MCP server for processing images using Florence-2.
You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character Recognition) or generate descriptive captions summarizing the content of the images.
Installation
Claude
Download the latest MCP bundle mcp-florence2.mcpb from
the Releases page,
then open the downloaded .mcpb file or drag it into the Claude Desktop's Settings window.
You can also manually configure this server for Claude Desktop.
Edit the claude_desktop_config.json file by adding the following entry under mcpServers:
{
"mcpServers": {
"florence-2": {
"command": "uvx",
"args": [
"--from",
"git+https://github.com/jkawamoto/mcp-florence2",
"mcp-florence2"
]
}
}
}After editing, restart the application.
For more information, see: Connect to local MCP servers - Model Context Protocol.
goose
Open this link
goose://extension?cmd=uvx&arg=--from&arg=git%2Bhttps%3A%2F%2Fgithub.com%2Fjkawamoto%2Fmcp-florence2&arg=mcp-florence2&id=florence2&name=Florence-2&description=An%20MCP%20server%20for%20processing%20images%20using%20Florence-2to launch the installer, then click "Yes" to confirm the installation.
You can also directly edit the config file (~/.config/goose/config.yaml) to include the following entry:
extensions:
florence2:
name: Florence-2
cmd: uvx
args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ]
enabled: true
type: stdioFor more details on configuring MCP servers in Goose, refer to the documentation: Using Extensions | goose.
LM Studio
To configure this server for LM Studio, click the button below.
Related MCP server: FFmpeg MCP Server
Tools
ocr
Process an image file or URL using OCR to extract text.
Arguments:
src: A file path or URL to the image file that needs to be processed.
caption
Processes an image file and generates captions for the image.
Arguments:
src: A file path or URL to the image file that needs to be processed.
License
This application is licensed under the MIT License. See the LICENSE file for more details.