Skip to main content
Glama

Florence-2 MCP Server

Python Application GitHub License pre-commit Ruff smithery badge

An MCP server for processing images using Florence-2.

You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character Recognition) or generate descriptive captions summarizing the content of the images.

Installation

For Claude Desktop

To configure this server for Claude Desktop, edit the claude_desktop_config.json file with the following entry under mcpServers:

{ "mcpServers": { "florence-2": { "command": "uvx", "args": [ "--from", "git+https://github.com/jkawamoto/mcp-florence2", "mcp-florence2" ] } } }

After editing, restart the application. For more information, see: For Claude Desktop Users - Model Context Protocol.

For Goose CLI

To enable the Bear extension in Goose CLI, edit the configuration file ~/.config/goose/config.yaml to include the following entry:

extensions: bear: name: Florence-2 cmd: uvx args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ] enabled: true type: stdio

For Goose Desktop

Add a new extension with the following settings:

  • Type: Standard IO

  • ID: florence-2

  • Name: Florence-2

  • Description: An MCP server for processing images using Florence-2

  • Command: uvx --from git+https://github.com/jkawamoto/mcp-florence2 mcp-florence2

For more details on configuring MCP servers in Goose Desktop, refer to the documentation: Using Extensions - MCP Servers.

Tools

ocr

Process an image file or URL using OCR to extract text.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

caption

Processes an image file and generates captions for the image.

Arguments:

  • src: A file path or URL to the image file that needs to be processed.

License

This application is licensed under the MIT License. See the LICENSE file for more details.

-
security - not tested
-
license - not tested
-
quality - not tested

Related MCP Servers

  • A
    security
    A
    license
    A
    quality
    This MCP server aids users in searching and analyzing their photo library by location, labels, and people, offering functionalities like photo analysis and fuzzy matching for enhanced photo management.
    Last updated -
    23
    MIT License
    • Apple
  • A
    security
    -
    license
    A
    quality
    An MCP server designed to work with FFmpeg for media processing tasks, offering enhanced performance and secure communication for handling media processing requests.
    Last updated -
    2
    19
    14
  • -
    security
    A
    license
    -
    quality
    A lightweight MCP server for image processing and cloud uploads that automates resizing, converting, optimizing, and uploading images to services like AWS S3, Cloudflare R2, and Google Cloud Storage.
    Last updated -
    3
    15
    MIT License

View all related MCP servers

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/jkawamoto/mcp-florence2'

If you have feedback or need assistance with the MCP directory API, please join our Discord server