Florence-2 MCP Server

Integrations

  • Hosts the repository for the Florence-2 MCP server, allows installation via git

Florence-2 MCP Server

An MCP server for processing images using Florence-2.

You can process images or PDF files stored on a local or web server to extract text using OCR (Optical Character Recognition) or generate descriptive captions summarizing the content of the images.

Installation

For Claude Desktop

To configure this server for Claude Desktop, edit the claude_desktop_config.json file with the following entry under mcpServers:

{ "mcpServers": { "florence-2": { "command": "uvx", "args": [ "--from", "git+https://github.com/jkawamoto/mcp-florence2", "mcp-florence2" ] } } }

After editing, restart the application. For more information, see: For Claude Desktop Users - Model Context Protocol.

For Goose CLI

To enable the Bear extension in Goose CLI, edit the configuration file ~/.config/goose/config.yaml to include the following entry:

extensions: bear: name: Florence-2 cmd: uvx args: [ --from, git+https://github.com/jkawamoto/mcp-florence2, mcp-florence2 ] enabled: true type: stdio

For Goose Desktop

Add a new extension with the following settings:

  • Type: Standard IO
  • ID: florence-2
  • Name: Florence-2
  • Description: An MCP server for processing images using Florence-2
  • Command: uvx --from git+https://github.com/jkawamoto/mcp-florence2 mcp-florence2

For more details on configuring MCP servers in Goose Desktop, refer to the documentation: Using Extensions - MCP Servers.

Tools

ocr

Process an image file or URL using OCR to extract text.

Arguments:
  • src: A file path or URL to the image file that needs to be processed.

caption

Processes an image file and generates captions for the image.

Arguments:
  • src: A file path or URL to the image file that needs to be processed.

License

This application is licensed under the MIT License. See the LICENSE file for more details.

-
security - not tested
A
license - permissive license
-
quality - not tested

hybrid server

The server is able to function both locally and remotely, depending on the configuration or use case.

An MCP server for processing images using Florence-2.

  1. Installation
    1. For Claude Desktop
    2. For Goose CLI
    3. For Goose Desktop
  2. Tools
    1. ocr
    2. caption
  3. License

    Related MCP Servers

    • A
      security
      A
      license
      A
      quality
      An intelligent MCP server with a fully automated batch pipeline for web-ready images. Features include noise reduction, auto levels/curves, JPEG artifact removal, 4K resizing, smart sharpening with shadow/highlight enhancement, and advanced WebP conversion.
      Last updated -
      1
      5
      JavaScript
      MIT License
    • -
      security
      A
      license
      -
      quality
      An MCP server for analyzing images using OpenRouter vision models, offering capabilities like automatic image resizing, model configuration, and handling custom queries about images.
      Last updated -
      5
      JavaScript
      MIT License
    • A
      security
      A
      license
      A
      quality
      An MCP Server that integrates with Stability AI's API to provide high-quality image generation, editing, and manipulation capabilities including background removal, outpainting, search-and-replace, and upscaling.
      Last updated -
      13
      35
      55
      TypeScript
      MIT License
      • Apple
    • A
      security
      A
      license
      A
      quality
      An MCP server that allows users to generate images using Replicate's Stable Diffusion model and save them to the local filesystem.
      Last updated -
      3
      Python
      MIT License
      • Apple

    View all related MCP servers

    ID: mziprujksn