Skip to main content
Glama

Perception-MCP

A lightweight Model Context Protocol (MCP) server that lets you ask any question about an image, audio, or video file and returns an answer powered by state-of-the-art multimodal models served through fal.ai.

Prerequisites

Related MCP server: Langflow Document Q&A Server

Installation

git clone --recurse-submodules https://github.com/lintyourcode/perception-mcp.git cd perception-mcp cp mcp_agent.secrets_template.yaml mcp_agent.secrets.yaml $EDITOR mcp_agent.secrets.yaml

Usage

Add Perception-MCP to Claude Desktop (v0.3.7+) by adding the following to your claude_desktop_config.json file:

{ "mcpServers": { "perception-mcp": { "command": "fastmcp", "args": ["run", "perception-mcp", "serve"] } } }

Tools

Perception-MCP provides the following tools:

  • query_image: Answer a question about an image's contents

  • query_audio: Answer a question about an audio file's contents

  • query_video: Answer a question about a video's contents

Development

Running tests

uv run pytest -q
-
security - not tested
F
license - not found
-
quality - not tested

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/lintyourcode/perception-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server