Skip to main content
Glama

Perception-MCP

by lintyourcode

Perception-MCP

A lightweight Model Context Protocol (MCP) server that lets you ask any question about an image, audio, or video file and returns an answer powered by state-of-the-art multimodal models served through fal.ai.

Prerequisites

Installation

# Clone and install dependencies $ git clone https://github.com/lintyourcode/perception-mcp.git $ cd perception-mcp $ poetry install --no-root

Usage

Add Perception-MCP to Claude Desktop (v0.3.7+) by adding the following to your claude_desktop_config.json file:

{ "mcpServers": { "perception-mcp": { "command": "fastmcp", "args": ["run", "perception-mcp", "serve"], "env": { "FAL_KEY": "<your-fal-key>" } } } }

Development

Running the test suite

The project uses pytest with pytest-asyncio to handle FastMCP’s async helpers.

# Install runtime + dev dependencies $ poetry install --with dev --no-root # Run the tests $ poetry run pytest -q

All tests should pass, verifying that the server registers its tools and that the current stubs raise NotImplementedError until implemented.

-
security - not tested
F
license - not found
-
quality - not tested

remote-capable server

The server can be hosted and run remotely because it primarily relies on remote services or has no dependency on the local environment.

Enables asking questions about image, audio, or video files using state-of-the-art multimodal models. Powered by fal.ai for advanced media analysis and understanding capabilities.

  1. Prerequisites
    1. Installation
      1. Usage
        1. Development
          1. Running the test suite

        Related MCP Servers

        • -
          security
          A
          license
          -
          quality
          A powerful server that integrates the Moondream vision model to enable advanced image analysis, including captioning, object detection, and visual question answering, through the Model Context Protocol, compatible with AI assistants like Claude and Cline.
          Last updated -
          17
          JavaScript
          Apache 2.0
        • A
          security
          A
          license
          A
          quality
          Enables querying documents through a Langflow backend using natural language questions, providing an interface to interact with Langflow document Q\&A flows.
          Last updated -
          1
          14
          JavaScript
          MIT License
          • Apple
        • -
          security
          F
          license
          -
          quality
          Interfaces with the Perplexity AI API to provide advanced question answering capabilities through the standardized Model Context Protocol, supporting multiple Perplexity models.
          Last updated -
          JavaScript
        • -
          security
          F
          license
          -
          quality
          Provides chat and image analysis capabilities through OpenRouter.ai's diverse model ecosystem, enabling both text conversations and powerful multimodal image processing with various AI models.
          Last updated -
          16
          8
          TypeScript
          • Apple
          • Linux

        View all related MCP servers

        MCP directory API

        We provide all the information about MCP servers via our MCP API.

        curl -X GET 'https://glama.ai/api/mcp/v1/servers/lintyourcode/perception-mcp'

        If you have feedback or need assistance with the MCP directory API, please join our Discord server