Which integrations are available for this server?

Provides multimodal analysis capabilities for images, audio, and video files using Perplexity's API to answer questions about media content

How do I use Perception-MCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Perception-MCP what's happening in this video clip?" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Perception-MCP

A lightweight Model Context Protocol (MCP) server that lets you ask any question about an image, audio, or video file and returns an answer powered by state-of-the-art multimodal models served through fal.ai.

Prerequisites

Python 3.11+
uv
A fal.ai account & API key
A Perplexity account & API key

Related MCP server: Langflow Document Q&A Server

Installation

git clone --recurse-submodules https://github.com/lintyourcode/perception-mcp.git cd perception-mcp cp mcp_agent.secrets_template.yaml mcp_agent.secrets.yaml $EDITOR mcp_agent.secrets.yaml