Skip to main content
Glama
Maheidem

WAHA WhatsApp MCP Server

by Maheidem

Download WhatsApp Media

whatsapp_download_media
Read-onlyIdempotent

Download media (image, audio, video, document) attached to a WhatsApp message. Use after reading messages to fetch media by contact and message IDs.

Instructions

Download media (image, audio, video, document) from a WhatsApp message.

Fetches the media attached to a specific message and returns it directly:

  • Images: returned inline so Claude can see them

  • Audio: returned inline so Claude can process them

  • Video/documents: saved to /tmp/whatsapp-media/ and path returned

Use whatsapp_read_messages first to find messages with hasMedia=true, then pass the contactId and messageId here.

Note: Old media (weeks+) may no longer be available on WhatsApp's servers.

Args:

  • contactId: Phone digits or "*@g.us" — the contact whose chat contains the message

  • messageId: Message ID with media to download (from whatsapp_read_messages)

Returns:

  • For images: the image content directly (Claude can see it)

  • For audio: the audio content directly

  • For other files: file path where it was saved + metadata

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
contactIdYesPhone digits or "*@g.us" — chat that contains the media message
messageIdYesMessage ID with media to download
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint, destructiveHint, idempotentHint. The description adds valuable behavioral context: inline return for images/audio, file save path for video/documents, and potential unavailability of old media.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is concise (5 sentences plus structured Args/Returns sections), front-loaded with purpose, and well-organized. Every sentence adds value without redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a simple tool with 2 required parameters and no output schema, the description covers all necessary aspects: return behavior by media type, prerequisite workflow, and data persistence. Annotations cover safety constraints.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with clear descriptions for both parameters. The description repeats similar explanations in Args section, adding no new semantic value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool downloads media (image, audio, video, document) from a WhatsApp message. It distinguishes from siblings by specifying to use whatsapp_read_messages first and provides the exact verb and resource.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly instructs to use whatsapp_read_messages first to find messages with hasMedia=true, then pass contactId and messageId. Also notes that old media may not be available, guiding appropriate usage conditions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Maheidem/waha-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server