Skip to main content
Glama

create_carousel

Generate carousel slides from content, URLs, or topics for LinkedIn, Instagram, and Threads by extracting key points and creating formatted PDFs or PNGs.

Instructions

Full pipeline: content → slides → PDF/PNG in one call. Supports three input modes:

Mode 1 (Content provided): Pass article text, markdown, or transcript in the content field. The tool extracts key points, generates slides, and produces output immediately.

Mode 2 (URL provided): Pass a sourceUrl. The tool fetches the page, extracts article content, and runs the full pipeline. If the page is JavaScript-rendered and extraction fails, the tool returns fallback instructions to use firecrawl_scrape or web_fetch instead.

Mode 3 (Topic only): Pass a topic without content or sourceUrl. The tool returns orchestration instructions with multiple research workflows: • Web Research: Use gemini_deep_research → pass result as content • YouTube Summary: Use supadata_transcript → summarize with gemini_chat → pass as content • Data-Driven: Use gemini_deep_research → preview_slides → generate_svg for charts → render_slides • Quick Draft: Use web_fetch to grab a page → pass as content

Common workflows the user might request:

  1. "Make a carousel about [topic]" → Call with topic param, follow returned workflow

  2. "Turn this article into a carousel: [url]" → Call with sourceUrl param

  3. "Here's my content, make a carousel" → Call with content param

  4. "Research [topic] and make a carousel with charts" → Call with topic, follow Data-Driven workflow

Platforms: linkedin (1080×1350 PDF), instagram (1080×1080 PNGs), threads (1080×1350 PNGs)

After rendering: Consider using analyze_image (Gemini MCP) to review the output quality.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
contentNoArticle text, markdown, or transcript. Optional if sourceUrl is provided (content will be fetched). If neither content nor sourceUrl is provided, topic must be set.
topicNoTopic for the carousel. When set without content/sourceUrl, returns orchestration instructions for researching the topic first via available tools (Gemini, web fetch, Supadata).
templateNameNoTemplate name (run list_templates to see options including user brand kits)professional
brandNameNoBrand name shown in slide footers
slideCountNoTarget number of slides (4-12)
sourceUrlNoURL to fetch article content from. The server scrapes and cleans HTML automatically. Also used for CTA slide link.
outputDirNoOutput directory (default: ~/Documents/carousels/)
platformNoTarget platform: linkedin (1080x1350 PDF), instagram (1080x1080 PNGs), threads (1080x1350 PNGs)linkedin
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden and does an excellent job disclosing behavioral traits. It explains the three distinct processing modes, fallback behaviors when extraction fails, orchestration workflows for topic-only inputs, platform-specific outputs, and post-rendering suggestions. The only minor gap is not explicitly mentioning rate limits or authentication requirements.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with clear sections (modes, workflows, platforms, post-rendering) and uses bullet points effectively. While comprehensive, it's appropriately sized for a complex 8-parameter tool with multiple operational modes. Every sentence serves a purpose in clarifying usage.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (8 parameters, 3 distinct modes, multiple workflows) and no annotations or output schema, the description provides exceptional completeness. It covers all operational scenarios, explains behavioral outcomes, provides usage examples, mentions platform specifics, and even suggests follow-up actions. This fully compensates for the lack of structured metadata.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline is 3, but the description adds significant value by explaining how parameters interact in the three modes. It clarifies that content and sourceUrl are mutually exclusive alternatives, topic triggers research workflows, and platform affects output format. This contextual understanding goes beyond the schema's individual parameter descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states the tool's purpose as a 'full pipeline' that transforms content into slides and PDF/PNG outputs. It clearly distinguishes from sibling tools like create_template, list_templates, preview_slides, and render_slides by being the comprehensive end-to-end solution rather than individual components.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides extensive guidance on when to use each input mode (content, sourceUrl, topic) and includes specific workflow examples. It explicitly mentions alternatives like firecrawl_scrape, web_fetch, gemini_deep_research, and supadata_transcript for different scenarios, giving clear context for tool selection.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/houtini-ai/carousels-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server