Skip to main content
Glama
jacob-bd

Perplexity Web MCP

by jacob-bd

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault

No arguments

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tasks
{
  "list": {},
  "cancel": {},
  "requests": {
    "tools": {
      "call": {}
    },
    "prompts": {
      "get": {}
    },
    "resources": {
      "read": {}
    }
  }
}
tools
{
  "listChanged": true
}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
pplx_queryA

Query Perplexity AI with explicit model selection. COSTS 1 PRO SEARCH QUERY per call.

Prefer pplx_smart_query for automatic quota-aware routing. Use this only when you need a specific model or thinking mode.

Args: query: The question to ask model: Model to use - auto, sonar, deep_research, gpt54, gpt55, claude_sonnet, claude_opus, gemini_pro, nemotron, glm52, kimi_k26 thinking: Enable extended thinking mode (available for gpt54, gpt55, claude_sonnet, claude_opus, kimi_k26; always on for gemini_pro, nemotron, and glm52) source_focus: Source type - none (model only, no search), web, academic, social, finance, all, or connector source ID from pplx_connectors()

pplx_askA

Quick Q&A with auto model. COSTS 1 PRO SEARCH QUERY. Prefer pplx_smart_query(intent='quick') for simple lookups (Sonar 2 first).

pplx_deep_researchB

Deep Research — in-depth reports. COSTS 1 DEEP RESEARCH QUERY (limited monthly pool, typically 5-10 total). Only use when the user explicitly requests deep research.

pplx_deep_research_startA

Start a deep research task asynchronously. COSTS 1 DEEP RESEARCH QUERY.

Use this instead of pplx_deep_research for complex queries to avoid connection timeouts. Returns a task_id immediately. Poll pplx_research_status with the task_id to get the result.

pplx_research_statusA

Check the status of an asynchronous deep research task.

Returns 'in_progress' if the task is still running, the final research report if completed, or an error message if it failed.

pplx_sonarC

Sonar 2 — Perplexity's latest in-house model. Subject to your plan and Perplexity usage counters (see pplx_usage).

pplx_gpt54D

GPT-5.4 — OpenAI's versatile model. COSTS 1 PRO SEARCH QUERY.

pplx_gpt54_thinkingC

GPT-5.4 Thinking — OpenAI's versatile model with extended thinking. COSTS 1 PRO SEARCH QUERY.

pplx_gpt55C

GPT-5.5 — OpenAI's latest model. COSTS 1 PRO SEARCH QUERY. Requires Max subscription.

pplx_gpt55_thinkingC

GPT-5.5 Thinking — OpenAI's latest model with extended thinking. COSTS 1 PRO SEARCH QUERY. Requires Max subscription.

pplx_claude_sonnetC

Claude Sonnet 5.0 — Anthropic's fast model. COSTS 1 PRO SEARCH QUERY.

pplx_claude_sonnet_thinkC

Claude Sonnet 5.0 Thinking — Anthropic's newest reasoning model. COSTS 1 PRO SEARCH QUERY.

pplx_claude_opusC

Claude Opus 4.8 — Anthropic's most advanced reasoning model. COSTS 1 PRO SEARCH QUERY. Requires Max subscription.

pplx_claude_opus_thinkC

Claude Opus 4.8 Thinking — Anthropic's most advanced reasoning model with extended thinking. COSTS 1 PRO SEARCH QUERY. Requires Max subscription.

pplx_gemini_pro_thinkC

Gemini 3.1 Pro Thinking — Google's most advanced model with extended thinking. COSTS 1 PRO SEARCH QUERY.

pplx_nemotron_thinkingC

Nemotron 3 Ultra — NVIDIA's Nemotron 3 Ultra 550B model with extended thinking. COSTS 1 PRO SEARCH QUERY.

pplx_glm52C

GLM 5.2 — Z.ai's advanced model with thinking always enabled. COSTS 1 PRO SEARCH QUERY.

pplx_kimi_k26C

Kimi K2.6 — Moonshot's advanced model. COSTS 1 PRO SEARCH QUERY.

pplx_kimi_k26_thinkingC

Kimi K2.6 Thinking — Moonshot's advanced model with extended thinking. COSTS 1 PRO SEARCH QUERY.

pplx_smart_queryA

RECOMMENDED DEFAULT TOOL. Quota-aware query — checks limits and picks the best model automatically.

USE THIS FOR EVERY QUERY unless the user explicitly requests a specific model. Default to intent='quick' for most lookups — it routes to Sonar 2 when appropriate. Only escalate intent when the question genuinely requires it.

Intent guide (choose the LOWEST sufficient level):

  • quick: Facts, definitions, simple lookups, 'what is X' → Sonar 2 (check pplx_usage)

  • standard: How-to, comparisons, explanations needing web sources → 1 Pro Search

  • detailed: Complex multi-source analysis, technical deep-dives → 1 Pro Search (premium model)

  • research: Comprehensive report → 1 Deep Research (scarce monthly quota, user must request)

Response includes a metadata block showing the model used, routing reason, and current quota snapshot.

Args: query: The question to ask intent: Query complexity — quick (default for most), standard, detailed, research source_focus: Source type — none (model only, no search), web, academic, social, finance, all, or connector source ID from pplx_connectors()

pplx_councilA

Model Council — query multiple models in parallel, get synthesized consensus.

IMPORTANT — BEFORE calling this tool, you MUST:

  1. Tell the user the available models: sonar, gpt54, gpt55, claude_sonnet, claude_opus, gemini_pro, nemotron, glm52, kimi_k26

  2. Check pplx_usage() first. If Subscription is Pro, do not include Max-only models: gpt55, claude_opus

  3. Ask the user WHICH models they want in their council and HOW MANY

  4. Inform them of the cost: each council model = 1 Pro Search query, plus synthesis (default chairman sonar = Sonar 2 pass — still counts as a normal query toward limits)

  5. Get explicit confirmation before executing

Default council: GPT-5.4, Claude Sonnet 5.0, Gemini 3.1 Pro (Pro-compatible, 3 diverse providers).

Args: query: The question to ask all council models source_focus: Source type for all models (none/web/academic/social/finance/all or connector source ID) models: Comma-separated model names to use as council members. Available: sonar, gpt54, gpt55, claude_sonnet, claude_opus, gemini_pro, nemotron, glm52, kimi_k26. Default: "gpt54,claude_sonnet,gemini_pro" (3 models + synthesis = 4 Pro Searches) Max-only: gpt55, claude_opus. Exclude these when pplx_usage shows a Pro subscription. synthesize: Whether to synthesize a consensus from all responses. Set false to get only individual responses (saves 1 Sonar 2 call). thinking: Enable extended thinking for council models (gpt54, gpt55, claude_sonnet, claude_opus, kimi_k26 support toggle; gemini_pro, nemotron, and glm52 are always thinking). chairman: Model to use for synthesis (default: "sonar" / Sonar 2). Non-sonar chairmen cost 1 extra Pro Search query.

pplx_list_threadsA

Browse your Perplexity thread library. FREE — zero quota cost.

Returns a paginated list of your past Perplexity conversations with their slugs, titles, models used, and answer previews.

PRIMARY USE CASES:

  1. "Did I already research X?" — search before spending a Pro query: pplx_list_threads(search_term="quantum computing")

  2. Find a conversation to resume — get its slug, then pass it as conversation_id to any pplx_* query tool to continue right where it left off.

  3. Retrieve full history — call pplx_get_thread(slug) with any slug from this list.

Args: limit: Max threads to return (default 20, max 100). offset: Skip this many threads — use for pagination (e.g. offset=20 for page 2). search_term: Optional keyword to filter threads by title or content.

pplx_get_threadA

Fetch the full conversation history for a Perplexity thread. FREE — zero quota cost.

Returns the complete Q&A turns, sources, and related queries for any past thread. Get the slug from pplx_list_threads, or from the [Conversation ID: ...] footer returned by any pplx_* query tool.

RESUME PATTERN — to continue a past conversation:

  1. pplx_list_threads(search_term="topic") — find the thread and its slug

  2. pplx_get_thread(slug) — read the full history for context

  3. pplx_smart_query("follow-up question", conversation_id=slug) — continue it

Args: slug: Thread UUID / slug. Obtain from pplx_list_threads or a previous [Conversation ID: ...] response footer.

pplx_usageA

Check current Perplexity usage limits and remaining quotas.

CALL THIS AT THE START OF EVERY SESSION before making any queries. Shows remaining Pro Search (weekly), Deep Research (monthly), and other quotas. Use the results to decide whether to conserve Pro quota (e.g. quick intent before premium models).

Args: refresh: Force refresh from Perplexity (ignores cache). Default False.

pplx_connectorsA

List account connector source IDs that can be passed as source_focus.

Returns source IDs from the Perplexity rate-limit API. Use these IDs as source_focus values, for example source_focus="pitchbook_mcp_cashmere".

pplx_auth_statusA

Check if Perplexity is authenticated.

Returns the current authentication status and subscription tier if authenticated. Use this to check if re-authentication is needed before making queries.

pplx_auth_request_codeA

Request a verification code for Perplexity authentication.

Sends a 6-digit verification code to the provided email address. After calling this, check the email inbox and use pplx_auth_complete with the code to finish authentication.

Args: email: Your Perplexity account email address

Returns: Status message indicating if the code was sent successfully

pplx_auth_completeA

Complete Perplexity authentication with the verification code.

Use the 6-digit code received via email after calling pplx_auth_request_code. On success, the session token is saved and all pplx_* tools will work.

Args: email: Your Perplexity account email (same as used in pplx_auth_request_code) code: The 6-digit verification code from your email

Returns: Status message with authentication result and subscription tier

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription
library_resourceYour Perplexity thread library — most recent 50 threads. Returns a formatted list of your recent Perplexity conversations. Use perplexity://thread/{slug} to access a specific thread's full history.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/jacob-bd/perplexity-web-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server