Perplexity Web MCP
Server Configuration
Describes the environment variables required to run the server.
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||
Capabilities
Features and capabilities supported by this server
| Capability | Details |
|---|---|
| tasks | {
"list": {},
"cancel": {},
"requests": {
"tools": {
"call": {}
},
"prompts": {
"get": {}
},
"resources": {
"read": {}
}
}
} |
| tools | {
"listChanged": true
} |
| prompts | {
"listChanged": false
} |
| resources | {
"subscribe": false,
"listChanged": false
} |
| experimental | {} |
Tools
Functions exposed to the LLM to take actions
| Name | Description |
|---|---|
| pplx_queryA | Query Perplexity AI with explicit model selection. COSTS 1 PRO SEARCH QUERY per call. Prefer pplx_smart_query for automatic quota-aware routing. Use this only when you need a specific model or thinking mode. Args: query: The question to ask model: Model to use - auto, sonar, deep_research, gpt54, gpt55, claude_sonnet, claude_opus, gemini_pro, nemotron, glm52, kimi_k26 thinking: Enable extended thinking mode (available for gpt54, gpt55, claude_sonnet, claude_opus, kimi_k26; always on for gemini_pro, nemotron, and glm52) source_focus: Source type - none (model only, no search), web, academic, social, finance, all, or connector source ID from pplx_connectors() |
| pplx_askA | Quick Q&A with auto model. COSTS 1 PRO SEARCH QUERY. Prefer pplx_smart_query(intent='quick') for simple lookups (Sonar 2 first). |
| pplx_deep_researchB | Deep Research — in-depth reports. COSTS 1 DEEP RESEARCH QUERY (limited monthly pool, typically 5-10 total). Only use when the user explicitly requests deep research. |
| pplx_deep_research_startA | Start a deep research task asynchronously. COSTS 1 DEEP RESEARCH QUERY. Use this instead of pplx_deep_research for complex queries to avoid connection timeouts. Returns a task_id immediately. Poll pplx_research_status with the task_id to get the result. |
| pplx_research_statusA | Check the status of an asynchronous deep research task. Returns 'in_progress' if the task is still running, the final research report if completed, or an error message if it failed. |
| pplx_sonarC | Sonar 2 — Perplexity's latest in-house model. Subject to your plan and Perplexity usage counters (see pplx_usage). |
| pplx_gpt54D | GPT-5.4 — OpenAI's versatile model. COSTS 1 PRO SEARCH QUERY. |
| pplx_gpt54_thinkingC | GPT-5.4 Thinking — OpenAI's versatile model with extended thinking. COSTS 1 PRO SEARCH QUERY. |
| pplx_gpt55C | GPT-5.5 — OpenAI's latest model. COSTS 1 PRO SEARCH QUERY. Requires Max subscription. |
| pplx_gpt55_thinkingC | GPT-5.5 Thinking — OpenAI's latest model with extended thinking. COSTS 1 PRO SEARCH QUERY. Requires Max subscription. |
| pplx_claude_sonnetC | Claude Sonnet 5.0 — Anthropic's fast model. COSTS 1 PRO SEARCH QUERY. |
| pplx_claude_sonnet_thinkC | Claude Sonnet 5.0 Thinking — Anthropic's newest reasoning model. COSTS 1 PRO SEARCH QUERY. |
| pplx_claude_opusC | Claude Opus 4.8 — Anthropic's most advanced reasoning model. COSTS 1 PRO SEARCH QUERY. Requires Max subscription. |
| pplx_claude_opus_thinkC | Claude Opus 4.8 Thinking — Anthropic's most advanced reasoning model with extended thinking. COSTS 1 PRO SEARCH QUERY. Requires Max subscription. |
| pplx_gemini_pro_thinkC | Gemini 3.1 Pro Thinking — Google's most advanced model with extended thinking. COSTS 1 PRO SEARCH QUERY. |
| pplx_nemotron_thinkingC | Nemotron 3 Ultra — NVIDIA's Nemotron 3 Ultra 550B model with extended thinking. COSTS 1 PRO SEARCH QUERY. |
| pplx_glm52C | GLM 5.2 — Z.ai's advanced model with thinking always enabled. COSTS 1 PRO SEARCH QUERY. |
| pplx_kimi_k26C | Kimi K2.6 — Moonshot's advanced model. COSTS 1 PRO SEARCH QUERY. |
| pplx_kimi_k26_thinkingC | Kimi K2.6 Thinking — Moonshot's advanced model with extended thinking. COSTS 1 PRO SEARCH QUERY. |
| pplx_smart_queryA | RECOMMENDED DEFAULT TOOL. Quota-aware query — checks limits and picks the best model automatically. USE THIS FOR EVERY QUERY unless the user explicitly requests a specific model. Default to intent='quick' for most lookups — it routes to Sonar 2 when appropriate. Only escalate intent when the question genuinely requires it. Intent guide (choose the LOWEST sufficient level):
Response includes a metadata block showing the model used, routing reason, and current quota snapshot. Args: query: The question to ask intent: Query complexity — quick (default for most), standard, detailed, research source_focus: Source type — none (model only, no search), web, academic, social, finance, all, or connector source ID from pplx_connectors() |
| pplx_councilA | Model Council — query multiple models in parallel, get synthesized consensus. IMPORTANT — BEFORE calling this tool, you MUST:
Default council: GPT-5.4, Claude Sonnet 5.0, Gemini 3.1 Pro (Pro-compatible, 3 diverse providers). Args: query: The question to ask all council models source_focus: Source type for all models (none/web/academic/social/finance/all or connector source ID) models: Comma-separated model names to use as council members. Available: sonar, gpt54, gpt55, claude_sonnet, claude_opus, gemini_pro, nemotron, glm52, kimi_k26. Default: "gpt54,claude_sonnet,gemini_pro" (3 models + synthesis = 4 Pro Searches) Max-only: gpt55, claude_opus. Exclude these when pplx_usage shows a Pro subscription. synthesize: Whether to synthesize a consensus from all responses. Set false to get only individual responses (saves 1 Sonar 2 call). thinking: Enable extended thinking for council models (gpt54, gpt55, claude_sonnet, claude_opus, kimi_k26 support toggle; gemini_pro, nemotron, and glm52 are always thinking). chairman: Model to use for synthesis (default: "sonar" / Sonar 2). Non-sonar chairmen cost 1 extra Pro Search query. |
| pplx_list_threadsA | Browse your Perplexity thread library. FREE — zero quota cost. Returns a paginated list of your past Perplexity conversations with their slugs, titles, models used, and answer previews. PRIMARY USE CASES:
Args: limit: Max threads to return (default 20, max 100). offset: Skip this many threads — use for pagination (e.g. offset=20 for page 2). search_term: Optional keyword to filter threads by title or content. |
| pplx_get_threadA | Fetch the full conversation history for a Perplexity thread. FREE — zero quota cost. Returns the complete Q&A turns, sources, and related queries for any past thread. Get the slug from pplx_list_threads, or from the [Conversation ID: ...] footer returned by any pplx_* query tool. RESUME PATTERN — to continue a past conversation:
Args: slug: Thread UUID / slug. Obtain from pplx_list_threads or a previous [Conversation ID: ...] response footer. |
| pplx_usageA | Check current Perplexity usage limits and remaining quotas. CALL THIS AT THE START OF EVERY SESSION before making any queries. Shows remaining Pro Search (weekly), Deep Research (monthly), and other quotas. Use the results to decide whether to conserve Pro quota (e.g. quick intent before premium models). Args: refresh: Force refresh from Perplexity (ignores cache). Default False. |
| pplx_connectorsA | List account connector source IDs that can be passed as source_focus. Returns source IDs from the Perplexity rate-limit API. Use these IDs as source_focus values, for example source_focus="pitchbook_mcp_cashmere". |
| pplx_auth_statusA | Check if Perplexity is authenticated. Returns the current authentication status and subscription tier if authenticated. Use this to check if re-authentication is needed before making queries. |
| pplx_auth_request_codeA | Request a verification code for Perplexity authentication. Sends a 6-digit verification code to the provided email address. After calling this, check the email inbox and use pplx_auth_complete with the code to finish authentication. Args: email: Your Perplexity account email address Returns: Status message indicating if the code was sent successfully |
| pplx_auth_completeA | Complete Perplexity authentication with the verification code. Use the 6-digit code received via email after calling pplx_auth_request_code. On success, the session token is saved and all pplx_* tools will work. Args: email: Your Perplexity account email (same as used in pplx_auth_request_code) code: The 6-digit verification code from your email Returns: Status message with authentication result and subscription tier |
Prompts
Interactive templates invoked by user choice
| Name | Description |
|---|---|
No prompts | |
Resources
Contextual data attached and managed by the client
| Name | Description |
|---|---|
| library_resource | Your Perplexity thread library — most recent 50 threads. Returns a formatted list of your recent Perplexity conversations. Use perplexity://thread/{slug} to access a specific thread's full history. |
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/jacob-bd/perplexity-web-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server