mrmarket.ai

by ai.mrmarket

Server Details

Analytical engine for US-listed equities: screens, rankings, and event studies in plain English.

Status: Healthy
Last Tested: 2026-07-22 03:32
Transport: Streamable HTTP
URL
Repository: mrmarket-ai/mrmarket-mcp
GitHub Stars: 3

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4.4/5.0

Tool DescriptionsA

Average 4.5/5 across 9 of 9 tools scored. Lowest: 3.9/5.

Server CoherenceA

Disambiguation5/5

Each tool has a clearly distinct purpose: describe_data for catalog, query_data for analysis, fetch_page for pagination, get_symbols for symbol lookup, get_account_status for account info, getting_started for onboarding, list_tickets/report_issue for support, recent_queries for history. No overlap.

Naming Consistency4/5

Most tools follow a verb_noun pattern (describe_data, fetch_page, get_account_status, get_symbols, list_tickets, report_issue), but 'getting_started' uses a gerund and 'recent_queries' is adjective_noun, breaking the pattern. 'query_data' could be interpreted as a noun phrase. Minor inconsistency.

Tool Count5/5

With 9 tools covering data catalog, query, pagination, symbol lookup, account status, onboarding, support, and query history, the count is well-scoped for the server's purpose. Each tool earns its place without redundancy.

Completeness5/5

The tool surface covers the full lifecycle from discovery (describe_data), analysis (query_data, fetch_page), identification (get_symbols), account management (get_account_status), onboarding (getting_started), to support (list_tickets, report_issue, recent_queries). No obvious gaps for the intended use case.

Available Tools

9 tools

describe_dataA

Read-only

Inspect

Data catalog — zero cost. Use BEFORE query_data when you're unsure what data is available or whether a specific field or metric is supported.

The catalog lists RAW INGREDIENTS; query_data accepts RECIPES. Any field can be combined with any other in one question — ratios of two fields, growth of any metric, rolling stats, group medians, cohort-relative z-scores, benchmark-relative betas. A metric missing from the catalog by name is still answerable if it's computable from the fields (the overview's composition section lists the patterns).

Scopes:

overview → list all data categories (prices, financials, earnings, insider transactions, etc.) + coverage depth + key fields + computed metrics + composition patterns. Good first call.
category → full field list for one data category. Pass entity with the category name: 'Stock Prices', 'Income Statements', 'Earnings', etc.
computed_metrics → pre-defined computed metrics (ROIC, FCF yield, D/E, ROE, ROA, margins, growth/CAGR, volatility, drawdown, beta, z-score, median…).
search → keyword search across field names, categories, computed metrics, and composition patterns. Pass search_term: 'volatility', 'capex', etc.

ParametersJSON Schema

Name	Required	Description
`scope`	Yes	Use "category" (or "entity" as alias) for a specific data category.
`entity`	No	Category name for scope "category" — e.g. "Stock Prices", "Income Statements", "Earnings".
`search_term`	No

Tool Definition Quality

A4.3/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare readOnlyHint=true, and the description adds that it is 'zero cost' and lists raw ingredients. No behavioral surprises; the tool is purely informational. Could be enhanced by describing return format, but overall transparent.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured: front-loaded with core use case, followed by key insight (raw ingredients vs recipes), then bullet list of scopes. Slightly long but every sentence adds value. Could be trimmed slightly without losing clarity.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (multiple scopes, relationship to query_data, no output schema), the description covers purpose, usage context, and parameter details. Missing output format, but that is acceptable as the catalog is self-descriptive. Adequate for an agent to decide when to invoke.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Adds significant meaning beyond the schema: explains scope enum values in detail (overview, category, computed_metrics, search), provides examples for entity parameter, and implies search_term usage. Schema coverage is 67% (search_term lacks schema description), and the description compensates well.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly identifies the tool as a data catalog and contrasts it with query_data (raw ingredients vs recipes). It lists specific uses (list categories, fields, computed metrics) and distinguishes from siblings by stating when to use it (before query_data).

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicit guidance to use before query_data when unsure about data availability. Each scope (overview, category, computed_metrics, search) has usage context. However, no explicit 'when not to use' or alternative suggestions beyond query_data.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

fetch_pageA

Read-only

Inspect

Fetch the NEXT page of a large query_data result — FREE (zero credits, runs no new query). Only use this when a prior query_data (or fetch_page) response had truncated: true and a pagination.next_cursor.

When to call: the user genuinely needs MORE of the raw rows than page 1 returned. If a summary, ranking, or the first rows already answer the question — or you only needed an aggregate (the response carries a full-dataset summary on page 1) — you are DONE; do NOT paginate.

Pass the cursor string from pagination.next_cursor VERBATIM — do not edit or truncate it. Keep calling fetch_page with each new next_cursor until it is null. Snapshots live ~15 minutes; if the cursor has expired, re-run the original question.

ParametersJSON Schema

Name	Required	Description	Default
`cursor`	Yes	The opaque continuation cursor from a prior response's `pagination.next_cursor`. Paste it exactly. Do NOT pass a question here — fetch_page returns the next page of an existing result and never runs a new query.

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already provide readOnlyHint=true, but description adds critical context: it is free, runs no new query, and snapshots live ~15 minutes. No contradiction with annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is well-structured and front-loaded with the primary purpose. It is slightly long but each sentence adds value. Could be slightly more concise by merging some points, but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Without an output schema, the description fully covers the pagination behavior, including cursor handling, lifecycle, and when to stop. It is complete enough for an agent to use correctly.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema covers the cursor parameter with description, but the description adds the instruction to pass it verbatim and to keep calling until null. This provides valuable guidance beyond the schema, but schema coverage is already high.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states the tool fetches the next page of a query_data result, with specific verb 'Fetch' and resource 'NEXT page'. It distinguishes from siblings like query_data by emphasizing it never runs a new query.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly states when to use (prior response had truncated: true and pagination.next_cursor) and when not to use (if first rows or summary answers the question). Also warns about cursor expiry and provides guidance on repeated calls.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_account_statusA

Read-only

Inspect

Account snapshot — zero LLM cost, no credits charged. Returns which mrmarket.ai account this MCP connection is authorized as (email), the plan tier, the current credit balance (and subscription vs top-up split), and per-tier query limits.

Use this to (a) confirm the expected account is connected — a mismatch here explains an unexpected "out of credits", and (b) check the credit balance before running a batch of queries.

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already indicate readOnlyHint=true. Description adds valuable context: zero LLM cost, no credits charged, and specifics of returned data (email, plan, credits split, limits). No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Three sentences: first states what it is, second lists outputs, third gives usage. No fluff, every sentence contributes.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given zero parameters and no output schema, the description fully covers purpose, output details, and use cases. No gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

No parameters exist, so baseline is 4. Description adds no parameter info but doesn't need to.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool returns an account snapshot including email, plan, credits, and query limits. It uses specific verbs ('Returns', 'snapshot') and distinguishes itself from siblings by being an account-level status check.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly provides two use cases: (a) confirm expected account is connected, (b) check credit balance before batch queries. While it doesn't list exclusions, the guidance is clear and context-aware.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

get_symbolsA

Read-only

Inspect

Fast company/sector/industry lookup. Sub-50ms, zero LLM cost. Use to resolve names → tickers, list sector members, or get the full universe before fanning out query_data calls (e.g., for backtests or per-symbol price pulls). Actions: search | list_sectors | list_industries | by_sector | by_industry | all.

ParametersJSON Schema

Name	Required	Description	Default
`query`	No
`action`	Yes
`sector`	No
`industry`	No

Tool Definition Quality

A3.9/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations already declare readOnlyHint=true and openWorldHint=false. The description adds behavioral context: sub-50ms speed and zero LLM cost, aiding agent decision-making. No contradiction.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Description is concise and front-loaded with purpose and performance. Action list is presented compactly. Could be slightly more structured (e.g., separating parameter hints) but overall efficient.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

With 4 parameters, no output schema, and no parameter descriptions, the description covers the basic use cases but does not specify return format or detailed behavior for each action, leaving some gaps for agent.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%; the description only lists actions implicitly via enumeration but does not explain parameters like query, sector, industry individually. For example, it's unclear what query expects (a company name?) or how sector/industry relate to actions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Fast company/sector/industry lookup' and enumerates specific actions (search, list_sectors, etc.). It distinguishes itself from sibling tools like query_data by focusing on symbol resolution.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description advises using this tool before fanning out query_data calls for backtests or per-symbol price pulls, providing clear context. It could explicitly mention when not to use (if tickers already known).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

getting_startedA

Read-only

Inspect

Onboarding tour for mrmarket.ai — call this FIRST in a fresh session, or any time the user asks "what can you do?" / "how does this work?". Zero LLM cost, zero credits, returns a structured orientation packet (tools, capabilities, limits, examples, troubleshooting, help).

Default scope ('overview') covers everything in a short tour. Optional topic deep-dives a single area without re-fetching the whole thing:

tools → tool-by-tool reference for query_data, describe_data, get_symbols, get_account_status, report_issue.
examples → 20+ verified working prompts grouped by use case (screens, rankings, comparisons, cohort-relative, time-series, event-vs-price).
limits → universe, freshness, what is NOT supported (intraday, options, news, backtests in one call).
cost → credit model, which tools are free, how to read credits_remaining.
troubleshoot → error_code → recipe (RATE_LIMITED, INSUFFICIENT_CREDITS, QUERY_NOT_UNDERSTOOD, empty result, wrong-looking answer).
help → links + how to reach support; preferred channel is report_issue.

Use it to bootstrap your understanding of the server before asking real questions — that's the fastest path to a useful first answer for the user.

ParametersJSON Schema

Name	Required	Description	Default
`topic`	No	Optional section to deep-dive. Default: "overview" — short tour of everything.

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Discloses zero LLM cost and zero credits, adds details on return value (structured orientation packet), and explains each topic scope. No contradiction with annotations (readOnlyHint).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured with front-loaded key info and bullet list of topics. Slightly long but all content earns its place; could be more concise.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, description fully explains return value and each topic. Completely adequate for the simple parameter structure.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema describes parameter with 100% coverage, but description adds deeper meaning: default behavior, explains each enum option in detail, making it more actionable.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states it is an onboarding tour for mrmarket.ai, with specific verb 'call this FIRST in a fresh session' and distinguishes from siblings by being the orientation tool.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Explicitly says when to call: first in fresh session or on user asking capabilities. Does not explicitly state when not to use or give alternatives, but context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

list_ticketsA

Read-only

Inspect

List the user's own report_issue tickets (newest first). Zero LLM cost, zero credits.

Use this to look up a ticket you (or the user) filed earlier — e.g. to quote the full ticket_id when following up with support, or to check what was already reported before filing a duplicate. Each entry returns:

ticket_id (full UUID — quote this in follow-ups)
category (wrong_data | bug | slow | feature_request | general)
preview (first 200 chars of the report text)
query_id (the query the ticket was filed against, when attached)
created_at_iso

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	How many tickets to return. Default 10, max 50.

Tool Definition Quality

A4.2/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations declare readOnlyHint=true; description adds 'Zero LLM cost, zero credits' and details the return fields, enhancing transparency beyond annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured: purpose first, then usage guidance, then return field list. Each sentence adds value, though slightly long.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite no output schema, the description lists all return fields completely, making it self-contained. No gaps for a simple list tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%; the description does not add new meaning for the limit parameter beyond what the schema provides. Baseline score is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Clearly states the tool lists the user's own report_issue tickets, newest first. Distinguishes from siblings like report_issue and recent_queries.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit use cases: looking up a previously filed ticket to quote the ID or check for duplicates. No explicit when-not, but the context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

query_dataA

Read-only

Inspect

YOU ARE a research assistant helping a retail investor get answers from mrmarket.ai. You are NOT a database engineer. Ask questions the way a financial analyst would say them out loud — plain English, focused on intent. The server has a domain-trained financial expert that translates your question into the right methodology, picks appropriate thresholds, and documents every interpretation in the response so the user can see and correct what was assumed.

Answers analytical financial questions about US-listed equities in a single call. Send the full natural-language question — not SQL. Returns structured rows + columns.

CAPABILITIES — all handled in one call:

Top-N / bottom-N rankings by any metric
Multi-criteria stock screens (combine sector, ratios, growth thresholds, insider activity)
Computed financial metrics: ROIC, FCF, D/E, margins, ROE, ROA, dividend yield, growth rates
Derived metrics composed on the fly: any ratio of two fields, growth of any metric, rolling stats on any series — the data catalog lists ingredients, you may mix them
Period-over-period changes: QoQ, YoY, multi-year
Rolling averages, trend slopes, volatility, beta vs a benchmark, correlation, median/percentiles, quartile buckets, max drawdown, statistical measures
Multi-symbol comparisons and time-series trends
Sector/industry rollups and averages
Cohort-relative analysis (vs sector average, vs universe z-score)
Forward returns after events (earnings beats, insider buys)
Price charts with event overlays (earnings dates, insider transactions)
Consecutive-quarter screening (e.g., 4 quarters of growing FCF)

EXAMPLES — notice how these read like a human asking, not a technical specification:

"Top 20 stocks by ROIC excluding financials"
"Companies with 4 consecutive quarters of growing free cash flow"
"Compare AAPL, MSFT, and GOOGL revenue over the last 5 years"
"Stocks whose ROIC is at least 1 standard deviation above their sector average"
"Average 30-day stock return after companies beat earnings by more than 10%"
"AAPL daily closes for the last 5 years with earnings dates overlaid"
"Top 20 quality compounders by 5-yr ROIC stability and margin trend"
"Find undervalued stocks with recent insider buying — low P/E, strong FCF, low debt"
"Average stock return 90 days after large CEO insider purchases"

HOW TO PHRASE YOUR QUESTION — this matters for best results:

Pass the user's question through with minimal rewording. The server's financial expert interprets casual language better than you can translate it:

"large purchase" → appropriate dollar threshold (documented in assumptions[])
"90 days" → trading-day equivalent (documented in assumptions[])
"CEO" → executive title matching
"growing" → positive AND increasing
"cheap" / "undervalued" → appropriate valuation thresholds
"Buffett screen" / "quality compounder" → recognized analytical frameworks

DO: ✓ Preserve the user's intent and language faithfully ✓ Use directional terms: "low P/E", "strong cash flow", "high margins" ✓ Add thresholds ONLY when the user stated them explicitly ✓ Ask for aggregated answers when the user wants a summary ("average return after...") ✓ Combine multi-criteria screens into ONE question, not separate calls

DON'T: ✗ Invent numeric thresholds the user didn't specify — the server picks sensible defaults and surfaces them in assumptions[] so the user can adjust ✗ Specify column lists — the server selects the most relevant columns automatically ✗ Convert calendar days to trading days — the server handles this ✗ Add metrics or time ranges the user didn't request — adds complexity and risk ✗ Use AND/OR boolean syntax — plain English works better ✗ Prefix with jargon like "Event study:" or "Screen:" — just ask the question

GOOD: "Find undervalued stocks with recent insider buying — low P/E, strong FCF, low debt" BAD: "Screen for companies where insiders have made open-market stock purchases in the past 3 months AND P/E ratio below 20 AND price-to-book below 3 AND positive free cash flow AND debt-to-equity below 1. Show symbol, name, sector, P/E..."

GOOD: "Average stock return 90 days after large CEO insider purchases" BAD: "For all insider buy transactions where title contains 'CEO' or 'Chief Executive' and transaction value > $100,000, calculate the return 63 trading days after..."

Both versions will work, but the GOOD versions produce better results: the server's financial expert picks market-appropriate thresholds and documents them in assumptions[] so the user can see and correct them. Your pre-translations hide these from the user.

ONE QUESTION PER CALL — the unit of work (max 100 words, enforced): Each call carries exactly ONE analytical question: one deliverable you could present as a single table or chart. "One question" is NOT "one metric" — a screen with five criteria, a three-ticker comparison, or an event study at five horizons is still one question. Questions over 100 words are rejected free of charge (QUESTION_TOO_LONG): overruns are nearly always several questions clobbered together, or an inline ticker dump that belongs in symbols. Be precise, not redundant — no column lists, no restated criteria, no boilerplate.

Classify the request BEFORE calling:

ATOMIC → one call. The parts share one computation or land in one table. The server joins fundamentals, prices, earnings, and insider data internally, so touching several data categories does NOT mean splitting:
- Multi-symbol comparison ("monthly returns for TSLA and SPY" — one call, not two)
- Multi-metric screens ("high ROIC, strong margins, low debt, consistent earnings")
- Cross-metric formulas ("stocks where margin > 2x sector average")
- Cohort relatives ("ROIC ≥ 1 stddev above sector mean"); sector rollups
- Forward-return event studies — and MULTIPLE HORIZONS IN ONE CALL: "returns at 1, 5, 10, 21, and 63 days after earnings" is ONE call, not five; the server computes all forward windows in parallel.
- Multi-entity retrieval — "ROIC, FCF yield, D/E, 6-month return, and earnings beat rate for every stock" is ONE call across fundamentals + prices + earnings. Fetch in one call; score/rank/normalize in code.
- Price + overlay charts; conditional labels ("classify the drawdown as earnings-driven or multiple-compression")
ORTHOGONAL → independent calls, issued in parallel. The request bundles 2+ questions whose answers don't feed each other and that you'd present as separate tables or sections. Smells: "and also…", "separately…", numbered sub-requests, two different universes, two analytical verbs ("screen for X… and chart Y"), unrelated time windows. "Top 10 by ROIC, and also TSLA's margin trend" = 2 calls. "Full analysis of AAPL" = 3: valuation vs sector / financial trends / insider activity (announce the plan first).
PIPELINE → sequential calls that YOU join, when part B needs part A's output and the join point is SMALL (a symbol list, a date range, a few values). Cut at the narrow point: run A → take its symbols → run B passing them via symbols → merge rows client-side on symbol/date. Screen-then-drill ("screen for X, then pull 5y revenue history for the matches"), backtests with rebalancing, Monte Carlo (pull returns once, simulate in code), portfolio optimization, custom multi-factor scoring with user weights (fetch ALL metrics in one call, weight in code).

When unsure, try ONE call first — the server is compositional and most cross-entity questions are atomic. If the response carries meta.needs_decomposition: true, retry as parallel calls using meta.suggested_split.

COMPUTE IN CODE WHEN YOU CAN. Each query_data call costs credits and can fail. If you already have data from a previous call, compute locally instead of calling again:

Aggregations (averages, sums, medians, min/max)
Percentage changes, ratios, growth rates
Sorting, filtering, grouping, ranking
Statistical measures (std dev, correlation, z-scores)
Percentile normalization, composite scoring, factor weighting
Pairwise correlations, covariance matrices Only call query_data when you need NEW data you don't already have.

ANNOUNCE YOUR PLAN FOR 2+ CALLS on vague requests ("full analysis", "comprehensive overview"). For specific multi-part questions, announce at 3+ calls. Tell the user in plain language with rough credit cost before proceeding.

OUTPUT SIZE: the MCP tool-result ceiling is ~1MB. Quick math:

1 month ≈ 21 trading days, 1 year ≈ 252
Practical ceilings: ~5,000 price rows or ~2,500 fundamental rows
PREFER narrowing/summarizing first ("per-stock 6mo return" not "all stocks 6mo daily prices", or narrow by sector/time range). A focused question is almost always the better answer.
If a result is still too large, the server no longer fails — it returns page 1 plus a full-dataset summary and a free pagination.next_cursor. Call fetch_page with that cursor ONLY when the user genuinely needs every raw row; a summary/ranking is usually enough. For very large dumps, hand the user view_url — their private link to the full dataset in the web app — instead of paginating.

SCOPE CEILING — the engine is a transactional warehouse, not an OLAP cluster. At most ONE of these axes can be unbounded per question: universe (all ~7k stocks), history (15-20 years of daily rows), per-row computation (forward returns, rolling windows, pooled medians). Two or more unbounded axes — "pooled forward returns for every stock-day since 2011", "median daily return across all stocks for all history" — cannot finish in time; the server now stops the run BEFORE executing and returns status: "clarification_needed" with error_code: "SCOPE_TOO_LARGE" + narrowing questions (free of charge), instead of burning a 40s+ TIMEOUT. How to stay under the ceiling:

Bound the universe: an index, a sector, a market-cap floor ("above $10 billion"), or an explicit list via symbols.
Bound the history: a date range ("since 2022", "last 3 years"). Event studies (returns after insider buys / earnings events) with NO stated range default to the most recent 5 YEARS of events — disclosed in assumptions[]; state a range explicitly ("since 2010") to widen it.
Sample instead of exhaustive: "from the first trading day of each month" beats "from every trading day".
Split benchmark legs: SPY/QQQ series are single-symbol and cheap alone — ask "all stocks vs SPY" as two calls and compare in code. To insist on full scope anyway, answer the returned questions via clarifications ("Full universe anyway") — the run is then attempted and may still time out.

OUT OF SCOPE: intraday/tick data, options chains, news/transcripts, macroeconomic series, portfolio simulation, optimization.

RESPONSE FORMAT — what to expect back:

data: array of row objects keyed by column name (e.g., [{symbol: "AAPL", revenue: 394328000000}, ...])
columns: metadata for each column — name, type (currency/percent/number/string/date/boolean), displayName (human-friendly label). Use type to format values for the user: currency → "$394.3B", percent → "18.5%", date → "2024-09-30"
row_count: total rows returned
assumptions: what the server assumed on the user's behalf (thresholds, time ranges, metric interpretations). ALWAYS surface these to the user so they can correct them.
caveats: data-accuracy notes worth passing on — approximation, point-in-time vs later-restated figures, survivorship, or FX-conversion. as_of_date is the point-in-time anchor the answer is computed as of; currency_converted: true flags that foreign filings were converted to USD so price-relative ratios stay consistent.
filters_applied: scope provenance — which tables were read and which filters were ACTUALLY applied (per plan: reads, filters, group_by, windows, limit). VERIFY this matches the question you asked: if you see a sector/industry/date filter you did not request, treat the result as wrong and re-ask (and report_issue it).
audit: SQL provenance (success only) — the exact queries behind the answer, written in the PUBLIC data-dictionary vocabulary (the same entity/field names describe_data uses; intermediate stages numbered step_1…step_N, params holds the $1… bind values). Read it to confirm scope, joins, and point-in-time date bounds — e.g. that there is no look-ahead past the as-of date. This is the ground truth when a number looks off.
warnings: diagnostic notes (low credit balance, applied defaults)
credits_remaining / cost_credits: balance after this call / what this call cost
view_url: a PRIVATE mrmarket.ai link to view THIS result as a chart/table in the web app — hand it to the user so they can jump from chat into the visual app and see the full dataset. Only they can open it; to share it with others they publish it from the share view ("Enable public sharing"). Present on success with rows.
truncated + pagination (oversize results only): truncated: true means this is page 1 of a larger result. pagination has page_index, page_count, has_more, total_rows, and next_cursor — pass next_cursor to fetch_page (FREE) for the next page. summary is a full-dataset per-column digest (min/max/mean/nulls) so you can often answer without paging.

On error: error_code + message + retryable flag. Retry once if retryable is true.

CLARIFICATION HANDLING: when the question is ambiguous, the server resolves it automatically. If your client supports MCP elicitation, the user is prompted directly. Otherwise the server applies a sensible default and proceeds. Either way you get a final answer in one call. Known standing default: event studies with no stated time range cover the past 5 years. Check assumptions[] — always tell the user what was assumed. Exception: SCOPE_TOO_LARGE narrowing questions are NEVER auto-defaulted — a default universe would change which stocks the answer covers. Unless a human answers via elicitation, you get status: "clarification_needed" back (free); narrow the question or answer the questions via clarifications.

LIMITS: ONE question of max 100 words per call (longer is rejected free of charge). 60-second timeout (180s with a clarification roundtrip); plans that cannot finish in time are rejected up front as SCOPE_TOO_LARGE (free — see SCOPE CEILING). No default row cap. Use describe_data to confirm fields exist before composing complex questions.

ParametersJSON Schema

Name	Required	Description
`symbols`	No	Explicit ticker universe for this question (e.g., a screening batch). STRONGLY preferred over pasting long ticker lists into `question`: the list is applied mechanically as a symbol filter (exact, order-independent), keeps the question readable, and is cheaper. Phrase the question generically ("Find every golden cross event…") and put the tickers here.
`max_rows`	No	Optional caller-side row cap. There is NO default cap — by default the server returns every row the SQL produced. Only pass this when you deliberately want a preview (e.g., max_rows: 10).
`question`	Yes	ONE natural-language financial question, max 100 words (hard limit — longer is rejected as QUESTION_TOO_LONG, free of charge). Not SQL. Be precise, not redundant: metric(s) + universe/filters + time range is enough; the server picks columns and thresholds. Put ticker lists in `symbols`, never in this text. If 100 words feels tight, you are bundling multiple questions — split them into separate calls (see mrmarket://capabilities for the split test).
`visualize`	No	Opt-in charting for the shareable view_url. Default false returns a plain table link (cheapest). Set true when the user wants a chart or stat card: the link is then rendered with the chart type the engine picks from the data shape — a single-row result becomes a "stat" card, a multi-period series a line chart, etc. Does not change the JSON rows you receive, only how the view_url renders.
`clarifications`	No	Answers to a prior clarification_needed response (pass each question verbatim with your chosen answer). Use together with clarification_mode: "return" for a two-step round-trip.
`clarification_mode`	No	"auto" (default): ambiguous questions are resolved via elicitation or sensible server defaults in one call. "return": the server returns status "clarification_needed" with the questions + recommended defaults and does NOT proceed (and does not charge); answer by re-calling query_data with `clarifications`. Use "return" when silent defaults are not acceptable for your user.

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description provides thorough behavioral details beyond annotations, including timeout, scope ceiling, clarification handling, output format, and error handling. It does not contradict the readOnlyHint=true annotation.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is excessively long (over 2000 words) and could be more concise. While it front-loads core purpose and capabilities, the extensive detail on decomposition, examples, and response format makes it verbose.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity and the lack of an output schema, the description covers all necessary aspects: usage, parameters, response format, scope limits, error handling, and assumptions. It leaves no significant gaps.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Although the input schema already has high coverage (100%), the description adds valuable context for each parameter, such as explaining the purpose of symbols vs question, max_rows as preview, and clarification_mode options.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool's purpose: 'Answers analytical financial questions about US-listed equities in a single call.' It lists specific capabilities and provides examples, distinguishing it from sibling tools like describe_data and fetch_page.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines5/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Extensive guidance on when to use this tool vs alternatives, including explicit sections on decomposition (ATOMIC, ORTHOGONAL, PIPELINE), when to compute in code, and out-of-scope items. It also offers best practices for phrasing questions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

recent_queriesA

Read-only

Inspect

List the user's most recent query_data calls (newest first). Zero LLM cost, zero credits.

Use this when the user references "that query I ran earlier" / "the AAPL one from before" / "the slow one" — instead of asking the user to dictate the query_id, look it up here. Each entry returns:

query_id (the mcp__ handle — pass directly to report_issue or quote to support)
status (success | clarification | timeout | error)
error_code (null on success; the stable error taxonomy otherwise)
row_count (rows returned on success)
prompt_preview (first 120 chars of the original question)
when (relative — "2m ago" / "1h ago" / "3d ago")
created_at_iso (raw ISO timestamp for precise ordering)
execution_time_ms

limit defaults to 10, max 50. status filters: "all" (default), "success", or "error" (includes timeouts).

ParametersJSON Schema

Name	Required	Description	Default
`limit`	No	How many recent queries to return. Default 10, max 50.
`status`	No	Filter by outcome. "error" includes timeouts. Default "all".

Tool Definition Quality

A4.5/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations provide readOnlyHint=true, confirming safe read. Description adds behavioral info: zero LLM cost, zero credits, returns specific fields, defaults, max limit. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured: front-loaded with purpose, then usage hints, then return field details, then parameter details. Every sentence adds value, no wasted words.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Covers return fields, parameter details, and usage context. No output schema exists, but description compensates fully. Complete for a list tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, baseline 3. Description adds value: 'limit defaults to 10, max 50', 'status filters: "all" (default), "success", or "error" (includes timeouts)'. This goes beyond schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description clearly states it lists the user's most recent query_data calls, newest first. It distinguishes from siblings by explaining it's for retrieving past queries, not executing new ones.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides explicit usage scenarios: when user references a past query, 'that query I ran earlier', etc. It doesn't explicitly state when not to use, but the context is clear.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

report_issueAInspect

Tell support something went wrong, file a bug, request a feature, or flag a slow query. Zero LLM cost, zero credits.

Call this when:

A query_data result looks wrong or misleading (pass the query_id from that response — support uses it to investigate the issue).
You hit a confusing error or the same query keeps failing.
A query took unreasonably long (still pass query_id if available).
You wish mrmarket.ai supported something it currently doesn't.

This is the PREFERRED support channel because it auto-links to the query trace; email is slower and requires the user to copy/paste the query_id manually.

Categories:

wrong_data → result was incorrect / numbers look off
bug → something crashed or the response was malformed
slow → query took too long or timed out
feature_request → "I wish it could do X"
general → catch-all if none of the above fits

ParametersJSON Schema

Name	Required	Description
`category`	No	Optional. Defaults to "general" if omitted.
`query_id`	No	Optional but strongly recommended for query-specific reports. The `query_id` from the `query_data` response you want support to look at (format: `mcp_<timestamp>_<n>`). With it, the server auto-attaches the run's status, error_code, row_count, execution_time, and the original question snippet to the ticket — the user does not need to dictate any of that. If you don't have the query_id at hand, call `recent_queries` first to look it up.
`description`	Yes	Required. Short free-text description of what went wrong or what you want. One or two sentences is plenty — support reads every report. Max 4000 chars.

Tool Definition Quality

A4.8/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description discloses key behavioral traits: zero LLM cost/credits, auto-linking to query traces, and category meanings. Annotations indicate non-read-only and non-destructive, which aligns with the description. No contradictions.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with bullet points and clear sections. Every sentence adds value, and it is concise without being overly brief.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Despite lacking an output schema, the description explains what happens with the report (support reads, attaches query info) and covers all necessary behavioral aspects. It is complete and informative.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, and the description adds extra context beyond the schema, such as the auto-attach behavior of `query_id`, default category, and character limit for `description`. This provides meaningful guidance for parameter usage.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool is for reporting issues, bugs, feature requests, and slow queries. It uses specific verbs and resources ("Tell support something went wrong") and distinguishes itself from sibling tools like `query_data` and `recent_queries`.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides explicit when-to-use scenarios (e.g., wrong query result, confusing error, slow query) and category examples. It mentions it's the preferred channel over email, but does not explicitly list when not to use it, though the context is clear enough.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

mrmarket.ai

Server Details

Tool Definition Quality

Available Tools

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Discussions

Your Connectors

Resources