mcp-bigquery-evals
Provides read-only access to Google BigQuery, allowing AI agents to discover datasets and tables, describe schemas, sample data, search schemas, estimate query costs, and execute SQL queries with mandatory cost guardrails.
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@mcp-bigquery-evalsshow me the top 10 stack overflow questions tagged 'python' by view count"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
mcp-bigquery-evals
The BigQuery MCP server with mandatory cost guardrails and a measurable accuracy number.
uvx mcp-bigquery-evals · works with any MCP-compatible client · v0.1.0
Why use this over the other BigQuery MCPs
Most BQ MCPs |
| |
Cost guardrails | none | mandatory dry-run before every query, refuses if over cap |
Quality signal | "trust me" | live accuracy badge, recomputed every release |
Write operations | usually enabled | disabled by design (read-only) |
Errors when things break | raw API exceptions | 7 stable error codes an agent can switch on |
Local dev without GCP | impossible | in-memory sqlite-backed fake ships in the box |
What ships in the box
7 read-only MCP tools for warehouse discovery and querying
Mandatory dry-run cost cap on every
run_query(default 100 MB scanned, about $0.0005 per query)Result-set-equivalence eval harness (Spider/BIRD methodology) with a live accuracy badge in this README
Structured BigQuery errors with 7 stable codes (
invalid_sql,table_not_found,permission_denied,unauthenticated,rate_limited,query_timeout,unknown)Two BigQueryClient implementations:
RealBigQueryClient(production, wrapsgoogle-cloud-bigquery) andFakeBigQueryClient(in-memory, sqlite-backed, for dev and CI without GCP credentials)
Quickstart (5 minutes)
1. Install
uvx mcp-bigquery-evals --helpFirst run takes about 30s while uv fetches dependencies; subsequent runs are instant from the local cache. Plain pip install mcp-bigquery-evals also works.
2. Authenticate to GCP
gcloud auth application-default login3. Wire into your MCP client
Open your MCP client's server config (developer settings) and add:
{
"mcpServers": {
"bigquery": {
"command": "uvx",
"args": ["mcp-bigquery-evals", "serve"],
"env": {
"BIGQUERY_PROJECT": "YOUR_GCP_PROJECT_ID_HERE"
}
}
}
}Restart your client. The MCP indicator should show "bigquery" with 7 tools.
4. Try it
Using the bigquery tool, find the top 5 most-viewed Stack Overflow questions tagged 'python'.
The agent chains list_datasets, list_tables, describe_table, run_query to answer. Every run_query is dry-run-cost-capped before execution.
Detailed setup, troubleshooting, and the alternative pip install path live in docs/mcp_client_setup.md.
The 7 tools
Tool | Purpose |
| List all datasets in your GCP project |
| List tables in a dataset |
| Schema, row count, size |
| Up to n sample rows |
| Fuzzy-match a term against all column names |
| Free dry-run; returns bytes_scanned and estimated USD |
| Dry-run, refuse if over cap, then execute |
All tools are read-only. There are no write operations in v1 by design. See docs/architecture.md for the design rationale.
Cost guardrails
Every run_query call dry-runs first (free) before execution. If the dry-run estimate exceeds max_bytes_scanned, the call returns a structured error rather than burning bytes:
{
"error": "cost_cap_exceeded",
"would_scan": "1.4 GB",
"cap": "100.0 MB",
"estimated_usd": 0.007,
"hint": "narrow your WHERE clause or pass max_bytes_scanned=1500000000 to override"
}The agent reads the structured error and self-corrects (narrows the WHERE clause, raises the cap explicitly, picks a different table).
Eval harness
Every release runs a result-set-equivalence eval suite against bigquery-public-data and updates the accuracy badge above. The methodology matches Spider and BIRD academic benchmarks: execute both gold and predicted SQL, compare result sets as multisets of rows (order-independent, with float tolerance, Decimal handling, NULL equality, NaN equality, ARRAY/STRUCT recursion, bool/int distinction).
Run locally:
mcp-bigquery-evals evals run --model <your-model-id>Full methodology, golden-pairs YAML format, and how to add your own pairs: docs/how_evals_work.md.
Development
git clone https://github.com/Umarfarook1/mcp-bigquery-evals
cd mcp-bigquery-evals
python -m venv .venv && source .venv/bin/activate # Windows: .venv\Scripts\activate
pip install -e ".[dev]"
pytest # unit tests (no GCP needed; ~160 tests)
pytest -m bq # real-BQ integration tests (needs GCP creds)
pytest -m live # end-to-end with real model + real BQContributing
Issues and PRs welcome. Highest-leverage contributions:
More verified golden NL-to-SQL pairs against
bigquery-public-dataPrompt improvements with before/after eval numbers showing the accuracy badge moved
Bug reports with minimum reproductions
License
MIT, see LICENSE.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/Umarfarook1/mcp-bigquery-evals'
If you have feedback or need assistance with the MCP directory API, please join our Discord server