soi-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@soi-mcpWhat is the income distribution for ZIP 90210?"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
soi-mcp
IRS income & tax statistics by ZIP code, inside your agent. An MCP
server that lets an LLM pull the income distribution, tax, credits, and deductions of any U.S. ZIP
straight from the IRS Statistics of Income (SOI) — built on Anthropic's official
mcp Python SDK.
All tools are read-only and the data is public domain (a U.S. government work) — no API key required. The dataset is downloaded once into a local SQLite store and served offline.
Status: published —
uvx mcpwright-soi(PyPI) and listed in the official MCP Registry asio.github.mcpwright/soi-mcp. 10 tools, working today (see below). The IRS SOI ZIP release lags ~2–3 years; the latest available year (currently Tax Year 2022) loads by default, and older years are onerefresh <year>away. See the roadmap for what's next.
Tools
Tool | What it does |
| Confirm a ZIP has SOI data → state, number of returns, number of individuals, tax year. A good first call. |
| Adjusted gross income (AGI), average AGI per return, and income components: salaries/wages, taxable interest, ordinary dividends, business net income, net capital gain. |
| The distinctive one. The ZIP's returns and AGI split across the six IRS AGI brackets (<$25k, $25–50k, $50–75k, $75–100k, $100–200k, $200k+), with each bracket's share — the income shape of a ZIP, not just an average. |
| Income tax, income tax before credits, total tax liability (broader — includes self-employment tax, etc.), total tax payments, and average total tax per return. |
| EITC take-up (overall and split by number of qualifying children: none / one / two / three or more) and the additional (refundable) child tax credit. |
| Standard vs. itemized deductions (count and amount), the taxes-paid (SALT) deduction, and the percent of returns that itemized. |
| Single / married-filing-jointly / head-of-household return counts, elderly returns (age 65+), and the count and share of electronically filed returns. |
| Rank several ZIPs by one metric (e.g. |
| A whole state's totals and AGI-bracket mix (returns, individuals, AGI, average AGI per return, income tax, total tax liability), from the IRS state rollup. Accepts |
| Escape hatch: the raw value of one SOI field code (e.g. |
All dollar amounts are returned in whole USD (the source reports thousands). Counts are numbers of returns, rounded by the IRS to the nearest 10.
Related MCP server: mcp-census
Install
Requires Python 3.12+. The zero-clone way to run it (the PyPI package is mcpwright-soi; the
command, server, and tools are all "soi"):
uvx mcpwright-soiThe first tool call downloads the latest SOI ZIP file (~200 MB) into a local SQLite store under your OS cache directory and serves everything offline thereafter. To pre-load (or to pick a specific tax year) without waiting for the first query:
uvx mcpwright-soi setup # download the latest available year
uvx mcpwright-soi refresh 2021 # re-pull a specific older year for comparisonClaude Code
claude mcp add soi -- uvx mcpwright-soiClaude Desktop
Add to claude_desktop_config.json:
{
"mcpServers": {
"soi": { "command": "uvx", "args": ["mcpwright-soi"] }
}
}OpenAI Agents SDK (Python)
It's a standard MCP server, so it works with any MCP-capable client — not just Claude. With the OpenAI Agents SDK:
from agents import Agent, Runner
from agents.mcp import MCPServerStdio
async def main():
async with MCPServerStdio(
name="soi",
params={"command": "uvx", "args": ["mcpwright-soi"]},
) as soi:
agent = Agent(
name="Analyst",
instructions="Use the SOI tools for IRS income and tax data by ZIP.",
mcp_servers=[soi],
)
result = await Runner.run(
agent, "What's the income distribution of ZIP 90210 vs 10001?"
)
print(result.final_output)Any other MCP client (Cursor, VS Code, Cline, Goose, Zed, …)
They all launch a stdio MCP server the same way — point yours at:
{
"mcpServers": {
"soi": { "command": "uvx", "args": ["mcpwright-soi"] }
}
}Hosted chat connectors (e.g. ChatGPT connectors) expect a remote MCP server over Streamable HTTP;
mcpwright-soiruns locally over stdio.
Storage: the dataset lives in a SQLite file under your OS cache dir (override with the
SOI_MCP_STOREenv var). Delete it any time;setup/refreshrebuilds it.
A note on suppression: the IRS excludes ZIPs with fewer than 100 returns (folding them into a "99999" bucket) and suppresses line items with fewer than 20 returns. Summed ZIP totals can therefore slightly understate reality and won't exactly equal the state total. All figures are aggregates of filed returns, not a population census.
Develop
git clone https://github.com/mcpwright/soi-mcp && cd soi-mcp
uv sync
uv run pytest # tests (mocked download + seeded SQLite)
uv run ruff check src/ && uv run ruff format --check src/ # lint + format
uv run mypy # strict type checking
uv run mcp dev src/soi_mcp/server.py # poke the tools in the MCP InspectorRoadmap
lookup_zip/get_income/get_agi_distribution— the income backboneget_tax/get_credits/get_deductions/get_filing_status— the tax sidecompare_zips— rank ZIPs by a metricget_state_totals— state rollups from the IRS 00000 rowget_soi_field— raw-field escape hatchsetup/refresh [year]— download once, re-pull or pick an older tax yearPublish to PyPI (
mcpwright-soi) + the official MCP Registry (io.github.mcpwright/soi-mcp)Multi-year queries in one call (trend a ZIP across tax years)
Privacy
soi-mcp runs entirely on your machine. It collects, stores, or transmits no personal data
— no accounts, no tracking, no telemetry. Its only outbound requests go to the U.S. IRS static
file host (www.irs.gov/pub/irs-soi) to download the public SOI ZIP-code CSV; no API key is
needed and nothing about your queries leaves your machine. The downloaded dataset is cached on
disk as a local SQLite file (under your OS cache dir, or SOI_MCP_STORE); delete it any time.
Full policy: https://mcpwright.com/privacy/
Questions & feedback
Questions, ideas, or "could it do X?" → Discussions
Bugs & concrete feature requests → Issues
Contributions welcome — and if you build something with it, I'd love to hear about it.
Part of mcpwright · built by Devender Gollapally
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/mcpwright/soi-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server