estonian-mcp
Click on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@estonian-mcpCheck spelling of 'Tere hommikust'"
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
estonian-mcp
Claude is quite bad at Estonian, so this MCP is here to fix that. Give it a shot.
A small Model Context Protocol server that exposes EstNLTK — the Estonian NLP toolkit — as tools any LLM client can call in real time. Hand it Estonian text, get back correct lemmas, morphology, POS tags, spell-check + suggestions, syllables, named entities, WordNet synonyms, fastText-based related words, and a register hint (formal vs colloquial).
If your AI agent has to draft, edit, or proofread Estonian, this wires in ground truth so it stops guessing on the mechanical layer (spelling, case forms, conjugation) and gives it real Estonian synonyms instead of inventing them.
Benchmark: on TalTech's
inflection_etgold dataset (a noun-phrase inflection benchmark; Lillepalu & Alumäe, arXiv:2510.21193), our morphology engine scores 96.5% first-candidate / 99.1% any-candidate over 1,400 items. Reproduce:uv run python scripts/eval_inflection.py. (We're a tool server, not a rankable LLM — this scores our tools against published gold data.)
Three ways to use it:
👉 Paste a URL into your Claude app — the easiest path, no terminal, no install. See Get started in 30 seconds below.
One-click on Smithery — install from the estonian-mcp listing.
Self-host — clone, run locally as stdio, or deploy your own container to Fly.io / any host. See Self-host (advanced).
What it does
Tool | What it does |
| Split text into sentences and words |
| Lemma, POS, form, root, ending, clitic, compound parts, ambiguity count, and usage flags (archaic / foreign / interjection / abbreviation / proper-noun) per word |
| Full Vabamorf-generated inflection paradigm — 14 cases × 2 numbers for nominals, ~30 verb forms — with Estonian labels per form |
| Just the dictionary form per word |
| Just the part-of-speech tag per word |
| Spelling check + correction suggestions |
| Syllables with quantity + accent |
| People / places / organisations |
| Synsets from Estonian WordNet — synonymous lemmas + definition + examples per word sense |
| Top-N semantically nearby words via fastText embeddings (semantically related, not always synonymous) |
| Coarse formal/colloquial register hint with matched markers + consistency flag for register-mixed text (heuristic, phase 1) |
| Style metrics — lemma-aware repetition, passive-voice ratio, sentence-length variance, hedging-word density |
| Pleonasm check — flags semantic doubling like |
| Käändeõpetus — flags direct-object case errors under negation and after partitive-only verbs (armastama, vihkama, vajama, …) |
| Lühendiortograafia — flags abbreviations with case endings missing the EKI-mandated hyphen ( |
| Calque-risk diagnostic — for each compound noun, returns top fastText neighbours and flags compounds with weak similarity ( |
| Algustäheortograafia check — flags wrongly capitalized weekdays, months, nationalities, and language/culture adjectives per EKI's Reeglid |
| Liitsõnaõigekiri — flags common AI-generated splits of words that should be a single compound ( |
| Kirjavahemärgid — flags missing commas before subordinating conjunctions ( |
| Poolitamine — safe line-break positions for an Estonian word, syllable-boundary based with no-orphan-edge rule |
| Numbrite õigekirjutus — flags decimal separators ( |
POS tag set: S=noun, V=verb, A=adj, P=pron, D=adv, K=adp,
J=conj, N=numeral, I=interj, Y=abbrev, X=foreign, Z=punct.
Related MCP server: mhlabs-mcp-tools
✨ Get started in 30 seconds (no install)
This section is for everyone — including if you've never opened a terminal in your life. You'll be done before your tea is steeped.
The trick is that we run the server for you on the public internet at
https://estonian-mcp.fly.dev/mcp. You just need to tell your Claude
app to talk to it. Pick the app you use:
In Claude Cowork
Open Cowork and click your profile / Settings.
Find Connectors in the sidebar.
Click Add custom connector.
Paste this URL into the URL field:
https://estonian-mcp.fly.dev/mcpLeave any "Authentication" / "API key" / "Bearer token" fields empty. The server is public — no token needed.
Click Save / Connect.
Done. Start a new chat and write in Estonian — proofread an email, study a paragraph, draft a reply. Claude will reach for the EstNLTK tools whenever it needs to verify spelling, lemmas, or morphology rather than guessing.
In claude.ai (web Claude)
Click your profile in the bottom-left → Settings.
Find Connectors (sometimes called Custom Integrations).
Click Add custom connector.
Paste:
https://estonian-mcp.fly.dev/mcpAuthentication: none (leave fields blank).
Save. The new tools appear in your tool tray.
In Claude Desktop
If your Claude Desktop has a Settings → Connectors menu (newer versions), follow the same three steps as Cowork above.
If it doesn't, you have an older Desktop that needs a JSON config file edit — see Self-host (advanced) for the local-stdio path, which works on every version.
In Claude Code (CLI)
One command — no clone, no Python, no uv. Point Claude Code at the
hosted server over HTTP:
claude mcp add --transport http estnltk https://estonian-mcp.fly.dev/mcpThen run /mcp inside a session to confirm estnltk shows as
connected. The tools are live immediately — ask Claude to proofread
or lemmatize Estonian text and it'll reach for them.
Want a fully local, zero-network setup instead? See the stdio path in Self-host (advanced).
Don't see your client here?
Any tool that supports MCP over HTTPS can connect — just point it at
https://estonian-mcp.fly.dev/mcp with no auth. If your client only
speaks stdio (Cursor, VS Code MCP, Continue, Zed), jump to the
local-install path in Self-host.
💡 Pro tip — teach Claude your Estonian alongside the MCP
This MCP gives Claude correct linguistics: real lemmas, real case forms, real spelling. What it can't do is teach Claude your voice — the register, idioms, and tone you actually want when writing.
You handle the voice; the MCP handles the correctness. Layer them.
A few things to add to your Claude project / custom instructions / system prompt to get this right:
Set the register. "Always reply in formal officialese Estonian for legal and government topics, and in conversational Tallinn speech for chat replies. Never mix the two in one message."
Pin the dialect / region. "I'm from Tartu — prefer southern Estonian phrasings where there's a choice (e.g. 'kus sa lähed' rather than 'kuhu sa lähed' for casual speech)."
Show your tone with examples. Paste 3–4 short paragraphs of your own writing into the project instructions and ask Claude to match that voice. Real examples beat any abstract description.
Anchor common mistakes. "You always confuse
kasutama(to use) withkäsitlema(to handle / to deal with). Double-check those with the lemmatize tool before sending."Direct the MCP explicitly when it matters. "Before sending any Estonian email, run spell_check on every word. Show me misspelled words with suggestions before drafting."
Use
classify_registeras a sanity check. "After drafting, run classify_register on the final text and warn me if it lands in 'formal' or 'colloquial' when I asked for the opposite." The classifier is coarse but reliably catches drift into officialese (käesolev,vastavalt,sätestama) or slang (mõnus,vinge,kuule).Use
synonymsto break repetition. "This newsletter useskasutamafour times. Look up synonyms via the MCP and suggest natural-sounding swaps." You'll get real Estonian alternatives with definitions, not invented ones.Use
find_related_wordsfor richer rewrites. "What words pattern withkohvin Estonian? Use that to suggest three alternative phrasings for our café-launch ad copy." This is fastText-based, so it surfaces near-neighbours that aren't strict synonyms — useful when you want adjacent concepts, not just same-meaning swaps. (Quick rule of thumb:synonymsfor "say the same thing differently";find_related_wordsfor "what else belongs in this conceptual space.")
The MCP catches misspelled words and invented case forms; your prompt drives the style. Together they make Claude actually useful for writing in Estonian, not just plausible-looking.
How to prompt it once it's connected
Most prompts don't need to mention the tools by name — Claude picks the right one. A few patterns that work especially well:
Proofread this Estonian email and use spell_check on any words
you're unsure about: <text>Lemmatize this Estonian paragraph, then translate the lemmas to
English so I can study vocabulary: <text>Analyze the morphology of this sentence and explain the case
markings: "Tallinnas elavad eestlased räägivad eesti keelt."Extract the people and places from this Estonian news article,
then summarise in one paragraph.This Estonian draft uses "kasutama" three times — look up synonyms
via the MCP and rewrite each occurrence with a natural-sounding
alternative that preserves the meaning.Classify the register of this draft. If it scores formal, soften
it for a casual newsletter audience. If it scores colloquial,
tighten it for a B2B email.The model calls the tool, gets authoritative output, and bases its response on that — no more hallucinated lemmas or invented case forms.
All clients at a glance
Client | No-install path | Local-install path |
Claude Cowork | ✅ Paste URL | ✅ stdio via JSON |
Claude Desktop | ✅ Paste URL (newer) | ✅ stdio via JSON |
claude.ai web | ✅ Paste URL | — |
Claude Code (CLI) | ✅ | ✅ |
Cursor | — | ✅ stdio via JSON |
VS Code MCP / Continue / Zed | — | ✅ stdio via JSON |
"No-install path" = paste https://estonian-mcp.fly.dev/mcp in the
client's Connectors UI. "Local-install path" = clone the repo and
point the client at python server.py.
Reducing permission prompts
Claude clients ask for confirmation before calling a tool from a custom/third-party connector — that's the client's security default, not something the server controls (there's no MCP field a server can send to suppress it). You'll especially see it right after adding or updating the connector, since the client re-checks tools it hasn't seen before.
Good news: all 21 tools are marked readOnlyHint: true (they only
read text, never write or call out), so any well-behaved client can
safely let you allow them once and stop asking:
Claude Desktop / Cowork / claude.ai — when the prompt appears, choose "Always allow" for the connector (or toggle it in the connector's settings). One time, then it's quiet.
Claude Code — run
/permissionsand allow the estonian-mcp tools, or allow the whole server at once.
Re-releasing or updating the connector can reset that "always allow" state (the client sees changed tools and re-asks) — just allow it again. A verified listing in the Anthropic Connectors Directory also gets smoother permission UX than an unverified custom connector.
Self-host (advanced)
The hosted instance is convenient, but if you'd rather run your own (privacy, latency, custom auth, offline use), the same one-file server works locally and as a container.
Run locally as stdio (zero network)
EstNLTK requires Python 3.10–3.13.
git clone https://github.com/silly-geese/estonian-mcp.git
cd estonian-mcp
uv sync
uv run python tests/test_smoke.py # verifyThen wire it into your client.
Claude Code:
claude mcp add estnltk -- /absolute/path/to/uv \
--directory /absolute/path/to/estonian-mcp \
run python server.pyClaude Desktop / Cowork (local mode) — edit
~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"estnltk": {
"command": "/absolute/path/to/uv",
"args": [
"--directory", "/absolute/path/to/estonian-mcp",
"run", "python", "server.py"
]
}
}
}Cursor — same JSON shape in ~/.cursor/mcp.json.
Run as a remote server (HTTP)
The same server.py speaks streamable-http over the network.
Two auth postures:
Public mode (
ESTNLTK_MCP_PUBLIC_MODE=1) — no bearer token, per-IP rate limit (default 120/min). This is how the silly-geese hosted instance runs.Bearer mode (default) — every request must carry
Authorization: Bearer <token>(or Smithery's?config=<base64>); per-token rate limit. Refuses to start withoutESTNLTK_MCP_AUTH_TOKEN≥16 chars.
Fly.io public deployment (matches silly-geese):
fly auth login
fly apps create my-estonian-mcp
# one-time: persistent volume for /metrics counters (~$0.15/month)
fly volumes create estonian_mcp_data --size 1 --region ams -a my-estonian-mcp
fly deployfly.toml already sets ESTNLTK_MCP_PUBLIC_MODE=1 and mounts the
volume at /data, so no token needed and /metrics counters survive
machine restarts. Endpoint: https://my-estonian-mcp.fly.dev/mcp.
Fly.io with bearer auth — remove
ESTNLTK_MCP_PUBLIC_MODE from fly.toml's [env] block, then:
fly secrets set ESTNLTK_MCP_AUTH_TOKEN="$(python3 -c 'import secrets;print(secrets.token_urlsafe(32))')"
fly deployGeneric Docker (any container host):
# Public
docker run -p 8081:8081 -e ESTNLTK_MCP_PUBLIC_MODE=1 \
ghcr.io/silly-geese/estonian-mcp # or build from source
# Bearer
docker run -p 8081:8081 \
-e ESTNLTK_MCP_AUTH_TOKEN="$(python3 -c 'import secrets;print(secrets.token_urlsafe(32))')" \
ghcr.io/silly-geese/estonian-mcpSmithery auto-builds from smithery.yaml and hosts the image
for you. Fork, connect on Smithery,
deploy. The shipped configSchema is empty (one-click install)
because the deployment runs in public mode; flip it back if you fork
to a bearer-mode setup.
Security
stdio mode: pure local subprocess. No network egress, no shell exec, no fs writes, no telemetry.
HTTP / public mode: no auth required (intentional for the free public service). Per-IP rate limit (300/min default). Same hardening as bearer mode: no shell exec, no fs writes, no telemetry, size-bounded inputs.
HTTP / bearer mode:
ESTNLTK_MCP_AUTH_TOKEN(≥16 chars) required, server refuses to start without it. Bearer auth on every request, constant-time comparison, per-token rate limit (120/min).Common to all HTTP:
/healthis the only unauthenticated path. No request or token logging.proxy_headers=Trueso client IPs reflect the originator, not the platform's edge.Inputs: 100 KB cap per text tool, 200 chars for
syllabify. Oversized inputs return a structured error rather than hanging.Supply chain: deps pinned + hashed in
uv.lock. Dependabot watches pip + GitHub Actions weekly. CI runs smoke + HTTP tests + Docker build/boot on Python 3.11 and 3.13 on every push.
Full threat model and disclosure path: SECURITY.md. Privacy policy (what we receive, what we don't store): PRIVACY.md. Terms of service for the hosted endpoint: TERMS.md.
Notes
Most EstNLTK models (morph, NER, spell-check) ship inside the wheel — no runtime downloads.
WordNet is a separate ~26 MB resource (used by
synonyms); the Docker image pre-downloads it at build time so the first call doesn't pause to fetch it.The fastText model used by
find_related_wordsandcheck_compound_familiarityis a ~33 MB compressed resource with a 100K-word vocabulary (built locally from Facebook's cc.et.300 via compress-fasttext, CC-BY-SA-3.0; see NOTICE); pre-downloaded at image-build time.Heavy neural taggers (
estnltk_neural, BERT-based NER) are intentionally not pulled in; this server stays lean and fast.First call after server start incurs a one-time tag-layer load (~1–2 s). Subsequent calls are millisecond-scale.
The hosted Fly instance scales to zero when idle; the first request after a quiet period takes ~5 s, then everything is fast again.
🤝 Contributing
Contributions are welcome — especially from Estonian speakers who can sharpen the linguistic rules. Here's how to get started:
Fork the repo and clone your fork.
Set up the environment (Python 3.10–3.13):
uv sync # the fastText-backed tools need the embedding model for tests: curl -fsSL -o ~/.cache/estnltk-mcp/fasttext-et-medium --create-dirs \ "https://github.com/silly-geese/estonian-mcp/releases/download/v0.1.0-models/fasttext-et-medium" export ESTNLTK_MCP_FASTTEXT_PATH=~/.cache/estnltk-mcp/fasttext-et-mediumCreate a feature branch (
git checkout -b feature/my-feature).Run the tests — both must pass:
uv run python tests/test_smoke.py # tool behaviour uv run python tests/test_http.py # transport, auth, /metricsCommit and open a pull request against
master. CI (smoke on Python 3.11 + 3.13, plus a Docker build/boot check) must be green before merge.
Please open an issue first for major changes so we can discuss the approach before you invest the work.
Especially wanted: linguistic corrections
The heuristic tools lean on small hand-curated lexicons in
server.py — marked/archaic words, register markers,
compound-split pairs, partitive-governing verbs, and the EKI
orthography rule sets. These are deliberately conservative and
incomplete. If you're a fluent Estonian speaker and spot a gap or a
wrong entry, that's the highest-value contribution you can make:
A missing calque AI agents produce, with the idiomatic native form
A verb that governs the partitive but isn't in the list
A compound that should (or shouldn't) be flagged
A register marker that's miscategorised
Open an issue with the English source (if it's a calque), the bad Estonian, and the better Estonian — or send a PR adding the entry to the relevant lexicon with a one-line test case.
License
Apache-2.0 for the source. Bundled data + models keep their own (copyleft) licenses — these apply to those files only, not to the Apache-2.0 code:
EstNLTK — dual-licensed GPL-2.0 OR Apache-2.0 (we use Apache-2.0).
Vabamorf analyzer — LGPL-2.1 with a separate commercial-use license.
Estonian fastText model (
find_related_words,check_compound_familiarity) — CC-BY-SA-3.0.Estonian Wordnet (
synonyms) — CC-BY-SA-4.0.
The CC-BY-SA model + Wordnet data carry share-alike obligations on those files when you redistribute them (the Docker image includes both). See NOTICE for full attribution and redistribution terms.
This server cannot be installed
Maintenance
Resources
Unclaimed servers have limited discoverability.
Looking for Admin?
If you are the server author, to access and configure the admin panel.
Latest Blog Posts
- Your AI Chatbot Just Exposed Your CEO's Salary to an InternBy Om-Shree-0709 on .Agent IdentityMCP SecurityOAuth Delegation
- Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)By Om-Shree-0709 on .Agentic AiPrompt InjectionWebAssembly
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/silly-geese/estonian-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server