@voxell/forge-mcp
OfficialClick on "Install Server".
Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state.
In the chat, type
@followed by the MCP server name and your instructions, e.g., "@@voxell/forge-mcpEmbed 'How to train a model?' with input_type query."
That's it! The server will respond to your query, and you can continue using it as needed.
Here is a step-by-step guide with screenshots.
@voxell/forge-mcp
An MCP server for Forge — Voxell's hosted text-embedding API. It exposes Forge to any MCP client (Claude, Cursor, Cline, Windsurf, VS Code, …) as two tools:
embed— turn text into vectorslist_models— list available models and their dimensions
You bring a Forge API key. The server is stateless, and Voxell does not store the text you send or the vectors it returns — only usage metadata (token counts) is recorded, for billing. It does embeddings only — no storage, no search, no RAG. Those are different products.
What you can do with it
Add semantic search — embed your documents with
input_type: "document"and each query withinput_type: "query", then rank by cosine similarity.Build RAG — embed a knowledge base, store the vectors, and retrieve the closest chunks to ground an LLM.
Find similar or duplicate text — embed two texts and compare their vectors.
Cluster or classify — embed a batch, then cluster or train a classifier on the vectors.
Shrink vector storage — set
dimto truncate (Matryoshka) and trade a little accuracy for smaller, cheaper vectors.Straight from your editor — ask your AI agent (Cursor, Claude, …) to embed a snippet, a batch, or a file via the
embedtool — no separate script.
Requirements
Node.js ≥ 18 (tested on 20)
A Forge API key — create one at https://dash.voxell.ai. New accounts start with 10M free tokens, no credit card.
Use it
Most MCP clients run it on demand with npx. Add this to your client's MCP config:
{
"mcpServers": {
"forge": {
"command": "npx",
"args": ["-y", "@voxell/forge-mcp"],
"env": { "FORGE_API_KEY": "your-key-here" }
}
}
}(Cursor, Claude Desktop, Cline, Windsurf, and VS Code all use this mcpServers shape.)
Tools
embed
arg | type | default | notes |
| string or string[] | — | text(s) to embed (required) |
| string |
|
|
| number | model default | truncate to N dimensions (Matryoshka) — works on every model |
|
|
| use |
Returns the vectors plus the model, dimension, and token count.
Default is turbo — the one you probably want. pro/ultra trade size and speed for more
dimensions.
list_models
Lists the available models and their dimensions.
Configuration
env | required | default |
| yes | — |
| no |
|
License
MIT © Voxell, Inc.
Maintenance
Tools
Latest Blog Posts
MCP directory API
We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/VoxellInc/forge-mcp'
If you have feedback or need assistance with the MCP directory API, please join our Discord server