Container infrastructure, purpose-built for MCP
- Dedicated machines per deployment – no noisy-neighbour hops
- Health checks against /ping with retry-on-failure
- Optional persistent volumes mounted at /data
Deploy private or public MCP servers on dedicated infrastructure – with a gateway, call logs, per-tool access control, and directory distribution built in.
Every call logged · Every tool gated · Every credential managed
You do not just need somewhere to run a process. You need authentication that matches the MCP specification, TLS that terminates correctly for Streamable HTTP, session handling that does not break behind a load balancer, OAuth credential refresh, per-user rate limits, secret rotation, audit logs, and runtime monitoring that understands JSON-RPC. Most teams underestimate every layer.
Ship an MCP server, not an ops project.
Four things every serious MCP deployment needs, shipped together so you do not have to stitch them up yourself.
Pick the path that matches where your MCP server lives today. The output is the same in all three cases: a Gateway endpoint your clients can point at.
Pick any server from the Glama registry and deploy it to your own instance in one click. Useful when you want a private copy of a community server under your control.
Connect a GitHub repository and Glama builds the image straight from source. Best for MCP servers you are actively developing in-house.
Ship a Dockerfile, an npm package, or a PyPI module. Configure Node or Python versions, build steps, and CMD arguments – or let Glama infer them.
Every deployment starts private and access-gated. When you decide to distribute – whether to your company or to the public – flip a single setting and the deployment gets a directory page where people are already searching.
Directory distribution is unique to Glama. Fly.io, Cloudflare Workers, and self-hosted stacks give you compute; they do not give you a place to be found. Glama’s directory is the single most trafficked index of MCP servers on the web – see the indexing methodology for how the ranking works.
Three layers of visibility that treat your server as an MCP server – not as an opaque Linux process that happens to listen on TCP.
Hundreds of the most recent runtime log events are tailed live from the machine. Filter by time window, grep for a string, or scroll to the moment a particular request arrived.
Every tool invocation is recorded as a telemetry event. Break calls down per tool, per user, and per time window – from the last minute up to the last ninety days.
Glama runs automated health tests that drive the MCP handshake – not a generic TCP or HTTP probe. When a deployment stops responding, the health flag flips before your users start seeing errors.
Fly.io and Cloudflare are great general-purpose compute. Glama is shaped specifically for MCP – which means everything below is built in, not bolted on.
| Capability | Glama | Fly.io | Cloudflare Workers | Self-hosted |
|---|---|---|---|---|
| Container hosting | —Workers runtime | you operate | ||
| Docker image support | ||||
| One-click deploy from an MCP registry | ||||
| Gateway included (auth · ACL · call logs) | —separate product | |||
| Managed OAuth 2.1 with refresh | — | |||
| Tool-level analytics | —generic metrics | —generic metrics | ||
| MCP-aware health checks | —generic HTTP | —generic HTTP | ||
| Browser-based inspector / playground | ||||
| Public directory listing with SEO | ||||
| Encrypted environment variables | —DIY |
Everything teams typically ask before putting an MCP server on Glama.
Yes. Every deployment is private by default – access is scoped to your account and to workspace members you explicitly add. Requests to the endpoint are gated by access tokens issued per connection profile. You can flip a deployment to public at any time and list it in the Glama directory.
The public endpoint speaks Streamable HTTP – the transport the MCP specification standardized in 2025, which every modern MCP client understands. Servers written for stdio are wrapped automatically, so you do not have to rewrite the transport layer yourself.
Dockerfile, npm, and PyPI. Pick a base image and Node version, or point Glama at an npm package or PyPI module and we handle the build. Custom build steps and CMD arguments are both configurable per deployment.
Install the Glama GitHub App and connect the repository that contains your MCP server. Glama pulls the source, builds the image, and provisions a machine. Re-deploys are triggered from the dashboard or via webhook.
Paste the Gateway URL into the client’s remote-server configuration. Claude, Cursor, ChatGPT, VS Code, Zed, and every other MCP-compatible client accept a single streamable-http endpoint – no special client setup is required.
Environment variables are encrypted at rest with a server-side key; plaintext values are never stored. They are decrypted in memory only at the moment the machine starts or a process reads them. Access tokens are hashed; the fingerprint shown in the UI is derived from the first and last characters of the original token.
Share the deployment with your workspace. Members of the workspace inherit access to the deployment’s endpoint and logs according to their workspace role, so you manage access to an MCP server the same way you manage access to any other workspace resource.
Machines run with a retry-on-failure restart policy and an HTTP health check against /ping. If the process keeps failing, the deployment is marked unhealthy and surfaces in the analytics dashboard. You can inspect the last hundreds of runtime log events from the deployment’s logs tab.
Hosting runs your code. The Gateway fronts the traffic. Every server you deploy on Glama gets a Gateway endpoint automatically, so the two are the same control plane from the operator’s point of view. See the Gateway overview for what the proxy layer adds on top of raw hosting.
You keep ownership. A public deployment is listed in the Glama registry under your account, indexed by Glama’s tool search, and shown on category and client pages. You can switch it back to private at any time; users already connected lose access immediately.
See the pricing page for the current rate card. Hosting is billed on top of your workspace plan; infrastructure cost is metered by machine hours and egress.
Plus a Gateway in front of every deployment, a browser-based MCP inspector, and directory distribution when you’re ready to go public.