Which integrations are available for this server?

Facilitates x402 USDC nanopayments on the Arc network for MCP server discovery and lookup. Enables orchestration of evaluations via Chainlink CRE and attestation using Chainlink's Confidential AI Attester.

How do I use GoldenMCP?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@GoldenMCP evaluate lifi quote on Base" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

GoldenMCP

by vhspace

Overview Schema Related Servers Score Discussions

JavaScript

Remote

License: MIT Live demo

Chainlink CRE + CAI Arc x402 ENS Walrus

GoldenMCP

An onchain reputation layer for Web3 MCP servers. Evals score live MCPs on data accuracy / tool-path / token efficiency; results are attested by Chainlink Confidential AI, stored on Walrus, written to a registry on Arc, and made discoverable via ENS — queryable by agents for a USDC nanopayment.

Live demo: https://goldenmcp-e9l6.vercel.app/demo · Demo video: (coming soon) · Registry on Arc: 0x8db0…20e3

Bounties — find your code

We're targeting: Chainlink (Best workflow with CRE + Best usage of Confidential AI Attester) · Arc (Best Agentic Economy with Circle Agent Stack) · ENS (Best Integration for AI Agents). Walrus is a supporting integration, not a bounty submission.

Each bounty's integration lives in a small number of files. Links go straight to the relevant source on main.

ENS — MCP discovery via ENSIP-25/26

Submitting for: Best ENS Integration for AI Agents.

ENS is the identity and discovery layer for every scored MCP. Each MCP is a child.parent.eth subname under goldenmcp.eth (Sepolia), and its agent context, MCP endpoint, and Walrus eval-blob pointer are stored as text records (agent-context, agent-endpoint[mcp], goldenmcp/eval-blob, per ENSIP-25/26). We read each subname's ENSv2 TTL expiry (walking .eth registry → getSubregistry → findExpiry) and mark an MCP stale once its identity has lapsed — so a name with an expired registration is visibly out of date rather than silently trusted. Discovery is fully live: no hard-coded names or values.

On-chain (Sepolia): ENSv2 .eth registry 0xDEDB92913A25abE1f7BCDD85D8A344a43B398B67.

What	Code
ENS text-record resolver (`resolve_text`, `resolve_agent_context`, `resolve_eval_blob`, `resolve_mcp_endpoint`)	`packages/identity/src/goldenmcp_identity/registry.py`
ENSv2 TTL expiry / staleness check (`resolveENS`, `ensSubnameExpiry`)	`apps/web/src/lib/data.ts`
Registry SDK (`ens_name` field, register/lookup)	`packages/identity/`
Live ENS resolver UI	`apps/web/src/app/ens/page.tsx`

Chainlink — CRE eval orchestration + Confidential AI attestation

Submitting for: Best workflow with CRE + Best usage of Confidential AI Attester.

A Chainlink CRE workflow orchestrates the pipeline as two event-driven handlers. A hook (repo update, API change, or manual fire) hits the workflow's HTTP run trigger: it scores an MCP via the eval-runner and submits the score manifest to Confidential AI (CAI) with a cre_callback back to the workflow. When CAI finishes, its callback starts a fresh execution that publishes to Walrus and writes the score + attestation onchain via updateCapabilityScore + recordAttestation on the Arc registry. The CRE workflow drives a real onchain state change — it is the orchestration layer, not a frontend read. (See the eval-pipeline diagram for the full two-handler flow.)

The attestation is the completed TEE inference — there is no synthetic tx hash. CAI processes the eval manifest (sensitive scoring data) inside the enclave; the pipeline records the CAI inference_id and the bytes32 transcript hash (the enclave's response_digest, falling back to sha256(output)) onchain via recordAttestation, mirroring Chainlink's official undercollateralized-loan example.

The CRE workflow's onchain target is the same Arc registry — recordAttestation + updateCapabilityScore on 0x8db0…20e3.

What	Code
CRE pipeline steps (eval → CAI attest → Walrus → Arc)	`workflows/eval-pipeline/src/pipeline.ts`
Workflow triggers + two handlers (HTTP run / CAI callback / cron)	`workflows/eval-pipeline/src/workflow.ts`
CAI submit + callback parsing (`submitCaiInference`, `caiAttest`, `parseCaiAttestation`)	`workflows/eval-pipeline/src/pipeline.ts`
eval-runner HTTP service CRE calls	`packages/eval-runner/`
CRE workflow config	`workflows/eval-pipeline/workflow.yaml`

Arc — x402 USDC nanopayments for MCP lookup

Submitting for: Best Agentic Economy with Circle Agent Stack. GoldenMCP is a pay-per-query agent marketplace — agents pay gas-free USDC nanopayments on Arc to look up the best-scoring MCP for a capability, with no human in the loop.

The marketplace MCP is x402-gated: lookups return HTTP 402 with a USDC price until a payment header is present. Scores are written to an ERC-8004-inspired registry deployed on Arc, where USDC is the native gas token.

On-chain (Arc testnet, chain 5042002):

Contract	Address
MCPRegistry	`0x8db02877046c8fA3c8c6Abb2565094Ca29E820e3`
USDC (gas token)	`0x3600000000000000000000000000000000000000`
x402 payee	`0x1A067578b8d4f69eFB1B8b857c99d1b825E84e73`

What	Code
x402-gated lookup server — 402 challenge, dynamic price ladder, Circle Gateway settlement on Arc (`@circle-fin/x402-batching`, `eip155:5042002`)	`packages/marketplace-mcp-ts/src/server.ts`
Price ladder + registry/Walrus score index	`packages/marketplace-mcp-ts/src/pricing.ts`, `registry.ts`
x402 buyer agent demo (pays USDC on Arc, retries with `X-PAYMENT`)	`packages/marketplace-mcp-ts/demo/lookup_agent.ts`
MCP registry contract (`register`, `updateCapabilityScore`, `recordAttestation`)	`contracts/mcp-registry/src/MCPRegistry.sol`
Arc deploy script	`contracts/mcp-registry/script/Deploy.s.sol`
CRE → Arc registry write (`writeToArc`)	`workflows/eval-pipeline/src/pipeline.ts`

Walrus — decentralized eval-blob storage (supporting integration)

Eval results need durable, verifiable, content-addressed storage that any agent can read without trusting our server — so the eval store is Walrus, the Sui-native decentralized blob store. Every score manifest and raw Inspect .eval log is written to Walrus testnet via its publisher/aggregator HTTP API, and the resulting walrus://<blobId> is what ENS text records and the Arc registry point at. Walrus does genuine work in the stack: it's the storage layer the onchain reputation and the demo viewer both resolve against.

What	Code
Walrus publisher/aggregator client (`upload`, `download`, `*_json`)	`packages/walrus-client/src/goldenmcp_walrus/client.py`
`walrus://` fsspec adapter + index (Inspect View log dir)	`packages/walrus-client/`
Web demo Walrus manifest fetch	`apps/web/src/lib/data.ts`

Related MCP server: AgentStamp

Workflow diagrams

Eval pipeline (Chainlink CRE)

The CRE workflow is event-driven and runs in two async handlers, so any hook can kick it off — a GitHub repo update, an API-change webhook, or a manual fire. The workflow registers three triggers: an HTTP "run" trigger (the one a hook calls), an HTTP CAI-callback trigger, and a cron trigger (kept for production; it currently hangs in cre simulate, so the demo drives the HTTP run trigger instead). All three execute the same core logic.

Handler A (run trigger) asks the eval-runner for the next benchmark (round-robin, one per fire), scores that MCP/capability, then submits the score manifest to Confidential AI with a cre_callback pointing back at this workflow — and returns. It does not block or poll.

Handler B (CAI-callback trigger) is a fresh execution started when CAI POSTs its completed inference back. It parses the attestation (inference_id + transcript hash), resolves the run via the inference_id, publishes the manifest + raw .eval log to Walrus, then writes both recordAttestation and updateCapabilityScore to the Arc registry.

When CAI / the callback URL are not configured, Handler A falls back to the original inline path (runPipeline: score → poll → publish → write) so the pipeline stays simulatable without secrets.

flowchart TD
    Hook([Hook: repo update / API change / manual]) -->|POST run trigger| HA
    Cron([CRE cron trigger<br/>prod only — hangs in simulate]) -.->|same logic| HA

    subgraph HandlerA [Handler A — run trigger]
      HA[onRunTrigger] -->|GET /benchmarks/next| Next[next benchmark]
      Next -->|"POST /eval/inspect, poll until scored"| Score[score manifest]
      Score -->|"POST CAI /v1/inference<br/>cre_callback = this workflow"| Submit[submit + return]
    end
    Runner[(eval-runner HTTP)] -.->|runs Inspect eval| MCP[(Web3 MCP server)]

    Submit -.->|TEE inference| CAI[Confidential AI TEE]
    CAI -->|"POST completed status (callback)"| HB

    subgraph HandlerB [Handler B — CAI callback, fresh execution]
      HB[onAttestationCallback] -->|inference_id + transcript_hash| Resolve[resolve run by inference_id]
      Resolve -->|POST /eval/publish| Walrus[(Walrus: manifest + raw .eval log)]
      Walrus --> ArcAtt[recordAttestation]
      ArcAtt --> ArcScore[updateCapabilityScore + Walrus blob ptr]
    end

    ArcScore --> Registry[(MCPRegistry on Arc)]
    Registry --> ENS[ENS records point at Walrus + registry]

    HA -.->|"no CAI/callback configured"| Inline[inline runPipeline:<br/>score → publish → write]
    Inline --> Registry

x402 lookup + payment (Arc)

An agent asks the marketplace for the best MCP for a capability. The first call returns a 402 with a USDC price (it scales with min_score); the agent pays in USDC on Arc and retries with an X-PAYMENT header. The marketplace then builds a score index from the registry + Walrus and returns the top match.

sequenceDiagram
    participant Agent as lookup_agent.ts (GatewayClient)
    participant Market as marketplace-mcp-ts (x402 nanopayments)
    participant Reg as MCPRegistry (Arc)
    participant Wal as Walrus

    Agent->>Market: POST /tools/lookup (capability, min_score)
    Market-->>Agent: 402 Payment Required (price_usdc, payee, network arc-testnet)
    Note over Agent: pay USDC on Arc
    Agent->>Market: POST /tools/lookup + X-PAYMENT header

    Note over Market: _load_index builds the score index
    Market->>Reg: list_agent_ids + getCapabilityScore per capability
    Market->>Wal: download_json(manifest blob)
    Market->>Market: filter by min_score, sort by composite

    Market-->>Agent: results[] top MCP (ens_name, mcp_endpoint, composite,<br/>attestation_id, transcript_hash) + payment_settled

Setup

Prerequisites

Python 3.12, managed with uv (no pip)
bun for the web app and CRE TypeScript workflow
foundry (forge, cast) for contracts and wallet generation
An LLM API key (e.g. Anthropic) and reachable Web3 MCP endpoints

Install

# Python toolchain + workspace
uv python install 3.12
uv sync --all-packages

# Credentials — copy and fill in
cp .env.example .env

Or bootstrap a demo machine (generates a cast wallet, sets MCP URLs, runs uv sync):

chmod +x scripts/setup_eval_env.sh
./scripts/setup_eval_env.sh          # full bootstrap
./scripts/setup_eval_env.sh --check  # prerequisites only

Eval chain defaults: Base (8453) for quote evals; Fraxtal (252) for odos_swap. Fund EVM_EVAL_ADDRESS on Base (and Fraxtal for Odos swaps). ENS identity uses Sepolia separately.

Run

# Unit tests
uv run pytest packages/ -v

# Run an eval against a live MCP (needs LLM key + MCP endpoints in .env)
uv run inspect eval goldenmcp/lifi_quote --model anthropic/claude-3-5-haiku-20241022
uv run inspect eval goldenmcp/odos_quote --model anthropic/claude-3-5-haiku-20241022

# eval-runner HTTP service (the API the CRE workflow calls)
uv run python -m goldenmcp_eval_runner

# Marketplace seller (x402 nanopayments via Circle Gateway, Arc testnet)
(cd packages/marketplace-mcp-ts && bun install && bun src/server.ts)

# x402 buyer agent demo (EOA funded with Arc testnet USDC + native gas)
cd packages/marketplace-mcp-ts && DEMO_PAYER_PRIVATE_KEY=0x... bun demo/lookup_agent.ts --capability quote --min-score 0.9

# Web demo (leaderboard, eval viewer, ENS resolver)
cd apps/web && bun install && bun run dev

Walrus + Inspect View

GoldenMCP stores eval logs on Walrus with an indexed walrus:// path (S3-style keys over content-addressed blobs). After the first upload, set WALRUS_INDEX_BLOB_ID in .env from the walrus_index_blob_id field printed by post_eval_walrus.py.

# Upload scored eval + raw Inspect log bytes
uv run python scripts/post_eval_walrus.py --mcp lifi --capability quote --log ./logs/your-run.json

# List logs from Walrus (same as s3:// log-dir)
uv run inspect view start --log-dir walrus://evals/goldenmcp

Inspect View requires native .eval / JSON log files at indexed paths — not score-manifest JSON alone.

Scoring

Dimension	Weight
DataScore	0.45
PathScore	0.35
TokenEfficiency	0.20

Binary fail (composite 0.0) on prompt injection, disallowed tools, or policy violations.

See docs/scoring.md.

Structure

packages/inspect-web3     Inspect tasks + scorers
packages/walrus-client    walrus:// fsspec + HTTP client
packages/marketplace-mcp-ts  x402 MCP server (Arc, Circle Gateway) — current
packages/marketplace-mcp     legacy Python x402 server (superseded by -ts)
packages/identity         ENS + registry SDK
packages/eval-runner      HTTP service for CRE
apps/web                  Leaderboard, eval viewer, ENS resolver
workflows/eval-pipeline   Chainlink CRE workflow
contracts/mcp-registry    ERC-8004-inspired MCP registry (Arc)

Architecture overview: docs/architecture.md.

Planning artifacts (for judges)

This project was built spec-first. The full set of implementation plans / planning artifacts is in agent-plans/:

GoldenMCP eval marketplace — the core spec: evals → scoring → Walrus → attestation → Arc registry → x402 lookup.
DO eval-runner (Terraform) — eval-runner HTTP service infra the CRE workflow calls.
Vercel deploy — judge-facing web demo deployment.
Web concierge agent — the agent-driven demo flow.
Landing page — marketing/landing surface.

Deploy demo UI to Vercel (GH #106)

The hackathon judge demo (apps/web) deploys to Vercel with Root Directory apps/web. Eval-runner and marketplace stay on existing infra; set their public URLs in Vercel env vars.

Full checklist and variable list: docs/plans/2026-06-13-vercel-deploy.md.

# Local smoke (from apps/web)
bun install && bun test && bun run build

License

MIT

This server cannot be installed

license - not found

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

–Release cycle

–Releases (12mo)

Commit activity

Resources

GitHub Repository

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Who's Calling? MCP Hosts Are an Identity Blind Spot (And the Spec Knows It)
By Om-Shree-0709 on July 25, 2026.
mcp
Agent Identity
OAuth 2.1
Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/vhspace/goldenmcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server