Skip to main content
Glama

GoldenMCP

Web3 MCP evaluation marketplace: standardized Inspect evals, Walrus-backed results, ENS identity, Chainlink attestation, and x402 discovery on Arc.

Bounties

  • ENS — MCP discovery via ENSIP-25/26

  • Chainlink — CRE eval orchestration + Confidential AI attestation

  • Arc — x402 USDC nanopayments for MCP lookup

Related MCP server: AgentStamp

Quick start

# Python 3.12 + uv (no pip)
uv python install 3.12
uv sync --all-packages

# Run unit tests
uv run pytest packages/ -v

# Run eval (requires LLM key + MCP endpoints in .env)
uv run inspect eval goldenmcp/lifi_quote --model anthropic/claude-3-5-haiku-20241022
uv run inspect eval goldenmcp/odos_quote --model anthropic/claude-3-5-haiku-20241022

# Eval runner HTTP service (for CRE)
uv run python -m goldenmcp_eval_runner

# Web demo
cd apps/web && bun install && bun run dev

# Marketplace MCP
uv run python -m goldenmcp_marketplace

# Lookup agent demo (requires Arc wallet + x402)
uv run python demo/lookup_agent.py --capability quote --min-score 0.9

Copy .env.example to .env and fill in credentials, or bootstrap eval env on a demo machine:

chmod +x scripts/setup_eval_env.sh
./scripts/setup_eval_env.sh          # generates cast wallet, sets MCP URLs, uv sync
./scripts/setup_eval_env.sh --check  # prerequisites only

Eval chain defaults: Base (8453) for quote evals; Fraxtal (252) for odos_swap. Fund EVM_EVAL_ADDRESS on Base (+ Fraxtal for Odos swaps). ENS identity uses Sepolia separately.

Walrus + Inspect View

GoldenMCP stores eval logs on Walrus with an indexed walrus:// path (S3-style keys over content-addressed blobs). After the first upload, set WALRUS_INDEX_BLOB_ID in .env from the walrus_index_blob_id field printed by post_eval_walrus.py.

# Upload scored eval + raw Inspect log bytes
uv run python scripts/post_eval_walrus.py --mcp lifi --capability quote --log ./logs/your-run.json

# List logs from Walrus (same as s3:// log-dir)
uv run inspect view start --log-dir walrus://evals/goldenmcp

Inspect View requires native .eval / JSON log files at indexed paths — not score-manifest JSON alone.

Scoring

Dimension

Weight

DataScore

0.45

PathScore

0.35

TokenEfficiency

0.20

Binary fail (composite 0.0) on prompt injection, disallowed tools, or policy violations.

See docs/scoring.md.

Plans for judges

All implementation plans: docs/plans/

Project-local Chainlink Developer Agent Skills from smartcontractkit/chainlink-agent-skills:

Path

Agent

.agents/skills/

Cursor

.claude/skills/

Claude Code (this repo)

Installed skills: CRE, Confidential AI Attester, CCIP, Data Feeds, Data Streams, ACE, VRF. Pin file: skills-lock.json.

Invoke explicitly in chat, e.g. Using /chainlink-cre-skill, … or /chainlink-confidential-ai-attester-skill for CAI attestation work.

Refresh from upstream:

npx skills add smartcontractkit/chainlink-agent-skills --skill '*' --agent cursor --agent claude-code -y --copy

Structure

packages/inspect-web3   Inspect tasks + scorers
packages/walrus-client  walrus:// fsspec + HTTP client
packages/marketplace-mcp  x402 MCP server
packages/identity       ENS + registry SDK
packages/eval-runner    HTTP service for CRE
apps/web                Leaderboard, eval viewer, ENS resolver
workflows/eval-pipeline Chainlink CRE workflow
contracts/mcp-registry  ERC-8004 MCP registry (Arc)

License

MIT

F
license - not found
-
quality - not tested
D
maintenance

Maintenance

Maintainers
Response time
Release cycle
Releases (12mo)
Commit activity

Resources

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/vhspace/goldenmcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server