How do I use Echoes MCP Server?

1. Click on "Install Server". 2. Wait a few minutes for the server to deploy. Once ready, it will show a "Started" state. 3. In the chat, type @ followed by the MCP server name and your instructions, e.g., "@Echoes MCP Server search the bloom arc for Alice's first meeting" That's it! The server will respond to your query, and you can continue using it as needed. Here is a step-by-step guide with screenshots.

Echoes MCP Server

Official

by echoes-io

Overview Schema Related Servers Score Discussions

TypeScript

Local

Echoes MCP Server

npm Node License: MIT Coverage Badge

Model Context Protocol server for AI integration with Echoes storytelling platform.

Features

Narrative Knowledge Graph: Automatically extracts characters, locations, events, and their relationships using Gemini AI
Semantic Search: Find relevant chapters using natural language queries
Entity Search: Search for characters, locations, and events
Relation Search: Explore relationships between entities
Arc Isolation: Each arc is a separate narrative universe - no cross-arc contamination
Statistics: Aggregate word counts, POV distribution, and more
Dynamic Prompts: Reusable prompt templates with placeholder substitution

Related MCP server: CodeFlow MCP Server

Installation

npm install -g @echoes-io/mcp-server

Or run directly with npx:

npx @echoes-io/mcp-server --help

Requirements

Node.js 20+
Gemini API key (for entity extraction)

Usage

CLI

# Count words in a markdown file
echoes words-count ./content/arc1/ep01/ch001.md

# Index timeline content
echoes index ./content

# Index only a specific arc
echoes index ./content --arc bloom

# Get statistics
echoes stats
echoes stats --arc arc1 --pov Alice

# Search (filters by arc to avoid cross-arc contamination)
echoes search "primo incontro" --arc bloom
echoes search "Alice" --type entities --arc bloom

# Check narrative consistency
echoes check-consistency bloom
echoes check-consistency bloom --rules kink-firsts,outfit-claims

MCP Server

Configure in your MCP client (e.g., Claude Desktop, Kiro):

{
  "mcpServers": {
    "echoes": {
      "command": "npx",
      "args": ["@echoes-io/mcp-server"],
      "cwd": "/path/to/timeline",
      "env": {
        "GEMINI_API_KEY": "your_api_key"
      }
    }
  }
}

Environment Variables

Variable	Required	Default	Description
`GEMINI_API_KEY`	Yes	-	API key for Gemini entity extraction
`ECHOES_GEMINI_MODEL`	No	`gemini-2.5-flash`	Gemini model for extraction
`ECHOES_EMBEDDING_MODEL`	No	`Xenova/e5-small-v2`	HuggingFace embedding model
`ECHOES_EMBEDDING_DTYPE`	No	`fp32`	Quantization level: `fp32`, `q8`, `q4` (see Performance Notes)
`HF_TOKEN`	No	-	HuggingFace token for gated models

Available Tools

Tool	Description
`words-count`	Count words and statistics in a markdown file
`index`	Index timeline content into LanceDB
`search`	Search chapters, entities, or relations
`stats`	Get aggregate statistics
`check-consistency`	Analyze arc for narrative inconsistencies
`timeline-overview`	Quick overview of all arcs: status, chapters, words, POVs
`graph-export`	Export knowledge graph in various formats
`history`	Query character/arc history (kinks, outfits, locations, relations)
`review-generate`	Generate review file for pending entity/relation extractions
`review-status`	Show review statistics for an arc
`review-apply`	Apply corrections from review file to database

Available Prompts

Prompt	Arguments	Description
`arc-resume`	arc, episode?, lastChapters?	Load complete context for resuming work on an arc
`new-chapter`	arc, chapter	Create a new chapter
`revise-chapter`	arc, chapter	Revise an existing chapter
`expand-chapter`	arc, chapter, target	Expand chapter to target word count
`new-character`	name	Create a new character sheet
`new-episode`	arc, episode	Create a new episode outline
`new-arc`	name	Create a new story arc
`revise-arc`	arc	Review and fix an entire arc

Architecture

Content Hierarchy

Timeline (content directory)
└── Arc (story universe)
    └── Episode (story event)
        └── Chapter (individual .md file)

Arc Isolation

Each arc is treated as a separate narrative universe:

Entities are scoped to arcs: bloom:CHARACTER:Alice ≠ work:CHARACTER:Alice
Relations are internal to arcs
Searches can be filtered by arc to avoid cross-arc contamination

Data Flow

┌─────────────────────────────────────────────────────────────┐
│                     INDEXING PHASE                          │
├─────────────────────────────────────────────────────────────┤
│  1. Scan content/*.md (filesystem scanner)                  │
│  2. Parse frontmatter + content (gray-matter)               │
│  3. For each chapter:                                       │
│     a. Extract entities/relations with Gemini API           │
│     b. Generate embeddings (Transformers.js ONNX)           │
│     c. Calculate word count and statistics                  │
│  4. Save everything to LanceDB                              │
└─────────────────────────────────────────────────────────────┘

Development

# Install dependencies
npm install

# Run tests
npm test

# Run tests with coverage
npm run test:coverage

# Lint
npm run lint

# Type check
npm run typecheck

# Build
npm run build

Tech Stack

Purpose	Tool
Runtime	Node.js 20+
Language	TypeScript
Vector DB	LanceDB
Embeddings	@huggingface/transformers (ONNX)
Entity Extraction	Gemini AI
MCP SDK	@modelcontextprotocol/sdk
Testing	Vitest
Linting	Biome

Performance Notes

Embedding Quantization

The default embedding model (Xenova/e5-small-v2) supports different quantization levels via ECHOES_EMBEDDING_DTYPE:

Level	Speed	Quality	Memory	Recommendation
`fp32`	Baseline	Best (100%)	High	Production with ample resources
`q8`	2-3x faster	Excellent (99.6%)	50% less	Recommended - optimal balance
`q4`	3-4x faster	Good (99.1%)	75% less	Resource-constrained environments

Note: Some models like onnx-community/embeddinggemma-300m-ONNX don't support fp16. Always check model documentation.

Recommended setting:

export ECHOES_EMBEDDING_DTYPE=q8

License

MIT

Part of the Echoes project - a multi-POV digital storytelling platform.

This server cannot be installed

license - permissive license

quality - not tested

maintenance

How are these scores calculated?

Maintenance

–Maintainers

–Response time

2dRelease cycle

51Releases (12mo)

Commit activity

Resources

Need Help?

Related Servers

Unclaimed servers have limited discoverability.

Looking for Admin?

If you are the server author, to access and configure the admin panel.

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/echoes-io/mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server