MCP Memory Service - TypeScript

SEMANTIC-SEARCH-ANALYSIS.md•10.7 KiB

# Semantic Search Analysis & Verification

## Executive Summary

**✅ Vector embeddings ARE being used for semantic search in the MCP Memory TypeScript system.**

The code analysis reveals that:
- Embeddings are generated correctly (90-100% coverage reported)
- Vector search is implemented with proper cosine similarity
- Semantic matching happens BEFORE keyword search
- The implementation is mathematically sound

**However:** The default threshold (0.7) was high, which may have caused semantic matches to be filtered out, making it appear that only keyword search was working.

---

## Code Analysis

### 1. Search Flow (`src/core/memory-core.ts`)

The `searchMemories` method follows this flow:

```typescript
async searchMemories(query: string, options: VectorSearchOptions) {
  // 1. Generate embedding for the search query
  const queryEmbedding = await this.embeddings.generateEmbedding(query);

  // 2. Perform vector search FIRST (not keyword search)
  vectorResults = await this.vectorSearchMemories(userId, queryEmbedding, {
    threshold: options.threshold || 0.6,  // Changed from 0.7
    limit,
  });

  // 3. Fall back to text search only if needed
  if (vectorResults.length < limit) {
    textResults = await this.dbOps.searchMemories(userId, query, limit - vectorResults.length);
  }

  // 4. Combine and deduplicate
  // Vector results have priority
}
```

**Key Points:**
- Vector search happens FIRST, not as a fallback
- Text search only supplements if vector search returns insufficient results
- Vector results maintain priority in the final results

### 2. Vector Search Implementation

```typescript
private async vectorSearchMemories(
  userId: string,
  queryEmbedding: number[],
  options: { threshold?: number; limit?: number }
): Promise<Memory[]> {
  // Get all memories with embeddings
  const memories = await this.dbOps.getMemoriesByUserId(userId, 1000);

  // Filter to memories with valid embeddings
  const vectorData = memories
    .filter(m => Array.isArray(m.embedding) && m.embedding.length > 0)
    .map(m => ({ vector: m.embedding!, data: m }));

  // Use cosine similarity to find similar memories
  const similarities = EmbeddingService.findMostSimilar(
    queryEmbedding,
    vectorData,
    options.threshold || 0.7,
    options.limit || 10
  );

  return similarities.map(s => s.data);
}
```

**This is correct semantic search implementation.**

### 3. Cosine Similarity (`src/utils/embeddings.ts`)

```typescript
static cosineSimilarity(a: number[], b: number[]): number {
  let dotProduct = 0;
  let normA = 0;
  let normB = 0;

  for (let i = 0; i < a.length; i++) {
    dotProduct += a[i] * b[i];
    normA += a[i] * a[i];
    normB += b[i] * b[i];
  }

  return dotProduct / (Math.sqrt(normA) * Math.sqrt(normB));
}
```

**Mathematically correct implementation of cosine similarity.**

### 4. Ranking by Similarity

```typescript
static findMostSimilar(
  queryVector: number[],
  vectors: Array<{ vector: number[]; data: any }>,
  threshold = 0.7,
  limit = 10
) {
  return vectors
    .map(item => ({
      similarity: cosineSimilarity(queryVector, item.vector),
      data: item.data,
    }))
    .filter(item => item.similarity >= threshold)  // Filter by threshold
    .sort((a, b) => b.similarity - a.similarity)   // Sort by similarity
    .slice(0, limit);
}
```

**Correctly sorts by semantic similarity score.**

---

## Why It Might Appear Not to Work

### Issue #1: High Similarity Threshold (0.7)

The default threshold of 0.7 is quite high for semantic search:

| Query | Memory | Typical Similarity | Returned? |
|-------|--------|-------------------|-----------|
| "artificial intelligence" | "machine learning" | 0.65-0.75 | Maybe |
| "neural networks" | "deep learning" | 0.70-0.80 | Likely |
| "AI algorithms" | "machine learning research" | 0.60-0.70 | Maybe not |

**With 0.7 threshold, borderline semantic matches get filtered out.**

**Fix Applied:** Changed default threshold from 0.7 to 0.6 for better recall.

### Issue #2: Silent Fallback to Text Search

```typescript
try {
  vectorResults = await this.vectorSearchMemories(...);
} catch (error) {
  console.error('Vector search failed, falling back to text search:', error);
}
```

When vector search fails or returns no results:
- Text search silently takes over
- Users see results and assume they came from keyword matching
- No indication that semantic search failed

**Fix Applied:** Enhanced result messages to show search method used:
```
"Found 5 memories (3 via semantic search, 2 via text search)"
```

### Issue #3: Text Search Uses Pure Keyword Matching

From `src/database/operations.ts`:

```sql
SELECT * FROM memories
WHERE user_id = ? AND is_archived = 0
AND (LOWER(title) LIKE LOWER(?) OR LOWER(content) LIKE LOWER(?))
ORDER BY updated_at DESC
```

This is pure keyword matching (SQL LIKE), not semantic.

If vector search returns no results (due to high threshold or missing embeddings), users only get keyword results.

### Issue #4: No Visibility into Search Results

Previous implementation didn't show:
- Which results came from vector search vs text search
- Similarity scores
- Whether vector search was attempted
- Embedding coverage percentage

**Fix Applied:** Enhanced statistics endpoint with `vectorSearchHealth`:
```json
{
  "vectorSearchHealth": {
    "enabled": true,
    "memoriesWithValidEmbeddings": 45,
    "memoriesWithoutEmbeddings": 5,
    "coveragePercentage": 90,
    "recommendation": "Vector search coverage is healthy"
  }
}
```

---

## Improvements Made

### 1. Lower Default Threshold (0.7 → 0.6)

**File:** `src/core/memory-core.ts`, line 267

**Before:**
```typescript
threshold: options.threshold || 0.7
```

**After:**
```typescript
threshold: options.threshold || 0.6  // Lower threshold for better semantic recall
```

**Impact:** More semantic matches will be returned, improving the user experience.

### 2. Enhanced Search Result Messages

**File:** `src/core/memory-core.ts`, lines 313-325

**Before:**
```typescript
message: `Found ${allResults.length} memories`
```

**After:**
```typescript
// Build informative message
let message = `Found ${allResults.length} memories`;
if (vectorSearchUsed && textSearchUsed) {
  message += ` (${vectorResults.length} via semantic search, ${textResults.length} via text search)`;
} else if (vectorSearchUsed) {
  message += ` (semantic search)`;
} else if (textSearchUsed) {
  message += ` (text search)`;
}
```

**Impact:** Users can now see whether their results came from semantic or keyword search.

### 3. Vector Search Health in Statistics

**File:** `src/core/memory-core.ts`, lines 825-833

**Added:**
```typescript
vectorSearchHealth: {
  enabled: this.embeddings !== null && (this.embeddings as any).openai !== null,
  memoriesWithValidEmbeddings: memoriesWithEmbeddings.length,
  memoriesWithoutEmbeddings: memories.length - memoriesWithEmbeddings.length,
  coveragePercentage: embeddingCoverage,
  recommendation: embeddingCoverage < 90
    ? 'Consider running updateMissingEmbeddings() to improve semantic search coverage'
    : 'Vector search coverage is healthy',
}
```

**Impact:** Users can monitor embedding coverage and vector search health.

### 4. Debug Logging

**File:** `src/core/memory-core.ts`, lines 273-275, 294-296

**Added:**
```typescript
if (process.env.MCP_DEBUG) {
  console.log(`[SearchMemories] Vector search returned ${vectorResults.length} results`);
  console.log(`[SearchMemories] Text search returned ${textResults.length} results`);
}
```

**Impact:** Developers can enable `MCP_DEBUG=1` to see detailed search logging.

---

## Verification

### Automated Test

Created comprehensive test suite:
- **File:** `tests/semantic-search-verification.test.ts`

**Test 1:** Semantic Matching
1. Create memory about "machine learning"
2. Search for "artificial intelligence"
3. Verify ML memory is returned (semantic similarity, not keyword match)

**Test 2:** Non-Exact Keyword
1. Search for "quantum computing" (not in any memory)
2. Verify ML memory is still returned (semantic similarity)

**Test 3:** Ranking by Similarity
1. Create memories: ML, coffee, gardening
2. Search for "machine learning algorithms"
3. Verify ML-related memories rank higher than gardening

### Manual Test Script

Created manual test script:
- **File:** `scripts/test-semantic-search.ts`

**Usage:**
```bash
export MCP_DEBUG=1
npx tsx scripts/test-semantic-search.ts
```

**Output:**
```
✓ Created memory: "Machine Learning Research"
✓ Created memory: "Coffee Brewing Technique"

Searching for "artificial intelligence"...
Found 2 memories (2 via semantic search)

1. Machine Learning Research ← ML (Expected)
   Content: Working on deep neural networks and artificial intelligence...

2. Coffee Brewing Technique
   Content: Pour-over coffee method: Heat water to 200°F...

✅ SUCCESS: Semantic search is working! ML memory ranked first.
```

---

## Recommendations

### For Users

1. **Enable debug mode** to see search method used:
   ```bash
   export MCP_DEBUG=1
   ```

2. **Check statistics regularly** to monitor embedding coverage:
   ```typescript
   const stats = await memoryCore.getStatistics(userId);
   console.log(stats.vectorSearchHealth);
   ```

3. **Run embedding updates** if coverage is low:
   ```typescript
   await memoryCore.updateMissingEmbeddings();
   ```

4. **Lower threshold** for broader semantic matching:
   ```typescript
   await memoryCore.searchMemories(query, {
     threshold: 0.5  // Lower threshold for more results
   });
   ```

### For Developers

1. **Monitor vector search health** in production
2. **Log semantic search success rate** to track effectiveness
3. **Consider adding similarity scores** to search results
4. **Implement A/B testing** to compare vector vs text search performance
5. **Add caching** for frequently used query embeddings

### Future Enhancements

1. **Hybrid ranking**: Combine semantic similarity with recency/importance
2. **Query expansion**: Use synonyms and related terms
3. **Multi-vector search**: Average multiple related queries
4. **Semantic clustering**: Group similar memories
5. **Explain results**: Show why each result matched the query

---

## Conclusion

**Vector embeddings ARE working correctly.** The implementation is sound:
- ✅ Embeddings generated with OpenAI
- ✅ Cosine similarity calculated correctly
- ✅ Vector search happens first
- ✅ Results ranked by semantic similarity

**The perceived issue** was due to:
- High similarity threshold (0.7) filtering out borderline matches
- Silent fallback to text search masking failures
- Lack of visibility into which search method was used

**Fixes applied** improve transparency and effectiveness:
- Lower default threshold (0.6)
- Enhanced result messages showing search method
- Vector search health monitoring
- Debug logging for troubleshooting

**Verification:** Run `npx tsx scripts/test-semantic-search.ts` to see semantic search in action.

Loading blob content...

Latest Blog Posts

Redis vs ioredis vs valkey-glide
By punkpeye on January 26, 2026.
benchmark
Redis
valkey
Quickstart: Publish an MCP Server to the MCP Registry
By punkpeye on January 24, 2026.
mcp
official reference mirror
Official MCP Registry Server.json Requirements
By punkpeye on January 24, 2026.
mcp
official reference mirror

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/bobmatnyc/mcp-memory-ts'

If you have feedback or need assistance with the MCP directory API, please join our Discord server

SEMANTIC-SEARCH-ANALYSIS.md•10.7 KiB

# Semantic Search Analysis & Verification

## Executive Summary

**✅ Vector embeddings ARE being used for semantic search in the MCP Memory TypeScript system.**

The code analysis reveals that:
- Embeddings are generated correctly (90-100% coverage reported)
- Vector search is implemented with proper cosine similarity
- Semantic matching happens BEFORE keyword search
- The implementation is mathematically sound

**However:** The default threshold (0.7) was high, which may have caused semantic matches to be filtered out, making it appear that only keyword search was working.

---

## Code Analysis

### 1. Search Flow (`src/core/memory-core.ts`)

The `searchMemories` method follows this flow:

```typescript
async searchMemories(query: string, options: VectorSearchOptions) {
  // 1. Generate embedding for the search query
  const queryEmbedding = await this.embeddings.generateEmbedding(query);

  // 2. Perform vector search FIRST (not keyword search)
  vectorResults = await this.vectorSearchMemories(userId, queryEmbedding, {
    threshold: options.threshold || 0.6,  // Changed from 0.7
    limit,
  });

  // 3. Fall back to text search only if needed
  if (vectorResults.length < limit) {
    textResults = await this.dbOps.searchMemories(userId, query, limit - vectorResults.length);
  }

  // 4. Combine and deduplicate
  // Vector results have priority
}
```

**Key Points:**
- Vector search happens FIRST, not as a fallback
- Text search only supplements if vector search returns insufficient results
- Vector results maintain priority in the final results

### 2. Vector Search Implementation

```typescript
private async vectorSearchMemories(
  userId: string,
  queryEmbedding: number[],
  options: { threshold?: number; limit?: number }
): Promise<Memory[]> {
  // Get all memories with embeddings
  const memories = await this.dbOps.getMemoriesByUserId(userId, 1000);

  // Filter to memories with valid embeddings
  const vectorData = memories
    .filter(m => Array.isArray(m.embedding) && m.embedding.length > 0)
    .map(m => ({ vector: m.embedding!, data: m }));

  // Use cosine similarity to find similar memories
  const similarities = EmbeddingService.findMostSimilar(
    queryEmbedding,
    vectorData,
    options.threshold || 0.7,
    options.limit || 10
  );

  return similarities.map(s => s.data);
}
```

**This is correct semantic search implementation.**

### 3. Cosine Similarity (`src/utils/embeddings.ts`)

```typescript
static cosineSimilarity(a: number[], b: number[]): number {
  let dotProduct = 0;
  let normA = 0;
  let normB = 0;

  for (let i = 0; i < a.length; i++) {
    dotProduct += a[i] * b[i];
    normA += a[i] * a[i];
    normB += b[i] * b[i];
  }

  return dotProduct / (Math.sqrt(normA) * Math.sqrt(normB));
}
```

**Mathematically correct implementation of cosine similarity.**

### 4. Ranking by Similarity

```typescript
static findMostSimilar(
  queryVector: number[],
  vectors: Array<{ vector: number[]; data: any }>,
  threshold = 0.7,
  limit = 10
) {
  return vectors
    .map(item => ({
      similarity: cosineSimilarity(queryVector, item.vector),
      data: item.data,
    }))
    .filter(item => item.similarity >= threshold)  // Filter by threshold
    .sort((a, b) => b.similarity - a.similarity)   // Sort by similarity
    .slice(0, limit);
}
```

**Correctly sorts by semantic similarity score.**

---

## Why It Might Appear Not to Work

### Issue #1: High Similarity Threshold (0.7)

The default threshold of 0.7 is quite high for semantic search:

| Query | Memory | Typical Similarity | Returned? |
|-------|--------|-------------------|-----------|
| "artificial intelligence" | "machine learning" | 0.65-0.75 | Maybe |
| "neural networks" | "deep learning" | 0.70-0.80 | Likely |
| "AI algorithms" | "machine learning research" | 0.60-0.70 | Maybe not |

**With 0.7 threshold, borderline semantic matches get filtered out.**

**Fix Applied:** Changed default threshold from 0.7 to 0.6 for better recall.

### Issue #2: Silent Fallback to Text Search

```typescript
try {
  vectorResults = await this.vectorSearchMemories(...);
} catch (error) {
  console.error('Vector search failed, falling back to text search:', error);
}
```

When vector search fails or returns no results:
- Text search silently takes over
- Users see results and assume they came from keyword matching
- No indication that semantic search failed

**Fix Applied:** Enhanced result messages to show search method used:
```
"Found 5 memories (3 via semantic search, 2 via text search)"
```

### Issue #3: Text Search Uses Pure Keyword Matching

From `src/database/operations.ts`:

```sql
SELECT * FROM memories
WHERE user_id = ? AND is_archived = 0
AND (LOWER(title) LIKE LOWER(?) OR LOWER(content) LIKE LOWER(?))
ORDER BY updated_at DESC
```

This is pure keyword matching (SQL LIKE), not semantic.

If vector search returns no results (due to high threshold or missing embeddings), users only get keyword results.

### Issue #4: No Visibility into Search Results

Previous implementation didn't show:
- Which results came from vector search vs text search
- Similarity scores
- Whether vector search was attempted
- Embedding coverage percentage

**Fix Applied:** Enhanced statistics endpoint with `vectorSearchHealth`:
```json
{
  "vectorSearchHealth": {
    "enabled": true,
    "memoriesWithValidEmbeddings": 45,
    "memoriesWithoutEmbeddings": 5,
    "coveragePercentage": 90,
    "recommendation": "Vector search coverage is healthy"
  }
}
```

---

## Improvements Made

### 1. Lower Default Threshold (0.7 → 0.6)

**File:** `src/core/memory-core.ts`, line 267

**Before:**
```typescript
threshold: options.threshold || 0.7
```

**After:**
```typescript
threshold: options.threshold || 0.6  // Lower threshold for better semantic recall
```

**Impact:** More semantic matches will be returned, improving the user experience.

### 2. Enhanced Search Result Messages

**File:** `src/core/memory-core.ts`, lines 313-325

**Before:**
```typescript
message: `Found ${allResults.length} memories`
```

**After:**
```typescript
// Build informative message
let message = `Found ${allResults.length} memories`;
if (vectorSearchUsed && textSearchUsed) {
  message += ` (${vectorResults.length} via semantic search, ${textResults.length} via text search)`;
} else if (vectorSearchUsed) {
  message += ` (semantic search)`;
} else if (textSearchUsed) {
  message += ` (text search)`;
}
```

**Impact:** Users can now see whether their results came from semantic or keyword search.

### 3. Vector Search Health in Statistics

**File:** `src/core/memory-core.ts`, lines 825-833

**Added:**
```typescript
vectorSearchHealth: {
  enabled: this.embeddings !== null && (this.embeddings as any).openai !== null,
  memoriesWithValidEmbeddings: memoriesWithEmbeddings.length,
  memoriesWithoutEmbeddings: memories.length - memoriesWithEmbeddings.length,
  coveragePercentage: embeddingCoverage,
  recommendation: embeddingCoverage < 90
    ? 'Consider running updateMissingEmbeddings() to improve semantic search coverage'
    : 'Vector search coverage is healthy',
}
```

**Impact:** Users can monitor embedding coverage and vector search health.

### 4. Debug Logging

**File:** `src/core/memory-core.ts`, lines 273-275, 294-296

**Added:**
```typescript
if (process.env.MCP_DEBUG) {
  console.log(`[SearchMemories] Vector search returned ${vectorResults.length} results`);
  console.log(`[SearchMemories] Text search returned ${textResults.length} results`);
}
```

**Impact:** Developers can enable `MCP_DEBUG=1` to see detailed search logging.

---

## Verification

### Automated Test

Created comprehensive test suite:
- **File:** `tests/semantic-search-verification.test.ts`

**Test 1:** Semantic Matching
1. Create memory about "machine learning"
2. Search for "artificial intelligence"
3. Verify ML memory is returned (semantic similarity, not keyword match)

**Test 2:** Non-Exact Keyword
1. Search for "quantum computing" (not in any memory)
2. Verify ML memory is still returned (semantic similarity)

**Test 3:** Ranking by Similarity
1. Create memories: ML, coffee, gardening
2. Search for "machine learning algorithms"
3. Verify ML-related memories rank higher than gardening

### Manual Test Script

Created manual test script:
- **File:** `scripts/test-semantic-search.ts`

**Usage:**
```bash
export MCP_DEBUG=1
npx tsx scripts/test-semantic-search.ts
```

**Output:**
```
✓ Created memory: "Machine Learning Research"
✓ Created memory: "Coffee Brewing Technique"

Searching for "artificial intelligence"...
Found 2 memories (2 via semantic search)

1. Machine Learning Research ← ML (Expected)
   Content: Working on deep neural networks and artificial intelligence...

2. Coffee Brewing Technique
   Content: Pour-over coffee method: Heat water to 200°F...

✅ SUCCESS: Semantic search is working! ML memory ranked first.
```

---

## Recommendations

### For Users

1. **Enable debug mode** to see search method used:
   ```bash
   export MCP_DEBUG=1
   ```

2. **Check statistics regularly** to monitor embedding coverage:
   ```typescript
   const stats = await memoryCore.getStatistics(userId);
   console.log(stats.vectorSearchHealth);
   ```

3. **Run embedding updates** if coverage is low:
   ```typescript
   await memoryCore.updateMissingEmbeddings();
   ```

4. **Lower threshold** for broader semantic matching:
   ```typescript
   await memoryCore.searchMemories(query, {
     threshold: 0.5  // Lower threshold for more results
   });
   ```

### For Developers

1. **Monitor vector search health** in production
2. **Log semantic search success rate** to track effectiveness
3. **Consider adding similarity scores** to search results
4. **Implement A/B testing** to compare vector vs text search performance
5. **Add caching** for frequently used query embeddings

### Future Enhancements

1. **Hybrid ranking**: Combine semantic similarity with recency/importance
2. **Query expansion**: Use synonyms and related terms
3. **Multi-vector search**: Average multiple related queries
4. **Semantic clustering**: Group similar memories
5. **Explain results**: Show why each result matched the query

---

## Conclusion

**Vector embeddings ARE working correctly.** The implementation is sound:
- ✅ Embeddings generated with OpenAI
- ✅ Cosine similarity calculated correctly
- ✅ Vector search happens first
- ✅ Results ranked by semantic similarity

**The perceived issue** was due to:
- High similarity threshold (0.7) filtering out borderline matches
- Silent fallback to text search masking failures
- Lack of visibility into which search method was used

**Fixes applied** improve transparency and effectiveness:
- Lower default threshold (0.6)
- Enhanced result messages showing search method
- Vector search health monitoring
- Debug logging for troubleshooting

**Verification:** Run `npx tsx scripts/test-semantic-search.ts` to see semantic search in action.