Skip to main content
Glama

Jina AI Remote MCP Server

by wlmwwx

deduplicate_strings

Remove duplicate strings and select a diverse subset using Jina embeddings and submodular optimization. Ideal for filtering similar content, extracting representative samples, or ensuring semantic uniqueness.

Instructions

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

Input Schema

NameRequiredDescriptionDefault
kNoNumber of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return
stringsYesArray of strings to deduplicate

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "k": { "description": "Number of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return", "type": "number" }, "strings": { "description": "Array of strings to deduplicate", "items": { "type": "string" }, "type": "array" } }, "required": [ "strings" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/wlmwwx/jina-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server