Skip to main content
Glama

Jina AI Remote MCP Server

Official
by jina-ai

deduplicate_strings

Remove duplicate strings and select a diverse subset using semantic embeddings and optimization. Ideal for reducing redundancy and identifying representative content from large text collections.

Instructions

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

Input Schema

NameRequiredDescriptionDefault
kNoNumber of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return
stringsYesArray of strings to deduplicate

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "k": { "description": "Number of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return", "type": "number" }, "strings": { "description": "Array of strings to deduplicate", "items": { "type": "string" }, "type": "array" } }, "required": [ "strings" ], "type": "object" }
Install Server

Other Tools from Jina AI Remote MCP Server

Related Tools

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/jina-ai/MCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server