Skip to main content
Glama

Jina AI Remote MCP Server

by acchuang

deduplicate_strings

Select semantically unique strings from a list using Jina embeddings and submodular optimization. Ideal for removing duplicates, finding representative samples, or extracting diverse content. Returns chosen strings with indices for efficient analysis.

Instructions

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content. Returns the selected strings with their indices.

Input Schema

NameRequiredDescriptionDefault
kNoNumber of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return
stringsYesArray of strings to deduplicate

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "k": { "description": "Number of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return", "type": "number" }, "strings": { "description": "Array of strings to deduplicate", "items": { "type": "string" }, "type": "array" } }, "required": [ "strings" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/acchuang/jina-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server