Skip to main content
Glama

Jina AI Remote MCP Server

by acchuang

deduplicate_strings

Extract top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Ideal for deduplication, selecting representative samples, or finding diverse content.

Instructions

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

Input Schema

NameRequiredDescriptionDefault
kNoNumber of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return
stringsYesArray of strings to deduplicate

Input Schema (JSON Schema)

{ "$schema": "http://json-schema.org/draft-07/schema#", "additionalProperties": false, "properties": { "k": { "description": "Number of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return", "type": "number" }, "strings": { "description": "Array of strings to deduplicate", "items": { "type": "string" }, "type": "array" } }, "required": [ "strings" ], "type": "object" }

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/acchuang/jina-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server