Skip to main content
Glama
acchuang

Jina AI Remote MCP Server

by acchuang

deduplicate_strings

Extract top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Ideal for deduplication, selecting representative samples, or finding diverse content.

Instructions

Get top-k semantically unique strings from a list using Jina embeddings and submodular optimization. Use this when you have many similar strings and want to select the most diverse subset that covers the semantic space. Perfect for removing duplicates, selecting representative samples, or finding diverse content.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
kNoNumber of unique strings to return. If not provided, automatically finds optimal k by looking at diminishing return
stringsYesArray of strings to deduplicate

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/acchuang/jina-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server