Skip to main content
Glama

Server Details

Entity resolution toolkit — deduplicate, match records, golden records, PPRL

Status
Healthy
Last Tested
Transport
Streamable HTTP
URL

See and control every tool call

Log every tool call with full inputs and outputs
Control which tools are enabled per connector
Manage credentials once, use from any MCP client
Monitor uptime and get alerted when servers go down

Available Tools

27 tools
agent_approve_rejectInspect

Approve or reject a review queue pair

ParametersJSON Schema
NameRequiredDescriptionDefault
id_aYes
id_bYes
reasonNo
decisionYes
job_nameYes
decided_byYes
agent_compare_strategiesInspect

Compare ER strategies on your data

ParametersJSON Schema
NameRequiredDescriptionDefault
file_pathYes
ground_truthNo
agent_deduplicateInspect

Run full ER pipeline with confidence gating and reasoning

ParametersJSON Schema
NameRequiredDescriptionDefault
configNo
file_pathYes
agent_explain_clusterInspect

Explain why records are in the same cluster

ParametersJSON Schema
NameRequiredDescriptionDefault
cluster_idYes
agent_explain_pairInspect

Natural language explanation for a record pair

ParametersJSON Schema
NameRequiredDescriptionDefault
exactNo
fuzzyNo
record_aYes
record_bYes
agent_match_sourcesInspect

Match two files with intelligent strategy selection

ParametersJSON Schema
NameRequiredDescriptionDefault
configNo
file_aYes
file_bYes
agent_review_queueInspect

Get borderline pairs awaiting approval

ParametersJSON Schema
NameRequiredDescriptionDefault
job_nameYes
analyze_dataInspect

Profile data, detect domain, recommend ER strategy

ParametersJSON Schema
NameRequiredDescriptionDefault
file_pathYes
auto_configureInspect

Generate optimal matching config from data analysis

ParametersJSON Schema
NameRequiredDescriptionDefault
file_pathYes
constraintsNo
create_domainInspect

Create a custom domain extraction rulebook. Define patterns for a specific data domain (medical devices, automotive parts, real estate, etc.).

ParametersJSON Schema
NameRequiredDescriptionDefault
nameYesDomain name (e.g. 'medical_devices', 'automotive_parts')
scopeNoSave locally (.goldenmatch/domains/) or globally (~/.goldenmatch/domains/). Default: local.local
signalsYesColumn name keywords that trigger this domain (e.g. ['ndc', 'fda', 'implant'])
stop_wordsNoWords to strip during name normalization
brand_patternsNoBrand/manufacturer names to extract (e.g. ['Medtronic', 'Abbott'])
attribute_patternsNoNamed regex patterns for domain attributes (e.g. {'size': '\\b(\\d+mm)\\b'})
identifier_patternsNoNamed regex patterns for domain identifiers (e.g. {'ndc': '\\b(\\d{5}-\\d{4}-\\d{2})\\b'})
explain_matchInspect

Explain why two records match or don't match. Shows per-field score breakdown.

ParametersJSON Schema
NameRequiredDescriptionDefault
record_aYesFirst record fields
record_bYesSecond record fields
export_resultsInspect

Export matching results to a file (CSV or JSON).

ParametersJSON Schema
NameRequiredDescriptionDefault
formatNoOutput format (default csv)csv
output_pathYesFile path to save results
find_duplicatesInspect

Find duplicate matches for a record. Provide field values to search against the loaded dataset.

ParametersJSON Schema
NameRequiredDescriptionDefault
top_kNoMax results to return (default 5)
recordYesRecord fields to match (e.g. {"name": "John Smith", "zip": "10001"})
get_clusterInspect

Get details of a specific cluster: all member records and their field values.

ParametersJSON Schema
NameRequiredDescriptionDefault
cluster_idYesCluster ID to look up
get_golden_recordInspect

Get the merged golden (canonical) record for a cluster.

ParametersJSON Schema
NameRequiredDescriptionDefault
cluster_idYesCluster ID
get_statsInspect

Get dataset statistics: record count, cluster count, match rate, cluster sizes.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

list_clustersInspect

List duplicate clusters found in the dataset. Returns cluster IDs, sizes, and member counts.

ParametersJSON Schema
NameRequiredDescriptionDefault
limitNoMax clusters to return (default 20)
min_sizeNoMinimum cluster size to include (default 2)
list_domainsInspect

List available domain extraction rulebooks (built-in + user-defined).

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

match_recordInspect

Match a single record against the loaded dataset in real-time. Paste a record's fields and instantly see if it matches any existing record. Uses the configured matchkeys, scorers, and thresholds. Example: {"name": "John Smith", "email": "john@test.com", "zip": "10001"}

ParametersJSON Schema
NameRequiredDescriptionDefault
top_kNoMax matches to return (default 5)
recordYesRecord fields to match against the dataset
thresholdNoMinimum score to consider a match (default: use config threshold)
pprl_auto_configInspect

Analyze the loaded dataset and recommend optimal PPRL (privacy-preserving record linkage) configuration. Returns recommended fields, bloom filter parameters, threshold, and explanation.

ParametersJSON Schema
NameRequiredDescriptionDefault
use_llmNoUse LLM for enhanced recommendations (requires API key)
security_levelNoSecurity level (default: high)high
profile_dataInspect

Get data quality profile: column types, null rates, unique counts, sample values.

ParametersJSON Schema
NameRequiredDescriptionDefault

No parameters

shatter_clusterInspect

Break an entire cluster into individual records. All members become singletons. Use when a cluster is completely wrong.

ParametersJSON Schema
NameRequiredDescriptionDefault
cluster_idYesCluster ID to shatter
suggest_configInspect

Analyze bad merges and suggest config changes. Provide examples of incorrect merges (pairs that should NOT have matched) and GoldenMatch will identify which fields/thresholds to tighten. Example: [{"record_a": {...}, "record_b": {...}, "reason": "different people"}]

ParametersJSON Schema
NameRequiredDescriptionDefault
bad_mergesYesList of bad merge examples with record_a, record_b, and optional reason
suggest_pprlInspect

Check if data needs privacy-preserving matching

ParametersJSON Schema
NameRequiredDescriptionDefault
file_pathYes
test_domainInspect

Test a domain extraction rulebook against sample records. Shows what features would be extracted from the loaded data.

ParametersJSON Schema
NameRequiredDescriptionDefault
domain_nameYesName of the domain rulebook to test
sample_sizeNoNumber of records to test (default 10)
unmerge_recordInspect

Remove a record from its cluster. The record becomes a singleton. Remaining cluster members are re-clustered using stored pair scores. Use this to fix bad merges.

ParametersJSON Schema
NameRequiredDescriptionDefault
record_idYesRow ID of the record to unmerge

Verify Ownership

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [
    {
      "email": "your-email@example.com"
    }
  ]
}

The email address must match the email associated with your Glama account. Once verified, the connector will appear as claimed by you.

Sign in to verify ownership

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Sign in to create a connector for this server.