Skip to main content
Glama
zilliztech

Zilliz MCP Server

Official
by zilliztech

create_collection

Create vector collections in Milvus or Zilliz Cloud by specifying cluster details, dimensions, and metric types to organize data for similarity searches.

Instructions

Create a collection in a specified cluster using Quick Setup.

Args:
    cluster_id: ID of the cluster
    region_id: ID of the cloud region hosting the cluster
    endpoint: The cluster endpoint URL. Can be obtained by calling describe_cluster and using the connect_address field
    collection_name: The name of the collection to create
    dimension: The number of dimensions a vector value should have
    db_name: The name of the database. Pass explicit dbName or leave empty when cluster is free or serverless
    metric_type: The metric type (default: "COSINE", options: "L2", "IP", "COSINE") Ask the user to select the metric type, if user does not select, use default value "COSINE"
    id_type: The data type of the primary field (default: "Int64", options: "Int64", "VarChar")
    auto_id: Whether the primary field automatically increments (default: True)
    primary_field_name: The name of the primary field (default: "id")
    vector_field_name: The name of the vector field (default: "vector")
Returns:
    Dict containing the response
    Example:
    {
        "code": 0,
        "data": {}
    }
    

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
cluster_idYes
region_idYes
endpointYes
collection_nameYes
dimensionYes
db_nameNo
metric_typeNoCOSINE
id_typeNoInt64
auto_idNo
primary_field_nameNoid
vector_field_nameNovector
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It mentions the tool creates something ('Create a collection'), implying a write operation, but doesn't address permissions, side effects, error handling, or rate limits. The example response format adds some value, but key behavioral traits like whether this is idempotent or what happens on failure are missing.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is front-loaded with the core purpose, followed by a structured list of parameters and returns. However, it includes verbose examples and repetitive details (e.g., listing all default values in the description when they're already in the schema), making it longer than necessary. Some sentences, like the full example response, could be trimmed for efficiency.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the complexity (11 parameters, no annotations, no output schema), the description is moderately complete. It covers parameter semantics well but lacks behavioral context (e.g., error cases, performance). The example response helps, but without an output schema, more detail on return values would improve completeness. It's adequate but has gaps for a mutation tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 0%, so the description must compensate fully. It provides detailed semantics for all 11 parameters, including explanations (e.g., 'endpoint: The cluster endpoint URL. Can be obtained by calling describe_cluster'), default values, options for enums like 'metric_type', and usage notes (e.g., for 'db_name'). This adds significant meaning beyond the bare schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Create a collection') and resource ('in a specified cluster'), with the method 'using Quick Setup' providing additional context. It distinguishes from siblings like 'list_collections' or 'describe_collection' by focusing on creation. However, it doesn't explicitly differentiate from other creation tools like 'create_free_cluster' beyond the resource type.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage when needing to create a collection in a cluster, with a note that 'endpoint' can be obtained from 'describe_cluster', suggesting a prerequisite. However, it lacks explicit guidance on when to use this vs. alternatives (e.g., no mention of when to use 'create_free_cluster' first or how this relates to database operations), and doesn't specify exclusions or edge cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/zilliztech/zilliz-mcp-server'

If you have feedback or need assistance with the MCP directory API, please join our Discord server