Skip to main content
Glama
hydrolix

mcp-hydrolix

Official

Server Configuration

Describes the environment variables required to run the server.

NameRequiredDescriptionDefault
HYDROLIX_HOSTYesThe hostname of your Hydrolix server
HYDROLIX_PORTNoThe port number of your Hydrolix server. Usually doesn't need to be set unless using a non-standard port8088
HYDROLIX_USERYesThe username for authentication
HYDROLIX_VERIFYNoEnable/disable SSL certificate verification. Set to "false" to disable certificate verification (not recommended for production)true
HYDROLIX_DATABASENoDefault database to use. Set this to automatically connect to a specific database
HYDROLIX_PASSWORDYesThe password for authentication
HYDROLIX_MCP_BIND_HOSTNoHost to bind the MCP server to when using HTTP or SSE transport. Set to "0.0.0.0" to bind to all network interfaces (useful for Docker or remote access). Only used when transport is "http" or "sse"127.0.0.1
HYDROLIX_MCP_BIND_PORTNoPort to bind the MCP server to when using HTTP or SSE transport. Only used when transport is "http" or "sse"8000
HYDROLIX_MCP_SERVER_TRANSPORTNoSets the transport method for the MCP server. Valid options: "stdio", "http", "sse". This is useful for local development with tools like MCP Inspectorstdio

Capabilities

Features and capabilities supported by this server

CapabilityDetails
tools
{
  "listChanged": true
}
logging
{}
prompts
{
  "listChanged": false
}
resources
{
  "subscribe": false,
  "listChanged": false
}
extensions
{
  "io.modelcontextprotocol/ui": {}
}
experimental
{}

Tools

Functions exposed to the LLM to take actions

NameDescription
list_databasesA

List available Hydrolix databases

get_table_infoA

Get detailed metadata for a specific table including columns and summary table detection.

REQUIRED USAGE: Call this tool BEFORE querying ANY table to check if it's a summary table and get column metadata. This is mandatory to avoid query errors.

This tool provides:

  • is_summary_table: Boolean indicating if table has pre-aggregated data

  • columns: List of column objects, each with a column_category field:

    • column_category='Column': plain dimension column

    • column_category='AliasColumn': non-aggregate ALIAS column, has default_expr

    • column_category='AggregateColumn': AggregateFunction/SimpleAggregateFunction type, has base_function and merge_function

    • column_category='SummaryColumn': ALIAS column that transitively depends on aggregates, has default_expr

  • summary_table_info: Human-readable description for summary tables

  • total_rows, total_bytes: Table statistics

WORKFLOW for querying tables:

  1. Call get_table_info('database', 'table_name')

  2. Check is_summary_table field

  3. If is_summary_table=True:

    • Read column_category and merge_function for each column

    • Use merge_function to wrap aggregate columns in queries

    • Example: SELECT countMerge(count(vendor_id)) FROM table

  4. If is_summary_table=False:

    • Use standard SQL (SELECT count(*), sum(col), etc.)

  5. Execute query with run_select_query

For summary tables, aggregate columns MUST be wrapped with their corresponding -Merge functions from the merge_function field. Querying without checking this metadata first will cause errors.

list_tablesA

List all tables in a database for exploration and discovery.

Use this tool to:

  • Discover what tables exist in a database

  • Filter tables by name pattern (like/not_like)

  • Get basic table metadata (name, engine, row counts, sizes, primary keys)

Returns basic table information WITHOUT column details for performance. Tables are returned with empty columns lists and is_summary_table not set.

IMPORTANT: Always call get_table_info(database, table) before querying a specific table. Column metadata (types, categories, merge functions) is required to build correct queries, especially for summary tables which need special -Merge function syntax. list_tables() is intentionally lightweight to avoid loading schema for all tables at once.

run_select_queryA

Run a SELECT query in a Hydrolix time-series database using the Clickhouse SQL dialect. Queries run using this tool will timeout after 30 seconds.

RESULT TRUNCATION:

Query results are automatically truncated when the total cell count (rows * columns) exceeds the configured limit.

Response shape: - Always present: columns, rows, truncated (bool), row_count - Only when truncated=true: total_row_count, message Note: total_row_count is the number of rows fetched from the server, which is capped at 100,000. The actual table may contain more rows than this value suggests.

Note: if the cell limit is smaller than the number of columns, row_count will be 0 — in that case you must either refine the query (fewer columns, stricter filters) or increase max_cells.

MANDATORY PRE-QUERY CHECK:

Before running ANY query, call get_table_info(database, table_name) if you haven't already. Check is_summary_table and read column metadata (column_category, merge_function per column). If is_summary_table=True: follow summary_table_info from get_table_info response and rules below. If is_summary_table=False: use standard SQL (count, sum, avg, etc.).

The primary key on tables queried this way is always a timestamp. Queries should include either a LIMIT clause or a filter based on the primary key as a performance guard to ensure they return in a reasonable amount of time. Queries should select specific fields and avoid the use of SELECT * to avoid performance issues. The performance guard used for the query should be clearly communicated with the user, and the user should be informed that the query may take a long time to run if the performance guard is not used. When choosing a performance guard, the user's preference should be requested and used if available. When using aggregations, the performance guard should take form of a primary key filter, or else the LIMIT should be applied in a subquery before applying the aggregations.

When matching columns based on substrings, prefix or suffix matches should be used instead of full-text search whenever possible. When searching for substrings, the syntax column LIKE '%suffix' or column LIKE 'prefix%' should be used.

SUMMARY TABLE RULES (if is_summary_table=True):

Use column_category from get_table_info to determine column usage — do NOT infer from names.

  1. column_category='AggregateColumn': MUST be wrapped in its merge_function

    • Stores binary AggregateFunction state — direct SELECT causes deserialization errors

    • Use exact merge_function from column metadata (do NOT infer from column name)

    • count(vendor_id) → countMerge(count(vendor_id)), countIf(c) → countIfMerge(countIf(c))

    • Always use backticks for column names with special characters

  2. column_category='SummaryColumn': select directly, no wrapping

    • ALIAS that wraps -Merge internally — NEVER wrap in sum()/count()/avg() (ILLEGAL_AGGREGATION)

    • Per-row value — for grand totals use the corresponding AggregateColumn + merge_function

  3. column_category='Column'/'AliasColumn': dimension columns, use as-is

    • Many have function-like names (e.g., toStartOfMinute(dt)) — LITERAL names, not expressions

    • WRONG: SELECT toStartOfMinute(dt) RIGHT: SELECT toStartOfMinute(dt)

    • For time filters: use '2022-06-01' or '2022-06-01 00:00:00' — NOT partial '2022-06-01 00:00'

    • Use >= and < for ranges: WHERE col >= '2022-06-01' AND col < '2022-06-02'

  4. GROUP BY: required when SELECT mixes dimension columns with aggregates

    • Only Column/AliasColumn go in GROUP BY — never AggregateColumn or SummaryColumn

  5. NEVER use SELECT * on summary tables (causes deserialization errors)

Summary table query patterns (after calling get_table_info first):

Pattern 1: Aggregate entire table -- First: get_table_info('database', 'summary_table') -- Read column.merge_function for count(column_name) = "countMerge" SELECT countMerge(count(column_name)) as total FROM database.summary_table

Pattern 2: Aggregate with grouping by dimension and optional time range filter -- First: get_table_info('database', 'summary_table') -- Read merge_function for each aggregate column SELECT toStartOfMinute(datetime_field) as time_bucket, countMerge(count(column_name)) as total, avgMerge(avg(other_column)) as avg_value FROM database.summary_table WHERE toStartOfMinute(datetime_field) >= '2022-06-01' AND toStartOfMinute(datetime_field) < '2022-06-02' GROUP BY toStartOfMinute(datetime_field) ORDER BY time_bucket DESC

Pattern 3: Multiple aggregates (no dimensions, no GROUP BY) -- First: get_table_info('database', 'summary_table') SELECT countMerge(count(column_name)) as count_result, sumMerge(sum(other_column)) as sum_result FROM database.summary_table

Pattern 4: Using column_category='SummaryColumn' -- First: get_table_info('database', 'summary_table') -- SummaryColumns are per-row values — use with GROUP BY to break down by dimension SELECT cdn, cnt_all, sum_bytes FROM database.summary_table GROUP BY cdn -- No -Merge needed, these are pre-defined aliases -- For a grand total across all rows, use AggregateColumn + merge_function instead: SELECT countMerge(count()) AS grand_total FROM database.summary_table

Pattern 5: Using dimensions with function-like names (common pattern) -- First: get_table_info('database', 'summary_table') -- Dimension column named: toStartOfMinute(primary_datetime) — LITERAL name, not an expression -- WRONG: SELECT toStartOfMinute(primary_datetime) ... (tries to call function) -- RIGHT: Use the literal column name with backticks SELECT toStartOfMinute(primary_datetime) as time_bucket, countMerge(count()) as cnt, maxMerge(max(value)) as max_val FROM database.summary_table GROUP BY toStartOfMinute(primary_datetime) ORDER BY time_bucket DESC LIMIT 10

Regular table examples (non-summary):

Example query. Purpose: get logs from the application.logs table. Primary key: timestamp. Performance guard: 10 minute recency filter.

SELECT message, timestamp FROM application.logs WHERE timestamp > now() - INTERVAL 10 MINUTES

Example query. Purpose: get the median humidity from the weather.measurements table. Primary key: date. Performance guard: 1000 row limit, applied before aggregation.

SELECT median(humidity) FROM (SELECT humidity FROM weather.measurements LIMIT 1000)

Example query. Purpose: get the lowest temperature from the weather.measurements table over the last 10 years. Primary key: date. Performance guard: date range filter.

SELECT min(temperature) FROM weather.measurements WHERE date > now() - INTERVAL 10 YEARS

Example query. Purpose: get the app name with the most log messages from the application.logs table in the window between new year and valentine's day of 2024. Primary key: timestamp. Performance guard: date range filter. SELECT app, count(*) FROM application.logs WHERE timestamp > '2024-01-01' AND timestamp < '2024-02-14' GROUP BY app ORDER BY count(*) DESC LIMIT 1

Prompts

Interactive templates invoked by user choice

NameDescription

No prompts

Resources

Contextual data attached and managed by the client

NameDescription

No resources

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/hydrolix/mcp-hydrolix'

If you have feedback or need assistance with the MCP directory API, please join our Discord server