Schema | langfuse-mcp-java

langfuse-mcp-java

Overview Schema Related Servers Score Discussions

Server Configuration

Describes the environment variables required to run the server.

Name	Required	Description	Default
`LANGFUSE_HOST`	Yes	Langfuse base URL
`LANGFUSE_TIMEOUT`	No	HTTP request timeout (Spring Duration format, e.g. 30s, 1m)	30s
`LANGFUSE_PUBLIC_KEY`	Yes	Langfuse project public key
`LANGFUSE_SECRET_KEY`	Yes	Langfuse project secret key

Capabilities

Features and capabilities supported by this server

Capability	Details
`tools`	{ "listChanged": true }
`logging`	{}
`prompts`	{ "listChanged": true }
`resources`	{ "subscribe": false, "listChanged": true }
`completions`	{}

Tools

Functions exposed to the LLM to take actions

Name	Description
create_annotation_queueA	Creates a new annotation queue for human review workflows. Returns the created queue with its assigned ID. name is required. description and scoreConfigId are optional.
create_annotation_queue_itemB	Adds an item to an annotation queue for human review. queueId and objectId are required. objectType can be SESSION, TRACE, or OBSERVATION. status is optional PENDING \| COMPLETED.
delete_annotation_queue_itemA	Removes an item from an annotation queue. This action is irreversible. Both queueId and itemId are required.
get_annotation_queueC	Returns a single annotation queue by its ID. Returns: id, name, description, scoreConfigId, projectId, createdAt, updatedAt. queueId is required.
get_annotation_queue_itemC	Returns a specific item from an annotation queue by queue ID and item ID. Returns: id, queueId, traceId, observationId, status, annotatorUserId, completedAt. Both queueId and itemId are required.
list_annotation_queue_itemsC	Returns items in a specific annotation queue, optionally filtered by status. status values: PENDING \| COMPLETED. Omit status to return all items regardless of status. Each item contains: id, queueId, traceId, observationId, status, annotatorUserId, completedAt. queueId is required.
list_annotation_queuesB	Returns a paginated list of annotation queues in the Langfuse project. Each queue contains: id, name, description, scoreConfigId, projectId, createdAt, updatedAt. Annotation queues are used for human-in-the-loop review workflows. Pagination: page is 1-based (default 1), limit controls page size (default 20).
update_annotation_queue_itemB	Updates the status of an annotation queue item. status values: PENDING \| COMPLETED. Both queueId and itemId are required.
create_commentA	Creates a comment attached to a trace, observation, session, or prompt. objectType values: TRACE \| OBSERVATION \| SESSION \| PROMPT. Both objectType and objectId are required along with content. Returns the created comment with its assigned ID.
get_commentC	Returns a single comment by its ID. Returns: id, objectType, objectId, content, authorUserId, createdAt, updatedAt. commentId is required.
get_commentsB	List comments filtered by objectType and objectId. objectType values: TRACE \| OBSERVATION. Returns: id, objectType, objectId, content, authorUserId, createdAt. Read-only.
get_cost_metricsA	Query Langfuse cost, token, latency, and usage analytics via the Metrics API. Mirrors: GET /api/public/metrics?query= Pass the full query as a JSON string. All aggregation is server-side. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ QUERY STRUCTURE ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ { "view": string, // REQUIRED. traces \| observations \| scores-numeric \| scores-categorical "metrics": [...], // REQUIRED. At least one { measure, aggregation, alias? } "fromTimestamp": string, // REQUIRED. ISO-8601 e.g. "2026-03-18T00:00:00Z" "toTimestamp": string, // REQUIRED. ISO-8601 e.g. "2026-03-25T23:59:59Z" "dimensions": [...], // Optional. [{ "field": "..." }] "filters": [...], // Optional. [{ "column", "operator", "value", "type", "key"? }] "timeDimension": {...}, // Optional. { "granularity": "hour\|day\|week\|month\|auto" } "config": {...} // Optional. { "bins": 10, "row_limit": 100 } } ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ VIEW ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ traces → end-to-end cost, tokens, latency per request observations → per LLM call; USE for model breakdowns (providedModelName) scores-numeric → numeric/boolean evaluation scores scores-categorical → categorical evaluation scores ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ MEASURES (by view) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ traces: count \| observationsCount \| scoresCount \| latency \| totalTokens \| totalCost observations: count \| latency \| totalTokens \| totalCost \| timeToFirstToken \| countScores scores-numeric: count \| value scores-categorical: count ⚠ NEVER use inputTokens / outputTokens / promptTokens / completionTokens → 400 error. Use totalTokens. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ AGGREGATIONS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ sum \| avg \| count \| max \| min \| p50 \| p75 \| p90 \| p95 \| p99 sum → cost/token totals avg/p95/p99 → latency count → record counts ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ DIMENSIONS (group-by, by view) ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ traces: name \| tags \| userId \| sessionId \| release \| version \| environment \| observationName \| scoreName observations: providedModelName \| type \| name \| level \| version \| environment \| userId \| sessionId \| traceName \| traceRelease \| traceVersion \| promptName \| promptVersion \| scoreName ⚠ HIGH CARDINALITY — use in filters only, not dimensions: id \| traceId \| observationId ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ FILTERS ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Each filter: { "column", "operator", "value", "type", "key"? } type string/stringObject/boolean → operator: = \| contains \| does not contain \| starts with \| ends with type number/datetime → operator: = \| < \| > \| <= \| >= ⚠ NEVER use != / not_contains / not_equals for string fields → 400 error. ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ EXAMPLES ━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━ Total cost last 7 days: {"view":"traces","metrics":[{"measure":"totalCost","aggregation":"sum"}], "fromTimestamp":"2026-03-18T00:00:00Z","toTimestamp":"2026-03-25T23:59:59Z"} Daily cost trend this week: {"view":"traces","metrics":[{"measure":"totalCost","aggregation":"sum"},{"measure":"count","aggregation":"count"}], "timeDimension":{"granularity":"day"}, "fromTimestamp":"2026-03-18T00:00:00Z","toTimestamp":"2026-03-25T23:59:59Z"} Cost by model: {"view":"observations","dimensions":[{"field":"providedModelName"}], "metrics":[{"measure":"totalCost","aggregation":"sum"},{"measure":"totalTokens","aggregation":"sum"}], "fromTimestamp":"2026-03-18T00:00:00Z","toTimestamp":"2026-03-25T23:59:59Z"} Cost for a specific user: {"view":"traces","metrics":[{"measure":"totalCost","aggregation":"sum"}], "filters":[{"column":"userId","operator":"=","value":"user-123","type":"string"}], "fromTimestamp":"2026-03-18T00:00:00Z","toTimestamp":"2026-03-25T23:59:59Z"} Production environment only: filters: [{"column":"environment","operator":"=","value":"production","type":"string"}]
create_dataset_run_itemC	Creates a dataset run item and creates or updates the dataset run if needed. runName and datasetItemId are required. traceId is strongly recommended and observationId is optional. metadataJson must be valid JSON when provided.
delete_dataset_runA	Deletes a dataset run and all its run items. This action is irreversible. Use this to clean up experiment runs you no longer need. Both datasetName and runName are required.
get_dataset_runC	Returns a single dataset run including all its run items. Each run item links a dataset item to a trace and optional observation. Returns: id, name, datasetName, metadata, createdAt, updatedAt, datasetRunItems[]. Both datasetName and runName are required.
list_dataset_run_itemsB	Returns a paginated list of items in a specific dataset run. Each run item links a dataset item to a trace and optional observation for evaluation. Returns: id, datasetRunId, datasetRunName, datasetItemId, traceId, observationId, createdAt. Both datasetId and runName are required.
list_dataset_runsC	Returns a paginated list of runs for a specific dataset. Each run represents one experiment executed against a dataset. Returns: id, name, datasetId, datasetName, metadata, createdAt, updatedAt. datasetName is required. Pagination: page 1-based (default 1), limit (default 20).
create_datasetB	Creates a new dataset in Langfuse. name is required. description is optional. metadataJson, inputSchemaJson, and expectedOutputSchemaJson must be valid JSON when provided. Returns the created dataset definition.
create_dataset_itemA	Creates or upserts a dataset item in an existing dataset. datasetName is required. inputJson, expectedOutputJson, and metadataJson must be valid JSON when provided. Optional sourceTraceId or sourceObservationId can link the item back to Langfuse data.
delete_dataset_itemA	Deletes a dataset item by its ID. This action is irreversible.
get_datasetC	Get a Langfuse dataset by name. Read-only.
get_dataset_itemC	Get a single dataset item by ID. Read-only.
list_dataset_itemsC	List items in a dataset with pagination. Read-only.
list_datasetsC	List all evaluation datasets in the Langfuse project. Read-only.
list_llm_connectionsB	Returns a paginated list of LLM provider connections configured in the Langfuse project. Each connection contains: id, provider, displaySecretKey (masked), baseURL, config. LLM connections define the provider credentials used by Langfuse for evaluations and playground. Pagination: page is 1-based (default 1), limit controls page size (default 20).
upsert_llm_connectionC	Creates or updates an LLM provider connection (upserted by provider name). If a connection for the given provider already exists, it is updated. provider and secretKey are required. provider examples: openai, anthropic, azure, google.
create_modelC	Creates a custom model definition for cost tracking and token pricing. modelName, matchPattern, and unit are required. unit values: TOKENS \| CHARACTERS \| MILLISECONDS \| SECONDS \| IMAGES \| REQUESTS. Prices are per unit in USD (e.g. inputPrice=0.000001 means $1 per million tokens). Omit prices for models where you do not want cost tracking. startDate format: ISO-8601 date, e.g. 2025-01-01T00:00:00Z.
delete_modelA	Deletes a custom model definition by ID. Note: Langfuse-managed models cannot be deleted. Only custom models you created can be deleted. To override a Langfuse-managed model, create a new custom model with the same modelName instead. modelId is required. This action is irreversible.
get_modelC	Returns a single model definition by its ID. Returns: id, modelName, matchPattern, unit, inputPrice, outputPrice, totalPrice, startDate, tokenizerId, isLangfuseManaged, projectId. modelId is required.
list_modelsB	Returns a paginated list of all models in the Langfuse project, including both Langfuse-managed models and custom models you have defined. Each model contains: id, modelName, matchPattern, unit, inputPrice, outputPrice, totalPrice, startDate, tokenizerId, isLangfuseManaged. Pagination: page is 1-based (default 1), limit controls page size (default 20).
get_projects_for_api_keyB	Returns the project or projects visible to the currently configured API key. With a project-scoped key this normally returns one project. With broader credentials, use this to confirm which project metadata is available.
get_promptB	Fetch a specific prompt by name. Optionally pin to a version number or a label (e.g. 'production', 'staging'). Returns: name, version, type (text\|chat), prompt content, labels, tags, config. Read-only.
list_promptsC	List all prompts in the Langfuse project with pagination. Read-only.
create_promptA	Creates a new version of a prompt. If the prompt name does not exist, a new prompt is created. If it does exist, a new version is appended. type values: text (plain string prompt) \| chat (array of message objects). For text prompts, provide prompt as a plain string. For chat prompts, provide prompt as a JSON array of message objects with role and content fields. labels examples: production, staging, latest. The 'latest' label is managed by Langfuse automatically. Returns the created prompt version with its assigned version number. name, type, and prompt are required.
delete_promptA	Deletes prompt versions by name. Behaviour depends on which filters are supplied: Omit both label and version: deletes ALL versions of the prompt. Supply label only: deletes all versions carrying that label. Supply version only: deletes that specific version number. promptName is required. This action is irreversible.
update_prompt_labelsA	Replaces the labels on a specific prompt version. newLabels completely replaces the existing label set on that version. The 'latest' label is reserved and managed by Langfuse — do not include it. Both promptName and version are required. newLabels is required — supply an empty string to remove all labels from this version.
get_data_schemaA	Returns the Langfuse data model schema: all entity types, fields, and valid enum values. Call this first before running any query to understand the available data structures. Read-only.
create_score_configA	Creates a score config definition used to validate or structure future scores. name and dataType are required. For categorical configs, categoriesJson should be a JSON array of {label,value} objects. For numeric configs, minValue and maxValue are optional bounds.
get_scoreC	Fetch a single evaluation score by ID. Read-only.
get_score_configC	Get a specific score config schema by ID. Read-only.
get_score_configsC	List all score config schemas. Configs define constraints for NUMERIC (min/max), CATEGORICAL (allowed categories), or BOOLEAN scores. Read-only.
get_scoresC	List evaluation scores with optional filters. dataType values: NUMERIC \| CATEGORICAL \| BOOLEAN. Returns: id, traceId, observationId, name, value, dataType, comment, source. Read-only.
update_score_configC	Updates an existing score config. configId is required. Provide only the fields you want to change. categoriesJson must be a JSON array of {label,value} objects when supplied.
fetch_sessionsC	Paginated list of all sessions with optional time range filter. Read-only.
get_session_detailsC	Full details of one session including all its traces. Read-only.
get_user_sessionsC	All sessions for a specific user with pagination. Read-only.
delete_traceA	Deletes a single trace by ID. This action is irreversible. traceId is required.
delete_tracesA	Deletes multiple traces in one request. Pass a comma-separated list of trace IDs. This action is irreversible.
fetch_traceA	Returns the full detail of a single Langfuse trace identified by its ID. The response includes all observations (spans, generations, events) nested under the trace, as well as input/output payloads, metadata, tags, latency, and token usage. Use this after fetch_traces to drill into a specific trace. The traceId is required.
fetch_tracesB	Returns a paginated list of Langfuse traces. Each trace represents one end-to-end LLM pipeline execution. The response includes: id, name, userId, sessionId, level (DEFAULT \| DEBUG \| WARNING \| ERROR), latency (seconds), totalTokens, totalCost (USD), tags, timestamp. All filter parameters are optional. Omit any filter you do not need — omitted filters are ignored and do not narrow the result set. Pagination: page is 1-based (default 1), limit controls page size (default 20, max 100). To page through results, increment page while keeping limit fixed.
find_exceptionsB	Returns only traces whose level field equals ERROR. Filtering is performed on the server before the response is returned — the result set contains error traces only, never a mix of levels. Useful for surfacing pipeline failures and debugging production errors. Both time range parameters are optional. Omit them to search across all time. Pagination works the same way as fetch_traces.
find_exceptions_in_fileB	Returns ERROR-level traces whose metadata contains the given file name as a substring. Both conditions must be true for a trace to appear in the result: (1) the trace level is ERROR, and (2) the trace metadata JSON contains the fileName string anywhere inside it. Use this to isolate errors originating from a specific source file. fileName is required. Both time range parameters are optional — omit them to search across the full project history.
get_error_countA	Returns the total count of ERROR-level traces within the specified time range. The server scans up to 500 traces (5 pages of 100) and counts those with level=ERROR. The response contains errorCount, fromTimestamp, and toTimestamp. Both time range parameters are optional. Omit them to count errors across all time. Use this for a quick health signal before drilling into individual traces with find_exceptions.
get_exception_detailsA	Returns the full detail of a single ERROR-level trace identified by its ID. Equivalent to fetch_trace but semantically scoped to error traces. Use this after find_exceptions to inspect a specific failure in depth — the response includes all nested observations, input/output, metadata, and timing. The traceId is required.
get_user_tracesC	All traces for a specific user with pagination. Read-only.

Prompts

Interactive templates invoked by user choice

Name	Description
No prompts

Resources

Contextual data attached and managed by the client

Name	Description
No resources

Server Configuration
Capabilities
Tools
Prompts
Resources

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Log-LogN/langfuse-mcp-java'

If you have feedback or need assistance with the MCP directory API, please join our Discord server