Skip to main content
Glama

boruna_manage

Execute and manage Boruna evaluation framework operations: run benchmarks, validate results, retrieve evidence details, list capabilities, and handle skills for AI agent assessment.

Instructions

Manage Boruna evaluation framework. Actions: run (agent_id, benchmark), validate (run_id — validate results), evidence (run_id — get evidence details), capability_list (list evaluation capabilities), skill_manage (manage Boruna skills).

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
actionYesAction to perform: run, validate, evidence, capability_list, skill_manage
modeYesExecution mode: inline (run script directly) or skill (run a saved boruna_script skill)
scriptNo(inline mode) The .ax script source code to execute
policyNo(inline mode) Capability policy: allow-all or deny-all (default: deny-all)
boruna_tool_idNo(inline mode) UUID of the mcp_stdio Tool pointing to the Boruna binary. If omitted, auto-detects.
inputNoOptional input data passed to the script as JSON
skill_idNo(skill mode) UUID of the boruna_script Skill to execute
execution_idYesUUID of the SkillExecution record from a boruna_script skill run
nameNo(create) Skill name
descriptionNo(create) Skill description
limitNo(list | executions) Max results (default 20)

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/escapeboy/agent-fleet-o'

If you have feedback or need assistance with the MCP directory API, please join our Discord server