Skip to main content
Glama

claude-skill-validator

Server Details

Cloudflare Workers MCP server: claude-skill-validator

Status
Healthy
Last Tested
Transport
Streamable HTTP
URL
Repository
lazymac2x/claude-skill-validator-api
GitHub Stars
0

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client
Glama
MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.
Tool DescriptionsA

Average 3.7/5 across 4 of 4 tools scored. Lowest: 3.1/5.

Server CoherenceA
Disambiguation4/5

Tools are mostly distinct: lint_frontmatter focuses on YAML frontmatter only, scan_security on security, score_skill gives a minimal CI/CD result, and validate_skill provides a full validation. However, score_skill and validate_skill both return score and gate, which could cause confusion if not read carefully.

Naming Consistency5/5

All tool names follow a consistent verb_noun pattern in snake_case: lint_frontmatter, scan_security, score_skill, validate_skill. The style is uniform and predictable.

Tool Count5/5

With only 4 tools, the server is well-scoped for its purpose of validating Claude Skill files. Each tool serves a distinct validation need without being redundant or excessive.

Completeness5/5

The tool set covers all key validation aspects: frontmatter parsing, security scanning, and a full validation that includes body quality and tool collisions. No obvious gaps are present.

Available Tools

4 tools
lint_frontmatterAInspect

Parse and validate only the YAML frontmatter of a Claude Skill. Returns parsed fields and any rule violations. Use during iterative authoring.

ParametersJSON Schema
NameRequiredDescriptionDefault
yamlYesRaw YAML content without --- delimiters (just key: value lines)
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It states the tool returns parsed fields and violations but does not disclose whether it is read-only, if it has side effects, or any prerequisite conditions, which is insufficient for a tool with no annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is only two sentences, front-loaded with the core action, and every word adds value with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity and good schema coverage, the description adequately explains what is returned (parsed fields and violations) and the scope (frontmatter only), though lacking detail on violation types is a minor gap.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The only parameter 'yaml' is well-described in the schema (100% coverage). The description repeats the schema description without adding new meaning, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it parses and validates YAML frontmatter of a Claude Skill, which distinguishes it from sibling tools like scan_security or score_skill that have different purposes.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description suggests 'Use during iterative authoring' but does not provide explicit when-to-use/when-not-to-use guidance or compare to alternatives, leaving usage context somewhat vague.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

scan_securityAInspect

Scan a Claude Skill for dangerous shell commands, hardcoded secrets, or prompt-injection phrases. Returns findings with excerpts.

ParametersJSON Schema
NameRequiredDescriptionDefault
skill_mdYesThe skill text to scan for security issues
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description carries the full burden. It discloses the tool scans (non-destructive) and returns findings with excerpts, which is transparent enough for a read-like operation. However, it could mention that it does not modify data, but this is implied.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is two sentences, front-loading the action and listing scanned items. Every word is necessary, with no redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's simplicity (one param, no output schema), the description adequately explains the input and output. It could specify the format of 'findings' or mention limitations, but overall it is sufficient.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with the parameter 'skill_md' having a clear description. The tool description adds no additional semantic value beyond the schema, so a baseline score of 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states it scans a Claude Skill for specific security issues (shell commands, secrets, prompt-injection) and returns findings with excerpts. This distinct verb+resource+scope differentiates it from siblings like lint_frontmatter, score_skill, and validate_skill.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives, such as before deployment or after other checks. No exclusions or context are mentioned, so an AI agent lacks direction on selecting this tool.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

score_skillBInspect

Run full validation and return a 0–100 score plus a ship gate (pass/review/fail). Minimal payload suitable for CI/CD pipelines.

ParametersJSON Schema
NameRequiredDescriptionDefault
toolsNo
skill_mdYesFull SKILL.md contents
allowed_toolsNo
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure. It states it runs 'full validation' but does not disclose side effects, prerequisites, or safety profile (e.g., whether it modifies state or requires specific permissions).

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Two sentences, no wasted words. Key information (validation, score, gate, CI/CD suitability) is front-loaded in the first sentence, making it easy to parse quickly.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given three parameters (two undocumented), no output schema, and no annotations, the description is too brief. It fails to explain what 'full validation' covers, the relationship to sibling tools, or the format of the returned score and gate.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters2/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is low (33%), with only 'skill_md' having a description in the schema. The tool description does not explain the 'tools' or 'allowed_tools' parameters beyond the schema, missing an opportunity to clarify their roles.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the tool performs 'full validation' and returns a numeric score with a ship gate, which differentiates it from sibling tools like 'validate_skill' that likely produce a pass/fail only. The CI/CD context adds specificity.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit guidance on when to use this tool versus siblings like 'lint_frontmatter', 'scan_security', or 'validate_skill'. The mention of CI/CD pipelines is the only implicit context, but no alternatives or exclusions are provided.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

validate_skillAInspect

Fully validate a Claude SKILL.md file — frontmatter rules, body quality, security scan, tool collision check. Returns score (0–100), gate (pass/review/fail), and structured errors/warnings.

ParametersJSON Schema
NameRequiredDescriptionDefault
toolsNoOptional override tool list if not declared in frontmatter
skill_mdYesFull contents of the SKILL.md file including --- frontmatter ---
allowed_toolsNoOptional whitelist; non-listed tools will produce errors
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations, the description fully carries the transparency burden. It discloses the validation operations and output format. It does not mention side effects (though validation likely has none) or potential rate limits, but it is sufficiently transparent for a validation tool.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Single, complete sentence that front-loads the core purpose and lists key features. Every word adds value; no redundancy or fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description clearly explains the return value (score, gate, errors/warnings) and covers all validation aspects. It is sufficiently complete for an AI agent to understand and select this tool.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100% with descriptions for all three parameters. The description does not add new semantic meaning beyond what the schema already provides, so baseline 3 is appropriate.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description explicitly states it fully validates a Claude SKILL.md file, listing specific checks (frontmatter, body quality, security, tool collision) and the output format (score 0-100, gate, errors/warnings). This clearly distinguishes it from sibling tools that perform individual checks.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies this is the comprehensive validation tool, but it does not explicitly state when to use this versus the sibling tools (lint_frontmatter, scan_security, score_skill). No 'when not to use' or direct comparison provided, leaving some ambiguity.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Discussions

No comments yet. Be the first to start the discussion!

Try in Browser

Your Connectors

Sign in to create a connector for this server.