Skip to main content
Glama

add_video_mask

Apply customizable mask effects to video segments in JianYing (CapCut) projects to control visibility and create visual overlays.

Instructions

为视频片段添加蒙版效果

Args: video_segment_id: 视频片段ID,通过add_video_segment获得 mask_type: 蒙版类型名称,可以使用find_effects_by_type工具,资源类型选择mask_type,从而获取蒙版类型有哪些 center_x: 蒙版中心点X坐标(以素材的像素为单位),默认0.0(素材中心) center_y: 蒙版中心点Y坐标(以素材的像素为单位),默认0.0(素材中心) size: 蒙版的主要尺寸,以占素材高度的比例表示,默认0.5 rotation: 蒙版顺时针旋转的角度,默认0.0 feather: 蒙版的羽化参数,取值范围0~100,默认0.0 invert: 是否反转蒙版,默认False rect_width: 矩形蒙版的宽度,仅在蒙版类型为矩形时有效,以占素材宽度的比例表示 round_corner: 矩形蒙版的圆角参数,仅在蒙版类型为矩形时有效,取值范围0~100

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
video_segment_idYes
mask_typeYes
center_xNo
center_yNo
sizeNo
rotationNo
featherNo
invertNo
rect_widthNo
round_cornerNo

Output Schema

TableJSON Schema
NameRequiredDescriptionDefault
dataNo
messageYes
successYes
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure. While it documents parameters thoroughly, it doesn't describe what the tool actually does behaviorally: whether it modifies the video segment in-place, creates a new version, requires specific permissions, has side effects, or what the output looks like. '添加蒙版效果' (add mask effect) suggests a mutation but lacks critical behavioral context.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness3/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is appropriately sized but not optimally structured. The purpose statement is clear but brief, followed by a detailed parameter list. While all information is useful, it could be more front-loaded with higher-level context before diving into parameters. The parameter documentation is thorough but makes the description somewhat dense.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's complexity (10 parameters, mutation operation), no annotations, but with an output schema present, the description does reasonably well. It thoroughly documents all parameters and their semantics. The main gap is lack of behavioral context about what the tool actually does and how it affects the video segment. The output schema existence means return values don't need explanation, but operational behavior is underspecified.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage and 10 parameters, the description provides comprehensive semantic information beyond the bare schema. It explains what each parameter means, provides defaults, valid ranges (feather: 0~100), conditional validity (rect_width/round_corner only for rectangle masks), and references to other tools for obtaining values. This fully compensates for the schema's lack of descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states '为视频片段添加蒙版效果' (add mask effect to video segment), which is a specific verb+resource combination. It distinguishes this tool from siblings like add_video_effect or add_video_filter by focusing specifically on masks. However, it doesn't explicitly differentiate mask effects from other visual effects in the sibling list.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description implies usage context through parameter documentation (e.g., video_segment_id comes from add_video_segment, mask_type can be found using find_effects_by_type). However, it lacks explicit guidance on when to use this tool versus alternatives like add_video_effect or add_video_filter, and doesn't mention prerequisites or exclusions.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/hey-jian-wei/jianying-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server