Skip to main content
Glama
Boyan253

FL Studio 2025 AI Bridge

by Boyan253

voice_to_piano_roll

Record a melody by humming or playing into a microphone and transcribe it directly into FL Studio's piano roll with adjustable quantization, scale snapping, and confidence filtering.

Instructions

Hum/play a melody into the mic, get it written straight into FL's piano roll.

Args: duration_sec: how long to record. bpm: used to convert seconds -> bars. Match your FL tempo. device: mic index (see voice_list_devices); None = default. scale_root: e.g. "C", "F#", "Bb". If given, notes are snapped. scale: e.g. "minor", "major", "dorian". One of gen_list_scales. Used only when scale_root is given. transpose_semitones: shift all notes up/down after transcription. quantize_grid_sec: snap note starts to this grid (e.g. 0.125 = 1/32 at 120 BPM, 0.25 = 1/16, 0.5 = 1/8). None = off. min_confidence: drop notes where the engine was uncertain (0..1). min_note_sec: drop notes shorter than this many seconds. polyphonic: use Spotify Basic Pitch (chords / overlapping notes — needs the polyphonic extra). Default = monophonic pyin. clear_first: clear the open piano roll before writing.

Returns: {ok, notes_written, piano_roll_state, wav_path, transcription_count}

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
duration_secNo
bpmNo
deviceNo
scale_rootNo
scaleNo
transpose_semitonesNo
quantize_grid_secNo
min_confidenceNo
min_note_secNo
polyphonicNo
clear_firstNo
Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden. It clearly discloses that the tool records audio, transcribes it using pyin or Spotify Basic Pitch, writes notes to the piano roll, and returns results including a WAV path. It also mentions optional scale snapping, quantization, and the clear_first behavior. However, it does not explicitly state that the tool is write-only (mutates the project) or that permission may be needed for mic input.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is well-structured with a one-liner summary, then an Args section, then a Returns section. It is front-loaded with a sentence that immediately conveys the tool's high-level function. While the Args block is lengthy, it is necessary for the 11 parameters and is clearly formatted.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the high parameter count (11), lack of output schema, and absence of annotations, the description is remarkably complete. It covers all parameters, explains return values, and even notes dependencies (polyphonic extra). It leaves little ambiguity about how the tool operates or what the agent should expect.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

The input schema has 0% coverage, so the description must explain all parameters. It does so comprehensively in the Args block, detailing each parameter's purpose, acceptable values, defaults, and constraints (e.g., quantize_grid_sec examples with BPM, polyphonic dependency). This fully compensates for the schema's lack of descriptions.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description starts with a clear and specific verb+resource: 'Hum/play a melody into the mic, get it written straight into FL's piano roll.' This immediately and precisely communicates the tool's purpose, distinguishing it as an audio-to-MIDI tool for FL Studio.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus its siblings, such as 'audio_melody_to_piano_roll' or 'voice_record_and_transcribe.' It does not state prerequisites, limitations, or scenarios where alternatives would be preferred.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Boyan253/fl-studio-2025-ai-bridge'

If you have feedback or need assistance with the MCP directory API, please join our Discord server