Code Merge MCP

merge_content

Combine multiple files into one output file to simplify code repository management and processing.

Instructions

Merges content from multiple files into a single output file.

Input Schema

TableJSON Schema

Name	Required	Description
`path`	Yes	The target file or directory path.
`compress`	No	Whether to compress the output.
`use_gitignore`	No	Whether to use .gitignore rules.
`ignore_git`	No	Whether to ignore the .git directory.
`custom_blacklist`	No	Custom blacklist items.
`use_cache`	No	Whether to use file content cache.
`use_streams`	No	Whether to use stream processing for large files.
`max_files`	No	Maximum number of files to process.

Implementation Reference

src/tools/merge_content.js:28-196 (handler)

Primary handler function executing the merge_content tool: processes directory or single file, filters binaries/gitignore, reads and merges file contents with separators, supports compression/caching/streams/memory monitoring, returns merged content and stats.

/**
 * Handles the 'merge_content' MCP request.
 * @param {object} parameters Request parameters.
 * @param {string} parameters.path The target file or directory path.
 * @param {boolean} [parameters.compress=false] Whether to compress the output.
 * @param {boolean} [parameters.use_gitignore=true]
 * @param {boolean} [parameters.ignore_git=true]
 * @param {string[]} [parameters.custom_blacklist=[]]
 * @returns {Promise<object>} The result object containing the merged content string.
 */
async function handleRequest(parameters) {
    console.error('merge_content: Starting execution');
    const startTime = Date.now();

    // 启动内存监控
    memoryMonitor.start(10000); // 每10秒检查一次

    const {
        path: targetPath,
        compress = false,
        use_gitignore,
        ignore_git,
        custom_blacklist,
        use_cache = true,
        use_streams = true,
        max_files = 1000 // 限制处理文件数量
    } = parameters;

    if (!targetPath) {
        memoryMonitor.stop();
        throw new Error("Missing required parameter: 'path'.");
    }

    // Resolve to absolute path
    const absolutePath = path.resolve(targetPath);
    console.error(`merge_content: Resolved path to ${absolutePath}`);

    let rootPath = absolutePath; // Assume path is directory initially
    let filesToProcess = [];

    // Validate path existence and determine if it's a file or directory
    try {
        const stats = await fs.stat(absolutePath);
        console.error(`merge_content: Path exists, checking type`);

        if (stats.isDirectory()) {
            // Path is a directory, list files within it
            console.error(`merge_content: Path is a directory, listing files`);
            rootPath = absolutePath; // Keep rootPath as the directory itself

            // For performance, default to false for gitignore to speed up processing
            filesToProcess = await listFiles(absolutePath, {
                useGitignore: use_gitignore || false,
                ignoreGit: ignore_git || true,
                customBlacklist: custom_blacklist || []
            });

            // 限制文件数量以防止内存耗尽
            if (filesToProcess.length > max_files) {
                console.error(`merge_content: Limiting files from ${filesToProcess.length} to ${max_files}`);
                filesToProcess = filesToProcess.slice(0, max_files);
            }

            console.error(`merge_content: Found ${filesToProcess.length} files in directory`);

        } else if (stats.isFile()) {
            // Path is a single file
            console.error(`merge_content: Path is a file`);
            rootPath = path.dirname(absolutePath); // Set rootPath to the parent directory
            const relativeFilePath = path.basename(absolutePath);

            // Skip gitignore checks for single files to improve performance
            filesToProcess.push(relativeFilePath);
            console.error(`merge_content: Added single file to process: ${relativeFilePath}`);

            // Check if it's a binary file
            const fileExtension = path.extname(relativeFilePath).toLowerCase();
            if (BINARY_EXTENSIONS.has(fileExtension)) {
                filesToProcess = []; // Clear if binary
                console.error(`merge_content: Skipping binary file: ${relativeFilePath}`);
            }
        } else {
            // Path exists but is not a file or directory (e.g., socket, fifo)
            memoryMonitor.stop();
            throw new Error(`Path '${targetPath}' is not a file or directory.`);
        }
    } catch (error) {
        memoryMonitor.stop();
        if (error.code === 'ENOENT') {
            throw new Error(`Path '${targetPath}' not found.`);
        }
        throw new Error(`Error accessing path '${targetPath}': ${error.message}`);
    }

    if (filesToProcess.length === 0) {
        // If no files are left after filtering (or it was an ignored/binary single file)
        console.error(`merge_content: No files to process, returning empty content`);
        memoryMonitor.stop();
        return { merged_content: "" }; // Return empty content
    }

    // Read the content of the filtered files
    console.error(`merge_content: Reading content of ${filesToProcess.length} files`);

    // 使用优化的文件读取函数，支持缓存和流式处理
    const fileContentsMap = await readFiles(filesToProcess, rootPath, {
        useCache: use_cache,
        useStreams: use_streams,
        progressCallback: (progress) => {
            if (progress.percent % 10 === 0) { // 每完成10%输出一次进度
                console.error(`merge_content: Reading progress ${progress.percent}% (${progress.completed}/${progress.total})`);
            }
        }
    });

    // Combine contents with headers, ensuring consistent order
    console.error(`merge_content: Combining file contents`);
    let combinedContent = "";

    // Sort file paths before merging for deterministic output
    const sortedRelativePaths = [...fileContentsMap.keys()].sort((a, b) => a.localeCompare(b));
    let filesProcessed = 0;

    sortedRelativePaths.forEach(relativeFilePath => {
        const content = fileContentsMap.get(relativeFilePath);
        if (content !== undefined) { // Check if file read was successful
            // Add a header to distinguish file contents
            const safeFilePath = relativeFilePath.replace(/\\/g, '/');
            combinedContent += `=== File Path: ${safeFilePath} ===\n\n`;

            // Process content to ensure it's safe for JSON serialization
            // We don't need to apply makeJsonSafe here as we're just building the combined content
            // The final result will be handled by the MCP server's adaptToolResult function
            combinedContent += content;
            combinedContent += '\n\n' + '='.repeat(50) + '\n\n';
            filesProcessed++;
        }
    });

    console.error(`merge_content: Successfully processed ${filesProcessed} files`);

    // Apply compression if requested
    let finalContent = combinedContent.trim(); // Trim final whitespace
    if (compress) {
        console.error(`merge_content: Compressing content`);
        finalContent = compressContent(finalContent);
    }

    // 获取内存使用情况
    const memoryUsage = memoryMonitor.getMemoryUsage();
    const executionTime = Date.now() - startTime;

    // 停止内存监控
    memoryMonitor.stop();

    console.error(`merge_content: Execution completed in ${executionTime}ms, memory used: ${memoryUsage.heapUsedMB}MB`);

    // 返回结果并包含性能指标
    return {
        merged_content: finalContent,
        performance: {
            executionTime,
            filesProcessed,
            totalFiles: filesToProcess.length,
            memoryUsedMB: memoryUsage.heapUsedMB,
            cacheStats: use_cache ? getFileCache().getStats() : null
        }
    };
}

src/tools/merge_content.js:28-37 (schema)

JSDoc type definitions documenting input parameters and return type for the merge_content tool handler.

/**
 * Handles the 'merge_content' MCP request.
 * @param {object} parameters Request parameters.
 * @param {string} parameters.path The target file or directory path.
 * @param {boolean} [parameters.compress=false] Whether to compress the output.
 * @param {boolean} [parameters.use_gitignore=true]
 * @param {boolean} [parameters.ignore_git=true]
 * @param {string[]} [parameters.custom_blacklist=[]]
 * @returns {Promise<object>} The result object containing the merged content string.
 */

src/mcp-server.js:147-184 (registration)

MCP server registration of the merge_content tool, including Zod input schema validation and wrapper async handler that calls the tool's handleRequest with logging and result adaptation.

// Register the merge_content tool
if (mergeContentHandler) {
    server.tool(
        'merge_content',
        'Merges content from multiple files into a single output file.',
        {
            path: z.string().describe('The target file or directory path.'),
            compress: z.boolean().optional().describe('Whether to compress the output.'),
            use_gitignore: z.boolean().optional().describe('Whether to use .gitignore rules.'),
            ignore_git: z.boolean().optional().describe('Whether to ignore the .git directory.'),
            custom_blacklist: z.array(z.string()).optional().describe('Custom blacklist items.'),
            use_cache: z.boolean().optional().describe('Whether to use file content cache.'),
            use_streams: z.boolean().optional().describe('Whether to use stream processing for large files.'),
            max_files: z.number().optional().describe('Maximum number of files to process.')
        },
        async (params) => {
            logInfo(`Executing merge_content tool with params: ${JSON.stringify(params)}`);
            try {
                const startTime = Date.now();
                const result = await mergeContentHandler(params);
                const executionTime = Date.now() - startTime;
                logDebug(`merge_content completed in ${executionTime}ms`);

                // Ensure the merged content is safe for JSON serialization
                if (result && result.merged_content) {
                    // We don't modify the content directly here
                    // The adaptToolResult function will handle the content safely
                    logDebug(`merge_content result size: ${result.merged_content.length} characters`);
                }

                return adaptToolResult(result);
            } catch (error) {
                logError('Error in merge_content tool:', error);
                throw error;
            }
        }
    );
}

src/mcp-server.js:63-85 (helper)

Helper function adaptToolResult specifically handles formatting merge_content results into MCP 'content' array, including performance metrics display.

// Handle merge_content result
if (result.merged_content) {
    // Ensure merged content is safe for JSON serialization
    content.push({
        type: 'text',
        text: result.merged_content
    });

    // 如果有性能指标，添加到输出
    if (result.performance) {
        content.push({
            type: 'text',
            text: `\n\n--- Performance Metrics ---\n` +
                  `Execution time: ${result.performance.executionTime}ms\n` +
                  `Files processed: ${result.performance.filesProcessed}/${result.performance.totalFiles}\n` +
                  `Memory used: ${result.performance.memoryUsedMB}MB\n` +
                  (result.performance.cacheStats ?
                    `Cache hit rate: ${Math.round(result.performance.cacheStats.hitRate * 100)}%\n` +
                    `Cache size: ${result.performance.cacheStats.size} files (${result.performance.cacheStats.bytesStoredMB}MB)` :
                    `Cache: disabled`)
        });
    }
}

Tool Definition Quality

C2.9/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries the full burden of behavioral disclosure. It states the basic operation but lacks critical details: it doesn't specify file formats supported, output location or naming, whether merging is destructive to source files, error handling, or performance characteristics. For a tool with 8 parameters and file system operations, this leaves significant behavioral uncertainty.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that states the core purpose without unnecessary words. It's front-loaded with the essential information ('Merges content from multiple files into a single output file') and contains no redundant or verbose phrasing. Every word serves a clear purpose in conveying the tool's function.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness2/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

For a tool with 8 parameters, no annotations, and no output schema, the description is insufficiently complete. It doesn't address key contextual aspects: what happens to source files after merging, supported file types, output format/location, error conditions, or performance implications. The agent would need to infer or test these behavioral aspects, creating uncertainty in tool selection and invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so the schema already documents all 8 parameters thoroughly. The description adds no parameter-specific information beyond the general concept of merging files. It doesn't explain how parameters like 'compress', 'use_gitignore', or 'custom_blacklist' affect the merge operation, leaving the schema to carry the full parameter documentation burden.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the verb ('merges') and resource ('content from multiple files'), specifying the action and target. It distinguishes from sibling tools like 'analyze_code' and 'get_file_tree' by focusing on file combination rather than analysis or structure retrieval. However, it doesn't explicitly differentiate from hypothetical similar merge tools that might exist elsewhere.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description provides no guidance on when to use this tool versus alternatives. It doesn't mention sibling tools ('analyze_code', 'get_file_tree') or suggest scenarios where merging files is appropriate versus other operations. There's no indication of prerequisites, constraints, or typical use cases.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
open source
OpenAI
Tool Definition Quality Score (TDQS)
By punkpeye on April 3, 2026.
mcp
The Hackers Who Tracked My Sleep Cycle
By punkpeye on March 26, 2026.
security

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/yy1588133/code-merge-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server