Image Tools - Background Removal, Upscaling & Face Restoration

Name: Image Tools - Background Removal, Upscaling & Face Restoration
Author: fasuizu-br

by io.github.fasuizu-br

Server Details

Background removal, 4x upscaling, and face restoration via GPU

Status: Healthy
Last Tested: 2026-05-28 01:34
Transport: Streamable HTTP
URL
Repository: fasuizu-br/speech-ai-examples
GitHub Stars: 0

Glama MCP Gateway

Connect through Glama MCP Gateway for full control over tool access and complete visibility into every call.

MCP client

Glama

MCP server

Full call logging

Every tool call is logged with complete inputs and outputs, so you can debug issues and audit what your agents are doing.

Tool access control

Enable or disable individual tools per connector, so you decide what your agents can and cannot do.

Managed credentials

Glama handles OAuth flows, token storage, and automatic rotation, so credentials never expire on your clients.

Usage analytics

See which tools your agents call, how often, and when, so you can understand usage patterns and catch anomalies.

100% free. Your data is private.

Tool Definition Quality

A4.3/5.0

Tool DescriptionsA

Average 4.2/5 across 4 of 4 tools scored.

Server CoherenceA

Disambiguation5/5

Each tool has a clearly distinct purpose with no overlap: check_image_service monitors API health, remove_background segments images, restore_face enhances facial features, and upscale_image increases resolution. The descriptions specify different models (BiRefNet, GFPGAN, Real-ESRGAN) and use cases, making misselection unlikely.

Naming Consistency5/5

All tool names follow a consistent verb_noun pattern with snake_case: check_image_service, remove_background, restore_face, and upscale_image. This uniformity makes the set predictable and easy to understand, with no deviations in style or convention.

Tool Count5/5

With 4 tools, the server is well-scoped for its purpose of image processing tasks like background removal, upscaling, and face restoration. Each tool earns its place by covering a core functionality without being overly sparse or bloated, aligning with typical tool counts for such domains.

Completeness4/5

The tool set covers the main advertised capabilities (background removal, upscaling, face restoration) and includes a health check, but there are minor gaps. For example, no tool handles basic image transformations (e.g., cropping, resizing) or format conversions, which agents might need to work around, though core workflows are well-supported.

Available Tools

4 tools

check_image_serviceCheck Image ServiceA

Read-onlyIdempotent

Inspect

Check health status of Image API services and loaded models.

Returns: dict with keys: - status (str): 'healthy' or error state - models (dict): Loaded model status per capability - version (str): API version

ParametersJSON Schema

Name	Required	Description	Default
No parameters

Tool Definition Quality

A4.1/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Annotations cover safety profile (readOnly, idempotent, non-destructive, openWorld). Description adds valuable return value documentation detailing the dict structure with keys (status, models, version) and their semantics ('healthy' vs error states), which annotations do not cover.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Appropriately sized with clear docstring-style formatting. Returns section is structured and readable. No redundant text, though the multiline format with indentation consumes vertical space unnecessarily.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive for a zero-parameter health check tool. The manual documentation of return values compensates adequately for the absence of a formal output schema, providing sufficient completeness for agent invocation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Zero parameters present; baseline score applies per rubric. Description correctly does not invent parameter documentation where none exists.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description uses specific verb 'Check' with clear resource 'health status of Image API services and loaded models'. Effectively distinguishes from siblings (remove_background, restore_face, upscale_image) which are image processing operations, while this is a service health diagnostic.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No explicit when-to-use or when-not-to-use guidance provided. Usage is implied by the nature of the function (health check vs image manipulation), but lacks guidance such as 'call before processing operations' or troubleshooting contexts.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

remove_backgroundRemove BackgroundA

Read-onlyIdempotent

Inspect

Remove the background from an image.

Uses BiRefNet segmentation to precisely separate foreground from background. Returns a base64-encoded image with transparent background (PNG) or white background (WebP). Sub-500ms latency on GPU.

Args: image_base64: Base64-encoded image data (PNG, JPEG, or WebP). output_format: Output format -- 'png' (with transparency) or 'webp'.

Returns: dict with keys: - image_base64 (str): Base64-encoded result image - format (str): Output image format - original_size (dict): Original width and height - processing_ms (int): Processing time in milliseconds

ParametersJSON Schema

Name	Required	Description	Default
`image_base64`	Yes	Base64-encoded image data. Supports PNG, JPEG, and WebP formats.
`output_format`	No	Output image format: 'png' (default, with transparency) or 'webp'	png

Tool Definition Quality

A4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Adds substantial context beyond annotations: specifies BiRefNet algorithm, sub-500ms GPU latency, and detailed return structure (compensating for lack of output schema). Does not cover error cases or rate limits.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured with clear sections (summary, technical details, Args, Returns). The Returns dictionary is verbose but necessary given no output schema exists. Information is front-loaded with the primary action.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive given constraints: provides full output structure documentation to compensate for missing output schema, covers all parameters via high-coverage schema, and includes implementation/performance details for informed usage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema coverage is 100%, providing complete parameter documentation. The Args section in the description largely mirrors schema definitions without adding significant semantic value (e.g., no usage examples or edge case guidance).

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

States specific action ('Remove the background') and resource ('from an image'), distinguishes from siblings (restore_face, upscale_image) by focusing on segmentation/removal rather than restoration or scaling.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Implies usage through output format specifications (PNG for transparency, WebP for white background), but lacks explicit when-to-use guidance or comparisons to sibling tools (e.g., when to choose this over other image processing options).

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

restore_faceRestore FaceA

Read-onlyIdempotent

Inspect

Restore and enhance faces in an image using GFPGAN.

Detects all faces via RetinaFace, restores quality (fixes blur, noise, compression artifacts), and pastes them back. Optionally enhances the background using Real-ESRGAN. GPU-accelerated, sub-3s latency.

Args: image_base64: Base64-encoded image data containing faces (PNG, JPEG, WebP). upscale: Output upscale factor -- 1 to 4 (default: 2). enhance_background: Whether to enhance background with Real-ESRGAN (default: true).

Returns: dict with keys: - image (str): Base64-encoded restored image - format (str): Output image format - width (int): Output width - height (int): Output height - upscale (int): Scale factor applied - processing_time_ms (float): Processing time in milliseconds

ParametersJSON Schema

Name	Required	Description
`upscale`	No	Output upscale factor: 1-4 (default: 2)
`image_base64`	Yes	Base64-encoded image data containing one or more faces.
`enhance_background`	No	Enhance background with Real-ESRGAN (default: true)

Tool Definition Quality

A4.7/5.0

Behavior5/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

Rich behavioral disclosure beyond annotations: details specific algorithms (RetinaFace detection, GFPGAN restoration, Real-ESRGAN background), lists artifact types fixed (blur, noise, compression), discloses performance characteristics (GPU-accelerated, sub-3s latency), and documents return structure. Annotations only cover safety/idempotency.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

Well-structured and front-loaded: purpose statement first, followed by technical mechanism, Args block, and Returns block. Every sentence earns its place; the Returns section is essential given no formal output schema exists. No redundant fluff.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness5/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Comprehensive for a 3-parameter image processing tool. Documents input requirements, operational constraints (GPU acceleration, latency), specific algorithms, and complete return value structure. Fully compensates for absence of output schema with detailed Returns documentation.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters4/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema coverage, baseline is 3. Description adds file format constraints (PNG, JPEG, WebP) not present in schema's image_base64 description, and clarifies the technical mechanism (Real-ESRGAN) for background enhancement. Provides useful parameter context beyond structured schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

Description opens with specific verb-resource combination ('Restore and enhance faces') and identifies the specific technology (GFPGAN). It clearly distinguishes from sibling 'upscale_image' by focusing specifically on face restoration rather than general image upscaling.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

Provides clear context that this tool is for face restoration specifically, implying when to use it (images containing faces needing quality fixes). However, lacks explicit comparison to siblings (e.g., 'use upscale_image instead for general upscaling without face detection').

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

upscale_imageUpscale ImageA

Read-onlyIdempotent

Inspect

Upscale image resolution using Real-ESRGAN.

Enhances image resolution by 2x or 4x using GPU-accelerated Real-ESRGAN super-resolution. Processes in tiles (256x256) to manage VRAM. Maximum output dimension: 8192x8192.

Args: image_base64: Base64-encoded image data (PNG, JPEG, or WebP). scale: Upscale factor -- 2 or 4 (default: 4).

Returns: dict with keys: - image (str): Base64-encoded upscaled image - format (str): Output image format - width (int): Output width - height (int): Output height - scale (int): Scale factor applied - processing_time_ms (float): Processing time in milliseconds

ParametersJSON Schema

Name	Required	Description	Default
`scale`	No	Upscale factor: 2 or 4 (default: 4)
`image_base64`	Yes	Base64-encoded image data. Supports PNG, JPEG, and WebP formats.

Tool Definition Quality

A4/5.0

Behavior4/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description adds substantial behavioral context beyond the annotations, including: tile-based processing (256x256) for VRAM management, GPU acceleration, and the maximum output dimension constraint (8192x8192). It also documents the return structure (dict with image, format, dimensions, timing) which is not present in structured annotations.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description uses a clean docstring format with clear sections (one-line summary, technical details, Args, Returns). Every sentence provides distinct value: technology identification, implementation details (tiling/VRAM), constraints (max dimensions), and I/O specifications. No waste or redundancy.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness4/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the presence of comprehensive annotations and the detailed description covering inputs, constraints, and return values (despite no formal output_schema), the definition is nearly complete. It could benefit from mentioning error conditions (e.g., what happens if max dimension is exceeded) or authentication requirements, but covers the essential behavioral contract well.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 100% schema description coverage, the baseline is 3. The Args section essentially mirrors the schema descriptions (base64 formats, scale values) without adding significant new semantic meaning such as validation rules, format nuances, or the relationship between input size and VRAM requirements. It meets but does not exceed what the schema provides.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the specific action (upscale/enhance resolution), the technology used (Real-ESRGAN), and the specific scaling factors (2x or 4x). It distinguishes from siblings like 'remove_background' and 'restore_face' by focusing on general super-resolution rather than specific image editing tasks.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines3/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

While the description clearly explains what the tool does, it does not provide explicit guidance on when to use this versus the sibling 'restore_face' tool, which might also use super-resolution techniques. No 'when-not-to-use' or alternative selection criteria are provided, though the specific mention of Real-ESRGAN and general upscaling implies its intended use case.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Claim this connector by publishing a /.well-known/glama.json file on your server's domain with the following structure:

{
  "$schema": "https://glama.ai/mcp/schemas/connector.json",
  "maintainers": [{ "email": "your-email@example.com" }]
}

The email address must match the email associated with your Glama account. Once published, Glama will automatically detect and verify the file within a few minutes.

Image Tools - Background Removal, Upscaling & Face Restoration

Server Details

Tool Definition Quality

Available Tools

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Tool Definition Quality

Discussions

Your Connectors

Resources