MCP Headless Gmail Server

gmail_get_email_body_chunk

Retrieve a 1,000-character segment of a Gmail email body from a specified starting point to handle large messages in manageable portions.

Instructions

Get a 1k character chunk of an email body starting from the specified offset

Input Schema

TableJSON Schema

Name	Required	Description
`google_access_token`	Yes	Google OAuth2 access token
`message_id`	No	ID of the message to retrieve
`thread_id`	No	ID of the thread to retrieve (will get the first message if multiple exist)
`offset`	No	Offset in characters to start from (default: 0)

Implementation Reference

src/mcp_server_headless_gmail/server.py:349-454 (handler)

Core handler function in GmailClient class that retrieves a 1KB chunk of the email body starting from the specified offset. Handles thread_id resolution to message_id, extracts plain text body using helper, applies offset, and returns JSON with chunk and metadata.

def get_email_body_chunk(self, message_id: str = None, thread_id: str = None, offset: int = 0) -> str:
    """Get a chunk of the email body
    
    Args:
        message_id: ID of the message to retrieve
        thread_id: ID of the thread to retrieve (will get the first message if multiple exist)
        offset: Offset in characters to start from (default: 0)
        
    Returns:
        JSON string with the body chunk and metadata
    """
    try:
        # Check if service is initialized
        if not hasattr(self, 'service'):
            logger.error("Gmail service not initialized. No valid access token provided.")
            return json.dumps({
                "error": "No valid access token provided. Please refresh your token first.",
                "status": "error"
            })
            
        # Define the operation
        def _operation():
            logger.debug(f"Fetching email body chunk with offset {offset}")
            
            # Store message_id in local variable to make it accessible within _operation scope
            local_message_id = message_id
            local_thread_id = thread_id
            
            # Validate inputs
            if not local_message_id and not local_thread_id:
                return json.dumps({
                    "error": "Either message_id or thread_id must be provided",
                    "status": "error"
                })
            
            try:
                # If thread_id is provided but not message_id, get the first message in thread
                if local_thread_id and not local_message_id:
                    logger.debug(f"Getting messages in thread {local_thread_id}")
                    thread = self.service.users().threads().get(
                        userId='me',
                        id=local_thread_id
                    ).execute()
                    
                    if not thread or 'messages' not in thread or not thread['messages']:
                        return json.dumps({
                            "error": f"No messages found in thread {local_thread_id}",
                            "status": "error"
                        })
                        
                    # Use the first message in the thread
                    local_message_id = thread['messages'][0]['id']
                    logger.debug(f"Using first message {local_message_id} from thread {local_thread_id}")
                
                # Get the message
                logger.debug(f"Getting message {local_message_id}")
                msg = self.service.users().messages().get(
                    userId='me',
                    id=local_message_id,
                    format='full'
                ).execute()
                
                # Extract the full plain text body
                body_text = ""
                body_size_bytes = 0
                
                if 'payload' in msg:
                    body_text, body_size_bytes = self.extract_plain_text_body(msg['payload'])
                
                # Apply offset and get chunk
                if offset >= len(body_text):
                    chunk = ""
                else:
                    chunk = body_text[offset:offset+1000]
                
                # Determine if this contains the full remaining body
                contains_full_body = (offset + len(chunk) >= len(body_text))
                
                return json.dumps({
                    "message_id": local_message_id,
                    "thread_id": msg.get('threadId', ''),
                    "body": chunk,
                    "body_size_bytes": body_size_bytes,
                    "offset": offset,
                    "chunk_size": len(chunk),
                    "contains_full_body": contains_full_body,
                    "status": "success"
                })
                
            except Exception as e:
                logger.error(f"Error processing message: {str(e)}", exc_info=True)
                return json.dumps({
                    "error": f"Error processing message: {str(e)}",
                    "status": "error"
                })
        
        # Execute the operation with token refresh handling
        return self._handle_token_refresh(_operation)
        
    except HttpError as e:
        logger.error(f"Gmail API Exception: {str(e)}")
        return json.dumps({"error": str(e)})
    except Exception as e:
        logger.error(f"Exception in get_email_body_chunk: {str(e)}", exc_info=True)
        return json.dumps({"error": str(e)})

src/mcp_server_headless_gmail/server.py:503-516 (registration)

Tool registration in list_tools() handler, including name, description, and input schema definition.

types.Tool(
    name="gmail_get_email_body_chunk",
    description="Get a 1k character chunk of an email body starting from the specified offset",
    inputSchema={
        "type": "object",
        "properties": {
            "google_access_token": {"type": "string", "description": "Google OAuth2 access token"},
            "message_id": {"type": "string", "description": "ID of the message to retrieve"},
            "thread_id": {"type": "string", "description": "ID of the thread to retrieve (will get the first message if multiple exist)"},
            "offset": {"type": "integer", "description": "Offset in characters to start from (default: 0)"}
        },
        "required": ["google_access_token"]
    },
),

src/mcp_server_headless_gmail/server.py:592-606 (handler)

Dispatch handler within @server.call_tool() that extracts arguments, initializes GmailClient, and invokes the core get_email_body_chunk method.

elif name == "gmail_get_email_body_chunk":
    # Initialize Gmail client with just access token
    gmail = GmailClient(
        access_token=access_token
    )
    
    message_id = arguments.get("message_id")
    thread_id = arguments.get("thread_id")
    offset = int(arguments.get("offset", 0))
    
    if not message_id and not thread_id:
        raise ValueError("Either message_id or thread_id must be provided")
    
    results = gmail.get_email_body_chunk(message_id=message_id, thread_id=thread_id, offset=offset)
    return [types.TextContent(type="text", text=results)]

src/mcp_server_headless_gmail/server.py:125-173 (helper)

Helper method used by the handler to recursively extract plain text body from Gmail message payload parts, decoding base64url data and calculating size.

def extract_plain_text_body(self, msg_payload):
    """Extract plain text body from message payload
    
    Args:
        msg_payload: Gmail API message payload
        
    Returns:
        tuple: (plain_text_body, body_size_in_bytes)
    """
    body_text = ""
    body_size = 0
    
    # Helper function to process message parts recursively
    def extract_from_parts(parts):
        nonlocal body_text, body_size
        
        if not parts:
            return
            
        for part in parts:
            mime_type = part.get('mimeType', '')
            
            # If this part is plain text
            if mime_type == 'text/plain':
                body_data = part.get('body', {}).get('data', '')
                if body_data:
                    # Decode base64url encoded data
                    decoded_bytes = base64.urlsafe_b64decode(body_data)
                    body_size += len(decoded_bytes)
                    body_part = decoded_bytes.decode('utf-8', errors='replace')
                    body_text += body_part
            
            # If this part has child parts, process them
            if 'parts' in part:
                extract_from_parts(part['parts'])
    
    # If body data is directly in the payload
    if 'body' in msg_payload and 'data' in msg_payload['body']:
        body_data = msg_payload['body']['data']
        if body_data:
            decoded_bytes = base64.urlsafe_b64decode(body_data)
            body_size += len(decoded_bytes)
            body_text = decoded_bytes.decode('utf-8', errors='replace')
    
    # If message has parts, process them
    if 'parts' in msg_payload:
        extract_from_parts(msg_payload['parts'])
        
    return body_text, body_size

Tool Definition Quality

B3.1/5.0

Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

With no annotations provided, the description carries full burden for behavioral disclosure. It mentions the 1k character chunking behavior, which is valuable, but doesn't address authentication needs (though implied by google_access_token parameter), error handling, rate limits, or what happens with invalid offsets/message_ids. For a tool with no annotation coverage, this leaves significant gaps.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness5/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single, efficient sentence that immediately conveys the core functionality without any wasted words. It's appropriately sized and front-loaded with the essential information.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given the tool's moderate complexity (4 parameters, no output schema, no annotations), the description is minimally adequate. It explains the chunking behavior but lacks details about authentication requirements, error conditions, and how this tool relates to siblings. Without annotations or output schema, more behavioral context would be helpful for safe usage.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so parameters are well-documented in the schema. The description adds context about the 'offset' parameter (default: 0) and clarifies that thread_id retrieves the first message if multiple exist, providing some value beyond the schema. However, it doesn't explain parameter interactions or provide additional semantic context.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states the action ('Get'), resource ('email body chunk'), and key constraint ('1k character chunk starting from specified offset'), making the purpose immediately understandable. However, it doesn't explicitly differentiate from sibling tools like gmail_get_recent_emails, which retrieves multiple emails rather than a specific body chunk.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

No guidance is provided about when to use this tool versus alternatives. The description doesn't mention prerequisites (like needing a message_id or thread_id), nor does it explain when this tool is appropriate compared to gmail_get_recent_emails for retrieving email content.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

Your AI Chatbot Just Exposed Your CEO's Salary to an Intern
By Om-Shree-0709 on July 2, 2026.
Agent Identity
MCP Security
OAuth Delegation
Why MCP Servers Need Execution Sandboxing (And Why Your Current Stack Isn't Enough)
By Om-Shree-0709 on June 30, 2026.
Agentic Ai
Prompt Injection
WebAssembly
Lightport: Open-Sourcing Glama's AI Gateway
By punkpeye on April 27, 2026.
OpenAI
open source

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/baryhuang/mcp-headless-gmail'

If you have feedback or need assistance with the MCP directory API, please join our Discord server