Skip to main content
Glama
Lumos-Labs-HQ

Amazon Q Web Documentation Reader

๐ŸŒ Amazon Q Web Documentation Reader

MCP Server for Intelligent Web Content Extraction

Python MCP License

Features โ€ข Installation โ€ข Setup โ€ข Usage โ€ข Tools


โœจ Features

  • ๐Ÿง  Intelligent Navigation - Amazon Q (Claude 4.5) decides which documentation pages to visit

  • ๐Ÿงน Clean Content Extraction - Removes navigation, ads, scripts, and other non-content elements

  • ๐Ÿ“ Multiple Output Formats - Supports both Markdown and plain text output

  • ๐Ÿ’ป Code Block Extraction - Specifically extracts code examples from documentation

  • ๐Ÿ“Š Page Structure Analysis - Extracts heading hierarchy and table of contents

  • ๐Ÿ”— Link Discovery - Finds and filters documentation links

  • ๐Ÿ“š Batch Processing - Read multiple documentation pages at once


๐ŸŽฏ How It Works

User: "I'm having issues with Razorpay routes" Documentation: https://razorpay.com/docs Amazon Q (Claude 4.5): 1. Reads main docs page 2. Sees links: ["Payments", "Routes", "Webhooks", ...] 3. Intelligently decides: "Routes link is relevant!" 4. Navigates to Routes documentation 5. Extracts content and solves your problem All navigation decisions = Amazon Q's Claude brain ๐Ÿง  MCP Server = Clean content extraction tool ๐Ÿ› ๏ธ

๐Ÿ“ฆ Installation

Prerequisites

Step 1: Clone the Repository

git clone https://github.com/yourusername/amazon-q-web_search.git cd amazon-q-web_search

Step 2: Install Dependencies

Using uv (Recommended):

uv sync

Using pip:

pip install -e .

๐Ÿ”ง Setup with Amazon Q

Step 1: Locate Your MCP Configuration File

Amazon Q looks for MCP server configuration in:

  • Linux/WSL: ~/.aws/amazonq/mcp.json

  • macOS: ~/.aws/amazonq/mcp.json

  • Windows: %USERPROFILE%\.aws\amazonq\mcp.json

Step 2: Create/Edit the Configuration File

Create the directory if it doesn't exist:

mkdir -p ~/.aws/amazonq

Edit or create ~/.aws/amazonq/mcp.json:

For Linux/WSL:

{ "mcpServers": { "doc_reader": { "command": "/full/path/to/amazon-q-web_search/.venv/bin/python", "args": ["/full/path/to/amazon-q-web_search/main.py"] } } }

For macOS:

{ "mcpServers": { "doc_reader": { "command": "/full/path/to/amazon-q-web_search/.venv/bin/python", "args": ["/full/path/to/amazon-q-web_search/main.py"] } } }

For Windows:

{ "mcpServers": { "doc_reader": { "command": "C:\\full\\path\\to\\amazon-q-web_search\\.venv\\Scripts\\python.exe", "args": ["C:\\full\\path\\to\\amazon-q-web_search\\main.py"] } } }

๐Ÿ’ก Tip: Replace /full/path/to/ with the actual path where you cloned the repository.

Step 3: Verify Installation

  1. Start Amazon Q CLI:

    q chat
  2. Check if MCP server is loaded:

    /mcp

    You should see:

    doc_reader - read_web_documentation - get_documentation_links - get_page_structure - extract_code_examples - read_multiple_docs
  3. If not loaded:

    • Check the file path in mcp.json is correct

    • Restart Amazon Q CLI

    • Check logs: q chat logdump


๐Ÿš€ Usage

Basic Example

In Amazon Q CLI, simply ask about documentation:

I'm having issues with Razorpay routes. Can you help me understand how they work? Documentation: https://razorpay.com/docs/

Amazon Q will:

  1. โœ… Read the main documentation page

  2. โœ… Extract all available links

  3. โœ… Intelligently identify the "Routes" link

  4. โœ… Navigate to the Routes documentation

  5. โœ… Provide you with accurate information

More Examples

Python Documentation:

Can you explain Python asyncio event loops? Documentation: https://docs.python.org/3/library/asyncio.html

FastAPI Tutorial:

How do I create a basic FastAPI application? Documentation: https://fastapi.tiangolo.com/

AWS Lambda:

How do I create a Lambda function with Python? Documentation: https://docs.aws.amazon.com/lambda/

๐Ÿ›  Available Tools

Amazon Q intelligently chains these tools to navigate documentation:

1. read_web_documentation

Fetches and extracts clean documentation content from a web page.

Parameters:

  • url (required): The URL of the documentation page

  • output_format (optional): "markdown" (default) or "text"

Returns: Extracted documentation content with title and metadata


2. get_documentation_links

Extracts all links from a documentation page with optional filtering.

Parameters:

  • url (required): The URL of the documentation page

  • filter_pattern (optional): Pattern to filter links (e.g., "api", "guide")

Returns: List of links found on the page


3. get_page_structure

Extracts the heading structure and table of contents from a documentation page.

Parameters:

  • url (required): The URL of the documentation page

Returns: Hierarchical structure of headings on the page


4. extract_code_examples

Extracts all code blocks from a documentation page.

Parameters:

  • url (required): The URL of the documentation page

Returns: All code blocks found with their detected languages


5. read_multiple_docs

Reads multiple documentation pages and combines their content.

Parameters:

  • urls (required): List of documentation URLs (max 10)

Returns: Combined content from all pages


๐Ÿ“ Project Structure

amazon-q-web_search/ โ”œโ”€โ”€ main.py # Entry point โ”œโ”€โ”€ pyproject.toml # Project configuration โ”œโ”€โ”€ README.md # This file โ”œโ”€โ”€ run_mcp.sh # Startup script (Linux/macOS) โ””โ”€โ”€ src/ โ”œโ”€โ”€ __init__.py # Package initialization โ”œโ”€โ”€ server.py # MCP server initialization โ”œโ”€โ”€ config.py # Configuration constants โ”œโ”€โ”€ fetcher.py # HTTP fetching logic โ”œโ”€โ”€ extractor.py # HTML content extraction โ”œโ”€โ”€ formatters.py # Output formatting โ””โ”€โ”€ tools.py # MCP tool definitions

โš™๏ธ Configuration

Edit src/config.py to customize behavior:

Setting

Default

Description

HTTP_TIMEOUT

30.0s

Request timeout in seconds

MAX_CONTENT_LENGTH

10MB

Maximum content size in bytes

USER_AGENT

Custom

HTTP User-Agent string

REMOVE_TAGS

Various

HTML tags to remove during extraction

CONTENT_SELECTORS

Various

Selectors for finding main content


๐Ÿ› Troubleshooting

MCP Server Not Loading

Check configuration:

cat ~/.aws/amazonq/mcp.json

Verify paths are correct:

  • Use absolute paths, not relative

  • Check that Python executable exists

  • Check that main.py exists

Test server manually:

cd /path/to/amazon-q-web_search .venv/bin/python main.py

Check Amazon Q logs:

q chat logdump

Server Starts But Tools Don't Work

Verify dependencies are installed:

cd /path/to/amazon-q-web_search .venv/bin/python -c "import httpx, bs4, markdownify; print('OK')"

Reinstall dependencies:

uv sync --reinstall

Connection Timeout

Increase timeout in settings:

q settings mcp.initTimeout 60000

๐Ÿ“š Dependencies

Package

Purpose

httpx

Async HTTP client for fetching web pages

beautifulsoup4

HTML parsing and navigation

lxml

Fast XML/HTML parser

markdownify

HTML to Markdown conversion

mcp

Model Context Protocol SDK


โš ๏ธ Limitations

Limit

Value

Maximum content size

10MB per page

Maximum URLs per batch

10

Request timeout

30 seconds

Content type

HTML only


๐Ÿค Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository

  2. Create your feature branch (git checkout -b feature/AmazingFeature)

  3. Commit your changes (git commit -m 'Add some AmazingFeature')

  4. Push to the branch (git push origin feature/AmazingFeature)

  5. Open a Pull Request


๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.


๐Ÿ’ฌ Support

  • ๐Ÿ“ซ Open an Issue for bug reports or feature requests

  • โญ Star this repo if you find it useful!


-
security - not tested
F
license - not found
-
quality - not tested

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/Lumos-Labs-HQ/amazon-q-docMCP'

If you have feedback or need assistance with the MCP directory API, please join our Discord server