deep-research-tool
Conduct comprehensive web research using Tavily Search and Crawl, generating aggregated JSON data with detailed findings, search summaries, and markdown formatting instructions.
Instructions
Performs extensive web research using Tavily Search and Crawl. Returns aggregated JSON data including the query, search summary (if any), detailed research findings, and documentation instructions. The documentation instructions will guide you on how the user wants the research data to be formatted into markdown.
Input Schema
Name | Required | Description | Default |
---|---|---|---|
chunks_per_source | No | For 'advanced' search: number of content chunks from each source (1-3). | |
crawl_allow_external | No | Allow crawler to follow links to external domains. | |
crawl_categories | No | Filter crawl URLs by categories (e.g., 'Blog', 'Documentation'). | |
crawl_exclude_domains | No | Regex for domains/subdomains to exclude. | |
crawl_exclude_paths | No | Regex for URL paths to exclude. | |
crawl_extract_depth | No | Extraction depth for crawl ('basic' or 'advanced'). | basic |
crawl_include_images | No | Extract image URLs from crawled pages. | |
crawl_instructions | No | Natural language instructions for the crawler. | |
crawl_limit | No | Total links crawler will process per root URL (1-20). | |
crawl_max_breadth | No | Max links to follow per page level during crawl (1-10). | |
crawl_max_depth | No | Max crawl depth from base URL (1-2). Higher values increase processing time significantly. | |
crawl_select_domains | No | Regex for domains/subdomains to crawl (e.g., '^docs\.example\.com$'). Overrides auto-domain focus. | |
crawl_select_paths | No | Regex for URLs paths to crawl (e.g., '/docs/.*'). | |
crawl_timeout | No | Timeout in seconds for Tavily crawl requests. | |
days | No | For 'news' topic: number of days back from current date to include results. | |
documentation_prompt | No | Optional. Custom prompt for LLM documentation generation. Overrides 'DOCUMENTATION_PROMPT' env var and default. If none set, a comprehensive default is used. | |
exclude_domains_search | No | List of domains to specifically exclude from search. | |
hardware_acceleration | No | Try to use hardware acceleration (WebGPU) if available. | |
include_answer | No | Include an LLM-generated answer from Tavily search (true implies 'basic'). | |
include_domains_search | No | List of domains to specifically include in search. | |
include_raw_content_search | No | Include cleaned HTML from initial search results. | |
include_search_image_descriptions | No | Include image descriptions from initial search results. | |
include_search_images | No | Include image URLs from initial search results. | |
max_search_results | No | Max search results to retrieve for crawling (1-20). | |
output_path | No | Optional. Path where generated research documents and images should be saved. If not provided, a default path in user's Documents folder with timestamp will be used. | |
query | Yes | The main research topic or question. | |
search_depth | No | Depth of the initial Tavily search ('basic' or 'advanced'). | advanced |
search_timeout | No | Timeout in seconds for Tavily search requests. | |
time_range | No | Time range for search results (e.g., 'd' for day, 'w' for week, 'm' for month, 'y' for year). | |
topic | No | Category for the Tavily search ('general' or 'news'). | general |
Input Schema (JSON Schema)
You must be authenticated.
Other Tools from Deep Research MCP
Related Tools
- @jimbul/youdotcom_MCP
- @spences10/mcp-tavily-search
- @spences10/mcp-tavily-search
- @marcopesani/mcp-server-serper
- @mcp2everything/mcp2brave
- @ameeralns/DeepResearchMCP