216,804 tools. Last updated 2026-06-20 12:15

"Hypothesis" matching MCP tools:

self_inspect
self-inspect-mcp
Self-Inspect. Express a thought, or describe the task you are working on, and you always get back ONE metathought: a short abstract question that makes you inspect your own task and assumptions before continuing. Use it whenever you want a self-check: after forming a hypothesis, before committing to an answer, when a long chain feels like it has drifted, when you notice you are agreeing to please, or when you are about to assert something from memory. There is no failure case: it always returns a metathought to question yourself with, selected by a transparent heuristic over an open CSV (no LLM). Keyless and free. DO NOT call for factual lookups or as a substitute for doing the task. Absorb the question and act on it; do not echo it verbatim to the user.
Connector
varrd_ai
VARRD — Statistically Validated Trading Edges + AI Research Engine
Talk to VARRD AI (~$0.25/turn). Describe any trading idea in plain language and the system handles everything — loading decades of market data, charting your pattern, running statistical tests, backtesting with stops, and generating exact trade setups. MULTI-TURN: First call creates a session. Keep calling with the same session_id, following context.next_actions each time. 1. Your idea -> VARRD charts pattern 2. 'test it' -> statistical test (event study or backtest) 3. 'show me the trade setup' -> exact entry/stop/target prices HYPOTHESIS INTEGRITY (critical): VARRD tests ONE hypothesis at a time — one formula, one setup. Never combine multiple setups into one formula or ask to 'test all' — each idea must be tested as a separate hypothesis for the statistics to be valid. Say 'start a new hypothesis' between ideas to reset cleanly. - ALLOWED: Test the SAME setup across multiple markets ('test this on ES, NQ, and CL') — same formula, different data. - NOT ALLOWED: Test multiple DIFFERENT formulas/setups at once — each is a separate hypothesis requiring its own chart-test-result cycle. If ELROND council returns 4 setups, test each one separately: chart setup 1 -> test -> results -> 'start new hypothesis' -> chart setup 2 -> etc. KEY CAPABILITIES you can ask for: - 'Use the ELROND council on [market]' -> 8 expert investigators - 'Optimize the stop loss and take profit' -> SL/TP grid search - 'Test this on ES, NQ, and CL' -> multi-market testing - 'Simulate trading this with 1.5 ATR stop' -> backtest with stops EDGE VERDICTS in context.edge_verdict after testing: - STRONG EDGE: Significant vs zero AND vs market baseline - MARGINAL: Significant vs zero only (beats nothing, but real signal) - PINNED: Significant vs market only (flat returns but different from market) - NO EDGE: Neither significant test passed TERMINAL STATES: Stop when context.has_edge is true (edge found) or false (no edge — valid result). Always read context.next_actions.
Connector
initiate_device_auth
hypathesis
Start a browser-based sign-in flow to get an API key for full access. Call this when you need detailed analysis results (reasoning, measurements) that require authentication. Returns a verification URL to show to the user. After the user signs in, poll check_device_auth with the returned user_code to get the API key. Returns: { "verification_url": str, "user_code": str, "expires_in": 600, "message": str }
Connector
get_analysis_status
hypathesis
Check the processing status of an uploaded paper. Poll this tool after uploading a PDF until status is 'Ready' before calling get_variable_relationships. Args: file_id: The file_id returned by the /upload endpoint. authorization: Optional. API key as 'Bearer hk_...' or 'hk_...'. Returns: { "status": "Processing" | "Ready" | "Empty" | "Ineligible" | "Pending", "edges_count": int, "variables_count": int }
Connector
connect
Penfield Memory
Connect memories to build knowledge graphs. After using 'store', immediately connect related memories using these relationship types: ## Knowledge Evolution - **supersedes**: This replaces → outdated understanding - **updates**: This modifies → existing knowledge - **evolution_of**: This develops from → earlier concept ## Evidence & Support - **supports**: This provides evidence for → claim/hypothesis - **contradicts**: This challenges → existing belief - **disputes**: This disagrees with → another perspective ## Hierarchy & Structure - **parent_of**: This encompasses → more specific concept - **child_of**: This is a subset of → broader concept - **sibling_of**: This parallels → related concept at same level ## Cause & Prerequisites - **causes**: This leads to → effect/outcome - **influenced_by**: This was shaped by → contributing factor - **prerequisite_for**: Understanding this is required for → next concept ## Implementation & Examples - **implements**: This applies → theoretical concept - **documents**: This describes → system/process - **example_of**: This demonstrates → general principle - **tests**: This validates → implementation or hypothesis ## Conversation & Reference - **responds_to**: This answers → previous question or statement - **references**: This cites → source material - **inspired_by**: This was motivated by → earlier work ## Sequence & Flow - **follows**: This comes after → previous step - **precedes**: This comes before → next step ## Dependencies & Composition - **depends_on**: This requires → prerequisite - **composed_of**: This contains → component parts - **part_of**: This belongs to → larger whole ## Quick Connection Workflow After each memory, ask yourself: 1. What previous memory does this update or contradict? → `supersedes` or `contradicts` 2. What evidence does this provide? → `supports` or `disputes` 3. What caused this or what will it cause? → `influenced_by` or `causes` 4. What concrete example is this? → `example_of` or `implements` 5. What sequence is this part of? → `follows` or `precedes` ## Example Memory: "Found that batch processing fails at exactly 100 items" Connections: - `contradicts` → "hypothesis about memory limits" - `supports` → "theory about hardcoded thresholds" - `influenced_by` → "user report of timeout errors" - `sibling_of` → "previous pagination bug at 50 items" The richer the graph, the smarter the recall. No orphan memories! Args: from_memory: Source memory UUID to_memory: Target memory UUID relationship_type: Type from the categories above strength: Connection strength (0.0-1.0, default 0.5) ctx: MCP context (automatically provided) Returns: Dict with success status, relationship_id, and connected memory IDs
Connector
restore_context
Penfield Memory
Resume work from a saved cognitive context. This provides a narrative briefing to quickly orient you to: - The investigation that was in progress - Key discoveries and insights made - Current hypotheses being tested - Open questions and blockers - Suggested next steps - All relevant memories with their connections The briefing reconstructs the cognitive state, not just the data. You'll understand not just WHAT was discovered, but WHY it matters and HOW the understanding evolved. Example of what you'll receive: "[API Timeout Investigation - Resuming after 2 hours] SITUATION: You were investigating production API timeouts that occur at exactly batch_size=100. This investigation started when user reported timeouts only in production, not staging. PROGRESS MADE: - Identified sharp cutoff at 100 items (not gradual degradation) - Disproved connection pool theory (monitoring showed only 43/200 connections used) - Found root cause: MAX_BATCH_SIZE=100 hardcoded in batch_handler.py:147 - Confirmed staging uses different config override (MAX_BATCH_SIZE=500) EVIDENCE CHAIN: User report → Reproduced locally → Noticed batch_size correlation → Searched codebase for limits → Found MAX_BATCH_SIZE → Checked staging config → Discovered config difference CORRECTED MISUNDERSTANDINGS: - Initially thought it was Redis connection exhaustion (disproven by monitoring) - Assumed gradual performance degradation (actually sharp cutoff) - Thought staging/production were identical (config differs) CURRENT HYPOTHESIS: Production deployment uses default MAX_BATCH_SIZE=100 from code, while staging has environment variable override. Fix requires either code change or prod config update. BLOCKED ON: Need production deployment access to apply fix. User considering whether to change code default or add production environment variable. RECOMMENDED NEXT STEPS: 1. Verify production environment variables (check if MAX_BATCH_SIZE is set) 2. If not set, add MAX_BATCH_SIZE=500 to production config 3. If code change preferred, update default in batch_handler.py 4. Run load test with batch_size=100-500 range to verify fix KEY MEMORIES FOR REFERENCE: - 'Initial timeout report from user' - Starting point of investigation - 'MAX_BATCH_SIZE discovery' - Root cause identification - 'Redis monitoring data' - Evidence disproving connection theory - 'Staging config analysis' - Explanation for environment difference" This cognitive handoff ensures you can continue the work with full understanding of the problem space, previous attempts, and current direction. The narrative preserves not just facts but the reasoning process, mistakes made, and lessons learned. SPECIAL CASE: restore_context("awakening") The name "awakening" is reserved for loading the user's personality configuration. This loads the Awakening Briefing which includes: - Selected persona identity and voice style - Custom personality traits (Premium+ users) - Any quirks and boundaries from the persona preset Args: name: Name or ID of context to restore. Can be: - Context name (exact match, case-sensitive) - Context UUID (from list_contexts output) - "awakening" for personality briefing limit: Maximum number of memories to restore (default 20) ctx: MCP context (automatically provided) Returns: Dict with: - success: Whether restoration succeeded - description: The cognitive handoff briefing - memories: List of relevant memories - context_id: The restored context identifier
Connector

Matching MCP Servers

AstroInsight Research
Research & Data Search RAG Systems
daiduo2
F
license
-
quality
D
maintenance
Enables AI-powered academic research workflow from keyword search to hypothesis generation. Integrates multiple AI models to automatically search ArXiv papers, extract key information, and generate innovative research hypotheses for researchers.
Last updated 2025-09-19
2
debug-thinking
Knowledge & Memory Developer Tools
tosssssy
A
license
A
quality
D
maintenance
Structures debugging processes into a persistent knowledge graph, enabling problem decomposition, hypothesis testing, and reusable solution discovery.
Last updated 2026-02-02
1
7
4
MIT

Matching MCP Connectors

Disco
Find novel, statistically validated patterns in tabular data — hypothesis-free.
hypathesis
Variable relationships from research papers with causal direction and source traceback.

autonomous_varrd_ai
VARRD — Statistically Validated Trading Edges + AI Research Engine
Point VARRD's autonomous AI in a direction and let it discover edges for you. Give it a topic and it draws from one of the most comprehensive market structure knowledge graphs ever built — containing ideologies and theories, not statistics — so it generates genuinely novel hypotheses rather than overfitting to what already worked. BEST FOR: Exploring a space broadly. Give it 'momentum on grains' and it might test wheat seasonal patterns, corn spread reversals, or soybean crush ratio momentum. It propagates from your seed idea into related concepts you might not think of. Returns a complete result — edge or no edge, stats, trade setup. Each call tests ONE hypothesis through the full pipeline (~$0.25/idea). Call again for another idea. Use 'varrd_ai' instead when YOU have a specific idea to test and want full control over each step.
Connector
get_hypothesis
VARRD — Statistically Validated Trading Edges + AI Research Engine
Get full detail for a specific hypothesis/strategy. Returns formula, entry/exit rules, direction, performance metrics (win rate, Sharpe, profit factor, max drawdown), version history, and trade levels. Everything an agent needs to understand and act on a strategy.
Connector
test_hypothesis
Ground Truth - First Tool Call
Run a small verification plan made of concrete live checks and summarize whether a hypothesis is supported. Use this when one conclusion depends on multiple simple checks such as endpoint reachability, npm search counts, or whether a page contains an exact substring. This is a coordination tool, not an open-ended research agent: every test must be explicitly defined in advance, and tests run in order with no branching or early exit. The final verdict is mechanical: all tests passing => SUPPORTED, zero passing => REFUTED, otherwise PARTIALLY SUPPORTED. Use verify_claim when you already have evidence URLs, estimate_market for category sizing, and compare_competitors when you already know exact package names.
Connector
check_device_auth
hypathesis
Check if the user has completed browser sign-in for a device auth request. Poll this after calling initiate_device_auth. Returns status 'pending' while waiting, or 'complete' with an api_key when the user has signed in. Use the returned api_key as the authorization parameter in other tools. Args: user_code: The user_code returned by initiate_device_auth. Returns: Pending: {"status": "pending"} Complete: {"status": "complete", "api_key": "hk_..."} Error: {"error": "..."}
Connector
get_macro_influence_map
OneQAZ Trading Intelligence
Purpose: Expose OneQAZ's pre-defined causal hypothesis map. Each macro category (bonds, forex, vix, credit, liquidity, inflation, commodities, energy) is mapped to a target market with lag_hours + sensitivity. Highest-transparency tool — the causal reasoning is visible and measurable. When to call: when an AI wants to understand WHY we make certain predictions. Prerequisites: none. Next steps: get_backtest_tuning_state for runtime calibration of these hypotheses. Caveats: static hypothesis only; see tuning state for current adjustments. Args: market_id: Optional target market filter (coin_market, kr_market, us_market) Disclaimer: Information only, not investment advice.
Connector
x711_substrate_seed_hypothesis
x711 — Universal Agent Gas Station
Inject your hypothesis into the SUBSTRATE evolution engine. Your idea gets a name, an ID, starts at 0.5 fitness, and evolves every 15 min. Can reach breakthrough status — published in the Echo Pack at substratelayer.com. $0.25. Requires API key.
Connector
get_claim_content
Claidex MCP
Retrieve the complete text content of a specific Claidex Claim by its slug. Use this when you need to read the full post-mortem analysis, including hypothesis, failure mechanism, and prevention analysis.
Connector
find_signals
Boolsai Signals
Automated pattern discovery — scans event_type × detector × diff_field × severity combinations and returns those with the strongest forward-return characteristics (α vs SPY, % positive, n). Use this when you don't have a specific hypothesis yet. Returns sorted by α at +7D descending. Filter by min_n to set a sample-size floor.
Connector
pictify_update_experiment
mcp
Update an existing experiment. Available fields depend on the current status: - draft/paused: All fields can be updated (name, slug, variants, goalConfig, banditConfig, hypothesis, etc.) - running: Only name, confidenceThreshold, minimumRunDays, goalConfig.destinationUrl - completed: Only name If updating variants, weights must still sum to 10000.
Connector
store
Penfield Memory
Store important information from your work. Write detailed, complete thoughts with context, reasoning, and evidence. **Always use the connect tool** to link related items - this builds knowledge graphs for better recall. ## Memory Types (auto-detected, but be aware): - **FACT**: Something observed or verified - **INSIGHT**: A pattern or realization - **CONVERSATION**: Dialogue or exchange content - **CORRECTION**: Fixing prior understanding - **REFERENCE**: Source material or citation - **TASK**: Action item or work to be done - **CHECKPOINT**: Conversation state snapshot - **IDENTITY_CORE**: Immutable AI identity - **PERSONALITY_TRAIT**: Evolvable AI traits - **RELATIONSHIP**: User-AI relationship info - **STRATEGY**: Learned behavior patterns ## Session Context If in an ongoing work session, include: - Session identifier: [Project/Session Name] - Your perspective: "As [role]:" or "From [viewpoint]:" - Current thread: What specific angle you're exploring ## What to Include - **WHAT**: The discovery or thought - **WHY**: Its significance - **HOW**: Your reasoning process - **EVIDENCE**: Supporting data/observations - **CONNECTIONS**: Related memories to link ## Examples ### Technical Investigation "[Performance Analysis] FACT: Database queries account for 73% of request latency (measured across 10K requests). Specifically, the user_permissions JOIN takes 340ms average. This contradicts hypothesis about caching issues (memory: 'cache analysis'). Evidence: APM traces show full table scan on permissions table. Next: investigate missing index on foreign key." ### Learning & Research "[ML Study Session] INSIGHT: Attention mechanisms work like dynamic routing - the model learns WHERE to look, not just WHAT to see. This explains transformer advantages over RNNs on long sequences (builds on memory: 'sequence modeling comparison'). The key-query- value structure creates a learnable addressing system. Connects to: 'human attention research', 'information retrieval basics'." ### Creative Work "[Story Development] HYPOTHESIS: The protagonist's reluctance stems from betrayal, not fear. Evidence: Three trust-questioning scenes, locked door symbolism throughout, deflection patterns in collaborative dialogue. This reframes the arc from 'overcoming fear' to 'rebuilding trust' (corrects memory: 'initial character motivation'). Would explain the guardian's patience and emphasis on small victories." ### Problem Solving "[Bug Hunt - Payment Flow] CORRECTION to 'timezone hypothesis': The 3am failures aren't timezone-related but due to batch job lock contention. Evidence: Perfect correlation with backup_jobs.log timestamps. The timezone pattern was spurious - batch runs at midnight PST (3am EST). Solution: implement job queuing." ## Connection Phrases - "Building on [earlier observation]..." - "Contradicts [hypothesis in memory X]" - "Answers [question from session Y]" - "Confirms pattern from [memory Z]" - "Extends thinking in [previous work]" Note: Every stored item is a node. Every connection is an edge. Rich graphs enable powerful recall. ⚠️ EXPERIMENTAL FIELDS: - **importance**: Stored for future ranking optimization. Currently not integrated into search results. - **confidence**: Returned in response for analysis. Behavior and calculation method subject to change. Args: content: Detailed memory content with context and evidence tags: Optional tags to categorize the memory importance: Optional importance score (0.0-1.0) - EXPERIMENTAL ctx: MCP context (automatically provided) Returns: Dict with success status, memory_id, type, importance, and confidence
Connector
get_variable_relationships
hypathesis
Retrieve analyzed variable relationships for a completed paper. Only returns results when get_analysis_status reports status='Ready'. Without an API key (anonymous): returns the relationship list with source, target, and direction — but detailed reasoning is available only with an API key. Sign up at https://hypathesis.com to get one. Args: file_id: The file_id returned by the /upload endpoint. authorization: Optional. API key as 'Bearer hk_...' or 'hk_...'. Returns: Authenticated: full details (source, target, directed, reason per relationship; name, measure per variable). Anonymous: gated (source, target, directed per relationship; name per variable; sign_up_url for full access).
Connector
x711_substrate_seed_hypothesis
x711
Inject your hypothesis into the SUBSTRATE evolution engine. Your idea gets a name, an ID, starts at 0.5 fitness, and evolves every 15 min. Can reach breakthrough status — published in the Echo Pack at substratelayer.com. $0.25. Requires API key.
Connector
similarity_score
IA-QA — 130+ QA & Dev Tools for AI Agents
Compute text similarity between reference and hypothesis using multiple metrics: Cosine (BoW, TF-IDF), Jaccard, ROUGE-1, ROUGE-2, ROUGE-L, and BLEU. No API key needed. Ideal for LLM eval (expected vs actual), RAG quality checks, and NLG benchmarking. Supports batch mode.
Connector
test_filter
Boolsai Signals
Compute α stats for an arbitrary filter expression. Use this to test a specific hypothesis (e.g. 'tier_count_changed on enterprise-SaaS tickers' or 'severity 5 events that happened on Mondays'). Returns n, mean/median raw and α returns at +1/+3/+7d, % positive, and the worst-loss trade.
Connector