We provide all the information about MCP servers via our MCP API.
curl -X GET 'https://glama.ai/api/mcp/v1/servers/letsgomaslow/think'
If you have feedback or need assistance with the MCP directory API, please join our Discord server
tool-accuracy-scenarios.jsonl•7.03 KiB
{"scenarioId":"accuracy-trace-001","userQuestion":"Walk me through solving this step by step - I need to figure out why our database queries are slow","expectedTool":"trace","acceptableTools":["trace","debug"],"category":"sequential-reasoning"}
{"scenarioId":"accuracy-trace-002","userQuestion":"Let me think through this problem carefully. I need to decide whether to use a monolith or microservices","expectedTool":"trace","acceptableTools":["trace","decide"],"category":"sequential-reasoning"}
{"scenarioId":"accuracy-trace-003","userQuestion":"Help me reason through the implications of this architectural change, one thought at a time","expectedTool":"trace","acceptableTools":["trace"],"category":"sequential-reasoning"}
{"scenarioId":"accuracy-model-001","userQuestion":"Apply first principles thinking to redesign our authentication system from scratch","expectedTool":"model","acceptableTools":["model"],"category":"mental-models"}
{"scenarioId":"accuracy-model-002","userQuestion":"What mental model should I use to analyze the cost-benefit of this infrastructure upgrade?","expectedTool":"model","acceptableTools":["model","decide"],"category":"mental-models"}
{"scenarioId":"accuracy-model-003","userQuestion":"Help me apply Occam's razor to simplify this overly complex solution","expectedTool":"model","acceptableTools":["model"],"category":"mental-models"}
{"scenarioId":"accuracy-pattern-001","userQuestion":"What design pattern should I use for handling multiple payment providers?","expectedTool":"pattern","acceptableTools":["pattern"],"category":"design-patterns"}
{"scenarioId":"accuracy-pattern-002","userQuestion":"How should I architect this service to handle state management across components?","expectedTool":"pattern","acceptableTools":["pattern","paradigm"],"category":"design-patterns"}
{"scenarioId":"accuracy-pattern-003","userQuestion":"I need a pattern for integrating with external APIs that might be unreliable","expectedTool":"pattern","acceptableTools":["pattern"],"category":"design-patterns"}
{"scenarioId":"accuracy-paradigm-001","userQuestion":"Should I use OOP or functional programming for this data transformation pipeline?","expectedTool":"paradigm","acceptableTools":["paradigm"],"category":"programming-paradigms"}
{"scenarioId":"accuracy-paradigm-002","userQuestion":"What programming approach fits best for a real-time event processing system?","expectedTool":"paradigm","acceptableTools":["paradigm","pattern"],"category":"programming-paradigms"}
{"scenarioId":"accuracy-paradigm-003","userQuestion":"Help me understand if reactive programming is right for this use case","expectedTool":"paradigm","acceptableTools":["paradigm"],"category":"programming-paradigms"}
{"scenarioId":"accuracy-debug-001","userQuestion":"Why is my code failing intermittently in production? Help me find the root cause","expectedTool":"debug","acceptableTools":["debug","hypothesis"],"category":"debugging"}
{"scenarioId":"accuracy-debug-002","userQuestion":"I have a memory leak somewhere in this application. How should I track it down?","expectedTool":"debug","acceptableTools":["debug"],"category":"debugging"}
{"scenarioId":"accuracy-debug-003","userQuestion":"The login flow is broken but I can't figure out where. Can you help me debug systematically?","expectedTool":"debug","acceptableTools":["debug"],"category":"debugging"}
{"scenarioId":"accuracy-council-001","userQuestion":"I need multiple perspectives on whether we should migrate to Kubernetes","expectedTool":"council","acceptableTools":["council","decide"],"category":"collaborative-reasoning"}
{"scenarioId":"accuracy-council-002","userQuestion":"What would different experts say about this API versioning strategy?","expectedTool":"council","acceptableTools":["council"],"category":"collaborative-reasoning"}
{"scenarioId":"accuracy-council-003","userQuestion":"Let's get diverse viewpoints on this security architecture decision","expectedTool":"council","acceptableTools":["council","decide"],"category":"collaborative-reasoning"}
{"scenarioId":"accuracy-decide-001","userQuestion":"Help me choose between AWS, GCP, and Azure for our infrastructure","expectedTool":"decide","acceptableTools":["decide","model"],"category":"decision-making"}
{"scenarioId":"accuracy-decide-002","userQuestion":"What's the best decision here? I need to weigh the tradeoffs between speed and quality","expectedTool":"decide","acceptableTools":["decide"],"category":"decision-making"}
{"scenarioId":"accuracy-decide-003","userQuestion":"I need to make a structured decision about which database to use - help me analyze options","expectedTool":"decide","acceptableTools":["decide"],"category":"decision-making"}
{"scenarioId":"accuracy-reflect-001","userQuestion":"Am I thinking about this correctly? I want to verify my assumptions about caching","expectedTool":"reflect","acceptableTools":["reflect","hypothesis"],"category":"metacognition"}
{"scenarioId":"accuracy-reflect-002","userQuestion":"What am I missing in my reasoning about this performance optimization?","expectedTool":"reflect","acceptableTools":["reflect"],"category":"metacognition"}
{"scenarioId":"accuracy-reflect-003","userQuestion":"Help me assess my knowledge gaps about Kubernetes networking","expectedTool":"reflect","acceptableTools":["reflect"],"category":"metacognition"}
{"scenarioId":"accuracy-hypothesis-001","userQuestion":"Let me test my assumption that the bottleneck is in the database layer","expectedTool":"hypothesis","acceptableTools":["hypothesis","debug"],"category":"scientific-method"}
{"scenarioId":"accuracy-hypothesis-002","userQuestion":"How can I validate my theory about why users are abandoning checkout?","expectedTool":"hypothesis","acceptableTools":["hypothesis"],"category":"scientific-method"}
{"scenarioId":"accuracy-hypothesis-003","userQuestion":"I think the issue is network latency - help me design an experiment to verify","expectedTool":"hypothesis","acceptableTools":["hypothesis","debug"],"category":"scientific-method"}
{"scenarioId":"accuracy-debate-001","userQuestion":"What are the arguments for and against using GraphQL vs REST?","expectedTool":"debate","acceptableTools":["debate","decide"],"category":"argumentation"}
{"scenarioId":"accuracy-debate-002","userQuestion":"Challenge my position that we should build this in-house instead of buying","expectedTool":"debate","acceptableTools":["debate"],"category":"argumentation"}
{"scenarioId":"accuracy-debate-003","userQuestion":"Present both sides of the serverless vs containers argument","expectedTool":"debate","acceptableTools":["debate","decide"],"category":"argumentation"}
{"scenarioId":"accuracy-map-001","userQuestion":"Help me visualize the data flow in this distributed system","expectedTool":"map","acceptableTools":["map"],"category":"visual-reasoning"}
{"scenarioId":"accuracy-map-002","userQuestion":"Draw out the relationships between these microservices","expectedTool":"map","acceptableTools":["map"],"category":"visual-reasoning"}
{"scenarioId":"accuracy-map-003","userQuestion":"I need a diagram to understand the state transitions in this workflow","expectedTool":"map","acceptableTools":["map"],"category":"visual-reasoning"}