Skip to main content
Glama
sifter-ai

sifter-mcp

Official

list_records

Fetch records from a sift by ID, with pagination options: limit, offset, or cursor for iterating through large result sets.

Instructions

Get extracted records from a sift.

Args:
    sift_id: The sift identifier
    limit: Maximum number of records to return (default 20, max 100)
    offset: Number of records to skip (ignored when cursor is provided)
    cursor: Opaque pagination cursor from a previous call's next_cursor field

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
sift_idYes
limitNo
offsetNo
cursorNo

Implementation Reference

  • MCP tool handler for list_records. Uses FastMCP @mcp.tool() decorator. Fetches a sift by ID and returns paginated extracted records with items, total, limit, offset, and next_cursor.
    @mcp.tool()
    async def list_records(sift_id: str, limit: int = 20, offset: int = 0, cursor: str = "") -> dict:
        """Get extracted records from a sift.
    
        Args:
            sift_id: The sift identifier
            limit: Maximum number of records to return (default 20, max 100)
            offset: Number of records to skip (ignored when cursor is provided)
            cursor: Opaque pagination cursor from a previous call's next_cursor field
        """
        async with _get_client() as client:
            handle = await client.get_sift(sift_id)
            page = await handle.find(limit=min(limit, 100), cursor=cursor or None)
        return {"items": page.items, "total": page.total, "limit": page.limit, "offset": page.offset, "next_cursor": page.next_cursor}
  • Agent loop handler for list_records. Dispatched by AgentToolRunner._dispatch(). Calls SiftResultsService.get_results() and returns total + records with extracted_data.
    if name == "list_records":
        limit = min(args.get("limit", 20), 100)
        offset = args.get("offset", 0)
        results, total = await self.results_svc.get_results(args["sift_id"], skip=offset, limit=limit)
        return {
            "total": total,
            "records": [{"id": r.id, "filename": r.filename, **r.extracted_data} for r in results],
        }
  • OpenAI/LLM function-calling schema for list_records, defining parameters: sift_id (required), limit (optional integer), offset (optional integer).
    {
        "type": "function",
        "function": {
            "name": "list_records",
            "description": "Get a paginated list of extracted records from a sift.",
            "parameters": {
                "type": "object",
                "properties": {
                    "sift_id": {"type": "string", "description": "The sift identifier"},
                    "limit": {"type": "integer", "description": "Max records to return (default 20, max 100)"},
                    "offset": {"type": "integer", "description": "Records to skip for pagination"},
                },
                "required": ["sift_id"],
            },
        },
    },
  • Helper function _infer_sift_id_from_trace that searches for sift_id from list_records calls in tool traces to infer context for widgets.
    def _infer_sift_id_from_trace(trace: list) -> Optional[str]:
        """Return the first sift_id seen in aggregate_sift or get_sift calls."""
        for t in trace:
            if t.tool in ("aggregate_sift", "get_sift", "list_records", "query_sift"):
                sid = t.args.get("sift_id")
                if sid:
Behavior2/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

No annotations are provided, so the description carries the full burden. It does not disclose whether the operation is read-only, destructive, or requires special permissions. The pagination behavior is implied but not explicitly stated as safe.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is structured as a clear list with parameter details. It is moderately concise, though the overall length could be reduced by combining purpose and parameters.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

While parameters are well-covered, the description omits the return value format, any output schema, and safety information. Without annotations, it feels incomplete for a tool with 4 parameters.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters5/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

With 0% schema description coverage, the description fully compensates by explaining each parameter's purpose, defaults, and interactions (e.g., offset ignored when cursor provided). This adds significant value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose4/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description opens with 'Get extracted records from a sift,' which clearly states the verb and resource. However, it does not differentiate from sibling tools like 'find_records' or 'query_sift' that may return similar data.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines2/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

There is no guidance on when to use this tool versus alternatives, when not to use it, or what prerequisites are needed. The description only lists parameters without context.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/sifter-ai/sifter'

If you have feedback or need assistance with the MCP directory API, please join our Discord server