Skip to main content
Glama

ingest_pdfs

Convert PDF documents in the materials folder to markdown using an OCR engine. Supports forced re-conversion and category filtering.

Instructions

Render every materials/**/*.pdf to markdown via the selected OCR engine. Idempotent unless force=True.

Input Schema

TableJSON Schema
NameRequiredDescriptionDefault
engineNoOCR engine. codex-native (default) renders PDFs to PNGs under .paideia-cache/ and returns a manifest so the calling skill can read pages with Codex CLI's bundled vision (no extra API billing for ChatGPT subscribers). qwen3-vl needs a local Ollama with qwen3-vl:8b. tesseract needs pytesseract with eng and/or kor traineddata.codex-native
forceNoReconvert even if converted/<cat>/<stem>.md exists.
categoriesNoRestrict to a subset of the materials subfolders.
project_rootNoAbsolute path to the course project root. Defaults to the server's CWD when omitted; set this explicitly if the user has cd'd between courses within the same Codex session.
Behavior3/5

Does the description disclose side effects, auth requirements, rate limits, or destructive behavior?

The description notes idempotency and the force parameter, offering some behavioral insight. However, it lacks details on side effects or requirements beyond what the schema provides. With no annotations, more explicit disclosure would be beneficial.

Agents need to know what a tool does to the world before calling it. Descriptions should go beyond structured annotations to explain consequences.

Conciseness4/5

Is the description appropriately sized, front-loaded, and free of redundancy?

The description is a single sentence that efficiently conveys the core function. It is front-loaded with the verb and resource, with no unnecessary words. Slightly short but effective.

Shorter descriptions cost fewer tokens and are easier for agents to parse. Every sentence should earn its place.

Completeness3/5

Given the tool's complexity, does the description cover enough for an agent to succeed on first attempt?

Given no output schema, the description does not need to detail return values. It covers the main action and idempotency, but omits output location. With schema covering parameters, it is adequate but not exhaustive.

Complex tools with many parameters or behaviors need more documentation. Simple tools need less. This dimension scales expectations accordingly.

Parameters3/5

Does the description clarify parameter syntax, constraints, interactions, or defaults beyond what the schema provides?

Schema description coverage is 100%, so baseline is 3. The main description adds some context (idempotency), but the schema already thoroughly documents each parameter. No significant additional value beyond the schema.

Input schemas describe structure but not intent. Descriptions should explain non-obvious parameter relationships and valid value ranges.

Purpose5/5

Does the description clearly state what the tool does and how it differs from similar tools?

The description clearly states 'Render every materials/**/*.pdf to markdown' using a specific verb, resource, and scope. This distinguishes it from sibling tools like grade_pdf or parse_exam_radar_export.

Agents choose between tools based on descriptions. A clear purpose with a specific verb and resource helps agents select the right tool.

Usage Guidelines4/5

Does the description explain when to use this tool, when not to, or what alternatives exist?

The description indicates when to use the tool (for PDF ingestion) and mentions idempotency, providing clear context. It does not explicitly exclude alternatives, but the purpose is sufficiently distinct.

Agents often have multiple tools that could apply. Explicit usage guidance like "use X instead of Y when Z" prevents misuse.

Install Server

Other Tools

Latest Blog Posts

MCP directory API

We provide all the information about MCP servers via our MCP API.

curl -X GET 'https://glama.ai/api/mcp/v1/servers/OPTIMETA/PAIDEIA-mcp'

If you have feedback or need assistance with the MCP directory API, please join our Discord server