Extract typed fields from document text using a caller-defined schema. Uses a quality AI model with retry logic. Use when you need specific data points from a document rather than full text. For invoices with known fields, parse_invoice (prebuilt schema) may be simpler. For general summarization, use summarize_document instead.
Schema format: { "field_name": "type hint or description" } — e.g. { "contract_date": "ISO date", "party_a": "string", "penalty_usd": "number" }.
Returns: {
data: { <field>: value },
data_cited: { <field>: { value, confidence: "high"|"medium"|"low", citations: [{ quote, paragraphs[] }] } }
}
Example prompts:
- "Extract the contract date, parties, and penalty amount from this agreement."
- "Pull the vendor name, PO number, and total from this document."
- "Get me all named fields from this form using my custom schema."
Connector