ingest_file
Ingest documents (PDF, DOCX, TXT, MD) into a vector database for semantic search. Supports updating existing documents and visual captioning for PDF figures.
Instructions
Ingest a document file (PDF, DOCX, TXT, MD) into the vector database for semantic search. File path must be an absolute path. Supports re-ingestion to update existing documents.
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| filePath | Yes | Absolute path to the file to ingest. Example: "/Users/user/documents/manual.pdf" | |
| visual | No | If true and the file is a PDF, run VLM captioning on figure pages. No effect on non-PDF files. | |
| visualQuality | No | VLM profile to use when visual is true. "fast" (default) is the lightweight SmolVLM-256M; "quality" is Qwen2.5-VL-3B-Instruct-ONNX with higher fidelity on figures with in-image text (~10x model-cache footprint, ~2x per-page inference). The server also accepts an empty string as a synonym for omitted (normalized to "fast"). Silently ignored when visual is false. | fast |