index_directory
Scans a directory to extract text from PDFs, documents, images, and media, then generates semantic embeddings for search.
Instructions
Scans a local directory, extracts text from files (PDF, DOCX, TXT, MD) AND raw video/audio/image bytes, generates semantic embeddings using Gemini 2 and stores them for searching.
Args: directory_path: Absolute path to the directory. ignore: Optional list of glob patterns to ignore (e.g., [".log", "drafts", "temp"]).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| directory_path | Yes | ||
| ignore | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
| result | Yes |