create_batch
Submit multiple queries for asynchronous batch processing with approximately 50% cost discount. Queries run within 24 hours; results are retrieved after completion.
Instructions
Submit a batch of queries for async processing (~50% cost discount).
Batches run asynchronously (up to 24h). No tool use — inline all relevant context in each prompt. Use get_batch/list_batches to check status and get_batch_results when the job completes.
The system prompt follows the same configuration as consult_gemini (config.toml, --system-prompt CLI flags).
Args: queries: List of {custom_id: str, prompt: str} dicts. model: Model alias or ID (default: "flash"). "pro" enables deep thinking. temperature: Sampling temperature (default: 0.2).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| queries | Yes | ||
| model | No | flash | |
| temperature | No |
Output Schema
| Name | Required | Description | Default |
|---|---|---|---|
No arguments | |||