Run a prediction on a Replicate deployment
replicate_run_deploymentRun a prediction on a deployment's current release, wait for completion, and automatically download outputs for immediate use.
Instructions
Run a prediction against a deployment's current release. WAITS for the prediction to finish and (by default) auto-downloads the outputs locally — same UX as the curated generate_* tools.
Args:
deployment: "owner/name" of the deployment to run.
input: model input parameters as a JSON object (same shape the deployment's underlying model expects).
download (default true): download output files locally.
timeout_ms (optional): max ms to wait before returning a pending result you can poll with replicate_get_prediction.
Returns the standard prediction result (inline image preview / text output, URLs, local_paths, prediction_id).
Input Schema
| Name | Required | Description | Default |
|---|---|---|---|
| input | No | Model input parameters as a JSON object — same shape the deployment's underlying model expects. | |
| download | No | Whether to download the generated files locally. Default true. When false, only Replicate URLs are returned (URLs expire after ~24h). | |
| deployment | Yes | Deployment to run, as "owner/name". Inspect it first with replicate_get_deployment. | |
| timeout_ms | No | Max ms to wait for the prediction. If exceeded, returns the prediction ID so you can poll via replicate_get_prediction. Default: 300000 (5min). |