create-endpoint

Create scalable GPU or CPU endpoints on RunPod by specifying template configurations, worker counts, and compute resources for deploying containerized applications.

Input Schema

Name	Required	Description
`computeType`	No	GPU or CPU endpoint
`dataCenterIds`	No	List of data centers
`gpuCount`	No	Number of GPUs per worker
`gpuTypeIds`	No	List of acceptable GPU types
`name`	No	Name for the endpoint
`templateId`	Yes	Template ID to use
`workersMax`	No	Maximum number of workers
`workersMin`	No	Minimum number of workers

Input Schema (JSON Schema)

{ "properties": { "computeType": { "description": "GPU or CPU endpoint", "enum": [ "GPU", "CPU" ], "type": "string" }, "dataCenterIds": { "description": "List of data centers", "items": { "type": "string" }, "type": "array" }, "gpuCount": { "description": "Number of GPUs per worker", "type": "number" }, "gpuTypeIds": { "description": "List of acceptable GPU types", "items": { "type": "string" }, "type": "array" }, "name": { "description": "Name for the endpoint", "type": "string" }, "templateId": { "description": "Template ID to use", "type": "string" }, "workersMax": { "description": "Maximum number of workers", "type": "number" }, "workersMin": { "description": "Minimum number of workers", "type": "number" } }, "required": [ "templateId" ], "type": "object" }

RunPod MCP Server

create-endpoint

Input Schema

Input Schema (JSON Schema)

Other Tools from RunPod MCP Server

Related Tools

MCP directory API