RunPod Jobs

Submit asynchronous RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

`application/json`

curl -X POST "https://loading/v1/runpod/chat/run" \  -H "Content-Type: application/json" \  -d '{}'

{
  "id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
  "status": "IN_QUEUE"
}

Empty

Submit synchronous RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

`application/json`

curl -X POST "https://loading/v1/runpod/chat/runsync" \  -H "Content-Type: application/json" \  -d '{}'

{
  "id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
  "status": "COMPLETED",
  "output": {
    "data": [
      {
        "embedding": [
          0.132,
          -0.018,
          0.004
        ]
      }
    ]
  }
}

Empty

Get RunPod job status

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/images/status/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Stream RunPod job output

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/images/stream/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Cancel RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/images/cancel/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Retry RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/images/retry/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Purge RunPod pending queue

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/chat/purge-queue"

Empty

Get RunPod endpoint health

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/chat/health"

Empty

Last updated on 21 March 2026

Submit asynchronous RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

endpointId?string

Explicit RunPod endpoint ID. Overrides default endpoint mapping when provided.

Length1 <= length

model?string

Model slug used for endpoint routing and cost estimation.

Value in

"AceStep_1_5_Turbo" | "Ben2" | "Bge_M3_INT8" | "Flux_2_Klein_4B_BF16" | "Ltx2_3_22B_Dist_INT8" | "Nanonets_Ocr_S_F16" | "Qwen3_TTS_12Hz_1_7B_CustomVoice" | "RealESRGAN_x4" | "WhisperLargeV3" | "ZImageTurbo_INT8"

worker_type?|

Optional worker type hint for dynamic pricing (active or flex).

input?

Provider input payload forwarded to RunPod as input.

webhook?string

Deprecated webhook alias. Prefer webhook_url.

Formaturi

webhook_url?string

HTTPS URL that receives signed status updates for this job.

Formaturi

policy?

Optional policy hints used by gateway routing and execution controls.

s3Config?

Optional object-storage output configuration for provider artifacts.

[key: string]?any

Response Body

`application/json`

curl -X POST "https://loading/v1/runpod/chat/run" \  -H "Content-Type: application/json" \  -d '{}'

{
  "id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
  "status": "IN_QUEUE"
}

Empty

Submit synchronous RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Request Body

application/json

TypeScript Definitions

Use the request body type in TypeScript.

endpointId?string

Explicit RunPod endpoint ID. Overrides default endpoint mapping when provided.

Length1 <= length

model?string

Model slug used for endpoint routing and cost estimation.

Value in

"AceStep_1_5_Turbo" | "Ben2" | "Bge_M3_INT8" | "Flux_2_Klein_4B_BF16" | "Ltx2_3_22B_Dist_INT8" | "Nanonets_Ocr_S_F16" | "Qwen3_TTS_12Hz_1_7B_CustomVoice" | "RealESRGAN_x4" | "WhisperLargeV3" | "ZImageTurbo_INT8"

worker_type?|

Optional worker type hint for dynamic pricing (active or flex).

input?

Provider input payload forwarded to RunPod as input.

webhook?string

Deprecated webhook alias. Prefer webhook_url.

Formaturi

webhook_url?string

HTTPS URL that receives signed status updates for this job.

Formaturi

policy?

Optional policy hints used by gateway routing and execution controls.

s3Config?

Optional object-storage output configuration for provider artifacts.

[key: string]?any

Response Body

`application/json`

curl -X POST "https://loading/v1/runpod/chat/runsync" \  -H "Content-Type: application/json" \  -d '{}'

{
  "id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
  "status": "COMPLETED",
  "output": {
    "data": [
      {
        "embedding": [
          0.132,
          -0.018,
          0.004
        ]
      }
    ]
  }
}

Empty

Get RunPod job status

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/images/status/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Stream RunPod job output

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/images/stream/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Cancel RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/images/cancel/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Retry RunPod job

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

jobId*string

Provider job identifier returned when a request is accepted.

Length1 <= length

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/images/retry/f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f"

Empty

Purge RunPod pending queue

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X POST "https://loading/v1/runpod/chat/purge-queue"

Empty

Get RunPod endpoint health

Authorization

BearerAuth

AuthorizationBearer <token>

Use Authorization: Bearer <api-key>.

In: header

Path Parameters

surface*|||

Inference surface used to select the provider endpoint.

Query Parameters

endpointId?string

Optional provider endpoint override. If omitted, routing uses surface/model mapping.

Length1 <= length

Response Body

curl -X GET "https://loading/v1/runpod/chat/health"

Empty

Last updated on 21 March 2026

RunPod Jobs

200application/json

400

401

429

500

502

200application/json

400

401

429

500

502

200

400

401

404

429

500

502

200

400

401

429

500

502

200

400

401

404

429

500

502

200

400

401

404

429

500

502

200

400

401

429

500

502

200

400

401

429

500

502

RunPod Jobs

200application/json

400

401

429

500

502

200application/json

400

401

429

500

502

200

400

401

404

429

500

502

200

400

401

429

500

502

200

400

`application/json`

`application/json`

`application/json`

`application/json`