OpenAI Compatible
OpenAI/OpenRouter-style inference enqueue endpoints.
Queue a chat completion request
Authorization
BearerAuth Use Authorization: Bearer <api-key>.
In: header
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
application/json
curl -X POST "https://loading/v1/chat/completions" \ -H "Content-Type: application/json" \ -d '{ "messages": [ { "role": "user", "content": "Summarize this document in 5 bullets." } ] }'{
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"object": "chat.completion.enqueue",
"created": 1773504000,
"status": "queued",
"surface": "chat",
"model": "Llama3_8B_Instruct",
"endpoint_id": "o6r4i5q9j8k7l6",
"runpod": {
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"status": "IN_QUEUE"
}
}{
"error": {
"code": "missing_endpoint_id",
"message": "No RunPod endpoint configured for surface chat"
}
}Queue an image generation request
Authorization
BearerAuth Use Authorization: Bearer <api-key>.
In: header
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
curl -X POST "https://loading/v1/images/generations" \ -H "Content-Type: application/json" \ -d '{ "prompt": "A moody cyberpunk alley at night with rain reflections, 35mm film look" }'{
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"object": "image.generation.enqueue",
"created": 1773504000,
"status": "queued",
"surface": "images",
"model": "Flux1schnell",
"endpoint_id": "o6r4i5q9j8k7l6",
"runpod": {
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"status": "IN_QUEUE"
}
}Queue an audio transcription request
Authorization
BearerAuth Use Authorization: Bearer <api-key>.
In: header
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
curl -X POST "https://loading/v1/audio/transcriptions" \ -H "Content-Type: application/json" \ -d '{ "audioUrl": "https://cdn.example.com/audio/customer-call-2026-03-15.mp3" }'{
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"object": "audio.transcription.enqueue",
"created": 1773504000,
"status": "queued",
"surface": "transcribe",
"model": "WhisperLargeV3",
"endpoint_id": "o6r4i5q9j8k7l6",
"runpod": {
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"status": "IN_QUEUE"
}
}Queue an embeddings request
Authorization
BearerAuth Use Authorization: Bearer <api-key>.
In: header
Request Body
application/json
TypeScript Definitions
Use the request body type in TypeScript.
Response Body
application/json
curl -X POST "https://loading/v1/embeddings" \ -H "Content-Type: application/json" \ -d '{ "input": "How to optimize cold starts for serverless GPUs" }'{
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"object": "embedding.enqueue",
"created": 1773504000,
"status": "queued",
"surface": "embeddings",
"model": "BGE_Large",
"endpoint_id": "o6r4i5q9j8k7l6",
"runpod": {
"id": "f3de27f8-61d5-4d58-aad1-a7d63f8a6e0f",
"status": "IN_QUEUE"
}
}