llms.txtllms-full.txt
DashboardStatusGet API Key
IntroductionQuickstartModelsPricingArchitecture & SecurityLimits & Quotas
Execution Modes & HTTP QueueWebhooksWebSocketsMCP Servern8n Integrationn8n dryAPI node
API OverviewErrorsText-to-ImagePOSTText-to-Image Price CalculationPOSTText-to-VideoPOSTText-to-Video Price CalculationPOSTImage-to-VideoPOSTImage-to-Video Price CalculationPOSTAudio-to-VideoPOSTAudio-to-Video Price CalculationPOSTText-to-Speech (TTS)POSTText-to-Speech Price CalculationPOSTText-to-MusicPOSTText-to-Music Price CalculationPOSTText-to-EmbeddingPOSTText-to-Embedding Price CalculationPOSTImage-to-ImagePOSTImage-to-Image Price CalculationPOSTImage Background RemovalPOSTImage Background Removal Price CalculationPOSTImage UpscalePOSTImage Upscale Price CalculationPOST
Audio-to-Text (X Spaces Transcription)POSTAudio-to-Text (X Spaces) Price CalculationPOSTImage-to-Text (OCR)POSTImage-to-Text Price CalculationPOSTUpload Audio File (Audio-to-Text)POSTUpload Audio File Price CalculationPOSTUpload Video File (Video-to-Text)POSTUpload Video File Price CalculationPOSTVideo-to-Text (Transcription)POSTVideo-to-Text Price CalculationPOST
OpenAPI
SDKs & IntegrationsPayment MethodsFAQ — Frequently Asked QuestionsSupport & Contact
dAdryAPI
DashboardStatusGet API Key
APIAnalysis
Technical Reference

Image-to-Text (OCR)

Technical documentation for dryAPI APIs, integration guides, and operational references.

Endpoint for requesting image2text (OCR) inference

NOTE

Prerequisite: To ensure a successful request, you must first consult the Model Selection endpoint to identify a valid model slug, check specific limits and features, and verify LoRA availability.

OpenAPI

Request Img2 Txt

POST
/api/v1/client/img2txt

Authorization

bearerAuth
AuthorizationBearer <token>

In: header

Header Parameters

Accept*string
Default"application/json"
Value in"application/json"

Request Body

multipart/form-data

Image to text conversion parameters

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

application/json

application/json

application/json

application/json

application/json

curl -X POST "https://api.dryapi.dev/api/v1/client/img2txt" \  -H "Accept: application/json" \  -F image="string" \  -F model="Nanonets_Ocr_S_F16"
{
  "data": {
    "request_id": "c08a339c-73e5-4d67-a4d5-231302fbff9a"
  }
}
{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}
{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}
{
  "message": "The selected model does not support Text To Image.",
  "errors": {
    "model": [
      "The selected model does not support Text To Image."
    ]
  }
}
{
  "message": "Too Many Attempts."
}
Last updated on 21 March 2026

POST Audio-to-Text (X Spaces) Price Calculation

Previous Page

POST Image-to-Text Price Calculation

Next Page

image*string

Image file to extract text from. Supported formats: JPG, JPEG, PNG, GIF, BMP, WebP. Maximum file size: 10 MB.

model*string

The OCR model to use for text extraction

language?|

Language code for OCR processing (optional)

format?|

Output format for extracted text

Value in"text" | "json"
return_result_in_response?|

If true, the result will be returned directly in the response instead of only download url. Optional parameter.

Defaultfalse
webhook_url?|

Optional HTTPS URL to receive webhook notifications for job status changes (processing, completed, failed). Must be HTTPS. Max 2048 characters. See Webhook Documentation for payload structure and authentication details.

Formaturi
Lengthlength <= 2048