llms.txtllms-full.txt
DashboardStatusGet API Key
IntroductionQuickstartModelsPricingArchitecture & SecurityLimits & Quotas
Execution Modes & HTTP QueueWebhooksWebSocketsMCP Servern8n Integrationn8n dryAPI node
API OverviewErrorsText-to-ImagePOSTText-to-Image Price CalculationPOSTText-to-VideoPOSTText-to-Video Price CalculationPOSTImage-to-VideoPOSTImage-to-Video Price CalculationPOSTAudio-to-VideoPOSTAudio-to-Video Price CalculationPOSTText-to-Speech (TTS)POSTText-to-Speech Price CalculationPOSTText-to-MusicPOSTText-to-Music Price CalculationPOSTText-to-EmbeddingPOSTText-to-Embedding Price CalculationPOSTImage-to-ImagePOSTImage-to-Image Price CalculationPOSTImage Background RemovalPOSTImage Background Removal Price CalculationPOSTImage UpscalePOSTImage Upscale Price CalculationPOST
OpenAPI
SDKs & IntegrationsPayment MethodsFAQ — Frequently Asked QuestionsSupport & Contact
dAdryAPI
DashboardStatusGet API Key
API
Technical Reference

Text-to-Music

Technical documentation for dryAPI APIs, integration guides, and operational references.

Endpoint for requesting text2music inference

Text-to-Music generates music tracks from text descriptions. You can control genre, tempo, key, time signature, and even provide lyrics. Optionally upload a reference_audio file for style transfer — the model will use it as a stylistic reference for the generated track. The endpoint returns a task ID to track processing status. Ideal for apps needing automated music creation — background tracks, jingles, or full songs with vocals.

NOTE

Prerequisite: To ensure a successful request, you must first consult the Model Selection endpoint to identify a valid model slug and check specific limits.

WARNING

Reference audio requirements (optional):

  • Supported formats: MP3, WAV, FLAC, OGG, M4A
  • Maximum file size: 10 MB
  • Duration must be within model-specific limits

This endpoint uses multipart/form-data content type to support file uploads.

OpenAPI

Request Txt2 Music

POST
/api/v1/client/txt2music

Authorization

bearerAuth
AuthorizationBearer <token>

In: header

Header Parameters

Accept*string
Default"application/json"
Value in"application/json"

Request Body

multipart/form-data

Music generation parameters

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

application/json

application/json

application/json

application/json

application/json

curl -X POST "https://api.dryapi.dev/api/v1/client/txt2music" \  -H "Accept: application/json" \  -F caption="upbeat electronic dance music with energetic synths" \  -F model="ACE-Step-v1.5-turbo" \  -F lyrics="[Instrumental]" \  -F duration="30" \  -F inference_steps="8" \  -F guidance_scale="7" \  -F seed="-1" \  -F format="flac"
{
  "data": {
    "request_id": "c08a339c-73e5-4d67-a4d5-231302fbff9a"
  }
}
{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}
{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}
{
  "message": "The selected model does not support Text To Image.",
  "errors": {
    "model": [
      "The selected model does not support Text To Image."
    ]
  }
}
{
  "message": "Too Many Attempts."
}
Last updated on 21 March 2026

POST Text-to-Speech Price Calculation

Previous Page

POST Text-to-Music Price Calculation

Next Page

caption*string

Text description of the music to generate

model*string

The model to use for music generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.

lyrics*string

Lyrics for the music. Use "[Instrumental]" for instrumental tracks without vocals.

duration*number

Duration in seconds (10-600)

bpm?|

Beats per minute (30-300)

keyscale?|

Musical key/scale (e.g. "C major", "F# minor")

timesignature?|

Time signature. Must be 2, 3, 4, or 6.

vocal_language?|

Language code for vocals (e.g. "en", "es", "fr")

inference_steps*integer

Number of diffusion inference steps (1-100). Use 8 for turbo models, 32+ for base models.

guidance_scale*number

Classifier-free guidance scale (0-20)

seed*integer

Random seed. Use -1 for random.

format*string

Audio output format

reference_audio?|

Optional reference audio file for style transfer. Supported formats: mp3, wav, flac, ogg, m4a. Max size configurable (default 10MB). Duration must be within model-specific limits.

Formatbinary
webhook_url?|

Optional HTTPS URL to receive webhook notifications for job status changes (processing, completed, failed). Must be HTTPS. Max 2048 characters.

Formaturi
Lengthlength <= 2048