Text-to-Music

Endpoint for requesting text2music inference

Text-to-Music generates music tracks from text descriptions. You can control genre, tempo, key, time signature, and even provide lyrics. Optionally upload a reference_audio file for style transfer — the model will use it as a stylistic reference for the generated track. The endpoint returns a task ID to track processing status. Ideal for apps needing automated music creation — background tracks, jingles, or full songs with vocals.

NOTE

Prerequisite: To ensure a successful request, you must first consult the Model Selection endpoint to identify a valid model slug and check specific limits.

WARNING

Reference audio requirements (optional):

Supported formats: MP3, WAV, FLAC, OGG, M4A

Maximum file size: 10 MB

Duration must be within model-specific limits

This endpoint uses multipart/form-data content type to support file uploads.

OpenAPI

Request Txt2 Music

Authorization

bearerAuth

AuthorizationBearer <token>

In: header

Header Parameters

Accept*string

Default"application/json"

Value in"application/json"

Request Body

multipart/form-data

Music generation parameters

TypeScript Definitions

Use the request body type in TypeScript.

Response Body

`application/json`

curl -X POST "https://api.dryapi.dev/api/v1/client/txt2music" \  -H "Accept: application/json" \  -F caption="upbeat electronic dance music with energetic synths" \  -F model="ACE-Step-v1.5-turbo" \  -F lyrics="[Instrumental]" \  -F duration="30" \  -F inference_steps="8" \  -F guidance_scale="7" \  -F seed="-1" \  -F format="flac"

{
  "data": {
    "request_id": "c08a339c-73e5-4d67-a4d5-231302fbff9a"
  }
}

{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}

{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}

{
  "message": "The selected model does not support Text To Image.",
  "errors": {
    "model": [
      "The selected model does not support Text To Image."
    ]
  }
}

{
  "message": "Too Many Attempts."
}

Last updated on 21 March 2026

Endpoint for requesting text2music inference

NOTE

Prerequisite: To ensure a successful request, you must first consult the Model Selection endpoint to identify a valid model slug and check specific limits.

WARNING

Reference audio requirements (optional):

Supported formats: MP3, WAV, FLAC, OGG, M4A

Maximum file size: 10 MB

Duration must be within model-specific limits

This endpoint uses multipart/form-data content type to support file uploads.

OpenAPI

Request Txt2 Music

Authorization

bearerAuth

AuthorizationBearer <token>

In: header

Header Parameters

Accept*string

Default"application/json"

Value in"application/json"

Request Body

multipart/form-data

Music generation parameters

TypeScript Definitions

Use the request body type in TypeScript.

caption*string

Text description of the music to generate

model*string

The model to use for music generation. Available models can be retrieved via the GET /api/v1/client/models endpoint.

lyrics*string

Lyrics for the music. Use "[Instrumental]" for instrumental tracks without vocals.

duration*number

Duration in seconds (10-600)

bpm?|

Beats per minute (30-300)

keyscale?|

Musical key/scale (e.g. "C major", "F# minor")

timesignature?|

Time signature. Must be 2, 3, 4, or 6.

vocal_language?|

Language code for vocals (e.g. "en", "es", "fr")

inference_steps*integer

Number of diffusion inference steps (1-100). Use 8 for turbo models, 32+ for base models.

guidance_scale*number

Classifier-free guidance scale (0-20)

seed*integer

Random seed. Use -1 for random.

format*string

Audio output format

reference_audio?|

Optional reference audio file for style transfer. Supported formats: mp3, wav, flac, ogg, m4a. Max size configurable (default 10MB). Duration must be within model-specific limits.

Formatbinary

webhook_url?|

Optional HTTPS URL to receive webhook notifications for job status changes (processing, completed, failed). Must be HTTPS. Max 2048 characters.

Formaturi

Lengthlength <= 2048

Response Body

`application/json`

curl -X POST "https://api.dryapi.dev/api/v1/client/txt2music" \  -H "Accept: application/json" \  -F caption="upbeat electronic dance music with energetic synths" \  -F model="ACE-Step-v1.5-turbo" \  -F lyrics="[Instrumental]" \  -F duration="30" \  -F inference_steps="8" \  -F guidance_scale="7" \  -F seed="-1" \  -F format="flac"

{
  "data": {
    "request_id": "c08a339c-73e5-4d67-a4d5-231302fbff9a"
  }
}

{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}

{
  "data": {},
  "message": "string",
  "errors": [
    null
  ],
  "statusCode": 0
}

{
  "message": "The selected model does not support Text To Image.",
  "errors": {
    "model": [
      "The selected model does not support Text To Image."
    ]
  }
}

{
  "message": "Too Many Attempts."
}

Last updated on 21 March 2026

Text-to-Music

200application/json

401application/json

404application/json

422application/json

429application/json

Text-to-Music

200application/json

401application/json

404application/json

422application/json

429application/json

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`

`application/json`