This guide uses the v2 endpoints and your API key.
Get an API key
Create an account and copy your API key from the dashboard.
Upload media for enhancement
Send a media file (audio or video) and optional enhancement parameters.curl -sS -X POST "https://api.ai-coustics.io/v2/medias" \
-H "X-API-Key: $AICOUSTICS_API_KEY" \
-F file=@"./example.wav" \
-F enhancement='{ "loudness_target": -14, "true_peak": -1, "enhancement_level": 100, "enhancement_model": "LARK_V2", "transcode": "WAV" }'
{
"uid": "med_7f6c7b1c2d",
"uploaded_at": "2025-09-11T10:30:00Z",
"file_name": "example.wav",
"file_size": 4821934,
"audio_codec": "PCM",
"video_codec": null,
"duration": 12345,
"enhancement_level": 100,
"enhancement_model": "LARK_V2",
"enhancement_status": "PROCESSING",
"enhancement_params": { "loudness_target": -14, "true_peak": -1, "transcode": "WAV" }
}
Supported audio codecs include PCM, MPEG Audio, Vorbis, Opus, AAC LC, FLAC. Maximum size is 512 MB and audio duration up to 120 minutes. See the API Reference for details.
Poll metadata until processing completes
Use the uid
to check status.curl -sS -X GET "https://api.ai-coustics.io/v2/medias/$UID/metadata" \
-H "X-API-Key: $AICOUSTICS_API_KEY"
{
"uid": "med_7f6c7b1c2d",
"enhancement_status": "COMPLETED",
"enhancement_model": "LARK_V2",
"enhancement_level": 100,
"duration": 12345,
"enhancement_params": { "loudness_target": -14, "true_peak": -1, "transcode": "WAV" }
}
Download the enhanced file
When status is COMPLETED
, download the processed media.curl -sS -L -X GET "https://api.ai-coustics.io/v2/medias/$UID/file" \
-H "X-API-Key: $AICOUSTICS_API_KEY" \
--output enhanced.wav
You should now have enhanced.wav
saved locally.
The legacy v1 endpoints (/v1/media/...
, /v1/videos/...
) are deprecated. Prefer v2 (/v2/medias
).