Skip to main content
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
Transforms part of an existing audio sample using text instructions. This node allows you to modify specific sections of audio by providing descriptive prompts, effectively “inpainting” or regenerating selected portions while preserving the rest of the audio.

Inputs

ParameterData TypeRequiredRangeDescription
modelCOMBOYes”stable-audio-2.5”
The AI model to use for audio inpainting.
promptSTRINGYesText description guiding how the audio should be transformed (default: empty).
audioAUDIOYesInput audio file to transform. Audio must be between 6 and 190 seconds long.
durationINTNo1-190Controls the duration in seconds of the generated audio (default: 190).
seedINTNo0-4294967294The random seed used for generation (default: 0).
stepsINTNo4-8Controls the number of sampling steps (default: 8).
mask_startINTNo0-190Starting position in seconds for the audio section to transform (default: 30).
mask_endINTNo0-190Ending position in seconds for the audio section to transform (default: 190).
Note: The mask_end value must be greater than the mask_start value. The input audio must be between 6 and 190 seconds in duration.

Outputs

Output NameData TypeDescription
audioAUDIOThe transformed audio output with the specified section modified according to the prompt.