Skip to main content
This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub
The Wan Text to Video node generates video content based on text descriptions. It uses AI models to create videos from prompts and supports various video sizes, durations, and optional audio inputs. The node can automatically generate audio when needed and provides options for prompt enhancement and watermarking.

Inputs

ParameterData TypeRequiredRangeDescription
modelCOMBOYes”wan2.5-t2v-preview”Model to use (default: “wan2.5-t2v-preview”)
promptSTRINGYes-Prompt used to describe the elements and visual features, supports English/Chinese (default: "")
negative_promptSTRINGNo-Negative text prompt to guide what to avoid (default: "")
sizeCOMBONo”480p: 1:1 (624x624)"
"480p: 16:9 (832x480)"
"480p: 9:16 (480x832)"
"720p: 1:1 (960x960)"
"720p: 16:9 (1280x720)"
"720p: 9:16 (720x1280)"
"720p: 4:3 (1088x832)"
"720p: 3:4 (832x1088)"
"1080p: 1:1 (1440x1440)"
"1080p: 16:9 (1920x1080)"
"1080p: 9:16 (1080x1920)"
"1080p: 4:3 (1632x1248)"
"1080p: 3:4 (1248x1632)“
Video resolution and aspect ratio (default: “480p: 1:1 (624x624)“)
durationINTNo5-10Available durations: 5 and 10 seconds (default: 5)
audioAUDIONo-Audio must contain a clear, loud voice, without extraneous noise, background music
seedINTNo0-2147483647Seed to use for generation (default: 0)
generate_audioBOOLEANNo-If there is no audio input, generate audio automatically (default: False)
prompt_extendBOOLEANNo-Whether to enhance the prompt with AI assistance (default: True)
watermarkBOOLEANNo-Whether to add an “AI generated” watermark to the result (default: True)
Note: The duration parameter only accepts values of 5 or 10 seconds, as these are the available durations. When providing audio input, it must be between 3.0 and 29.0 seconds in duration and contain clear voice without background noise or music.

Outputs

Output NameData TypeDescription
outputVIDEOThe generated video based on the input parameters