WanMoveTrackToVideo - ComfyUI Built-in Node Documentation

This documentation was AI-generated. If you find any errors or have suggestions for improvement, please feel free to contribute! Edit on GitHub

The WanMoveTrackToVideo node prepares conditioning and latent space data for video generation, incorporating optional motion tracking information. It encodes a starting image sequence into a latent representation and can blend in positional data from object tracks to guide the motion in the generated video. The node outputs modified positive and negative conditioning along with an empty latent tensor ready for a video model.

Inputs

Parameter	Data Type	Required	Range	Description
`positive`	CONDITIONING	Yes	-	The positive conditioning input to be modified.
`negative`	CONDITIONING	Yes	-	The negative conditioning input to be modified.
`vae`	VAE	Yes	-	The VAE model used to encode the starting image into the latent space.
`tracks`	TRACKS	No	-	Optional motion tracking data containing object paths.
`strength`	FLOAT	No	0.0 - 100.0	Strength of the track conditioning. (default: 1.0)
`width`	INT	No	16 - MAX_RESOLUTION	The width of the output video. Must be divisible by 16. (default: 832)
`height`	INT	No	16 - MAX_RESOLUTION	The height of the output video. Must be divisible by 16. (default: 480)
`length`	INT	No	1 - MAX_RESOLUTION	The number of frames in the video sequence. (default: 81)
`batch_size`	INT	No	1 - 4096	The batch size for the latent output. (default: 1)
`start_image`	IMAGE	Yes	-	The starting image or image sequence to encode.
`clip_vision_output`	CLIPVISIONOUTPUT	No	-	Optional CLIP vision model output to add to the conditioning.

Note: The strength parameter only has an effect when tracks are provided. If tracks are not provided or strength is 0.0, the track conditioning is not applied. The start_image is used to create a latent image and mask for the conditioning; if it is not provided, the node only passes through the conditioning and outputs an empty latent.

Outputs

Output Name	Data Type	Description
`positive`	CONDITIONING	The modified positive conditioning, potentially containing `concat_latent_image`, `concat_mask`, and `clip_vision_output`.
`negative`	CONDITIONING	The modified negative conditioning, potentially containing `concat_latent_image`, `concat_mask`, and `clip_vision_output`.
`latent`	LATENT	An empty latent tensor with dimensions shaped by the `batch_size`, `length`, `height`, and `width` inputs.

WanMoveTracksFromCoords - ComfyUI Built-in Node Documentation

WanMoveVisualizeTracks - ComfyUI Built-in Node Documentation

Nodes

​Inputs

​Outputs

Inputs

Outputs