workflow

Model Name	Function	File Source
Wan2.1-I2V-14B	Main video generator (480P)	`Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensors`
UMT5-XXL Text Encoder	Handles multilingual prompts	`umt5-xxl-enc-fp8_e4m3fn.safetensors`
OpenCLIP Vision Encoder	Extracts image semantics	`open-clip-xlm-roberta-large-vit-huge-14_visual_fp16.safetensors`

3. Key Nodes

Node Name	Function	Installation	Dependencies
WanVideoSampler	Controls video sampling (frames/CFG)	Requires WanVideo plugin	Main model + VAE
WanVideoImageClipEncode	Encodes input image to latent	Same as above	CLIP vision model
VHS_VideoCombine	Combines frames (supports audio)	Install `ComfyUI-VideoHelperSuite`	FFmpeg required

4. Workflow Structure

Group 1: Input Processing
- LoadImage: Loads input image (e.g., 576x1024)
- WanVideoTextEncode: Processes prompts (e.g., "A smiling ancient beauty")
Group 2: Model Loading
- LoadWanVideoT5TextEncoder: Loads T5 encoder
- WanVideoModelLoader: Loads 14B video model
Group 3: Video Generation
- WanVideoSampler: Generates latent (30 frames, CFG=6)
- WanVideoDecode: Decodes to image sequence via VAE

5. Inputs & Outputs

Required Inputs:
- Image file (PNG/JPG)
- Positive prompt (e.g., style description)
- Negative prompt (e.g., "low quality, static")
Outputs:
- Animated WEBP (default) or MP4
- Resolution: 272x272 (adjustable)

6. Notes

⚠️ Troubleshooting:

VRAM: 14B model requires ≥16GB GPU, enable bf16 precision

Plugin: Manual install required:

git clone https://github.com/AI-ModelScope/comfyui-wanvideo-plugin

Models: Place all .safetensors in models/wanvideo/

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock the Power of Text-to-Video Generation with Alibaba's Wanx-8G Model

Recommend

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Master Local Edits & Style Transfers with This Cutting-Edge Workflow

Unlock AI-powered image editing: Local inpainting, style transfer & auto-upscaling with ICEdit, Flux, and ESRGAN models. Try now and transform your images!

What is ComfyUI

Discover ComfyUI, a node-based GUI for Stable Diffusion workflows offering advanced control, customization, and efficiency. Build, optimize, and share AI pipelines seamlessly—start creating today!

From Images to Videos: A Deep Dive into the Wan2.1-I2V Workflow

Unlock AI-powered video generation with Alibaba's Wan2.1 model! Learn how to create stunning videos from static images using this workflow guide.

Summary

Unlock AI-powered video generation with Alibaba's Wan2.1 model! Learn how to create stunning videos from static images using this workflow guide.

Chapter

workflow:

CustomNodes:

WanVideoSampler WanVideoDecode...