workflow

Text-to-Video (T2V): Generates videos from text prompts.
Image-to-Video (I2V): Converts input images to animated sequences (e.g., anime-style transitions).
Optimizations: Includes VRAM management, inference acceleration, and resolution control.

2. Core Models

Wan2.1-I2V-14B: Primary video generation model (supports dual input: text/image).
umt5-xxl-enc: Text encoder for prompt processing.
open-clip-xlm-roberta: Encodes input images for I2V mode.

3. Key Nodes

Input & Encoding

LoadImage: Uploads input images (I2V mode).
WanVideoImageClipEncode: Encodes images into embeddings.
WanVideoTextEncode: Processes text prompts (T2V mode).

Model & Inference

WanVideoModelLoader: Loads Wan2.1 model (supports LoRA adapters).
WanVideoSampler: Generates videos (steps=25, CFG=6, etc.).

Optimizations

WanVideoBlockSwap: VRAM optimization (model chunking).
WanVideoTeaCache: Speeds up inference (caches intermediate results).
WanVideoSLG: Dynamic generation strategy (e.g., staged sampling).

Post-Processing

WanVideoDecode: Decodes latent frames to images.
VHS_VideoCombine: Renders final video (30FPS MP4).

4. Workflow Structure (Groups)

Image Input Zone
- Input: Images (e.g., 透明.png), recommended size ≤480x480.
- Key Nodes: LoadImage, WanVideoImageClipEncode.
Loader Zone
- Loads models/encoders:
  - WanVideoVAELoader (VAE).
  - LoadWanVideoT5TextEncoder (text encoder).
Workspace (Core Logic)
- Text/image encoding → Model inference → Optimizations.
- Key Nodes: WanVideoSampler, WanVideoSLG.
Post-Processing Zone
- Video decoding & synthesis: WanVideoDecode, VHS_VideoCombine.

5. Inputs & Outputs

Inputs:
- Image (I2V) or text prompt (T2V).
- Resolution: Default 832x480 (set in WanVideoImageClipEncode).
Output:
- Video file (MP4, 30FPS), e.g., WanVideo2_1_T2V_00256.mp4.

6. Notes & Tips

VRAM: 14B model requires 16GB+ GPU; enable BlockSwap and TeaCache.
Image Size: Resize large images with ImageResizeKJ to avoid OOM.
LoRA: Optional adapters like 馨染_Wan2.1 for style control.
Parameter Tips:
- SLG: For 14B, use blocks=16-20, strat_percent=0.1-0.15.
- TeaCache: For 14B, set rel_l1_thresh=0.2, mode=speed.

Unleash Dynamic Videos with Angry Facial Expressions: A Step-by-Step Workflow

Unlock Next-Level Animation: First-Frame Controlled Video Generation Pipeline

Recommend

Unlock Stunning Images: A Step-by-Step Guide to Flux.1-Based Text-to-Image Generation

Unlock high-quality image generation with Flux.1! Discover a Text-to-Image workflow integrating LoRA enhancement and multilingual support, producing stunning 1024x1280 images. Learn how to harness Flux.1-dev, T5-XXL, CLIP-L, and VAE for artistic and professional photography-style applications.

Beyond the Frame: A Step-by-Step Workflow for FLUX Model Image Outpainting

Unlock the full potential of your images with FLUX model outpainting. Extend borders, fill missing parts, and enhance quality using Stable Diffusion techniques and AI-powered tools. Learn how in this workflow guide.

Unlock Professional-Grade Poster Design with Miluo Advanced Aesthetic Workflow

Unlock stunning poster designs with Miluo Advanced Aesthetic Poster Design workflow, featuring Flux and Lora models for high-end aesthetics and artistic quality. Try now!

Boost Your Visual Content with AI-Driven Image Generation Workflow

Unlock precision image generation with our workflow, featuring multi-modal inputs, precision control, and bilingual processing. Discover key applications and core models for advertising, social media, and product visualization.

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Summary

Unlock AI-powered video creation with Wan2.1 Model Inference (T2V & I2V)! Generate videos from text prompts, convert images to animated sequences, and optimize with VRAM management, acceleration, and resolution control.

Chapter

workflow:

CustomNodes:

WanVideoImageClipEncode Note L...