Anime Video Magic: Unleashing the Potential of WAN2.1 for Stunning Visuals
1. Workflow Overview

This workflow uses Alibaba’s WAN2.1 model for Image-to-Video (I2V) generation, transforming a 2D anime-style character image (e.g., a girl cycling) into a dynamic video with camera motion.
Core Models:
Wan2.1-I2V-14B: Video generation model (480P output).UMT5-XXL: Multilingual text encoder for Chinese prompts.OpenCLIP-ViT-H: Image encoder for input analysis.
2. Key Nodes
Node | Function | Installation |
|---|---|---|
| Loads WAN2.1 model (requires | Manual download to |
| Processes bilingual prompts (e.g., "cycling scene") | Requires |
| Frame interpolation for smoother video | Install via ComfyUI Manager |
| Combines frames into MP4 | Install |
Dependencies:
Manual model downloads:
Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensorsumt5-xxl-enc-bf16.safetensorsSave to:
ComfyUI/models/wan_video/
3. Workflow Groups
Load Image: Input character image (e.g.,
00028-2257557710.png).Model Load: Load WAN2.1, VAE, and text encoders.
Prompt Processing: Positive/Negative prompts (e.g., "speed effect").
Video Generation:
WanVideoSamplergenerates latent frames.Post-Processing: Upscaling (
CR Upscale Image).Video Export: MP4 output (
VHS_VideoCombine).Frame Interpolation: Boosts FPS to 32 via
RIFE VFI.
4. Inputs & Outputs
Inputs:
Image: 480x768 PNG (anime character).
Prompts: Chinese/English descriptions.
Params: Frame rate (16→32FPS), CFG=8.0, Steps=30.
Output:
MP4 video (default:
xiao.mp4).
5. Notes
VRAM: ≥16GB GPU recommended (use
bf16for lower usage).Debugging:
Missing model → Check
wan_videofolder path.RIFE error → Ensure
rife47.pthis inComfyUI/models/rife/.
Optimization:
Reduce resolution (e.g., 360P) for lower VRAM.
Disable experimental features to speed up.