Bringing Figurines to Life: The WAN2.1 I2V Workflow Guide

CN
ComfyUI.org
2025-04-21 09:06:29

1. Workflow Overview

m9qupwqnsncjbxws1eda9a04622243bb876270948bfa0866eded68772956a644ee8c3628d8ebffb61e.gif

This WAN2.1 I2V workflow specializes in transforming static figurine images into animated videos. Key features:

  • Input: A PNG image (e.g., ComfyUI_temp_sydys_00036_.png) β†’ Output: 3D rotation/transformation video

  • Supports background preservation and motion control (via SLGArgs node)

  • Exports MP4 video (720P, 16FPS by default)

2. Core Models

Model Name

Function

Wan2_1-I2V-14B-720P_fp8_e4m3fn.safetensors

Main I2V temporal generation model

umt5-xxl-enc-bf16.safetensors

Multilingual text encoder (supports Chinese)

Wan2_1_VAE_bf16.safetensors

Video latent space decoder

3. Key Nodes

Node Name

Function

Installation

WanVideoModelLoader

Loads main model + LoRA (figurine style)

Requires ComfyUI-WanVideoWrapper plugin

WanVideoTextEncode

Processes prompts (e.g., "transform into 3D figurine")

Same as above

WanVideoSLG

Controls motion amplitude (0.1~0.8)

Same as above

VHS_VideoCombine

Video rendering (requires VideoHelperSuite)

Install via ComfyUI Manager

4. Workflow Groups

  1. Model Loading Group

    • Nodes: LoadWanVideoT5TextEncoder, WanVideoVAELoader

    • Input: Model paths (e.g., umt5-xxl-enc-bf16.safetensors)

    • Output: Initialized encoders + VAE

  2. Image Processing Group

    • Node: WanVideoImageClipEncode

    • Input: 1024x1440 PNG image (transparent background recommended)

    • Output: Image embeddings

  3. Video Generation Group

    • Nodes: WanVideoSampler + WanVideoDecode

    • Params: Seed 34660692369907, Steps 20

    • Output: Latent video sequence

  4. Video Export Group

    • Node: VHS_VideoCombine

    • Params: FPS 16, CRF 19

    • Output: WanVideo2_1_00001.mp4

5. Inputs & Outputs

  • Inputs:

    • Image: 1024x1440 PNG (transparent background)

    • Prompt: "A girl rotates 360Β° and transforms into a 3D figurine"

    • Negative Prompt: "Low quality, background change, distorted limbs"

  • Output:

    • 720P MP4 video (H.264, YUV420P)

6. Notes

  1. Hardware Requirements:

    • VRAM β‰₯16GB (due to bf16 + fp8_e4m3fn optimization)

    • Launch with --highvram flag

  2. Troubleshooting:

    • Model Path Error: Ensure .safetensors are in ComfyUI/models/wanvideo/

    • Video Flickering: Adjust WanVideoTeaCache parameter 0.3