workflow

This workflow leverages the Wan2.1-I2V-14B model to generate dynamic videos with "Avatar Summoning" effects (e.g., semi-transparent phantom synchronized with character movements). It combines text prompts + input images and custom LoRAs (e.g., spell effects).

2. Core Models

Wan2.1-I2V-14B-480P_fp8_e4m3fn.safetensors
- Main model for video generation (image-to-video). Requires BF16 precision.
umt5-xxl-enc-bf16.safetensors
- T5 text encoder for processing complex prompts (supports Chinese).
Wan2.1_VAE_bf16.safetensors
- Decodes latent frames to images.

3. Key Nodes

WanVideoModelLoader
- Loads the main model. Manual download required (place in ComfyUI/models/wan_video).
WanVideoTextEncode
- Processes text prompts (positive/negative) using T5.
WanVideoSampler
- Uses DPM++ SDE sampler (25 steps default).
WanVideoLoraSelect
- Applies custom LoRAs (e.g., Avatar Summoning_beta).
VHS_VideoCombine
- Renders frames into MP4 (16 FPS).

4. Workflow Structure

Input Group
- Text prompts (e.g., "A woman swings a sword, summoning a purple phantom").
- Reference image (e.g., "修仙女子.png").
Generation Group
- Model initialization via WanVideoModelLoader and WanVideoVAELoader.
- Frame generation via WanVideoSampler.
Output Group
- Video synthesis with VHS_VideoCombine (480x832 resolution).

5. Inputs & Outputs

Inputs: Text prompts, image, seed (e.g., 1057359483639287).
Outputs: MP4 video (H.264, with metadata).

6. Notes

Dependencies: Manually download Wan2.1 models and LoRAs.
VRAM: 16GB+ GPU recommended. Use BF16 to reduce usage.
Compatibility: Requires ComfyUI-WanVideoWrapper (install via ComfyUI Manager).
Troubleshooting:
- FileNotFoundError if models are missing.
- Reduce resolution in WanVideoBlockSwap for CUDA OOM errors.

Unlock Advanced Lighting Optimization: A Step-by-Step Workflow for Stunning Images

From Brushstrokes to Pixels: A Deep Dive into Stable Diffusion's Graffiti Capabilities

Recommend

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Create Adorable Cat Videos with AI: A Low-VRAM Workflow

Generate cute cat videos from static images with this workflow! Learn how to create high-quality MP4s using low VRAM and fast local processing. Discover the power of image-to-video, super-resolution, and frame interpolation.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Unlock Time-Lapse Aging Videos with Wan2.1 I2V Model: A Step-by-Step Guide

Generate stunning time-lapse aging videos from portraits with Wan2.1 I2V model & Aging LoRA. Learn how to create realistic facial aging effects with multimodal control.

Summary

Unlock dynamic video generation with the Wan2.1-I2V-14B model! Learn how to create stunning "Avatar Summoning" effects with text prompts, input images, and custom LoRAs. Discover the workflow, core models, and key nodes to get started

Chapter

workflow:

CustomNodes:

WanVideoDecode WanVideoModelLo...