Boost Your Video Creation with Wan2.1, RIFE, and CR Upscaling
1. Workflow Overview

This workflow leverages Wan2.1 model for Image-to-Video (I2V) generation, with:
Input: Single image + text prompt β Low-res video generation
RIFE interpolation + CR upscaling for higher FPS (32fps) and resolution
KJ acceleration (TeaCache/BlockSwap) to optimize VRAM usage
Outputs: Original (16fps) and enhanced videos (32fps)
2. Core Models
Model Name | Function | Source |
---|---|---|
| Main I2V model | Manual download (e.g., HuggingFace) |
| T5 text encoder | Required companion model |
| Super-resolution upscaler | Install via ComfyUI Manager |
| Frame interpolation (RIFE) | Manual GitHub install |
3. Key Nodes
Node | Purpose | Installation |
---|---|---|
| Controls video sampling (UniPC) | Built-in |
| Loads LoRA for style tuning | Requires Wan plugin |
| Frame interpolation (32fps output) | Install |
| 3x resolution upscaling | Install |
4. Workflow Groups
Group 1: Wan2.1 Model Loading
Input: Model files, VAE, LoRA
Output: Initialized video generation model
Group 2: Text & Image Encoding
Input: Prompt (e.g., "1girl, golden hair"), negative prompt, uploaded image
Output: CLIP image embeds + T5 text embeds
Group 3: Acceleration Nodes
Critical params:
BlockSwap=20
(VRAM safety),TeaCache=0.04
(speed boost)
Group 4: Initial Video Synthesis
Output: 480P raw video (16fps)
Group 5: Upscale + Interpolation
Pipeline: CR upscale β RIFE interpolation β 1080P output (32fps)
5. Inputs & Outputs
Input Parameters:
Required:
Image path
,Prompt
,Seed (41387343190862)
Optional:
LoRA model
,RIFE multiplier (10)
Output:
teacache_00002.mp4
(raw video)xiao_00001.mp4
(HD interpolated video)
6. Notes
VRAM: Recommended β₯12GB GPU (e.g., RTX 3060+). Enable BlockSwap to reduce usage.
Compatibility: Wan models must be bf16/fp8 format.
Troubleshooting: If
umt5-xxl-enc-bf16.safetensors
is missing, download from HuggingFace toComfyUI/models/wan_video
.Optimization: Adjust
TeaCache
(0.01~0.05) for speed/quality trade-off.