"Wan2.1 Multiverse Workflow: Generate Stunning Cooking Cat Videos"
1. Workflow Overview

This workflow generates viral "Cooking Cat" videos for Douyin using Wan2.1 Multiverse's image-to-video capabilities. Key features:
Dynamic video generation from a single image (e.g., cat cooking)
LoRA fine-tuning (e.g., "Fat Cat Chef" style)
4K upscaling (via
4x-UltraSharp
)Frame interpolation (10x with RIFE VFI)
Multi-format output (MP4 + GIF)
2. Core Models
Model Name | Function | Source/Installation |
---|---|---|
Wan2.1-I2V-14B | Image-to-video backbone (FP8) | Manual |
Fat_Cat_Cook_MZF04 LoRA | Stylized cat chef tuning | Load via |
4x-UltraSharp | Super-resolution | Install via ComfyUI Manager |
RIFE 4.7 | Frame interpolation | Requires |
3. Key Nodes
Node Name | Function | Installation |
---|---|---|
| Encodes images to video latent | Requires |
| Controls video sampling (15 steps) | Same as above |
| 10x frame interpolation | Requires |
| Dynamic image resizing | Built-in |
4. Workflow Groups
Group 1: Image Preprocessing
Input: 1152x2048 PNG (cat chef image)
Nodes:
LoadImage
βImageResizeAdvanced
(1024x1024)
Group 2: Video Generation
Core Nodes:
WanVideoImageClipEncode
WanVideoSampler
(DPM++, 15 steps)WanVideoDecode
Group 3: Post-Processing
Upscale:
ImageUpscaleWithModel
(4x)Interpolation:
RIFE VFI
(10x)Output:
VHS_VideoCombine
(MP4 24fps)
5. Inputs & Outputs
Inputs:
Required:
Source image (recommended 1152x2048)
Prompt (e.g., "A cat eating with a spoon")
Optional:
Seed (random by default)
LoRA strength (default: 0.9)
Outputs:
WanVideo2_1.mp4
(H.264, 24fps)Upscaled GIF animation
6. Notes
VRAM: β₯12GB recommended (reduced with
fp8_e4m3fn
quantization)Special Features:
Fat_Cat_Cook_MZF04
LoRA for viral Douyin styleNote node hints: "Test sdpa/flaash optimizations"
Troubleshooting:
Match image ratio to
ImageResizeAdvanced
settingsVerify
rife47.pth
path if interpolation fails