"Wan2.1 Multiverse Workflow: Generate Stunning Cooking Cat Videos"

CN
ComfyUI.org
2025-04-08 11:47:24

1. Workflow Overview

m98fqe3gcwhwpda4nl1444529fea95307e53cfb766cd5384e4a5283a45e764ac80671d003b91cf3997.gif

This workflow generates viral "Cooking Cat" videos for Douyin using Wan2.1 Multiverse's image-to-video capabilities. Key features:

  • Dynamic video generation from a single image (e.g., cat cooking)

  • LoRA fine-tuning (e.g., "Fat Cat Chef" style)

  • 4K upscaling (via 4x-UltraSharp)

  • Frame interpolation (10x with RIFE VFI)

  • Multi-format output (MP4 + GIF)

2. Core Models

Model Name

Function

Source/Installation

Wan2.1-I2V-14B

Image-to-video backbone (FP8)

Manual .safetensors download

Fat_Cat_Cook_MZF04 LoRA

Stylized cat chef tuning

Load via LoraLoader node

4x-UltraSharp

Super-resolution

Install via ComfyUI Manager

RIFE 4.7

Frame interpolation

Requires rife47.pth weights

3. Key Nodes

Node Name

Function

Installation

WanVideoImageClipEncode

Encodes images to video latent

Requires ComfyUI-WanVideoWrapper

WanVideoSampler

Controls video sampling (15 steps)

Same as above

RIFE VFI

10x frame interpolation

Requires VideoHelperSuite

ImageResizeAdvanced

Dynamic image resizing

Built-in

4. Workflow Groups

  • Group 1: Image Preprocessing

    • Input: 1152x2048 PNG (cat chef image)

    • Nodes: LoadImage β†’ ImageResizeAdvanced (1024x1024)

  • Group 2: Video Generation

    • Core Nodes:

      • WanVideoImageClipEncode

      • WanVideoSampler (DPM++, 15 steps)

      • WanVideoDecode

  • Group 3: Post-Processing

    • Upscale: ImageUpscaleWithModel (4x)

    • Interpolation: RIFE VFI (10x)

    • Output: VHS_VideoCombine (MP4 24fps)

5. Inputs & Outputs

  • Inputs:

    • Required:

      • Source image (recommended 1152x2048)

      • Prompt (e.g., "A cat eating with a spoon")

    • Optional:

      • Seed (random by default)

      • LoRA strength (default: 0.9)

  • Outputs:

    • WanVideo2_1.mp4 (H.264, 24fps)

    • Upscaled GIF animation

6. Notes

  • VRAM: β‰₯12GB recommended (reduced with fp8_e4m3fn quantization)

  • Special Features:

    • Fat_Cat_Cook_MZF04 LoRA for viral Douyin style

    • Note node hints: "Test sdpa/flaash optimizations"

  • Troubleshooting:

    • Match image ratio to ImageResizeAdvanced settings

    • Verify rife47.pth path if interpolation fails