Master Video Creation: A Workflow for First/Last Frame Generation and Enhancement
1. Workflow Overview

This workflow is designed for first/last frame video generation and enhancement, featuring:
Dynamic video synthesis from start/end images
4x super-resolution upscaling (using
4xRealWebPhoto_v4
)Frame interpolation (10x via RIFE VFI)
Auto frame labeling (e.g., "start_frame"/"end_frame")
Dual-format output (MP4 + GIF)
2. Core Models
Model Name | Function | Source/Installation |
---|---|---|
Wan2.1-Fun-InP-14B | Main video gen model (FP8) | Manual |
umt5-xxl-enc-bf16 | Multilingual text encoder | Loaded via |
4xRealWebPhoto_v4 | Image upscaler | Install via ComfyUI Manager |
RIFE 4.7 | Optical flow interpolation | Requires |
3. Key Nodes
Node Name | Function | Installation |
---|---|---|
| Encodes frames to latent space | Built-in |
| Controls video sampling process | Requires |
| Frame interpolation (10x) | Requires |
| Dynamic image resizing | Requires |
4. Workflow Groups
Group 1: Start Frame Processing
Input: 1288x1024 PNG
Nodes:
LoadImage
βImageResizeKJ
(480x640) βAddLabel
("start_frame")
Group 2: End Frame Processing
Input: 1288x1024 PNG
Nodes:
LoadImage
βImageResizeKJ
βAddLabel
("end_frame")
Group 3: Video Generation
Core:
WanVideoSampler
(30 steps, UniPC) βWanVideoDecode
(to image sequence)
Group 4: Post-Processing
Upscale:
ImageUpscaleWithModel
(4x)Interpolation:
RIFE VFI
(10x)Output:
VHS_VideoCombine
(MP4 16fps + GIF 32fps)
5. Inputs & Outputs
Inputs:
Required: Start/End frame images (drag-and-drop)
Optional: Seed (default: 999), Prompt (e.g., "change clothes"), Negative prompt (pre-configured)
Outputs:
output.mp4
(H.264, 16fps)AnimateDiff.gif
(32fps loop)
6. Notes
VRAM: Recommended β₯16GB GPU (FP8 optimization reduces usage to ~12GB)
Troubleshooting:
Check
ImageResizeKJ
parameters if image dimensions mismatchVerify
rife47.pth
path if RIFE fails
Optimization:
Enable
torch.compile
(configured withinductor
mode)Use
easy cleanGpuUsed
for manual VRAM release