workflow

This workflow generates e-commerce product videos from static images (e.g., jewelry, apparel), highlighting details and wearing effects. Key features:

Image-to-Video (I2V): Converts product images into dynamic clips (e.g., subtle bracelet movement on a model's wrist).
Smart Cropping: Auto-adjusts input aspect ratio (e.g., 350x350 → 832x480) for video output.
Commercial-Grade Output: 30fps H.264 encoding (CRF19) balances quality and file size.

Core Models:

Wan2.1-I2V-14B: Main video model (480P, bf16/fp8 mixed precision).
UMT5-XXL Text Encoder: Processes Chinese product descriptions.
CLIP Vision Encoder: Analyzes image composition/color.

2. Key Components

Critical Nodes:

WanVideoModelLoader:
- Loads Wan2.1 model (Wan2_1-I2V-14B-480P_fp8_e4m3fn.safetensors) with sdpa attention optimization.
WanVideoTextEncode:
- Input product prompts, e.g.:
  - Positive: "Bracelet showcase video, highlighting craftsmanship."
  - Negative: "low quality, blurry, distortion"
ImageScaleByAspectRatio V2:
- Resizes input (e.g., 847b03d5...jpg) with letterbox padding.
WanVideoSampler:
- Settings:
  - Steps: 25
  - Sampler: unipc (fast convergence)
  - Seed: Random (can fix to 773185989414318).
VHS_VideoCombine:
- Exports 30fps MP4 (default: AnimateDiff.mp4).

Dependencies:

Install ComfyUI-VideoHelperSuite.
Download models to:
- Main model: ComfyUI/models/wan_video
- VAE: Wan2_1_VAE_bf16.safetensors

3. Workflow Structure

Model Loading:
- Load video model, text encoder, VAE.
Input Processing (Operation Group):
- Upload product image → Resize → CLIP encode.
Video Generation:
- Fuse image features + text → Sampler → Latent frames.
Output:
- Decode latent → MP4 synthesis (with metadata).

Key Parameters:

Resolution: Fixed 480P (832x480 landscape).
Frame Rate: 30fps (set in VHS_VideoCombine).

4. Input & Output

Input Parameters:

Image: Square/vertical (e.g., 1440x1920), clear product focus.
Text Prompt: Concise product highlights (Chinese preferred).

Output:

480P MP4 video (e.g., bracelet animation), saved to ComfyUI/output.

5. Notes

VRAM: ≥12GB required (16GB recommended).
Image Tips:
- Clear subject, simple background (pre-cropped).
- Avoid text/watermarks interfering with CLIP.
Troubleshooting:
- Choppy video? Reduce WanVideoSampler steps (e.g., 20).
- Distortion? Check ImageScaleByAspectRatio mode is letterbox.

From Abstract to Stunning: Mastering AI-Driven Image Generation with LoRA Style Control and Captioning

Transform Portraits into Anime Masterpieces with AI-Powered Workflow

Recommend

MimicMotion Explained: How to Use Diffusion Models for Animation in ComfyUI

Generate animated videos with MimicMotion: Transform reference images and pose sequences into seamless MP4 animations. Explore the workflow now!

Unlock Stunning Images: A Step-by-Step Guide to Flux.1-Based Text-to-Image Generation

Unlock high-quality image generation with Flux.1! Discover a Text-to-Image workflow integrating LoRA enhancement and multilingual support, producing stunning 1024x1280 images. Learn how to harness Flux.1-dev, T5-XXL, CLIP-L, and VAE for artistic and professional photography-style applications.

"Unlocking Artistic Potential: A Deep Dive into the Flux.1 and Florence-2 Workflow"

Generate stunning oil painting-style images with Flux.1 & Florence-2. Learn how to harness AI for art creation & discover the power of image-to-text captioning. Dive into this workflow now!

Beyond the Frame: A Step-by-Step Workflow for FLUX Model Image Outpainting

Unlock the full potential of your images with FLUX model outpainting. Extend borders, fill missing parts, and enhance quality using Stable Diffusion techniques and AI-powered tools. Learn how in this workflow guide.

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

Summary

Unlock stunning e-commerce product videos from static images with our innovative workflow, featuring AI-powered image-to-video conversion, smart cropping, and commercial-grade output. Discover how to elevate your product showcases!

Chapter

workflow:

CustomNodes:

WanVideoBlockSwap LoadWanVideo...