workflow

Unlock Stunning Images: A Step-by-Step Guide to Flux.1-Based Text-to-Image Generation

Transform Your Images: A Step-by-Step Guide to SUPIR-8K Wallpaper-Level Upscaling

Bring Your Images to Life: AI-Driven Video Generation with Sonic Diffusion and NTCosyVoice

Unleash Artistic Potential: Leveraging Flux.1 for Hand-Drawn Watercolor Images

Creating Silver Gradient Cats: A Comprehensive Workflow for Artistic Image Generation

Unveiling Aolun: A Revolutionary Chinese Mythology Art Workflow

Discover the Ultimate Eastern Art Creation Workflow with AI

"Unlocking Artistic Potential: A Deep Dive into the Flux.1 and Florence-2 Workflow"

From Script to Screen: A Step-by-Step Guide to Miyazaki-Style Storyboards

Unlock High-Quality Image Generation with Stable Cascade and CLIP Vision

Cutout Made Easy: A Comprehensive Guide to ComfyUI's CLIP-Powered Image Generation

Unlock the Art of Watercolor: A Step-by-Step Generation Workflow

Unlock Seamless Face Swapping: Stable Diffusion XL & InstantID Workflow Revealed

Beyond the Frame: A Step-by-Step Workflow for FLUX Model Image Outpainting

Master the Art of Background Replacement: A Step-by-Step AI Workflow

Create Breathtaking Character Art: A Step-by-Step Guide to Nordic Elf Portraits

Revive Memories: AI-Powered Old Photo Restoration Made Easy

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

The Art of Revival: Using AI to Restore Historical Portraits from Paintings and Statues

Unlock Stunning Visuals: A Step-by-Step Guide to Stable Diffusion Workflow

Unlocking the Art of Guochao: A Deep Dive into Stable Diffusion Workflow

Create Stunning Animated Videos with Ease: A Flux.1 and WanVideo Tutorial

Unlock Professional-Grade Poster Design with Miluo Advanced Aesthetic Workflow

Unlock the Power of AI-Generated Design: Exploring the Flux Interior Design Workflow

From Real to Anime: A Deep Dive into Advanced Image Transformation Workflow

From Text to Video: How WanVideo and ControlNet Are Changing the Game

Unlock Stunning Architectural Visuals with Stable Diffusion XL Workflow

Face Swap Revolution: Mastering ReActor and RIFE for Pro Video Editing

Unlock the Power of Image Style Transfer: A Deep Dive into ControlNet and IPAdapter Workflow

Remove Backgrounds with Ease: A Step-by-Step Guide to ComfyUI's Workflow

From Photos to Masterpieces: Automating Line Art Conversion with AI

🚀 Transform Your Photos into Anime Masterpieces with AI!

Unlock Realistic Human Images: Transform Cartoon Photos with AI Power

Unlock Dreamy Cloud Scenes: A Step-by-Step Workflow Guide

Boost Texture and Skin Realism with ComfyUI's Cutting-Edge Workflow

Mastering Style Transfer: A Comprehensive Guide to Image Generation

The Future of Portrait Editing: Harnessing ControlNet and LoRA Models

"From Pixels to Brushstrokes: A Deep Dive into Traditional Chinese Art Generation"

Unlock Cinematic Portraits: Advanced ComfyUI Workflow for Backlit Masterpieces

Revive Your Videos: AI-Driven Frame-Level Restoration and Enhancement

Unlock the Power of Video-to-Animation: A Comprehensive Pipeline Guide

Transform Your Product Images with AI: A Comprehensive Workflow

Boost Your Visual Content with AI-Driven Image Generation Workflow

Unlock Spring Vitality: Transforming Text into Stunning 3D Art

Master Depth Control and Style Transfer with this Cutting-Edge Workflow

Achieve Unparalleled Image Quality: Expert Workflow for Reducing AI Artifacts

Unlock Seamless Image Inpainting with FLUX and Differential Diffusion

Unlock the Secrets of Chinese Ink Painting with AI-Powered FLUX

From Photo to Masterpiece: Leveraging AI for Single-Photo to Multi-Scene Transformation

Unlock the Art of Photo-to-Sketch Conversion: Expert Workflow Revealed

Unlock Anime Art Mastery: Auto-Coloring Workflow Revealed

Unlock Anime-Style Video Magic: A Step-by-Step WAN2.1 Workflow Guide

Unlock Efficient Character Image Creation: A Comprehensive Workflow Guide

Unlock Hyper-Realistic Skin Textures: A Step-by-Step Guide

Mastering Image Retouching: A Comprehensive Workflow for E-commerce

Unlock Stunning Portraits: Advanced AI Workflow Revealed

Unlock Customizable Cartoon Emoji Packs with AI-Powered Workflow

Unlock Seamless Hand Repair: AI-Powered Workflow Revealed

Unlock Professional-Grade Character-Scene Fusion with Advanced AI Pipeline

Unlock Realistic Material Transfer with IPAdapterFaceIDKolors and ControlNet

From Concept to Reality: Mastering Progressive Denoising and Super-Resolution Techniques

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

From Images to Videos: A Deep Dive into the Wan2.1-I2V Workflow

Unlock the Power of Text-to-Video Generation with Alibaba's Wanx-8G Model

Minimalist Masterpiece: AI-Powered Workflow for Gradient-Style Art

Unlock 3D Magic: A Step-by-Step Workflow for Converting 2D Line Art

Unlock the Power of Text-to-Video Generation with Aliyun's Wan2.1 Model

Unlock Lip-Synced Cartoon Avatar Videos with This AI-Powered Workflow

Unlock Advanced Video Depth Control with Wan Model-Based Workflow

The Ultimate Video Generation Pipeline: Features, Models, and Optimization

Transform Your Videos into Anime-Style Masterpieces with Advanced AI Models

Unlock Professional AI Art Prompts with DeepSeek-R1 32B LLM

Unlock Flawless Images: AI-Powered Watermark Removal Workflow

Transform Your Product Images: Advanced Style Transfer and Composition Techniques

Unlock Professional ID Photos: A Step-by-Step Workflow

Unlock Stunning 60FPS Videos: A Comprehensive Workflow Guide

Unlock Stunning Video Generation with Style Control: A Comprehensive Workflow Guide

Face Swapping Evolved: Mastering the Art of 3D Avatar Generation with PulID Flux

Elevate Your Visual Storytelling: Wan2.1 Video Generation Workflow for Professionals

Master Video Creation: A Workflow for First/Last Frame Generation and Enhancement

"Wan2.1 Multiverse Workflow: Generate Stunning Cooking Cat Videos"

Transform Your Videos into Stylized Animations with Advanced AI Technology

Unlock 360-Degree Product Animation with AI-Powered Video Generation

From Raw to Refined: Mastering Image Processing with Advanced Models

Unlock Seamless Image Expansion with Flux Diffusion and Janus AI

Unveiling the Past: Transforming Ancient Paintings into Hyper-Realistic Photos

Unlock Precise AI Image Editing with Flux Diffusion and Multi-Condition Guidance

Unleash the Power of WanVideo: Create Stunning Sticker Peeling Effect Videos

Unleash AI-Powered Video Character Redraw: Transforming Videos with Style

Unlock Cinematic Mastery: Ultra-HD Photography Workflow Revealed

Unlock Professional-Grade Portrait Editing with AI-Powered Workflow

Unlock the Power of AI-Generated Videos: A Comprehensive Workflow Guide

Blooming Beautiful: A Technical Guide to Creating Mesmerizing Flower Bloom Effects on Buildings

Ancient Beauty Generation Unveiled: Stable Diffusion Meets SUPIR

Unleash Creative Video Generation: How to Pack Objects into Gift Boxes with LoRA Effects

Boost Your Video Creation with Wan2.1, RIFE, and CR Upscaling

Unlock Efficient Image Generation: A Comprehensive Workflow Guide

From Photos to Masterpieces: A Workflow for Generating Stylized Images with ControlNet and LoRA

Discover the Magic of AI Art Generation: A Step-by-Step Workflow Guide

Create Stunning Asian-Style Portraits with Stable Diffusion: A Workflow Guide

Unlock the Power of Lip-Synced Talking Avatars with Sonic Digital Human Workflow

CN

ComfyUI.org

2025-05-12 10:19:13

1. Workflow Overview

makxk21z6rud9c51e5iad820040e7a084159f2f98ce8eb25731e14d23e45f8ee0810d94557cc5bc30a1.gif

This "Sonic Digital Human" workflow generates lip-synced talking avatar videos by combining input images (e.g. portraits) with audio (e.g. speech). Based on Stable Video Diffusion (SVD) framework, it outputs MP4 videos with synchronized facial animations.

2. Core Models

Model/Component	Function	Source
svd_xt_1_1	Base video diffusion model	Download to `models/checkpoints`
Sonic model (unet.pth)	Lip-sync control	Quark/Baidu links in workflow
CLIP Vision	Image feature extraction	Built-in

3. Key Nodes

Node	Purpose	Installation
SONICTLoader	Load Sonic adapter	Install `ComfyUI_Sonic`
SONIC_PreData	Fuse audio/image data	Same as above
VHS_VideoCombine	Video compositing	`VideoHelperSuite` plugin
LoadAudio	Audio file loader	Built-in

4. Pipeline Structure

Input Group
- Image: LoadImage (e.g. image.png)
- Audio: LoadAudio (e.g. April28.MP3)
Processing Group
- Data fusion: SONIC_PreData encodes temporal data
- Config: Image size 768x768, audio weight=0.5
Generation Group
- SONICSampler: 25 steps, 25fps
- Output: 8fps H.264 video (CRF=19)

5. I/O Specifications

Input Requirements:
- Image: 1139x1151 PNG recommended
- Audio: MP3/WAV with clear speech
Output:
- Video: ComfyUI/output/AnimateDiff_xxxx-audio.mp4

6. Critical Notes

Model Setup:
- Download Sonic model from provided cloud links
- Verify svd_xt_1_1 model path
Performance:
- VRAM ≥16GB required
- Reduce FPS to 8 for lower resource usage
Troubleshooting:
- Desync lips: Check audio sample rate (44.1kHz)
- Choppy video: Adjust CRF (18-23)

Unlock Advanced Image Generation with LibLib F1_CN_Union_Pro Workflow

Unlock Stunning Architecture and Interior Designs with AI-Powered FLUX.1-dev Workflow

Recommend

Unlock Stunning Images: A Step-by-Step Guide to Flux.1-Based Text-to-Image Generation

Unlock high-quality image generation with Flux.1! Discover a Text-to-Image workflow integrating LoRA enhancement and multilingual support, producing stunning 1024x1280 images. Learn how to harness Flux.1-dev, T5-XXL, CLIP-L, and VAE for artistic and professional photography-style applications.

Beyond the Frame: A Step-by-Step Workflow for FLUX Model Image Outpainting

Unlock the full potential of your images with FLUX model outpainting. Extend borders, fill missing parts, and enhance quality using Stable Diffusion techniques and AI-powered tools. Learn how in this workflow guide.

Unlock Professional-Grade Poster Design with Miluo Advanced Aesthetic Workflow

Unlock stunning poster designs with Miluo Advanced Aesthetic Poster Design workflow, featuring Flux and Lora models for high-end aesthetics and artistic quality. Try now!

Boost Your Visual Content with AI-Driven Image Generation Workflow

Unlock precision image generation with our workflow, featuring multi-modal inputs, precision control, and bilingual processing. Discover key applications and core models for advertising, social media, and product visualization.

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Summary

Create Lip-Synced Talking Avatars with Sonic Digital Human Workflow | Generate MP4 videos with synchronized facial animations using Stable Video Diffusion (SVD) framework and audio input.

Chapter

workflow:

CustomNodes:

SONICTLoader VHS_VideoCombine ...