workflow

This workflow leverages FLUX ControlNet V3.0 for multi-condition controlled generation, combining HED soft-edge, Depth, and Canny edge preprocessing to precisely guide image synthesis. Outputs adhere to input structure and text prompts.

Key Features:

Multi-ControlNet Integration (HED + Depth + Canny).
Depth Anything V2 for depth map generation.
WD14 Tagger for auto-prompt generation.
Flux Sampler with FP8 optimization for efficiency.

Output:

High-res images (default 1024x1024), stylized by prompts and ControlNet conditions.

2. Core Models

Model Name	Function
Stable Diffusion XL	Base image model (Flux.1-Dev fp16 variant).
Depth Anything V2	Generates depth maps from input images.
ControlNet V3	Provides HED, Depth, and Canny controls.
WD14 Tagger	Auto-generates tags from input images.

3. Key Nodes & Installation

Node Name	Function	Installation	Dependencies
DownloadAndLoadDepthAnythingV2Model	Loads Depth Anything V2 model.	Manual download to `/models/depth_anything/`.	Model Link
ApplyFluxControlNet	Applies Flux-optimized ControlNet.	Install `Flux Nodes` via ComfyUI Manager.	Requires ControlNet V3 models (e.g., `XLabs-flux-hed-controlnet_v3`).
XlabsSampler	Flux sampler with FP8 support.	Part of `Flux Nodes`.	FP8-compatible GPU (e.g., RTX 40 series).
WD14Tagger\|pysssss	Auto-tagging for prompts.	Install `WD14 Tagger` via ComfyUI Manager.	Requires `wd-v1-4-moat-tagger-v2` model.

4. Workflow Groups

HED Group (Blue)
- Input: Source image (resized via ImageResize+).
- Preprocess: HEDPreprocessor extracts soft edges.
- Control Weight: 0.8 (set in ApplyFluxControlNet).
Depth Group (Orange)
- Input: Same image.
- Preprocess: DepthAnything_V2 generates depth map.
- Control Weight: 0.7.
Canny Group (Purple)
- Input: Same image.
- Preprocess: CannyEdgePreprocessor (thresholds 100/200).
- Control Weight: 0.6.
Generation Group
- Prompts: Processed by CLIPTextEncode (e.g., "Makoto Shinkai style").
- Sampling: XlabsSampler merges multi-ControlNet conditions.

5. Inputs & Outputs

Input Parameters:

Image: Loaded via LoadImage (e.g., sample Redbook image).
Prompts: Manual input or auto-generated by WD14Tagger.
Resolution: Default 1024x1024 (set in EmptyLatentImage).

Output:

Final images saved in /ComfyUI/output/ with metadata.

6. Notes

Hardware:
- RTX 40 series recommended (FP8 support), VRAM ≥12GB.
- Depth Anything V2 is VRAM-intensive; may crash at high resolutions.
Model Setup:
- Download ControlNet V3 models to /models/controlnet/.
- Missing models trigger download prompts.
Tips:
- Total ControlNet weights should ideally ≤2.0 (e.g., HED 0.8 + Depth 0.7 + Canny 0.6).
- FP8 mode may introduce noise; adjust denoise in ModelSamplingFlux.

Create 6 Emotions from 1 Portrait: A Comprehensive Guide to ExpressionEditor

Unlock Stunning 360° Panoramas with AI: A Step-by-Step Guide

Recommend

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Create Adorable Cat Videos with AI: A Low-VRAM Workflow

Generate cute cat videos from static images with this workflow! Learn how to create high-quality MP4s using low VRAM and fast local processing. Discover the power of image-to-video, super-resolution, and frame interpolation.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Unlock Time-Lapse Aging Videos with Wan2.1 I2V Model: A Step-by-Step Guide

Generate stunning time-lapse aging videos from portraits with Wan2.1 I2V model & Aging LoRA. Learn how to create realistic facial aging effects with multimodal control.

Summary

Unlock advanced image synthesis with FLUX ControlNet V3.0! Combine HED, Depth, and Canny edge preprocessing for precise control. Discover key features, core models, and installation details. Get started now!

Chapter

workflow:

CustomNodes:

DownloadAndLoadDepthAnythingV2...