workflow

Model Name	Function
Stable Diffusion XL	Base text-to-image model for high-quality generation
ControlNet (Depth)	Controls pose and composition via depth maps
Florence-2	Generates image captions to refine prompts
Flux Guidance	Enhances character consistency across views

2. Key Components & Installation

Required Nodes:

ControlNetApplySD3: Applies ControlNet constraints (requires FLUX.1-dev-Controlnet-Depth model)
FluxGuidance: Ensures character consistency (install Flux plugin)
TTP_Tile: Processes large images in tiles (install Tiled Diffusion via ComfyUI Manager)
Florence2Run: Generates image captions (download HuggingFace model)

Dependencies:

Lora: 苏-FLUX小红书极致真实_v1.0 (place in models/loras)
ControlNet Model: FLUX.1-dev-Controlnet-Depth-InstantX.safetensors

3. Workflow Structure

Group 1: Input Control

Nodes: LoadImage (pose map), easy positive (prompts)
Inputs: Pose map (e.g., skeleton image), character description prompts
Outputs: Encoded conditioning vectors

Group 2: Image Generation

Nodes: KSampler + ControlNetApplySD3 + FluxGuidance
Logic: Generates latent images with pose consistency via Flux

Group 3: Tiled Processing (TTP Tile)

Nodes: TTP_Image_Tile_Batch → SamplerCustomAdvanced → TTP_Image_Assy
Function: Splits high-res images into tiles for VRAM efficiency

Group 4: Post-Processing

Nodes: ImageCrop+ (view cropping), Image Overlay (multi-view merge), SaveImage
Output: Final PNG with 4 aligned model views

4. Inputs & Outputs

Input Parameters:

Required: Pose map (e.g., POSE2.png), positive prompts
Optional: Seed value, resolution (default 1152x896), ControlNet strength (0.6)

Output:

A single PNG with 4 model views (left/front/back/right), saved to ComfyUI/output

5. Notes

VRAM: ≥12GB GPU recommended (reduce tile size for lower usage)
Troubleshooting:
- Missing ControlNet model → Download to models/controlnet
- Flux plugin not found → Install via ComfyUI Manager
Optimization:
- Lower TTP tile size (e.g., 512x512) for better performance
- Use easy cleanGpuUsed to free VRAM manually

Anime-ify Your Videos: A Step-by-Step Guide to Studio Ghibli-Style Animations

Unlock FLUX: The Ultimate Multimodal Workflow for Text-to-Image and Image Captioning

Recommend

Discover the Ultimate Eastern Art Creation Workflow with AI

Unlock Eastern Pixar-style art creation with this workflow! Generate high-quality images with Flux.1 and Lora models. Download now and enhance your digital illustrations!

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

"Revolutionizing 3D Generation: ComfyUI Now Supports Hunyuan3D 2.0!"

Unlock 3D Generation with Hunyuan3D 2.0! Discover how ComfyUI's native support for Tencent's open-source model empowers high-fidelity 3D creation - try it now!

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Unveiling the Past: Transforming Ancient Paintings into Hyper-Realistic Photos

Transform Ancient Portraits into Hyper-Realistic Photos with AI. Discover how to use SDXL models & multi-ControlNet guidance to bring historical figures to life.

Summary

Unlock AI model image generation with this workflow! Learn how to create multi-view consistent images using ControlNet, Flux, and TTP Tile technology. Discover the power of Stable Diffusion XL, ControlNet, and Florence-2 models. Get started now and enhance your AI image generation skills!

Chapter

workflow:

CustomNodes:

ControlNetApplySD3 CLIPTextEnc...