workflow

Model	Function	Source
F.1_Depth-fp16_1.0	Base UNet for depth-aware generation	Custom
flux1-redux-dev	Style transfer model	Requires installation
HyperL-F.1-加速器-PAseer_加速FLUX_AcceleratorV3.1 (LoRA)	Generation accelerator	Requires installation
depth_anything_v2_vitl.pth	Depth estimation	ControlNet Aux
unsloth/Meta-Llama-3.1-8B-Instruct-bnb-4bit	Prompt generation	Joy_caption_two

3. Key Components

Node	Function	Installation
DepthAnythingV2Preprocessor	Depth map generation	ComfyUI-ControlNet-Aux
StyleModelLoader	Loads style transfer model	Built-in
Joy_caption_two	AI-powered prompt generation	Custom/GitHub
InstructPixToPixConditioning	Image-to-image conditioning	Built-in
FluxGuidance	Enhanced guidance scaling	Built-in
DF_Image_scale_to_side	Smart image scaling	Derfuu's Modded Nodes

4. Workflow Structure

Input Section:
- "照片——放这里": Source image input (658×1170)
- "风格参考图——放这里": Style reference image (452×600)
Prompt Generation:
- Uses Llama-3 to analyze images and generate descriptive prompts
Depth Control:
- Creates depth maps for controlled generation
- Processes at 1216px resolution
Style Transfer:
- Applies style model with CLIP vision encoding
- Uses flux1-redux-dev style model
Generation:
- 8 sampling steps with Euler method
- 1216px output resolution

5. Input/Output

Inputs:

Source image (PNG/JPG)
Style reference image (optional)
Prompt (can be AI-generated or manual)

Outputs:

Stylized image with depth control
Intermediate depth maps
Generated prompts

6. Technical Notes

Requires significant VRAM (recommended 12GB+)
Uses fp8 precision for some models
Depth processing at 1216px may be memory-intensive
Style model applies multiply blending

7. Installation Requirements

Required custom nodes:
- ComfyUI-ControlNet-Aux (for depth processor)
- Derfuu's Modded Nodes (for smart scaling)
- Joy_caption_two (for prompt generation)
Model downloads needed:
- depth_anything_v2_vitl.pth
- flux1-redux-dev style model
- HyperL-F.1 LoRA

Unlock Spring Vitality: Transforming Text into Stunning 3D Art

Achieve Unparalleled Image Quality: Expert Workflow for Reducing AI Artifacts

Recommend

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Create Adorable Cat Videos with AI: A Low-VRAM Workflow

Generate cute cat videos from static images with this workflow! Learn how to create high-quality MP4s using low VRAM and fast local processing. Discover the power of image-to-video, super-resolution, and frame interpolation.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Unlock Time-Lapse Aging Videos with Wan2.1 I2V Model: A Step-by-Step Guide

Generate stunning time-lapse aging videos from portraits with Wan2.1 I2V model & Aging LoRA. Learn how to create realistic facial aging effects with multimodal control.

Summary

Unlock precise style transfer with depth control & AI-powered image generation. Enhance portraits, apply artistic styles & generate high-quality images. Learn how with this advanced workflow.

Chapter

workflow:

CustomNodes:

FluxGuidance PreviewImage CLIP...