workflow

Purpose: Generate stylized images from input photos (e.g., model poses) using ControlNet for structure preservation and LoRAs for style (e.g., ethnic costumes, autumn forest themes).
Core Models:
- 基础算法_F.1: Base text-to-image model (likely SDXL variant).
- FLUX.1-dev-ControlNet-Union-Pro-InstantX: Multi-ControlNet for pose/structure control.
- Meta-Llama-3.1-8B-bnb-4bit: Image captioning model (prompt reverse engineering).

2. Key Nodes

Node Name	Function	Installation	Dependencies
`ControlNetLoader`	Loads ControlNet model.	Manually place in `ComfyUI/models/controlnet`.	`FLUX.1-dev-ControlNet-Union-Pro-InstantX.safetensors`
`LoraLoader`	Applies style LoRAs (ethnic/autumn).	Place files in `ComfyUI/models/loras`.	`少数民族服饰_V1.0.safetensors`
`Joy_caption`	Reverse-engineers prompts via Llama-3.	Install `unsloth/Meta-Llama-3.1-8B-bnb-4bit` (HuggingFace).	Requires 4bit quantization libs.

3. Workflow Groups

Reference Image Group
- Input: User-uploaded photo (e.g., lQDPKGyzHiAGKAfNB9DNBQOwJksZaqj6fsIH2j_m_4e8AA_1283_2000.jpg).
- Process: Generates depth map via DepthAnythingV2 for ControlNet.
LoRA Group
- Loads two LoRAs: 少数民族服饰_V1.0 (weight=0.2) and 秋日森林_秋天女孩_V1.0 (weight=0.7).
Generation Group
- Output: 1280x2000 image after latent upscaling and VAE decoding.

4. Inputs & Outputs

Inputs:
- Reference image (required).
- Resolution: Default 768x1024 (adjustable via EmptyLatentImage).
- Negative prompt: "Imperfect, non-standard, poor quality".
Output: Stylized model image (e.g., in ethnic costume).

5. Tips & Warnings

⚠️ Errors:
- Missing ControlNet/LoRA files trigger "Missing model" errors.
- Llama-3 requires ≥8GB VRAM; disable Joy_caption on low-end devices.
✅ Optimization:
- Use fp8_e4m3fn precision to save VRAM.
- Adjust ControlNet weight (default: 0.75) in ControlNetApplyAdvanced.

Unlock Efficient Image Generation: A Comprehensive Workflow Guide

Discover the Magic of AI Art Generation: A Step-by-Step Workflow Guide

Recommend

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Master Local Edits & Style Transfers with This Cutting-Edge Workflow

Unlock AI-powered image editing: Local inpainting, style transfer & auto-upscaling with ICEdit, Flux, and ESRGAN models. Try now and transform your images!

The Future of Portrait Editing: Harnessing ControlNet and LoRA Models

Unlock consistent portrait pose transfer with ControlNet & LoRA models. Discover how to generate new poses while maintaining facial features, style, and details. Learn more!

Day to Night Transformation: A Step-by-Step Guide for Post-Production

Transform daytime scenes to nighttime with this workflow, ideal for architectural visualization and post-production. Achieve stunning results with advanced lighting, style control, and high-resolution output. Discover how.

Summary

Generate stylized images from photos with ControlNet and LoRAs: preserve structure, apply ethnic or autumn styles, and more.

Chapter

workflow:

CustomNodes:

DualCLIPLoader UNETLoader Int ...