Discover the Magic of AI Art Generation: A Step-by-Step Workflow Guide
1. Workflow Overview

Purpose: Automatically reverse-engineer prompts from input images and redraw them with stylized LoRAs (e.g., anime style).
Key Features:
Captioning: Uses
Florence-2-base-PromptGen-v2.0to generate tags.Redraw: Applies
Flux洗图神器.safetensorsLoRA andsvdq-int4-flux.1-devbase model for image-to-image generation.Custom Control: Supports manual trigger words (e.g.,
kk3d ip) fused with auto-generated tags.
2. Key Nodes
Node Name | Function | Installation | Dependencies |
|---|---|---|---|
| Generates image tags via Florence-2. | Install |
|
| Loads quantized Flux-DiT model. | Requires |
|
| Enhances conditioning strength. | Built-in (requires Flux models). | None |
3. Workflow Groups
Captioning Group
Input: User-uploaded image (e.g.,
ComfyUI_temp_btare_00003_.png).Output: Tags like
1girl, solo, long hair....
Prompt Fusion Group
Merges auto-tags with manual triggers via
LayerUtility: TextJoin.
Generation Group
Models:
svdq-int4-flux.1-dev+Flux洗图神器LoRA (weight=0.8).Output: Resolution matches input (default 512x512).
4. Inputs & Outputs
Inputs:
Image (required).
Optional trigger words (e.g.,
kk3d ip).Denoising strength: Default 0.7 (via
BasicScheduler).
Outputs:
Generated image (saved to
ComfyUI/output).Caption text (displayed in
ShowTextnode).
5. Tips & Warnings
⚠️ Compatibility:
Requires
ComfyUI-nunchakuandComfyUI-Florence2extensions.Florence-2 model must be manually downloaded.
✅ Optimization:
Use
svdq-int4for low-VRAM GPUs (8GB).Adjust
FluxGuidancestrength (default: 3.5) for style intensity.