Unlock Creative Image Fusion: A Step-by-Step Workflow Guide
1. Workflow Overview

This workflow is designed for multi-image fusion and semantic redrawing, leveraging FluxKontextMaxImageNode to merge input images based on text prompts. Key applications include:
Person-scene composition (e.g., blending portraits with sculptures/architecture)
Creative collages (e.g., KTV scenes, wedding photo background replacement)
Image upscaling & detail enhancement
2. Core Models
Flux Dev Model: Base generation model with integrated
4-step Lorafor fast inference.4x_NMKD-Siax_200k: Upscaling model for resolution enhancement.
Bilingual CLIP Encoder: Supports EN/CN prompts (loaded via
DualCLIPLoader).
3. Key Nodes
Node Name | Function | Installation | Dependencies |
|---|---|---|---|
| Core image fusion | Requires |
|
| Horizontal/vertical image merge | Built-in | None |
| Batch image input | Install via ComfyUI Manager | None |
| CN↔EN prompt translation | Manual install (GitHub) | None |
| Tiled processing (upscale/repair) | Requires | Upscale models |
4. Workflow Structure
Group 1: Dual-Person Pose
Input: Girl portrait + David sculpture
Process:
ImageConcanatemerge →FluxKontextgenerates "embracing with church background"
Group 2: Person-Scene Merge
Input: Bride + Castle image
Process: Semantic prompt-guided fusion
Group 3: Upscale & Repair
Pipeline:
ImageUpscaleWithModel→TTP_Image_Tiletiling → Reassembly
5. Inputs & Outputs
Input Parameters:
Images: Multiple formats (
.png,.jpeg)Prompts: Bilingual (auto-translated)
Seed: Fixed or randomized
Output: High-res PNG (default 3:4 or 16:9 ratio)
6. Notes
VRAM: ≥12GB recommended; use
--medvramfor large outputs.Plugin Dependencies:
FluxKontext: GitHub LinkTiled Diffusion: Install via ComfyUI Manager
Troubleshooting:
Chinese prompts must be translated via
LibLibTranslateResize mismatched images with
ImageResizeKJnode