workflow

This workflow specializes in image outpainting using Flux Diffusion and Janus image understanding, enabling seamless extension of images. It supports multi-directional expansion (top/bottom/left/right) while maintaining consistency with the original style.

2. Core Models

Model Name	Description
`F.1-Fill-fp16_Inpaint&Outpaint`	UNet optimized for inpainting/outpainting at high resolution.
`deepseek-ai/Janus-Pro-1B`	Multimodal model for image captioning (auto-prompt generation).
`ae.sft`	Custom VAE for improved image decoding.
`Flux Guidance`	Dynamically guides diffusion for natural and coherent outpainting.

3. Key Nodes & Installation

JanusModelLoader
- Function: Loads Janus-Pro model for image analysis.
- Install: Install Janus-Nodes via ComfyUI Manager or clone GitHub repo.
ImagePadForOutpaint
- Function: Defines expansion area (in pixels) and generates mask.
- Install: Built-in node (no installation needed).
FluxGuidance
- Function: Adjusts guidance strength (default=30) to prevent artifacts.
- Install: Requires Flux-Diffusion plugin (search in ComfyUI Manager).
DifferentialDiffusion
- Function: Combines base and refiner models for detail enhancement.
- Dependency: Download F.1-Fill-fp16 and place in models/unet.

4. Workflow Structure

Group Name	Description
Upload Image	Load input image (PNG/JPG).
Max Resolution	Constrains output size (default: 1024x1024) to avoid VRAM issues.
Outpaint Area	Set expansion pixels (e.g., left=104, right=104) to generate mask.
Prompt Generation	Janus auto-generates captions, or manually input English prompts.
Batch Control	Repeats latent samples (default=3) for stable results.
Flux Workspace	Core nodes (KSampler, VAE Decode) with default optimized parameters.

5. Inputs & Outputs

Inputs:
- Image file (e.g., output (2).png).
- Pixel values for expansion (e.g., left=104).
- Optional text prompts (auto-generated if empty).
Output:
- Upscaled image (PNG) with expanded regions.

6. Notes

VRAM: ≥12GB GPU recommended (e.g., RTX 3060 Ti).
Tips:
- Limit expansion to ≤300 pixels per step; split large expansions into multiple steps.
- Avoid single-direction expansion (e.g., only downward) to balance composition.
Troubleshooting:
- Reduce resolution in ConstrainImage or batch size if CUDA OOM occurs.
- Manually input prompts if Janus fails to generate captions.

From Raw to Refined: Mastering Image Processing with Advanced Models

Unveiling the Past: Transforming Ancient Paintings into Hyper-Realistic Photos

Recommend

Discover the Ultimate Eastern Art Creation Workflow with AI

Unlock Eastern Pixar-style art creation with this workflow! Generate high-quality images with Flux.1 and Lora models. Download now and enhance your digital illustrations!

Boost Your Image Generation Game with Stable Diffusion, JOY Caption Two, and LORA

Unlock AI-powered image generation with Stable Diffusion, JOY Caption Two, and FLUX. Discover how to reverse-engineer prompts from reference images and create stunning new visuals. Learn more and start creating now!

"Revolutionizing 3D Generation: ComfyUI Now Supports Hunyuan3D 2.0!"

Unlock 3D Generation with Hunyuan3D 2.0! Discover how ComfyUI's native support for Tencent's open-source model empowers high-fidelity 3D creation - try it now!

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Unveiling the Past: Transforming Ancient Paintings into Hyper-Realistic Photos

Transform Ancient Portraits into Hyper-Realistic Photos with AI. Discover how to use SDXL models & multi-ControlNet guidance to bring historical figures to life.

Summary

Unlock seamless image extension with Flux Diffusion and Janus image understanding. Learn how to use this workflow for multi-directional outpainting and maintain original style consistency.

Chapter

workflow:

CustomNodes:

JanusModelLoader DualCLIPLoade...