workflow

This workflow is designed for image padding and style enhancement, integrating image captioning, LoRA style control, and text-to-image generation. Key uses:

Style Transfer: Generate stylized images based on reference input (e.g., abstract art).
Detail Enhancement: Apply LoRAs (e.g., Anime-Chinese Beauty FLUX_1.0) for specific styles.
Multilingual: Supports mixed Chinese/English prompts.

Core Models:

F.1-fp8 11G: Base model (VRAM-optimized).
Meta-Llama-3.1-8B: Image captioning.
CatPaw_Anime-ChineseBeauty_FLUX_1.0: Style LoRA.

2. Key Components

Critical Nodes:

Joy_caption_two:
- Uses Meta-Llama-3 to generate image descriptions (e.g., abstract line art).
- Install via ComfyUI Manager (unsloth/Meta-Llama-3.1-8B-Instruct).
LoraLoader:
- Loads style LoRAs (e.g., Anime-Chinese Beauty), adjustable strength (default: 0.8).
CLIPTextEncodeFlux:
- Merges user prompts (e.g., miluo_cjsj, cloth) with captions for conditioning.
KSampler:
- Settings:
  - Steps: 20
  - Sampler: euler
  - Seed: Random (can fix to 6368394736575).

Dependencies:

Download F.1-fp8 and ae.sft VAE to ComfyUI/models.

3. Workflow Structure

Input Group (Group 2):
- Load image (e.g., @rawandrendered.jpg) → Caption → Translate.
Generation Group (Group 1):
- Fuse prompts + captions → Apply LoRA → Generate image (600x800).
Output:
- Decode latent → Preview/save image.

Key Parameters:

Resolution: Set via EmptyLatentImage (default: 600x800).
LoRA Strength: Adjust via ReroutePrimitive (default: 0.8).

4. Input & Output

Input Parameters:

Image: JPG/PNG (e.g., 1440x1440 abstract art).
Text Prompt: Optional keywords (e.g., miluo_cjsj, cloth).
LoRA: Select from preset styles.

Output:

Stylized image (e.g., Chinese anime style) in PreviewImage.
Example caption:
"Digital artwork with abstract colorful lines, deep blue background, reflective effects..."

5. Notes

VRAM: ≥8GB required (FP8 optimization).
Troubleshooting:
- Missing Joy_caption_two? Install comfyui_slk_joy_caption_two.
- Match image size to EmptyLatentImage (e.g., 600x800).
Style Control:
- Adjust LoRA strength (0-1) for intensity.
- Modify CFG scale (default: 3.5) in CLIPTextEncodeFlux.

Transforming Static Images into Cinematic Explosions with Wan2.1

ComfyUI E-commerce Product Animation: Boost Sales with I2V Technology

Recommend

Transforming Line Art into 3D-Style Renders: A Deep Dive into ControlNet and Dual CLIP Encoding

Unlock Stunning Art: Transform line art into vibrant illustrations & 3D-style renders with ControlNet-guided generation & super-resolution. Learn how to use this AI workflow for breathtaking results.

Unlock Liquid Magic: Advanced I2V Workflow for Stunning Visual Effects

Generate Stunning Liquid Collision Videos with I2V Workflow! Discover how to combine WanVideo's custom models with GIMM-VFI for breathtaking effects. Learn more and start creating now!

Master Local Edits & Style Transfers with This Cutting-Edge Workflow

Unlock AI-powered image editing: Local inpainting, style transfer & auto-upscaling with ICEdit, Flux, and ESRGAN models. Try now and transform your images!

From Photos to Masterpieces: Automating Line Art Conversion with AI

Transform images into stunning line art with ControlNet, LoRA models, and facial refinement. Discover how this workflow automates image conversion and enhances facial details. Learn more!

Unlock Seamless Product Background Blending with This AI-Powered Workflow

Unlock seamless product-background blending with our expert workflow! Discover how to combine SAM, GroundingDINO, and BrushNet for precise segmentation and stunning visuals. Learn more and elevate your designs!

Summary

Transform Images with AI: Style Transfer, Detail Enhancement & Multilingual Support. Discover how to generate stylized images with LoRA and text-to-image models.

Chapter

workflow:

CustomNodes:

Joy_caption_two ShowText|pysss...