Advanced model with fast text control, precision edits, and consistent visual fidelity.
canny (edge detection), depth (3D distance), or pose (human keypoints) preprocessors to tell the model exactly what to respect from your reference image.Control Scale (how much the reference image matters) and LoRA Scale (how much the style matters) for the perfect balance.Image slot. This will serve as the structural guide.- Canny: For keeping strict outlines and details.
- Depth: For architectural renders or maintaining 3D volume.
- Pose: For changing a character's outfit or background while keeping their position fixed.
LoRAs list.Control Scale (Default 0.9). Lower it if you want the model to be more creative; raise it for strict adherence.Control End (Default 0.4) to let the structure be defined early, but allow the LoRA style to take over the details in the later steps.Aspect Ratio matches your input image's shape to avoid stretching or cropping.Magic Prompt if you want the model to add more detail to your scene automatically, useful when your manual prompt is simple.Advanced model with fast text control, precision edits, and consistent visual fidelity.
Delivers refined image remastering and brand-consistent visual edits with scalable control.
Precise text rendering & multilingual edits for visual pros
Transforms images into editable RGBA layers for precise object isolation and seamless design control.
Fast, photorealistic image repair and refinements for product visuals.
Refine images with adaptive style control, LoRA merging, and high-res rendering for consistent design output.
Yes, Z Image Turbo ControlNet is distributed under the Apache 2.0 open-source license, which generally allows commercial use. However, using it on RunComfy does not override or bypass the model’s original license terms. If you plan to deploy Z Image Turbo ControlNet commercially for text-to-image generation at scale, review the official license from the model creators to ensure proper compliance.
Yes. Z Image Turbo ControlNet currently supports maximum output resolutions of up to about 1536×1536 pixels. The prompt input is limited to approximately 200–250 tokens, and users can apply up to 4 simultaneous reference conditions through the Fun ControlNet Union (Canny, HED, Depth, Pose, or MLSD). These constraints balance quality, speed, and GPU resource efficiency.
Z Image Turbo ControlNet stands out for its bilingual English and Chinese comprehension, improved prompt fidelity, and text rendering within generated images. Using a 6-billion-parameter Single-Stream DiT structure, it performs image generation in just 8 inference steps, delivering fast, high-quality text-to-image outcomes while being more VRAM-efficient than larger competitors like SDXL or Flux.
Yes, Z Image Turbo ControlNet supports the Base and Edit variants for fine-tuning and image editing tasks. Developers can adapt these for domain-specific text-to-image generation while still benefiting from the core DiT efficiency. Note that fine-tuned derivatives must also respect the original Apache 2.0 licensing conditions.
After your free trial credits (usd) are used, you’ll need to purchase additional usd to continue generating text-to-image outputs with Z Image Turbo ControlNet. Pricing is listed under the 'Generation' section of your account. You can monitor usage and costs directly within your RunComfy dashboard.
If you encounter issues using Z Image Turbo ControlNet for text-to-image generation, you can reach RunComfy’s support team via hi@runcomfy.com. They assist with API integration, usage limits, and troubleshooting performance or licensing-related questions.
RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.