RunComfy

Z Image | Ultra-Fast Photorealistic Generator

Generate ultra-clear visuals fast with unmatched real-time detail.

Wan2.2 Animate | Photo to Realistic Motion Video

Turn images into lifelike, moving characters with natural body and face motion.

Omni Kontext | Seamless Scene Integration

Perfect scene fits. Unique style. Identity stays. Kontext keeps it real.

FLUX.2 [klein] 4B & 9B | Ultra-Fast Flux Image Generator

Blazing-fast visual creation with unified editing control.

ComfyUI > Nodes > ComfyUI-QwenImageWanBridge > Qwen Spatial Token Generator

ComfyUI Node: Qwen Spatial Token Generator

Class Name

QwenSpatialTokenGenerator

Category
Qwen/Spatial

Author
fblissjr (Account age: 3903days) Extension
ComfyUI-QwenImageWanBridge Latest Updated
2025-12-15 Github Stars
0.16K

Github Ask fblissjr Current Questions Past Questions

Table of Content

Description
QwenSpatialTokenGenerator:
QwenSpatialTokenGenerator Input Parameters:
QwenSpatialTokenGenerator Output Parameters:
QwenSpatialTokenGenerator Usage Tips:
QwenSpatialTokenGenerator Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-QwenImageWanBridge

Install this extension via the ComfyUI Manager by searching for ComfyUI-QwenImageWanBridge

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-QwenImageWanBridge in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Qwen Spatial Token Generator Description

Generate spatial tokens from image coordinates and labels for AI image editing and annotation tasks.

Qwen Spatial Token Generator:

The QwenSpatialTokenGenerator is a powerful tool designed to generate spatial tokens from image coordinates and labels, facilitating spatial editing and annotation tasks. This node is particularly beneficial for AI artists and developers who need to manipulate or analyze images based on spatial data. By converting image coordinates into various token formats, it allows for flexible integration into different workflows, whether for structured data processing or natural language applications. The generator operates without relying on templates or assumptions, ensuring a pure and adaptable approach to spatial token generation. Its main goal is to provide a seamless and efficient way to handle spatial data, enhancing the capabilities of image-based projects.

Qwen Spatial Token Generator Input Parameters:

image

This parameter represents the input image that you wish to use for spatial editing. It is crucial as it serves as the base from which spatial tokens are generated. The image should be in a compatible format, and its quality can impact the accuracy and detail of the spatial tokens produced.

prompt

The prompt is a string input that can be multiline and is auto-populated from the spatial editor. It serves as a guide or instruction set for generating spatial tokens, allowing for customized and context-specific token generation. The default value is an empty string, and it can be adjusted to fit the specific needs of your project.

output_format

This parameter determines the format in which the spatial tokens will be output. Options include structured_json, xml_tags, natural_language, and traditional_tokens. The default is structured_json, which is recommended for its structured and easily parseable nature. Each format offers different benefits, such as xml_tags for HTML-like elements, natural_language for human-readable sentences, and traditional_tokens for legacy compatibility.

debug_mode

A boolean parameter that, when enabled, provides additional debugging information during the token generation process. The default value is False. Activating debug mode can be helpful for troubleshooting and understanding the internal workings of the node, especially if unexpected results occur.

Qwen Spatial Token Generator Output Parameters:

annotated_image

This output is the image with visual annotations based on the generated spatial tokens. It provides a visual reference for the spatial data, allowing you to see how the tokens correspond to specific areas or features within the image.

prompt

The output prompt is a string that reflects the final set of instructions or descriptions generated during the tokenization process. It can be used to understand the context and details of the spatial tokens produced.

debug_info

This output provides a string containing detailed debugging information, which is especially useful if the debug mode is enabled. It includes logs and messages that can help diagnose issues or understand the processing steps taken by the node.

Qwen Spatial Token Generator Usage Tips:

To optimize performance, ensure that the input image is of high quality and in a compatible format, as this can significantly affect the accuracy of the spatial tokens generated.
Experiment with different output_format options to find the one that best suits your project's needs. For structured data processing, structured_json is recommended, while natural_language may be more suitable for applications requiring human-readable output.
Enable debug_mode if you encounter unexpected results or need to understand the internal processing of the node. This can provide valuable insights and help troubleshoot potential issues.

Qwen Spatial Token Generator Common Errors and Solutions:

CRITICAL ERROR in generate_tokens

Explanation: This error occurs when there is a critical issue during the token generation process, possibly due to incompatible input data or internal processing errors.
Solution: Check the input image and prompt for compatibility and correctness. Ensure that the image is in a supported format and that the prompt is properly structured. Enabling debug_mode can provide additional information to help identify the root cause of the error.

JSONDecodeError or ValueError

Explanation: These errors indicate that the input data could not be parsed as JSON, which may happen if the prompt is not correctly formatted or if there is an issue with the spatial token data.
Solution: Verify that the prompt and any spatial token data are correctly formatted as JSON if using structured_json output. If using other formats, ensure that the data adheres to the expected structure for those formats.

Qwen Spatial Token Generator Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-QwenImageWanBridge

Table of Content

Description
QwenSpatialTokenGenerator:
QwenSpatialTokenGenerator Input Parameters:
QwenSpatialTokenGenerator Output Parameters:
QwenSpatialTokenGenerator Usage Tips:
QwenSpatialTokenGenerator Common Errors and Solutions:
Related Nodes

Qwen Image Edit 2509 | Multi-Image Editor

Turn 2–3 images into one seamless, edited masterpiece instantly.

Hunyuan Image to Video | Breathtaking Motion Creator

Create magnificent movies out of still images through cinematic motion and customizable effects.

HiDream E1.1 | AI Image Editing

Edit images with natural language using HiDream E1.1 model

Flux Kontext Pulid | Consistent Character Generation

Create consistent characters using FLUX Kontext with a single face reference image.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy