RunComfy

Flux Klein Face Swap | Realistic AI Face Editor

Swap faces perfectly. Natural, lifelike, and fast AI-powered editing.

FLUX.2 [klein] 4B & 9B | Ultra-Fast Flux Image Generator

Blazing-fast visual creation with unified editing control.

MatAnyone Video Matting | Single Mask Removal

Remove video backgrounds with one mask frame for perfect subject isolation.

FLUX ControlNet Depth-V3 & Canny-V3

Achieve better control with FLUX-ControlNet-Depth & FLUX-ControlNet-Canny for FLUX.1 [dev].

ComfyUI > Nodes > ComfyUI-FL-Qwen3TTS > FL Qwen3 TTS Transcribe

ComfyUI Node: FL Qwen3 TTS Transcribe

Class Name

FL_Qwen3TTS_Transcribe

Category
FL/Qwen3TTS

Author
filliptm (Account age: 2372days) Extension
ComfyUI-FL-Qwen3TTS Latest Updated
2026-03-18 Github Stars
0.12K

Github Ask filliptm Current Questions Past Questions

Table of Content

Description
FL_Qwen3TTS_Transcribe:
FL_Qwen3TTS_Transcribe Input Parameters:
FL_Qwen3TTS_Transcribe Output Parameters:
FL_Qwen3TTS_Transcribe Usage Tips:
FL_Qwen3TTS_Transcribe Common Errors and Solutions:
Related Nodes

How to Install ComfyUI-FL-Qwen3TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-Qwen3TTS

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI-FL-Qwen3TTS in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

FL Qwen3 TTS Transcribe Description

Converts audio to text using Whisper model, supporting multiple languages for transcription.

FL Qwen3 TTS Transcribe:

The FL_Qwen3TTS_Transcribe node is designed to convert audio input into text using the Whisper model, which is part of the Qwen3-TTS framework. This node is particularly useful for generating reference text from audio files, making it an essential tool for tasks that require audio-to-text transcription. By leveraging the capabilities of the Whisper model, the node provides accurate and efficient transcription services, supporting multiple languages and offering flexibility in terms of device usage. Its primary goal is to facilitate seamless audio transcription, thereby enhancing the workflow for AI artists and developers who need to process audio data into textual format.

FL Qwen3 TTS Transcribe Input Parameters:

audio

The audio parameter is the primary input for the node, representing the audio data that needs to be transcribed. This parameter is crucial as it directly influences the transcription output. The audio should be in a format compatible with the node's processing capabilities, typically a waveform that can be converted into a tensor for analysis. There are no specific minimum or maximum values, but the quality and clarity of the audio can impact the accuracy of the transcription.

model

The model parameter specifies which Whisper model to use for transcription. It offers a selection of models, including "openai/whisper-large-v3-turbo", "openai/whisper-large-v3", "openai/whisper-medium", "openai/whisper-small", "openai/whisper-base", and "openai/whisper-tiny". The default model is "openai/whisper-large-v3-turbo". The choice of model affects the transcription's speed and accuracy, with larger models generally providing more accurate results at the cost of increased computational resources.

language

The language parameter determines the language in which the transcription should be performed. Options include "auto", "en", "zh", "ja", "ko", "de", "fr", "es", "pt", "ru", and "it", with "auto" being the default setting. This parameter is important for ensuring that the transcription is accurate and contextually appropriate for the language of the audio input.

device

The device parameter allows you to specify the computational device to be used for processing, with options including "auto", "cuda", and "cpu". The default setting is "auto", which automatically selects the most suitable device based on availability and performance considerations. Choosing the right device can optimize the node's performance, especially for large models or high-volume transcription tasks.

FL Qwen3 TTS Transcribe Output Parameters:

transcription

The transcription output parameter provides the text result of the audio transcription process. This output is a string that represents the spoken content of the input audio, converted into text. The transcription is essential for applications that require textual analysis or further processing of audio data, such as generating subtitles, creating transcripts for accessibility, or feeding into other text-based AI models.

FL Qwen3 TTS Transcribe Usage Tips:

Ensure that the audio input is clear and free from excessive noise to improve transcription accuracy.
Select a Whisper model that balances your need for speed and accuracy; larger models are more accurate but require more computational resources.
Use the language parameter to specify the correct language for transcription, especially if the audio contains multilingual content.
If you have access to a GPU, set the device parameter to "cuda" to accelerate the transcription process.

FL Qwen3 TTS Transcribe Common Errors and Solutions:

No tokenizer provided. Please connect a Tokenizer Loader node.

Explanation: This error occurs when the node is unable to access a tokenizer, which is necessary for processing the audio input.
Solution: Ensure that a Tokenizer Loader node is connected and properly configured before running the transcription process.

No audio codes provided. Please connect an Audio Encode node.

Explanation: This error indicates that the node did not receive the necessary audio codes for decoding.
Solution: Verify that an Audio Encode node is connected and that it is supplying the required audio codes to the transcription node.

Audio decoding failed: `<error_message>`

Explanation: This error suggests that there was an issue during the audio decoding process, possibly due to incompatible audio formats or corrupted data.
Solution: Check the audio input for compatibility and integrity, and ensure that the audio format is supported by the node. If the problem persists, review the error message for specific details and adjust the input accordingly.

FL Qwen3 TTS Transcribe Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI-FL-Qwen3TTS

Table of Content

Description
FL_Qwen3TTS_Transcribe:
FL_Qwen3TTS_Transcribe Input Parameters:
FL_Qwen3TTS_Transcribe Output Parameters:
FL_Qwen3TTS_Transcribe Usage Tips:
FL_Qwen3TTS_Transcribe Common Errors and Solutions:
Related Nodes

Wan2.2 Fun Camera | Cinematic Motion from Images

Turn still images into lively cinematic shots with smooth camera moves.

Qwen-Image Lightning | 8-Step Speed Boost

Cut generation time in half.

Cosmos-Predict2 | Text2Image & Video2World

Fast and real! NVIDIA Cosmos with true physics.

Qwen Image Edit Plus 2511 LoRA Inference | AI Toolkit ComfyUI

Keep AI Toolkit-trained Qwen Image Edit Plus 2511 LoRA edits in ComfyUI preview-aligned using a single RCQwenImageEditPlus2511 custom node.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: FL Qwen3 TTS Transcribe

FL_Qwen3TTS_Transcribe

How to Install ComfyUI-FL-Qwen3TTS

FL Qwen3 TTS Transcribe Description

FL Qwen3 TTS Transcribe:

FL Qwen3 TTS Transcribe Input Parameters:

audio

model

language

device

FL Qwen3 TTS Transcribe Output Parameters:

transcription

FL Qwen3 TTS Transcribe Usage Tips:

FL Qwen3 TTS Transcribe Common Errors and Solutions:

No tokenizer provided. Please connect a Tokenizer Loader node.

No audio codes provided. Please connect an Audio Encode node.

Audio decoding failed: <error_message>

FL Qwen3 TTS Transcribe Related Nodes

Audio decoding failed: `<error_message>`