ComfyUI > Nodes > ComfyUI-FL-Qwen3TTS > FL Qwen3 TTS Transcribe

ComfyUI Node: FL Qwen3 TTS Transcribe

Class Name

FL_Qwen3TTS_Transcribe

Category
FL/Qwen3TTS
Author
filliptm (Account age: 2372days)
Extension
ComfyUI-FL-Qwen3TTS
Latest Updated
2026-03-18
Github Stars
0.12K

How to Install ComfyUI-FL-Qwen3TTS

Install this extension via the ComfyUI Manager by searching for ComfyUI-FL-Qwen3TTS
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter ComfyUI-FL-Qwen3TTS in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

FL Qwen3 TTS Transcribe Description

Converts audio to text using Whisper model, supporting multiple languages for transcription.

FL Qwen3 TTS Transcribe:

The FL_Qwen3TTS_Transcribe node is designed to convert audio input into text using the Whisper model, which is part of the Qwen3-TTS framework. This node is particularly useful for generating reference text from audio files, making it an essential tool for tasks that require audio-to-text transcription. By leveraging the capabilities of the Whisper model, the node provides accurate and efficient transcription services, supporting multiple languages and offering flexibility in terms of device usage. Its primary goal is to facilitate seamless audio transcription, thereby enhancing the workflow for AI artists and developers who need to process audio data into textual format.

FL Qwen3 TTS Transcribe Input Parameters:

audio

The audio parameter is the primary input for the node, representing the audio data that needs to be transcribed. This parameter is crucial as it directly influences the transcription output. The audio should be in a format compatible with the node's processing capabilities, typically a waveform that can be converted into a tensor for analysis. There are no specific minimum or maximum values, but the quality and clarity of the audio can impact the accuracy of the transcription.

model

The model parameter specifies which Whisper model to use for transcription. It offers a selection of models, including "openai/whisper-large-v3-turbo", "openai/whisper-large-v3", "openai/whisper-medium", "openai/whisper-small", "openai/whisper-base", and "openai/whisper-tiny". The default model is "openai/whisper-large-v3-turbo". The choice of model affects the transcription's speed and accuracy, with larger models generally providing more accurate results at the cost of increased computational resources.

language

The language parameter determines the language in which the transcription should be performed. Options include "auto", "en", "zh", "ja", "ko", "de", "fr", "es", "pt", "ru", and "it", with "auto" being the default setting. This parameter is important for ensuring that the transcription is accurate and contextually appropriate for the language of the audio input.

device

The device parameter allows you to specify the computational device to be used for processing, with options including "auto", "cuda", and "cpu". The default setting is "auto", which automatically selects the most suitable device based on availability and performance considerations. Choosing the right device can optimize the node's performance, especially for large models or high-volume transcription tasks.

FL Qwen3 TTS Transcribe Output Parameters:

transcription

The transcription output parameter provides the text result of the audio transcription process. This output is a string that represents the spoken content of the input audio, converted into text. The transcription is essential for applications that require textual analysis or further processing of audio data, such as generating subtitles, creating transcripts for accessibility, or feeding into other text-based AI models.

FL Qwen3 TTS Transcribe Usage Tips:

  • Ensure that the audio input is clear and free from excessive noise to improve transcription accuracy.
  • Select a Whisper model that balances your need for speed and accuracy; larger models are more accurate but require more computational resources.
  • Use the language parameter to specify the correct language for transcription, especially if the audio contains multilingual content.
  • If you have access to a GPU, set the device parameter to "cuda" to accelerate the transcription process.

FL Qwen3 TTS Transcribe Common Errors and Solutions:

No tokenizer provided. Please connect a Tokenizer Loader node.

  • Explanation: This error occurs when the node is unable to access a tokenizer, which is necessary for processing the audio input.
  • Solution: Ensure that a Tokenizer Loader node is connected and properly configured before running the transcription process.

No audio codes provided. Please connect an Audio Encode node.

  • Explanation: This error indicates that the node did not receive the necessary audio codes for decoding.
  • Solution: Verify that an Audio Encode node is connected and that it is supplying the required audio codes to the transcription node.

Audio decoding failed: <error_message>

  • Explanation: This error suggests that there was an issue during the audio decoding process, possibly due to incompatible audio formats or corrupted data.
  • Solution: Check the audio input for compatibility and integrity, and ensure that the audio format is supported by the node. If the problem persists, review the error message for specific details and adjust the input accordingly.

FL Qwen3 TTS Transcribe Related Nodes

Go back to the extension to check out more related nodes.
ComfyUI-FL-Qwen3TTS
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

FL Qwen3 TTS Transcribe