RunComfy

Flux Upscaler - Ultimate 32k | Image Upscaler

Flux Upscaler – Achieve 4k, 8k, 16k, and Ultimate 32k Resolution!

MatAnyone Video Matting | Single Mask Removal

Remove video backgrounds with one mask frame for perfect subject isolation.

Consistent Character Creator

Create consistent, high-resolution character designs from multiple angles with full control over emotions, lighting, and environments.

Consistent Face 3x3 Generator

Generate 3x3 consistent character faces using FLUX and Depth LoRA

ComfyUI > Nodes > ComfyUI_Prompt-All-In-One > API Qwen Audio2Text

ComfyUI Node: API Qwen Audio2Text

Class Name

APIQwenAudio2Text

Category
🎤MW/MW-Prompt-All-In-One

Author
billwuhao (Account age: 2576days) Extension
ComfyUI_Prompt-All-In-One Latest Updated
2026-03-20 Github Stars
0.05K

Github Ask billwuhao Current Questions Past Questions

Table of Content

Description
APIQwenAudio2Text:
APIQwenAudio2Text Input Parameters:
APIQwenAudio2Text Output Parameters:
APIQwenAudio2Text Usage Tips:
APIQwenAudio2Text Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_Prompt-All-In-One

Install this extension via the ComfyUI Manager by searching for ComfyUI_Prompt-All-In-One

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_Prompt-All-In-One in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

API Qwen Audio2Text Description

Converts audio to text using AI for accurate transcription, aiding content creators and researchers.

API Qwen Audio2Text:

APIQwenAudio2Text is a powerful node designed to convert audio inputs into text, leveraging advanced AI capabilities to facilitate seamless transcription. This node is particularly beneficial for users who need to transcribe spoken content into written form, making it an essential tool for content creators, researchers, and anyone dealing with audio data. By utilizing this node, you can efficiently transform audio files into text, enabling easier analysis, editing, and sharing of information. The node's integration with AI technology ensures high accuracy and reliability, making it a valuable asset for various applications where audio-to-text conversion is required.

API Qwen Audio2Text Input Parameters:

audio

The audio parameter is the primary input for the node, representing the audio file that you wish to transcribe into text. This parameter is crucial as it directly influences the transcription process, determining the content that will be converted into text. The audio file should be in a compatible format and of sufficient quality to ensure accurate transcription results.

api_key

The api_key parameter is a string input that serves as your authentication credential for accessing the transcription service. This key is essential for authorizing your requests and ensuring secure communication with the API. It does not have a default value, and you must provide a valid key to use the node's capabilities.

system_prompt

The system_prompt parameter is a string input that allows you to provide specific instructions or context to the transcription model. This can be useful for tailoring the transcription process to your specific needs, such as emphasizing certain words or phrases. The parameter supports multiline input, enabling you to provide detailed prompts to guide the transcription.

prompt

The prompt parameter is another string input that lets you specify additional instructions or context for the transcription process. Similar to the system_prompt, this parameter can help refine the transcription output by providing guidance on how the audio should be interpreted. It also supports multiline input, allowing for comprehensive prompts to enhance the transcription accuracy.

API Qwen Audio2Text Output Parameters:

transcribed_text

The transcribed_text parameter is the primary output of the node, representing the text version of the input audio. This output is crucial as it provides the written content derived from the audio file, enabling further analysis, editing, or sharing. The accuracy and quality of the transcribed text depend on the input audio's clarity and the effectiveness of the provided prompts.

API Qwen Audio2Text Usage Tips:

Ensure that your audio input is clear and free from excessive background noise to improve transcription accuracy.
Utilize the system_prompt and prompt parameters to provide context or specific instructions that can guide the transcription process and enhance the quality of the output.
Regularly update your api_key to maintain secure access to the transcription service and avoid interruptions.

API Qwen Audio2Text Common Errors and Solutions:

Invalid API Key

Explanation: This error occurs when the provided api_key is incorrect or expired, preventing access to the transcription service.
Solution: Verify that your api_key is correct and active. If necessary, obtain a new key from the service provider.

Unsupported Audio Format

Explanation: The audio file format is not compatible with the node, leading to a failure in processing the input.
Solution: Convert your audio file to a supported format, such as WAV or MP3, before using it as input for the node.

Poor Audio Quality

Explanation: Low-quality audio with background noise or distortion can result in inaccurate transcription.
Solution: Use audio files with clear sound and minimal background noise to ensure better transcription results.

API Qwen Audio2Text Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_Prompt-All-In-One

Table of Content

Description
APIQwenAudio2Text:
APIQwenAudio2Text Input Parameters:
APIQwenAudio2Text Output Parameters:
APIQwenAudio2Text Usage Tips:
APIQwenAudio2Text Common Errors and Solutions:
Related Nodes

Qwen Image 2512 LoRA Inference | AI Toolkit ComfyUI

Use an AI Toolkit-trained LoRA with Qwen Image 2512 in ComfyUI via one RCQwenImage2512 node for preview-aligned generations.

Generate ENTIRE AI WORLDS Video Scene Builder

Turn simple footage into immersive cinematic AI landscapes instantly

Wan2.2 Fun Inp | Cinematic Video Generator

From 2 images to stunning videos with smooth, controllable transitions.

Pose Control LipSync S2V | Expressive Video Generator

Turn images into talking, moving characters with pose and audio control.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: API Qwen Audio2Text

APIQwenAudio2Text

How to Install ComfyUI_Prompt-All-In-One

API Qwen Audio2Text Description

API Qwen Audio2Text:

API Qwen Audio2Text Input Parameters:

audio

api_key

system_prompt

prompt

API Qwen Audio2Text Output Parameters:

transcribed_text

API Qwen Audio2Text Usage Tips:

API Qwen Audio2Text Common Errors and Solutions:

Invalid API Key

Unsupported Audio Format

Poor Audio Quality

API Qwen Audio2Text Related Nodes