RunComfy

Wan 2.1 | Revolutionary Video Generation

Create incredible videos from text or images with breakthrough AI running on everyday CPUs.

Pose Control LipSync S2V | Expressive Video Generator

Turn images into talking, moving characters with pose and audio control.

FLUX Inpainting | Seamless Image Editing

Effortlessly fill, remove, and refine images, seamlessly integrating new content.

SUPIR | Photo-Realistic Image/Video Upscaler

SUPIR enables photo-realistic image restoration, works with SDXL model, and supports text-prompt enhancement.

ComfyUI > Nodes > civitai-comfy-nodes > Civitai Transcription

ComfyUI Node: Civitai Transcription

Class Name

CivitaiTranscription

Category
Civitai/Audio

Author
civitai (Account age: 1322days) Extension
civitai-comfy-nodes Latest Updated
2026-06-18 Github Stars
0.02K

Github Ask civitai Current Questions Past Questions

Table of Content

Description
CivitaiTranscription:
CivitaiTranscription Input Parameters:
CivitaiTranscription Output Parameters:
CivitaiTranscription Usage Tips:
CivitaiTranscription Common Errors and Solutions:
Related Nodes

How to Install civitai-comfy-nodes

Install this extension via the ComfyUI Manager by searching for civitai-comfy-nodes

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter civitai-comfy-nodes in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

Civitai Transcription Description

Converts audio to text with language identification, time stamps, and detailed metadata for AI artists.

Civitai Transcription:

CivitaiTranscription is a powerful node designed to convert audio content into text through a transcription process. This node is part of the Civitai Orchestration suite, specifically tailored for handling audio data. It provides a seamless way to extract textual information from audio files, making it an invaluable tool for AI artists who need to transcribe spoken words into written form. The node is capable of identifying the language of the audio, providing time stamps for each segment of the transcription, and delivering detailed metadata about the transcription process. By leveraging this node, you can efficiently transform audio inputs into structured text outputs, facilitating further analysis or creative applications.

Civitai Transcription Input Parameters:

media_url

The media_url parameter specifies the URL of the audio file that you want to transcribe. This parameter is crucial as it directs the node to the specific audio content that needs to be processed. The URL should be accessible and correctly formatted to ensure successful transcription. There are no specific minimum or maximum values, but the URL must point to a valid audio file.

language

The language parameter allows you to specify the language of the audio content. This helps the transcription process to accurately interpret and transcribe the spoken words. If the language is not specified, the node may attempt to auto-detect it, but providing this information can enhance accuracy. There are no predefined options, but it should match the language code of the audio content.

context

The context parameter provides additional context or information that might be relevant to the transcription process. This can include specific terminologies or phrases that are expected in the audio, which can help improve the accuracy of the transcription. There are no specific constraints on this parameter, but it should be relevant to the audio content.

return_time_stamps

The return_time_stamps parameter is a boolean value that determines whether time stamps should be included in the transcription output. When set to true, the node will provide time stamps for each segment of the transcription, which can be useful for aligning text with the audio. The default value is typically false, meaning time stamps are not included unless specified.

Civitai Transcription Output Parameters:

text

The text output parameter contains the transcribed text from the audio file. This is the primary output of the node, providing a written representation of the spoken content. It is essential for any further text-based analysis or processing.

language

The language output parameter indicates the detected language of the audio content. This can be useful for verifying that the transcription process correctly identified the language, especially in multilingual audio files.

time_stamps

The time_stamps output parameter provides a JSON object containing the time stamps for each segment of the transcription. This is particularly useful for applications that require synchronization between the audio and text, such as subtitles or detailed analysis.

elapsed_seconds

The elapsed_seconds output parameter indicates the total time taken to complete the transcription process. This can be useful for performance monitoring and optimization purposes.

workflow_id

The workflow_id output parameter provides a unique identifier for the transcription workflow. This can be useful for tracking and managing multiple transcription tasks.

raw_json

The raw_json output parameter contains the raw JSON data of the transcription process, including all metadata and additional information. This can be useful for debugging or detailed analysis of the transcription process.

Civitai Transcription Usage Tips:

Ensure that the media_url points to a valid and accessible audio file to avoid errors during transcription.
Specify the language parameter if you know the language of the audio content to improve transcription accuracy.
Use the return_time_stamps parameter if you need to synchronize the transcribed text with the audio, such as for creating subtitles.

Civitai Transcription Common Errors and Solutions:

Invalid URL

Explanation: The media_url provided is not valid or accessible.
Solution: Verify that the URL is correct and points to a valid audio file. Ensure that the file is accessible from the network.

Unsupported Language

Explanation: The specified language is not supported by the transcription service.
Solution: Check the language code and ensure it is supported. If unsure, try leaving the language parameter empty for auto-detection.

Transcription Timeout

Explanation: The transcription process took too long and timed out.
Solution: Try reducing the length of the audio file or check the network connection for any issues that might be causing delays.

Civitai Transcription Related Nodes

Go back to the extension to check out more related nodes.

civitai-comfy-nodes

Table of Content

Description
CivitaiTranscription:
CivitaiTranscription Input Parameters:
CivitaiTranscription Output Parameters:
CivitaiTranscription Usage Tips:
CivitaiTranscription Common Errors and Solutions:
Related Nodes

Hunyuan3D 2.1 | Image to 3D Model

Big jump from 2.0: Turn photos into incredible 3D models instantly.

Z Image Turbo | Ultra-Fast Photorealistic Generator

Generate ultra-clear visuals fast with unmatched real-time detail.

Wan 2.2 Low Vram | Kijai Wrapper

Low VRAM. No longer waiting. Kijai wrapper included.

SDXL Turbo | Rapid Text to Image

Experience fast text-to-image synthesis with SDXL Turbo.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: Civitai Transcription

CivitaiTranscription

How to Install civitai-comfy-nodes

Civitai Transcription Description

Civitai Transcription:

Civitai Transcription Input Parameters:

media_url

language

context

return_time_stamps

Civitai Transcription Output Parameters:

text

language

time_stamps

elapsed_seconds

workflow_id

raw_json

Civitai Transcription Usage Tips:

Civitai Transcription Common Errors and Solutions:

Invalid URL

Unsupported Language

Transcription Timeout

Civitai Transcription Related Nodes