Save 4 hours! We auto-setup your workflow! Free!

Drop your workflow.json — we handle every dependency, custom node, and model. Just open the link and run.

Auto-Setup Workflow Json (Free) Now!
ComfyUI > Nodes > civitai-comfy-nodes > Civitai Transcription

ComfyUI Node: Civitai Transcription

Class Name

CivitaiTranscription

Category
Civitai/Audio
Author
civitai (Account age: 1322days)
Extension
civitai-comfy-nodes
Latest Updated
2026-06-18
Github Stars
0.02K

How to Install civitai-comfy-nodes

Install this extension via the ComfyUI Manager by searching for civitai-comfy-nodes
  • 1. Click the Manager button in the main menu
  • 2. Select Custom Nodes Manager button
  • 3. Enter civitai-comfy-nodes in the search bar
After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

  • Free trial available
  • 16GB VRAM to 80GB VRAM GPU machines
  • 400+ preloaded models/nodes
  • Freedom to upload custom models/nodes
  • 200+ ready-to-run workflows
  • 100% private workspace with up to 200GB storage
  • Dedicated Support

Run ComfyUI Online

Civitai Transcription Description

Converts audio to text with language identification, time stamps, and detailed metadata for AI artists.

Civitai Transcription:

CivitaiTranscription is a powerful node designed to convert audio content into text through a transcription process. This node is part of the Civitai Orchestration suite, specifically tailored for handling audio data. It provides a seamless way to extract textual information from audio files, making it an invaluable tool for AI artists who need to transcribe spoken words into written form. The node is capable of identifying the language of the audio, providing time stamps for each segment of the transcription, and delivering detailed metadata about the transcription process. By leveraging this node, you can efficiently transform audio inputs into structured text outputs, facilitating further analysis or creative applications.

Civitai Transcription Input Parameters:

media_url

The media_url parameter specifies the URL of the audio file that you want to transcribe. This parameter is crucial as it directs the node to the specific audio content that needs to be processed. The URL should be accessible and correctly formatted to ensure successful transcription. There are no specific minimum or maximum values, but the URL must point to a valid audio file.

language

The language parameter allows you to specify the language of the audio content. This helps the transcription process to accurately interpret and transcribe the spoken words. If the language is not specified, the node may attempt to auto-detect it, but providing this information can enhance accuracy. There are no predefined options, but it should match the language code of the audio content.

context

The context parameter provides additional context or information that might be relevant to the transcription process. This can include specific terminologies or phrases that are expected in the audio, which can help improve the accuracy of the transcription. There are no specific constraints on this parameter, but it should be relevant to the audio content.

return_time_stamps

The return_time_stamps parameter is a boolean value that determines whether time stamps should be included in the transcription output. When set to true, the node will provide time stamps for each segment of the transcription, which can be useful for aligning text with the audio. The default value is typically false, meaning time stamps are not included unless specified.

Civitai Transcription Output Parameters:

text

The text output parameter contains the transcribed text from the audio file. This is the primary output of the node, providing a written representation of the spoken content. It is essential for any further text-based analysis or processing.

language

The language output parameter indicates the detected language of the audio content. This can be useful for verifying that the transcription process correctly identified the language, especially in multilingual audio files.

time_stamps

The time_stamps output parameter provides a JSON object containing the time stamps for each segment of the transcription. This is particularly useful for applications that require synchronization between the audio and text, such as subtitles or detailed analysis.

elapsed_seconds

The elapsed_seconds output parameter indicates the total time taken to complete the transcription process. This can be useful for performance monitoring and optimization purposes.

workflow_id

The workflow_id output parameter provides a unique identifier for the transcription workflow. This can be useful for tracking and managing multiple transcription tasks.

raw_json

The raw_json output parameter contains the raw JSON data of the transcription process, including all metadata and additional information. This can be useful for debugging or detailed analysis of the transcription process.

Civitai Transcription Usage Tips:

  • Ensure that the media_url points to a valid and accessible audio file to avoid errors during transcription.
  • Specify the language parameter if you know the language of the audio content to improve transcription accuracy.
  • Use the return_time_stamps parameter if you need to synchronize the transcribed text with the audio, such as for creating subtitles.

Civitai Transcription Common Errors and Solutions:

Invalid URL

  • Explanation: The media_url provided is not valid or accessible.
  • Solution: Verify that the URL is correct and points to a valid audio file. Ensure that the file is accessible from the network.

Unsupported Language

  • Explanation: The specified language is not supported by the transcription service.
  • Solution: Check the language code and ensure it is supported. If unsure, try leaving the language parameter empty for auto-detection.

Transcription Timeout

  • Explanation: The transcription process took too long and timed out.
  • Solution: Try reducing the length of the audio file or check the network connection for any issues that might be causing delays.

Civitai Transcription Related Nodes

Go back to the extension to check out more related nodes.
civitai-comfy-nodes
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Civitai Transcription