RunComfy

InfiniteTalk | Lip-Synced Avatar Generator

Photo + Voice = Perfectly Synced Talking Avatar in Minutes

ACE++ Face Swap ｜ Image Editing

Swap faces in images with natural language instructions while preserving style and context.

SeedVR2 | Image & Video Upscaler

Fixes blur instantly. Better than Keep/PMRF.

FLUX.2 Klein Unified Image Editing | Smart Inpaint, Outpaint & Remove

Flawless editing. Remove, fill, and extend any image fast.

ComfyUI > Nodes > ComfyUI_FL-CosyVoice3 > FL CosyVoice3 Speaker Instruct2

ComfyUI Node: FL CosyVoice3 Speaker Instruct2

Class Name

FL_CosyVoice3_SpeakerInstruct2

Category
🔊FL CosyVoice3/Synthesis

Author
filliptm (Account age: 2386days) Extension
ComfyUI_FL-CosyVoice3 Latest Updated
2026-03-21 Github Stars
0.11K

Github Ask filliptm Current Questions Past Questions

Table of Content

Description
FL_CosyVoice3_SpeakerInstruct2:
FL_CosyVoice3_SpeakerInstruct2 Input Parameters:
FL_CosyVoice3_SpeakerInstruct2 Output Parameters:
FL_CosyVoice3_SpeakerInstruct2 Usage Tips:
FL_CosyVoice3_SpeakerInstruct2 Common Errors and Solutions:
Related Nodes

How to Install ComfyUI_FL-CosyVoice3

Install this extension via the ComfyUI Manager by searching for ComfyUI_FL-CosyVoice3

1. Click the Manager button in the main menu
2. Select Custom Nodes Manager button
3. Enter ComfyUI_FL-CosyVoice3 in the search bar

After installation, click the Restart button to restart ComfyUI. Then, manually refresh your browser to clear the cache and access the updated list of nodes.

Visit ComfyUI Online for ready-to-use ComfyUI environment

Free trial available
16GB VRAM to 80GB VRAM GPU machines
400+ preloaded models/nodes
Freedom to upload custom models/nodes
200+ ready-to-run workflows
100% private workspace with up to 200GB storage
Dedicated Support

Run ComfyUI Online

FL CosyVoice3 Speaker Instruct2 Description

Synthesizes speech using saved presets for personalized, expressive voice outputs.

FL CosyVoice3 Speaker Instruct2:

FL_CosyVoice3_SpeakerInstruct2 is a sophisticated node designed to synthesize speech by leveraging a saved speaker preset for voice timbre, combined with instruct text to control the style, emotion, and tone of the speech. This node is particularly beneficial for creating personalized and expressive voice outputs without the need for live reference audio. It utilizes the inference_instruct2 method, which allows for zero-shot speaker identification, making it a powerful tool for AI artists looking to generate unique and dynamic audio content. The node's primary goal is to provide a seamless and efficient way to produce high-quality speech synthesis that can be tailored to specific artistic needs, enhancing the creative process with its advanced capabilities.

FL CosyVoice3 Speaker Instruct2 Input Parameters:

model

This parameter requires a CosyVoice model from the Model Loader. It is essential for the node's operation as it provides the underlying framework for speech synthesis. The model must support the inference_instruct2 method, which is crucial for the node's functionality.

text

This is the text you wish to synthesize into speech. It can be a multiline string, allowing for complex and lengthy inputs. The default value is "Hello, this is my cloned voice speaking." This parameter directly influences the content of the synthesized speech.

instruct_text

This parameter provides instructions to control the speaking style, emotion, and tone. It supports multiline input and can include examples like "请非常开心地说这句话。" or "Please say this in a very soft voice." The default value is "请非常开心地说这句话。" This parameter is vital for customizing the expressiveness of the synthesized speech.

speaker_preset

This parameter specifies the speaker preset saved by the FL CosyVoice3 Save Speaker node. It is crucial for defining the voice timbre and must be set to a valid preset. If set to "[none]", the node will raise an error.

speed

This parameter controls the speech speed multiplier, allowing you to adjust the tempo of the synthesized speech. It accepts values between 0.5 and 2.0, with a default of 1.0 and a step of 0.05. Adjusting this parameter can significantly impact the pacing and delivery of the speech.

seed

This optional parameter sets the random seed for reproducibility. It accepts integer values with a default of 42, and a range from -1 (for random) to 2147483647. Setting a specific seed ensures consistent results across different runs.

text_frontend

This optional boolean parameter enables text normalization. When set to True (default), it normalizes the text input. Disable it for CMU phonemes or special tags.

FL CosyVoice3 Speaker Instruct2 Output Parameters:

audio

The output is an audio object containing the synthesized speech. It includes the waveform and sample rate, providing a ready-to-use audio file that reflects the input text and instructions. This output is crucial for AI artists as it represents the final product of the synthesis process, ready for integration into creative projects.

FL CosyVoice3 Speaker Instruct2 Usage Tips:

Ensure that the speaker_preset is correctly set by using the FL CosyVoice3 Save Speaker node to create and save presets before synthesis.
Use the instruct_text parameter creatively to explore different emotional and stylistic expressions in your synthesized speech.
Adjust the speed parameter to match the desired pacing of your project, keeping in mind that extreme values may affect intelligibility.

FL CosyVoice3 Speaker Instruct2 Common Errors and Solutions:

"inference_instruct2 is not available on this model."

Explanation: The selected model does not support the inference_instruct2 method required by this node.
Solution: Ensure you are using a CosyVoice2 or CosyVoice3 model that includes the inference_instruct2 method.

"Speaker preset file not found: `<path>`"

Explanation: The specified speaker preset file does not exist at the given path.
Solution: Use the FL CosyVoice3 Save Speaker node to create and save the necessary speaker preset file.

"No speaker presets found."

Explanation: The speaker_preset parameter is set to "[none]", indicating no preset is available.
Solution: Create a speaker preset using the FL CosyVoice3 Save Speaker node and specify it in the speaker_preset parameter.

"instruct_text cannot be empty."

Explanation: The instruct_text parameter is empty or contains only whitespace.
Solution: Provide valid style instructions in the instruct_text parameter to guide the synthesis process.

FL CosyVoice3 Speaker Instruct2 Related Nodes

Go back to the extension to check out more related nodes.

ComfyUI_FL-CosyVoice3

Table of Content

Description
FL_CosyVoice3_SpeakerInstruct2:
FL_CosyVoice3_SpeakerInstruct2 Input Parameters:
FL_CosyVoice3_SpeakerInstruct2 Output Parameters:
FL_CosyVoice3_SpeakerInstruct2 Usage Tips:
FL_CosyVoice3_SpeakerInstruct2 Common Errors and Solutions:
Related Nodes

SDXL Turbo | Rapid Text to Image

Experience fast text-to-image synthesis with SDXL Turbo.

PMRF Ultra Fast Upscaler | Low VRAM ComfyUI

Ultra fast PMRF upscaler! 3.79s on medium machine. 2x scale.

Instagirl v.20 | Wan 2.2 LoRA Demo

A Wan 2.2 workflow for demoing the Instagirl LoRA by Instara.

Wan 2.2 VACE | Pose-Controlled Video Generator

Turn still images into stunning motion with pose-based control.

Support

Resources

Legal

RunComfy

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Models, enabling artists to harness the latest AI tools to create incredible art.

Support

Resources

Legal

RunComfy

Save 4 hours! We auto-setup your workflow! Free!

ComfyUI Node: FL CosyVoice3 Speaker Instruct2

FL_CosyVoice3_SpeakerInstruct2

How to Install ComfyUI_FL-CosyVoice3

FL CosyVoice3 Speaker Instruct2 Description

FL CosyVoice3 Speaker Instruct2:

FL CosyVoice3 Speaker Instruct2 Input Parameters:

model

text

instruct_text

speaker_preset

speed

seed

text_frontend

FL CosyVoice3 Speaker Instruct2 Output Parameters:

audio

FL CosyVoice3 Speaker Instruct2 Usage Tips:

FL CosyVoice3 Speaker Instruct2 Common Errors and Solutions:

"inference_instruct2 is not available on this model."

"Speaker preset file not found: <path>"

"No speaker presets found."

"instruct_text cannot be empty."

FL CosyVoice3 Speaker Instruct2 Related Nodes

"Speaker preset file not found: `<path>`"