logo
RunComfy
ComfyUIPlaygroundPricing
discord logo
ComfyUI>Workflows>ComfyUI PhotoMakerV2 | Create Realistic Photos

ComfyUI PhotoMakerV2 | Create Realistic Photos

Workflow Name: RunComfy/PhotoMakerV2
Workflow ID: 0000...1109
ComfyUI PhotoMakerV2 is a powerful text-to-image generation tool that enables users to create realistic personalized photos efficiently. By inputting identity images and a text prompt, PhotoMakerV2 preserves the likeness of the individuals while allowing flexible control over context, style, and attributes. This latest version offers improved identity fidelity compared to its predecessor. Discover the creative possibilities of generating photorealistic images in different settings, stylizing appearances, and even merging identities.

What is PhotoMakerV2

PhotoMakerV2, an upgrade from PhotoMaker, offers an efficient method for personalized text-to-image generation. It synthesizes realistic photos of individuals using a few input identity images and a text prompt.

Some key features of PhotoMakerV2 include:

  • High efficiency: Quickly generates personalized photos.
  • Excellent identity preservation: Maintains the likeness of input identities.
  • Flexible text control: Allows specifying context, style, attributes, etc., in the prompt.
  • Improved identity fidelity: Enhanced compared to PhotoMaker V1. PhotoMakerV2 generates photorealistic images of a person in various contexts, stylizes appearances, changes attributes like age and gender, merges identities, and modernizes people from old photos or artwork. It unlocks numerous creative possibilities.

How PhotoMakerV2 Works

PhotoMakerV2 encodes one or more input identity images into a "stacked ID embedding," serving as a unified representation encapsulating identity information.

This embedding, combined with a text prompt, feeds into a text-to-image diffusion model. The model then produces an image depicting the embedded identity in the context described by the prompt.

Some key aspects of how it works under the hood:

  • Uses an identity encoder to extract identity information from input face images
  • Improves identity preservation by leveraging an external face recognition model (InsightFace)
  • Encodes multiple identity images into a stacked embedding to capture identity comprehensively
  • Feeds the stacked ID embedding into the diffusion model's cross-attention layers
  • Guides generation with the text prompt while adaptively merging the identity information
  • Trained with an identity-oriented dataset to improve identification capabilities

How to Use ComfyUI PhotoMakerV2

To use PhotoMakerV2 in ComfyUI, primarily interact with the PhotoMakerEncodePlus node. A typical workflow involves:

  1. Load PhotoMakerV2 model using "PhotoMaker Loader Plus" node.
  2. Load one or more identity images using "Prepare Images For CLIP Vision" node.
  3. Load InsightFace model required by PhotoMakerV2 using "PhotoMaker InsightFace Loader" node.
  4. Connect outputs of these nodes to corresponding inputs of "PhotoMaker Encode Plus" node.
  5. In the "PhotoMaker Encode Plus" node, specify the prompt describing the desired image. Use the special trigger word in the prompt where the identity should appear.
  6. Connect output conditioning from "PhotoMaker Encode Plus" to a "KSampler" node to generate the image.

For more information, please visit PhotoMaker Hugging Face and ComfyUI-PhotoMaker-Plus. All credit goes to their contributions.

Want More ComfyUI Workflows?

Z Image | Ultra-Fast Photorealistic Generator

Generate ultra-clear visuals fast with unmatched real-time detail.

AnimateDiff + ControlNet + AutoMask | Comic Style

Effortlessly restyle videos, converting realistic characters into anime while keeping the original backgrounds intact.

SVD (Stable Video Diffusion) + SD | Text to Video

Integrate Stable Diffusion and Stable Video Diffusion to convert text directly into video.

Advanced Live Portrait | Parameter Control

Use customizable parameters to control every feature, from eye blinks to head movements, for natural results.

Vid2Vid Part 1 | Composition and Masking

The ComfyUI Vid2Vid offers two distinct workflows to creating high-quality, professional animations: Vid2Vid Part 1, which enhances your creativity by focusing on the composition and masking of your original video, and Vid2Vid Part 2, which utilizes SDXL Style Transfer to transform the style of your video to match your desired aesthetic. This page specifically covers Vid2Vid Part 1

FLUX Kontext Dev | Intelligent Image Editing

FLUX Kontext Dev | Intelligent Image Editing

Kontext Dev = Controllable + All Graphic Design Needs in One Tool

FLUX ControlNet Depth-V3 & Canny-V3

Achieve better control with FLUX-ControlNet-Depth & FLUX-ControlNet-Canny for FLUX.1 [dev].

AnimateDiff + ControlNet | Cartoon Style

Give your videos a playful twist by transforming them into lively cartoons.

Follow us
  • LinkedIn
  • Facebook
  • Instagram
  • Twitter
Support
  • Discord
  • Email
  • System Status
  • Affiliate
Resources
  • Free ComfyUI Online
  • ComfyUI Guides
  • RunComfy API
  • ComfyUI Tutorials
  • ComfyUI Nodes
  • Learn More
Legal
  • Terms of Service
  • Privacy Policy
  • Cookie Policy
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.