logo
RunComfy
ComfyUIPlaygroundPricing
discord logo
ComfyUI>Workflows>ComfyUI PhotoMakerV2 | Create Realistic Photos

ComfyUI PhotoMakerV2 | Create Realistic Photos

Workflow Name: RunComfy/PhotoMakerV2
Workflow ID: 0000...1109
ComfyUI PhotoMakerV2 is a powerful text-to-image generation tool that enables users to create realistic personalized photos efficiently. By inputting identity images and a text prompt, PhotoMakerV2 preserves the likeness of the individuals while allowing flexible control over context, style, and attributes. This latest version offers improved identity fidelity compared to its predecessor. Discover the creative possibilities of generating photorealistic images in different settings, stylizing appearances, and even merging identities.

What is PhotoMakerV2

PhotoMakerV2, an upgrade from PhotoMaker, offers an efficient method for personalized text-to-image generation. It synthesizes realistic photos of individuals using a few input identity images and a text prompt.

Some key features of PhotoMakerV2 include:

  • High efficiency: Quickly generates personalized photos.
  • Excellent identity preservation: Maintains the likeness of input identities.
  • Flexible text control: Allows specifying context, style, attributes, etc., in the prompt.
  • Improved identity fidelity: Enhanced compared to PhotoMaker V1. PhotoMakerV2 generates photorealistic images of a person in various contexts, stylizes appearances, changes attributes like age and gender, merges identities, and modernizes people from old photos or artwork. It unlocks numerous creative possibilities.

How PhotoMakerV2 Works

PhotoMakerV2 encodes one or more input identity images into a "stacked ID embedding," serving as a unified representation encapsulating identity information.

This embedding, combined with a text prompt, feeds into a text-to-image diffusion model. The model then produces an image depicting the embedded identity in the context described by the prompt.

Some key aspects of how it works under the hood:

  • Uses an identity encoder to extract identity information from input face images
  • Improves identity preservation by leveraging an external face recognition model (InsightFace)
  • Encodes multiple identity images into a stacked embedding to capture identity comprehensively
  • Feeds the stacked ID embedding into the diffusion model's cross-attention layers
  • Guides generation with the text prompt while adaptively merging the identity information
  • Trained with an identity-oriented dataset to improve identification capabilities

How to Use ComfyUI PhotoMakerV2

To use PhotoMakerV2 in ComfyUI, primarily interact with the PhotoMakerEncodePlus node. A typical workflow involves:

  1. Load PhotoMakerV2 model using "PhotoMaker Loader Plus" node.
  2. Load one or more identity images using "Prepare Images For CLIP Vision" node.
  3. Load InsightFace model required by PhotoMakerV2 using "PhotoMaker InsightFace Loader" node.
  4. Connect outputs of these nodes to corresponding inputs of "PhotoMaker Encode Plus" node.
  5. In the "PhotoMaker Encode Plus" node, specify the prompt describing the desired image. Use the special trigger word in the prompt where the identity should appear.
  6. Connect output conditioning from "PhotoMaker Encode Plus" to a "KSampler" node to generate the image.

For more information, please visit PhotoMaker Hugging Face and ComfyUI-PhotoMaker-Plus. All credit goes to their contributions.

Want More ComfyUI Workflows?

LayerDiffuse | Text to Transparent Image

LayerDiffuse | Text to Transparent Image

Use LayerDiffuse to generate transparent images or blend backgrounds and foregrounds with one another.

ComfyUI FLUX | A New Art Image Generation

ComfyUI FLUX | A New Art Image Generation

A new image generation model developed by Black Forest Labs

IPAdapter Plus (V2) | Merge Images

IPAdapter Plus (V2) | Merge Images

Use various merging methods with IPAdapter Plus for precise, efficient image blending control.

APISR | Anime Image/Video Upscaler

The APISR model enhances and restores anime images and videos, making your visuals more vibrant and clearer.

BRIA AI RMBG 1.4 vs Segment Anything | Background Removal

BRIA AI RMBG 1.4 vs Segment Anything | Background Removal

Efficiently removes backgrounds by comparing BRIA AI's RMBG 1.4 with Segment Anything.

Audioreactive Dancers Evolved

Transform your subject with an audioreactive background made of intricate geometries.

ACE++ Character Consistency

Generate consistent images of your character across poses, angles, and styles from a single photo.

SVD + IPAdapter V1 | Image to Video

Utilize IPAdapters for static image generation and Stable Video Diffusion for dynamic video generation.

Follow us
  • LinkedIn
  • Facebook
  • Instagram
  • Twitter
Support
  • Discord
  • Email
  • System Status
  • Affiliate
Resources
  • Free ComfyUI Online
  • ComfyUI Guides
  • RunComfy API
  • ComfyUI Tutorials
  • ComfyUI Nodes
  • Learn More
Legal
  • Terms of Service
  • Privacy Policy
  • Cookie Policy
RunComfy
Copyright 2025 RunComfy. All Rights Reserved.

RunComfy is the premier ComfyUI platform, offering ComfyUI online environment and services, along with ComfyUI workflows featuring stunning visuals. RunComfy also provides AI Playground, enabling artists to harness the latest AI tools to create incredible art.