Doby-Xu/WithAnyone

<div align="center"> <h2>WithAnyone: Towards Controllable and ID-Consistent Image Generation</h2>    <p> <a href="https://arxiv.org/abs/2510.14975"><img src="https://img.shields.io/badge/arXiv-2510.14975-b31b1b.svg" alt="arXiv"/></a> <a href="https://doby-xu.github.io/WithAnyone/"><img src="https://img.shields.io/badge/Project-Page-blue.svg" alt="Project Page"/></a> <a href="https://huggingface.co/WithAnyone/WithAnyone"><img src="https://img.shields.io/badge/HuggingFace-Model-yellow.svg" alt="HuggingFace"/></a> <a href="https://huggingface.co/datasets/WithAnyone/MultiID-Bench"><img src="https://img.shields.io/badge/MultiID-Bench-Green.svg" alt="MultiID-Bench"/></a> <a href="https://huggingface.co/datasets/WithAnyone/MultiID-2M"><img src="https://img.shields.io/badge/MultiID_2M-Dataset-Green.svg" alt="MultiID-2M"/></a> <a href="https://huggingface.co/spaces/WithAnyone/WithAnyone_demo"><img src="https://img.shields.io/badge/Huggingface-Demo-blue.svg" alt="MultiID-2M"/></a> </p> </div>  <p align="center"> <a href="assets/withanyone.gif"> <img src="assets/withanyone.gif" alt="Teaser" width="800"/> </a> </p>

Star us if you find this project useful! ⭐

🎉 Updates

[12/2025] 🔥 Training Codebase is now released! We are also working on training WithAnyone.Z on Z-image, please stay tuned!
[11/2025] 🔥 ComfyUI (community contribution) is now supported!
[10/2025] 🔥 Hugging Face Space Demo is online — give it a try!
[10/2025] 🔥 Model Checkpoints, MultiID-Bench, and are released!

<div style="text-align:center; margin-top:12px;"> <img src="assets/fidelity_vs_copypaste_v200_single.png" alt="Copy-Paste" style="width:70%; max-width:900px; height:auto; display:inline-block;"> </div>  <details> <summary>WithAnyone.K</summary> This is a preliminary version of WithAnyone with FLUX.1 Kontext. It can be used for text-to-image generation with multiple given identities. However, stability and quality are not as good as the base model. Please use it with caution. We are working on improving it. </details> <details> <summary>WithAnyone.Ke</summary> This is a face editing version of WithAnyone with FLUX.1 Kontext, leveraging the editing capabilities of FLUX.1 Kontext. Please use it with `gradio_edit.py` instead of `gradio_app.py`. It is still a preliminary version, and we are working on improving it. </details> <div style="color:#999; font-size:0.95em; margin-top:8px;"> We need to use the ArcFace model for face embedding. It will automatically be downloaded to `./models/`. However, there is an original bug. If you see an error like `assert 'detection' in self.models`, please manually move the model directory: </div> <pre style="color:#888; background:transparent; border:0; padding:0; margin-top:8px;"> mv models/antelopev2/ models/antelopev2_ mv models/antelopev2_/antelopev2/ models/antelopev2/ rm -rf models/antelopev2_, antelopev2.zip </pre> <details> <summary>How the slider works and some tips</summary> The slider actually controlls the weight of SigLIP embedding and ArcFace embedding. The former preserves more mid-level semantic details, while the latter preserves more high-level identity information. </details> <p align="center"> <a href="assets/kontext.jpg"> <img src="assets/kontext.jpg" alt="Face Edit" width="800"/> </a> </p> Run it with:

Doby

Molt Pulse

🎉 Updates

❤ Community Contributions

🕒 Action Items

📑Introduction

⚡️ Quick Start

🏰 Model Zoo

🔧 Requirements

🔧 Model Checkpoints

⚡️ Gradio Demo

💡 Tips for Better Results

⚙️ Batch Inference

Download MultiID-Bench

Run Batch Inference

⚙️ Face Edit with FLUX.1 Kontext

Train

📜 License and Disclaimer

🌹 Acknowledgement

📑 Citation

Ecosystem Role