CharaConsist: Fine-Grained Consistent Character Generation

¹Institute of Information Science, Beijing Jiaotong University
²Institute of Big Data, Fudan University
³Visual Intelligence + X International Joint Laboratory
⁴Alkaid Pte. Ltd.
ICCV 2025
^†Corresponding Authors

Abstract

In text-to-image generation, producing a series of consistent contents that preserve the same identity is highly valuable for real-world applications. Although a few works have explored training-free methods to enhance the consistency of generated subjects, we observe that they suffer from the following problems. First, they fail to maintain consistent background details, which limits their applicability. Furthermore, when the foreground character undergoes large motion variations, inconsistencies in identity and clothing details become evident. To address these problems, we propose CharaConsist, which employs point-tracking attention and adaptive token merge along with decoupled control of the foreground and background. CharaConsist enables fine-grained consistency for both foreground and background, supporting the generation of one character in continuous shots within a fixed scene or in discrete shots across different scenes. Moreover, CharaConsist is the first consistent generation method tailored for text-to-image DiT model. Its ability to maintain fine-grained consistency, combined with the larger capacity of latest base model, enables it to produce high-quality visual outputs, broadening its applicability to a wider range of real-world scenarios.

Background Maintaining

A rugged man with a beard, wearing a red jacket and black snow pants ...

A young black male scientist, wearing a white lab coat, black-rimmed glasses ...

A muscular man in his 30s, wearing a black tank top and shorts ...

A stunning female cyber-punk android combatant within a circle of light on the ground, white high-tech mechanical body, beautiful face, long brown hair ...

Background Switching

A young woman with short pink hair, wearing a leather jacket and jeans ...

A young man with short hair, wearing a leather jacket and sunglasses ...

A young boy with curly brown hair, wearing a T-shirt and shorts ...

A young white man with short brown hair, wearing a white and blue sports t-shirt and black shorts ...

@inproceedings{CharaConsist, title={{CharaConsist}: Fine-Grained Consistent Character Generation}, author={Wang, Mengyu and Ding, Henghui and Peng, Jianing and Zhao, Yao and Chen, Yunpeng and Wei, Yunchao}, booktitle={ICCV}, year={2025} }