Publications

Detailed Information

Leveraging Off-the-shelf Diffusion Model for Multi-attribute Fashion Image Manipulation

Cited 4 time in Web of Science Cited 7 time in Scopus
Authors

Kong, Chaerin; Jeon, DongHyeon; Kwon, Ohjoon; Kwak, Nojun

Issue Date
2023-01
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings - 2023 IEEE Winter Conference on Applications of Computer Vision, WACV 2023, pp.848-857
Abstract
Fashion attribute editing is a task that aims to convert the semantic attributes of a given fashion image while preserving the irrelevant regions. Previous works typically employ conditional GANs where the generator explicitly learns the target attributes and directly execute the conversion. These approaches, however, are neither scalable nor generic as they operate only with few limited attributes and a separate generator is required for each dataset or attribute set. Inspired by the recent advancement of diffusion models, we explore the classifier-guided diffusion that leverages the off-the-shelf diffusion model pretrained on general visual semantics such as Imagenet. In order to achieve a generic editing pipeline, we pose this as multi-attribute image manipulation task, where the attribute ranges from item category, fabric, pattern to collar and neckline. We empirically show that conventional methods fail in our challenging setting, and study efficient adaptation scheme that involves recently introduced attention-pooling technique to obtain a multi-attribute classifier guidance. Based on this, we present a mask-free fashion attribute editing framework that leverages the classifier logits and the cross-attention map for manipulation. We empirically demonstrate that our framework achieves convincing sample quality and attribute alignments.
ISSN
2472-6737
URI
https://hdl.handle.net/10371/205350
DOI
https://doi.org/10.1109/WACV56688.2023.00091
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • Graduate School of Convergence Science & Technology
  • Department of Intelligence and Information
Research Area Feature Selection and Extraction, Object Detection, Object Recognition

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share