Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing

김현수

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing : 실시간 이미지 편집을 위한 GAN 내 잠재의 공간차원 활용

DC Field	Value	Language
dc.contributor.advisor	유승주	-
dc.contributor.author	김현수	-
dc.date.accessioned	2021-11-30T02:38:05Z	-
dc.date.available	2021-11-30T02:38:05Z	-
dc.date.issued	2021-02	-
dc.identifier.other	000000163612	-
dc.identifier.uri	https://hdl.handle.net/10371/175397	-
dc.identifier.uri	https://dcollection.snu.ac.kr/common/orgView/000000163612	ko_KR
dc.description	학위논문 (석사) -- 서울대학교 대학원 : 공과대학 컴퓨터공학부, 2021. 2. 유승주.	-
dc.description.abstract	Generative adversarial networks (GANs) have been successful in synthesizing and manipulating synthetic but realistic images from latent vectors. However, it is still challenging for GANs to manipulate real images, especially in real-time. State-of-the-art GAN-based methods for editing real images suffer from time-consuming operations in projecting real images to latent vectors. Alternatively, an encoder can be trained to embed real images to the latent space instantly, but it loses details drastically. We propose StyleMapGAN, which adopts a novel representation of latent space, called stylemap, incorporating spatial dimension into embedding. Because each spatial location in the stylemap contributes to its corresponding region of the generated images, the real-time projection through the encoder becomes accurate as well as editing real images becomes spatially controllable. Experimental results demonstrate that our method significantly outperforms state-of-the-art models in various image manipulation tasks such as local editing and image interpolation. Especially, detailed comparisons show that our local editing method successfully reflects not only the color and texture but also the shape of a reference image while preserving untargeted regions.	-
dc.description.abstract	적대적 생성 신경망(GAN)은 실존하지 않지만, 실제 존재하는 것 같은 이미지들을 생성하는데 성공적으로 이용되고 있다. 또한 각 이미지를 생성하는 잠재 벡터를 이용해 가짜(실존하지 않는) 이미지들을 편집할 수 있다. 그러나 가짜 이미지가 아닌 실제 이미지를 편집하는 것은 어렵고, 특히 실시간으로는 더욱 어렵다. GAN을 이용해 실제 이미지를 편집하는 최첨단 방법들은 실제 이미지를 잠재 벡터로 투영하는 것이 선행되어야 하는데, 이 부분에 많은 시간이 소요된다. 그 대안으로 인코더를 학습해서 실제 이미지를 잠재 공간으로 즉시 임베딩할 수 있지만, 잠재 벡터를 다시 이미지로 복원 시 많은 디테일들을 잃어버린다. 우리는 새로운 형태의 잠재 공간인 stylemap을 가지는 StyleMapGAN을 제안했는데, stylemap은 기존 벡터 공간에 공간적인 차원을 추가한 잠재 공간이다. Stylemap의 각 위치는 생성된 이미지의 해당 지역에 대응되어, 인코더를 이용해 실시간이면서도 정확한 투영이 가능해질 뿐만 아니라, 실제 이미지의 공간적으로 편집이 가능해진다. 많은 실험 결과들은 우리가 제안한 방법이 다양한 이미지 편집 작업들(예를 들어, 국소적인 편집 및 이미지 보간)에서 기존 최첨단 방법들을 월등히 능가함을 보여준다. 특히 자세한 비교 실험들은 우리가 제안한 국소적인 편집 방법이 효과적임을 보여준다. 타겟하지 않는 부분의 기존 이미지는 잘 유지되고, 타겟하는 부분에서는 참조 이미지의 색과 질감뿐만 아니라 모양까지 잘 가져옴을 볼 수 있다.	-
dc.description.tableofcontents	Abstract i Contents iii List of Tables v List of Figures vii Chapter 1 Introduction 1 Chapter 2 Related Work 3 Chapter 3 StyleMapGAN 5 3.1 Stylemap-based generator 7 3.2 Training procedure and losses 10 3.3 Local editing 10 Chapter 4 Experiments 12 4.1 Experimental Setup 12 4.2 Evaluation metrics 13 4.3 Effects of stylemap resolution 17 4.4 Real Image Projection 19 4.5 Local Editing 19 4.6 Unaligned Transplantation 20 Chapter 5 Discussion and Conclusion 24 References 25 Appendix 31 A Local editing in w+ space 31 B Additional results 33 B.1 Random generation 33 B.2 Image projection & Interpolation 34 B.3 Local editing 34 B.4 Unaligned transplantation 39 B.5 Style mixing 39 B.6 Semantic manipulation 39 B.7 Failure cases 40 C Implementation details 46 D Loss details 49 국문초록 51	-
dc.format.extent	x, 64	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	Generative Adversarial Network	-
dc.subject	Image Manipulation	-
dc.subject	Deep Learning	-
dc.subject	적대적 생성 신경망	-
dc.subject	이미지 편집	-
dc.subject	딥러닝	-
dc.subject.ddc	621.39	-
dc.title	Exploiting Spatial Dimensions of Latent in GAN for Real-time Image Editing	-
dc.title.alternative	실시간 이미지 편집을 위한 GAN 내 잠재의 공간차원 활용	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Hyunsu Kim	-
dc.contributor.department	공과대학 컴퓨터공학부	-
dc.description.degree	Master	-
dc.date.awarded	2021-02	-
dc.identifier.uci	I804:11032-000000163612	-
dc.identifier.holdings	000000000044▲000000000050▲000000163612▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Computer Science and Engineering (컴퓨터공학부)
  - Theses (Master's Degree_컴퓨터공학부)

Files in This Item:

000000163612.pdf 3.84 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share