Publications

Detailed Information

SAVE: Protagonist Diversification with Structure Agnostic Video Editing

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Song, Yeji; Shin, Wonsik; Lee, Junsoo; Kim, Jeesoo; Kwak, Nojun

Issue Date
2025
Publisher
SPRINGER INTERNATIONAL PUBLISHING AG
Citation
COMPUTER VISION - ECCV 2024, PT LXXX, Vol.15138, pp.41-57
Abstract
Driven by the upsurge progress in text-to-image (T2I) generation models, text-to-video (T2V) generation has experienced a significant advance as well. Accordingly, tasks such as modifying the object or changing the style in a video have been possible. However, previous works usually work well on trivial and consistent shapes, and easily collapse on a difficult target that has a largely different body shape from the original one. In this paper, we spot the bias problem in the existing video editing method that restricts the range of choices for the new protagonist and attempt to address this issue using the conventional image-level personalization method. We adopt motion personalization that isolates the motion from a single source video and then modifies the protagonist accordingly. To deal with the natural discrepancy between image and video, we propose a motion word with an inflated textual embedding to properly represent the motion in a source video. We also regulate the motion word to attend to proper motion-related areas by introducing a novel pseudo optical flow, efficiently computed from the pre-calculated attention maps. Finally, we decouple the motion from the appearance of the source video with an additional pseudo word. Extensive experiments demonstrate the editing capability of our method, taking a step toward more diverse and extensive video editing. Our project page: https://ldynx.github.io/SAVE/
ISSN
0302-9743
URI
https://hdl.handle.net/10371/216404
DOI
https://doi.org/10.1007/978-3-031-72989-8_3
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Related Researcher

  • Graduate School of Convergence Science & Technology
  • Department of Intelligence and Information
Research Area Feature Selection and Extraction, Object Detection, Object Recognition

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share