Publications

Detailed Information

Breaking The Limits of Text-conditioned 3D Motion Synthesis with Elaborative Descriptions

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Qian, Yijun; Urbanek, Jack; Hauptmann, Alexander G.; Won, Jungdam

Issue Date
2023
Publisher
Institute of Electrical and Electronics Engineers Inc.
Citation
Proceedings of the IEEE International Conference on Computer Vision, pp.2306-2316
Abstract
Given its wide applications, there is increasing focus on generating 3D human motions from textual descriptions. Differing from the majority of previous works, which regard actions as single entities and can only generate short sequences for simple motions, we propose EMS, an elaborative motion synthesis model conditioned on detailed natural language descriptions. It generates natural and smooth motion sequences for long and complicated actions by factorizing them into groups of atomic actions. Meanwhile, it understands atomic-action level attributes (e.g., motion direction, speed, and body parts) and enables users to generate sequences of unseen complex actions from unique sequences of known atomic actions with independent attribute settings and timings applied. We evaluate our method on the KIT Motion-Language and BABEL benchmarks, where it outperforms all previous state-of-the-art with noticeable margins.
ISSN
1550-5499
URI
https://hdl.handle.net/10371/201162
DOI
https://doi.org/10.1109/ICCV51070.2023.00219
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share