Publications

Detailed Information

Real-time head orientation from a monocular camera using deep neural network

DC Field Value Language
dc.contributor.authorAhn, Byungtae-
dc.contributor.authorPark, Jaesik-
dc.contributor.authorKweon, In So-
dc.date.accessioned2024-05-09T04:14:24Z-
dc.date.available2024-05-09T04:14:24Z-
dc.date.created2024-05-09-
dc.date.created2024-05-09-
dc.date.issued2015-
dc.identifier.citationLecture Notes in Computer Science, Vol.9005, pp.82-96-
dc.identifier.issn0302-9743-
dc.identifier.urihttps://hdl.handle.net/10371/201328-
dc.description.abstractWe propose an efficient and accurate head orientation estimation algorithm using a monocular camera. Our approach is leveraged by deep neural network and we exploit the architecture in a data regression manner to learn the mapping function between visual appearance and three dimensional head orientation angles. Therefore, in contrast to classification based approaches, our system outputs continuous head orientation. The algorithm uses convolutional filters trained with a large number of augmented head appearances, thus it is user independent and covers large pose variations. Our key observation is that an input image having 32 × 32 resolution is enough to achieve about 3 degrees of mean square error, which can be used for efficient head orientation applications. Therefore, our architecture takes only 1ms on roughly localized head positions with the aid of GPU. We also propose particle filter based post-processing to enhance stability of the estimation further in video sequences. We compare the performance with the state-of-the-art algorithm which utilizes depth sensor and we validate our head orientation estimator on Internet photos and video.-
dc.language영어-
dc.publisherSpringer Verlag-
dc.titleReal-time head orientation from a monocular camera using deep neural network-
dc.typeArticle-
dc.identifier.doi10.1007/978-3-319-16811-1_6-
dc.citation.journaltitleLecture Notes in Computer Science-
dc.identifier.wosid000362446900006-
dc.identifier.scopusid2-s2.0-84983591340-
dc.citation.endpage96-
dc.citation.startpage82-
dc.citation.volume9005-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorPark, Jaesik-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Dept. of Computer Science and Engineering
Research Area Computer Graphics, Computer Vision, Machine Learning, Robotics

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share