Publications

Detailed Information

Stad: Stable Video Depth Estimation

DC Field Value Language
dc.contributor.authorLee, Hyunmin-
dc.contributor.authorPark, Jaesik-
dc.date.accessioned2024-05-09T04:13:01Z-
dc.date.available2024-05-09T04:13:01Z-
dc.date.created2024-05-08-
dc.date.created2024-05-08-
dc.date.issued2021-
dc.identifier.citationProceedings - International Conference on Image Processing, ICIP, Vol.2021-September, pp.3213-3217-
dc.identifier.issn1522-4880-
dc.identifier.urihttps://hdl.handle.net/10371/201302-
dc.description.abstractWe present a method for estimating temporally stable depth video from a sequence of images. We extend the prior work aimed at video depth estimation, Neural-RGBD [1], which proposed to use temporal information by accumulating a depth probability volume over time. We propose three simple yet effective ideas to gain improvement: (1) temporal attention module to select and propagate only the meaningful temporal information, (2) geometric warping operation to warp neighbor features in the way of preserving geometry cues, and (3) scale-invariant loss to relieve the inherent scale ambiguity problem in monocular depth estimation task. We demonstrate the efficiency of proposed ideas by comparing our proposed network STAD with the state-of-the-arts. Moreover, we compare STAD with its per-frame network STAD-frame to show the importance of utilizing temporal information. The experimental results show that STAD significantly improved the baseline accuracy without a large parameter increase.-
dc.language영어-
dc.publisherICIP-
dc.titleStad: Stable Video Depth Estimation-
dc.typeArticle-
dc.identifier.doi10.1109/ICIP42928.2021.9506521-
dc.citation.journaltitleProceedings - International Conference on Image Processing, ICIP-
dc.identifier.wosid000819455103067-
dc.identifier.scopusid2-s2.0-85125593146-
dc.citation.endpage3217-
dc.citation.startpage3213-
dc.citation.volume2021-September-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorPark, Jaesik-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
dc.subject.keywordAuthorComputer Vision-
dc.subject.keywordAuthorDepth Estimation-
dc.subject.keywordAuthorVideo Depth Estimation-
dc.subject.keywordAuthorTemporal Attention-
dc.subject.keywordAuthor3D Geometry-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Dept. of Computer Science and Engineering
Research Area Computer Graphics, Computer Vision, Machine Learning, Robotics

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share