Publications

Detailed Information

Frame-to-Frame Aggregation of Active Regions in Web Videos for Weakly Supervised Semantic Segmentation

DC Field Value Language
dc.contributor.authorLee, Jungbeom-
dc.contributor.authorKim, Eunji-
dc.contributor.authorLee, Sungmin-
dc.contributor.authorLee, Jangho-
dc.contributor.authorYoon, Sungroh-
dc.date.accessioned2022-10-26T07:23:42Z-
dc.date.available2022-10-26T07:23:42Z-
dc.date.created2022-10-19-
dc.date.issued2019-02-
dc.identifier.citation2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019), pp.6807-6817-
dc.identifier.issn1550-5499-
dc.identifier.urihttps://hdl.handle.net/10371/186972-
dc.description.abstractWhen a deep neural network is trained on data with only image-level labeling, the regions activated in each image tend to identify only a small region of the target object. We propose a method of using videos automatically harvested from the web to identify a larger region of the target object by using temporal information, which is not present in the static image. The temporal variations in a video allow different regions of the target object to be activated. We obtain an activated region in each frame of a video, and then aggregate the regions from successive frames into a single image, using a warping technique based on optical flow. The resulting localization maps cover more of the target object, and can then be used as proxy ground-truth to train a segmentation network. This simple approach outperforms existing methods under the same level of supervision, and even approaches relying on extra annotations. Based on VGG-16 and ResNet 101 backbones, our method achieves the mIoU of 65.0 and 67.4, respectively, on PASCAL VOC 2012 test images, which represents a new state-of-the-art.-
dc.language영어-
dc.publisherIEEE-
dc.titleFrame-to-Frame Aggregation of Active Regions in Web Videos for Weakly Supervised Semantic Segmentation-
dc.typeArticle-
dc.identifier.doi10.1109/ICCV.2019.00691-
dc.citation.journaltitle2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019)-
dc.identifier.wosid000548549201093-
dc.identifier.scopusid2-s2.0-85081911329-
dc.citation.endpage6817-
dc.citation.startpage6807-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorYoon, Sungroh-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share