Publications
Detailed Information
Broadcasting Convolutional Network for Visual Relational Reasoning
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Chang, Simyung | - |
dc.contributor.author | Yang, John | - |
dc.contributor.author | Park, SeongUk | - |
dc.contributor.author | Kwak, Nojun | - |
dc.date.accessioned | 2024-08-08T01:32:45Z | - |
dc.date.available | 2024-08-08T01:32:45Z | - |
dc.date.created | 2024-03-14 | - |
dc.date.created | 2024-03-14 | - |
dc.date.issued | 2018 | - |
dc.identifier.citation | IET Computer Vision, Vol.11219, pp.780-796 | - |
dc.identifier.issn | 1751-9632 | - |
dc.identifier.uri | https://hdl.handle.net/10371/206564 | - |
dc.description.abstract | In this paper, we propose the Broadcasting Convolutional Network (BCN) that extracts key object features from the global field of an entire input image and recognizes their relationship with local features. BCN is a simple network module that collects effective spatial features, embeds location information and broadcasts them to the entire feature maps. We further introduce the Multi-Relational Network (multiRN) that improves the existing Relation Network (RN) by utilizing the BCN module. In pixel-based relation reasoning problems, with the help of BCN, multiRN extends the concept of 'pairwise relations' in conventional RNs to 'multiwise relations' by relating each object with multiple objects at once. This yields in O(n) complexity for n objects, which is a vast computational gain from RNs that take O(n(2)). Through experiments, multiRN has achieved a state-of-the-art performance on CLEVR dataset, which proves the usability of BCN on relation reasoning problems. | - |
dc.language | 영어 | - |
dc.publisher | Institution of Engineering and Technology | - |
dc.title | Broadcasting Convolutional Network for Visual Relational Reasoning | - |
dc.type | Article | - |
dc.identifier.doi | 10.1007/978-3-030-01267-0_46 | - |
dc.citation.journaltitle | IET Computer Vision | - |
dc.identifier.wosid | 000612999000046 | - |
dc.identifier.scopusid | 2-s2.0-85055418062 | - |
dc.citation.endpage | 796 | - |
dc.citation.startpage | 780 | - |
dc.citation.volume | 11219 | - |
dc.description.isOpenAccess | Y | - |
dc.contributor.affiliatedAuthor | Kwak, Nojun | - |
dc.type.docType | Proceedings Paper | - |
dc.description.journalClass | 1 | - |
dc.subject.keywordAuthor | Visual relational reasoning | - |
dc.subject.keywordAuthor | BCN | - |
dc.subject.keywordAuthor | Broadcast | - |
dc.subject.keywordAuthor | CLEVR | - |
dc.subject.keywordAuthor | Multi-RN | - |
dc.subject.keywordAuthor | Visuo-spatial features | - |
- Appears in Collections:
- Files in This Item:
- There are no files associated with this item.
Related Researcher
- Graduate School of Convergence Science & Technology
- Department of Intelligence and Information
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.