Publications

Detailed Information

Learning to Specialize with Knowledge Distillation for Visual Question Answering

DC Field Value Language
dc.contributor.authorMun, Jonghwan-
dc.contributor.authorLee, Kimin-
dc.contributor.authorShin, Jinwoo-
dc.contributor.authorHan, Bohyung-
dc.date.accessioned2022-10-26T07:21:57Z-
dc.date.available2022-10-26T07:21:57Z-
dc.date.created2022-10-21-
dc.date.issued2018-12-
dc.identifier.citationADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018), Vol.31, pp.8081-8091-
dc.identifier.issn1049-5258-
dc.identifier.urihttps://hdl.handle.net/10371/186823-
dc.description.abstractVisual Question Answering (VQA) is a notoriously challenging problem because it involves various heterogeneous tasks defined by questions within a unified framework. Learning specialized models for individual types of tasks is intuitively attracting but surprisingly difficult; it is not straightforward to outperform naive independent ensemble approach. We present a principled algorithm to learn specialized models with knowledge distillation under a multiple choice learning (MCL) framework, where training examples are assigned dynamically to a subset of models for updating network parameters. The assigned and non-assigned models are learned to predict ground-truth answers and imitate their own base models before specialization, respectively. Our approach alleviates the limitation of data deficiency in existing MCL frameworks, and allows each model to learn its own specialized expertise without forgetting general knowledge. The proposed framework is model-agnostic and applicable to any tasks other than VQA, e.g., image classification with a large number of labels but few per-class examples, which is known to be difficult under existing MCL schemes. Our experimental results indeed demonstrate that our method outperforms other baselines for VQA and image classification.-
dc.language영어-
dc.publisherNEURAL INFORMATION PROCESSING SYSTEMS (NIPS)-
dc.titleLearning to Specialize with Knowledge Distillation for Visual Question Answering-
dc.typeArticle-
dc.citation.journaltitleADVANCES IN NEURAL INFORMATION PROCESSING SYSTEMS 31 (NIPS 2018)-
dc.identifier.wosid000461852002061-
dc.identifier.scopusid2-s2.0-85064812500-
dc.citation.endpage8091-
dc.citation.startpage8081-
dc.citation.volume31-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorHan, Bohyung-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share