Publications

Detailed Information

Rollback Ensemble With Multiple Local Minima in Fine-Tuning Deep Learning Networks

DC Field Value Language
dc.contributor.authorRo, Youngmin-
dc.contributor.authorChoi, Jongwon-
dc.contributor.authorHeo, Byeongho-
dc.contributor.authorChoi, Jin Young-
dc.date.accessioned2022-10-24T00:24:04Z-
dc.date.available2022-10-24T00:24:04Z-
dc.date.created2022-08-30-
dc.date.created2022-08-30-
dc.date.created2022-08-30-
dc.date.created2022-08-30-
dc.date.issued2022-09-
dc.identifier.citationIEEE Transactions on Neural Networks and Learning Systems, Vol.33 No.9, pp.4648-4660-
dc.identifier.issn2162-237X-
dc.identifier.urihttps://hdl.handle.net/10371/186630-
dc.description.abstractImage retrieval is a challenging problem that requires learning generalized features enough to identify untrained classes, even with very few classwise training samples. In this article, to obtain generalized features further in learning retrieval data sets, we propose a novel fine-tuning method of pretrained deep networks. In the retrieval task, we discovered a phenomenon in which the loss reduction in fine-tuning deep networks is stagnated, even while weights are largely updated. To escape from the stagnated state, we propose a new fine-tuning strategy to roll back some of the weights to the pretrained values. The rollback scheme is observed to drive the learning path to a gentle basin that provides more generalized features than a sharp basin. In addition, we propose a multihead ensemble structure to create synergy among multiple local minima obtained by our rollback scheme. Experimental results show that the proposed learning method significantly improves generalization performance, achieving state-of-the-art performance on the Inshop and SOP data sets.-
dc.language영어-
dc.publisherIEEE Computational Intelligence Society-
dc.titleRollback Ensemble With Multiple Local Minima in Fine-Tuning Deep Learning Networks-
dc.typeArticle-
dc.identifier.doi10.1109/TNNLS.2021.3059669-
dc.citation.journaltitleIEEE Transactions on Neural Networks and Learning Systems-
dc.identifier.wosid000732292700001-
dc.identifier.scopusid2-s2.0-85102302828-
dc.citation.endpage4660-
dc.citation.number9-
dc.citation.startpage4648-
dc.citation.volume33-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorChoi, Jin Young-
dc.type.docTypeArticle-
dc.description.journalClass1-
dc.subject.keywordAuthorTraining-
dc.subject.keywordAuthorTask analysis-
dc.subject.keywordAuthorNeural networks-
dc.subject.keywordAuthorTraining data-
dc.subject.keywordAuthorLearning systems-
dc.subject.keywordAuthorImage retrieval-
dc.subject.keywordAuthorGenerative adversarial networks-
dc.subject.keywordAuthorDeep neural network-
dc.subject.keywordAuthorfine-tuning-
dc.subject.keywordAuthorimage retrieval-
dc.subject.keywordAuthorlearning strategy-
dc.subject.keywordAuthorperson reidentification-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share