Publications
Detailed Information
Structured Energy Network as a Loss Function
DC Field | Value | Language |
---|---|---|
dc.contributor.author | Lee, Jay-Yoon | - |
dc.contributor.author | Patel, Dhruvesh | - |
dc.contributor.author | Goyal, Purujit | - |
dc.contributor.author | Zhao, Wenlong | - |
dc.contributor.author | Xu, Zhiyang | - |
dc.contributor.author | McCallum, Andrew | - |
dc.date.accessioned | 2024-05-03T07:37:42Z | - |
dc.date.available | 2024-05-03T07:37:42Z | - |
dc.date.created | 2024-04-29 | - |
dc.date.issued | 2022 | - |
dc.identifier.citation | Advances in Neural Information Processing Systems, Vol.35 | - |
dc.identifier.issn | 1049-5258 | - |
dc.identifier.uri | https://hdl.handle.net/10371/200916 | - |
dc.description.abstract | Belanger & McCallum (2016) and Gygli et al. (2017) have shown that energy networks can capture arbitrary dependencies amongst the output variables in structured prediction; however, their reliance on gradient based inference (GBI) makes the inference slow and unstable. In this work, we propose Structured Energy As Loss (SEAL) to take advantage of the expressivity of energy networks without incurring the high inference cost. This is a novel learning framework that uses an energy network as a trainable loss function (loss-net) to train a separate neural network (task-net), which is then used to perform inference through a forward pass. We establish SEAL as a general framework wherein various learning strategies like margin-based, regression, and noise-contrastive could be employed to learn the parameters of loss-net. Through extensive evaluation on multi-label classification, semantic role labeling, and image segmentation, we demonstrate that SEAL provides various useful design choices, is faster at inference than GBI, and leads to significant performance gains over the baselines. | - |
dc.language | 영어 | - |
dc.publisher | Advances in Neural Information Processing Systems | - |
dc.title | Structured Energy Network as a Loss Function | - |
dc.type | Article | - |
dc.citation.journaltitle | Advances in Neural Information Processing Systems | - |
dc.identifier.scopusid | 2-s2.0-85163155923 | - |
dc.citation.volume | 35 | - |
dc.description.isOpenAccess | N | - |
dc.contributor.affiliatedAuthor | Lee, Jay-Yoon | - |
dc.type.docType | Conference Paper | - |
dc.description.journalClass | 1 | - |
- Appears in Collections:
- Files in This Item:
- There are no files associated with this item.
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.