Publications

Detailed Information

Scalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems

DC Field Value Language
dc.contributor.authorAhuja, Sarthak-
dc.contributor.authorKachuee, Mohammad-
dc.contributor.authorSheikholeslami, Fateme-
dc.contributor.authorLiu, Weiqing-
dc.contributor.authorDo, Jae Young-
dc.date.accessioned2024-05-09T06:42:03Z-
dc.date.available2024-05-09T06:42:03Z-
dc.date.created2024-05-09-
dc.date.issued2023-07-
dc.identifier.citationAssociation for Computational Linguistics (ACL). Annual Meeting Conference Proceedings, Vol.5, pp.361-367-
dc.identifier.issn0736-587X-
dc.identifier.urihttps://hdl.handle.net/10371/201359-
dc.description.abstractOff-Policy reinforcement learning has been a driving force for the state-of-the-art conversational AIs leading to more natural human-agent interactions and improving the user satisfaction for goal-oriented agents. However, in large-scale commercial settings, it is often challenging to balance between policy improvements and experience continuity on the broad spectrum of applications handled by such system. In the literature, off-policy evaluation and guard-railing on aggregate statistics has been commonly used to address this problem. In this paper, we propose a method for curating and leveraging high-precision samples sourced from historical regression incident reports to validate, safe-guard, and improve policies prior to the online deployment. We conducted extensive experiments using data from a real-world conversational system and actual regression incidents. The proposed method is currently deployed in our production system to protect customers against broken experiences and enable long-term policy improvements.-
dc.language영어-
dc.publisherAssociation for Computational Linguistics (ACL)-
dc.titleScalable and Safe Remediation of Defective Actions in Self-Learning Conversational Systems-
dc.typeArticle-
dc.identifier.doi10.48550/arXiv.2305.10528-
dc.citation.journaltitleAssociation for Computational Linguistics (ACL). Annual Meeting Conference Proceedings-
dc.identifier.scopusid2-s2.0-85174238544-
dc.citation.endpage367-
dc.citation.startpage361-
dc.citation.volume5-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorDo, Jae Young-
dc.type.docTypeConference Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Related Researcher

  • College of Engineering
  • Department of Electrical and Computer Engineering
Research Area AI 애플리케이션을 위한 알고리즘-시스템 공동 설계, AI-powered Big Data Management, Generative AI, Large Language Model, ML, 고성능 대규모 AI 데이터 분석 및 처리, 모달 AI

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share