Publications

Detailed Information

VANT : A Visual Analytics System for Refining Parallel Corpora in Neural Machine Translation

DC Field Value Language
dc.contributor.authorPark, Sebeom-
dc.contributor.authorLee, Soohyun-
dc.contributor.authorKim, Youngtaek-
dc.contributor.authorJeon, Hyeon-
dc.contributor.authorJung, Seokweon-
dc.contributor.authorBok, Jinwook-
dc.contributor.authorSeo, Jinwook-
dc.date.accessioned2022-10-12T00:26:41Z-
dc.date.available2022-10-12T00:26:41Z-
dc.date.created2022-09-30-
dc.date.issued2022-04-
dc.identifier.citation2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022), pp.181-185-
dc.identifier.issn2165-8765-
dc.identifier.urihttps://hdl.handle.net/10371/185826-
dc.description.abstractThe quality of parallel corpora used to train a Neural Machine Translation (NMT) model can critically influence the model's performance. Various approaches for refining parallel corpora have been introduced, but there is still much room for improvements, such as enhancing the efficiency and the quality of refinement. We introduce VANT, a novel visual analytics system for refining parallel corpora used in training an NMT model. Our system helps users to readily detect and filter noisy parallel corpora by (1) aiding the quality estimation of individual sentence pairs within the corporaby providing diverse quality metrics (e.g., cosine similarity, BLEU, length ratio) and (2) allowing users to visually examine and manage the corpora based on the pre-computed metrics scores. Our system's effectiveness and usefulness are demonstrated through a qualitative user study with eight participants, including four domain experts with real-world datasets.-
dc.language영어-
dc.publisherIEEE COMPUTER SOC-
dc.titleVANT : A Visual Analytics System for Refining Parallel Corpora in Neural Machine Translation-
dc.typeArticle-
dc.identifier.doi10.1109/PacificVis53943.2022.00029-
dc.citation.journaltitle2022 IEEE 15TH PACIFIC VISUALIZATION SYMPOSIUM (PACIFICVIS 2022)-
dc.identifier.wosid000850180500021-
dc.identifier.scopusid2-s2.0-85132430220-
dc.citation.endpage185-
dc.citation.startpage181-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorSeo, Jinwook-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share