Publications

Detailed Information

kosp2e: Korean Speech to English Translation Corpus

DC Field Value Language
dc.contributor.authorCho, Won Ik-
dc.contributor.authorKim, Seok Min-
dc.contributor.authorCho, Hyunchang-
dc.contributor.authorKim, Nam Soo-
dc.date.accessioned2022-10-26T07:22:26Z-
dc.date.available2022-10-26T07:22:26Z-
dc.date.created2022-10-04-
dc.date.issued2021-08-
dc.identifier.citationProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Vol.6, pp.3705-3709-
dc.identifier.issn1990-9772-
dc.identifier.urihttps://hdl.handle.net/10371/186864-
dc.description.abstractMost speech-to-text (S2T) translation studies use English speech as a source, which makes it difficult for non-English speakers to take advantage of the S2T technologies. For some languages, this problem was tackled through corpus construction, but the farther linguistically from English or the more under-resourced, this deficiency and underrepresentedness becomes more significant. In this paper, we introduce kosp2e (read as 'kospi'), a corpus that allows Korean speech to be translated into English text in an end-to-end manner. We adopt open license speech recognition corpus, translation corpus, and spoken language corpora to make our dataset freely available to the public, and check the performance through the pipeline and training-based approaches. Using pipeline and various end-to-end schemes, we obtain the highest BLEU of 21.3 and 18.0 for each based on the English hypothesis, validating the feasibility of our data. We plan to supplement annotations for other target languages through community contributions in the future.-
dc.language영어-
dc.publisherISCA-
dc.titlekosp2e: Korean Speech to English Translation Corpus-
dc.typeArticle-
dc.identifier.doi10.21437/Interspeech.2021-1040-
dc.citation.journaltitleProceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH-
dc.identifier.wosid000841879503162-
dc.identifier.scopusid2-s2.0-85119265139-
dc.citation.endpage3709-
dc.citation.startpage3705-
dc.citation.volume6-
dc.description.isOpenAccessN-
dc.contributor.affiliatedAuthorKim, Nam Soo-
dc.type.docTypeProceedings Paper-
dc.description.journalClass1-
Appears in Collections:
Files in This Item:
There are no files associated with this item.

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share