Publications

Detailed Information

Detecting Causality by Data Augmentation via Part-of-Speech tagging : 품사 활용 데이터 증강을 통한 인과관계 탐색
De Novo Augmentation approach in NLP

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

김주현

Advisor
이상학
Issue Date
2023
Publisher
서울대학교 대학원
Keywords
Causal NLPELECTRAAugmentationPOS taggingCausality
Description
학위논문(석사) -- 서울대학교대학원 : 데이터사이언스대학원 데이터사이언스학과, 2023. 2. 이상학.
Abstract
With a deluge of text-based data available, the ability to automatically extract important information from the text data is crucial, especially extracting events from factual text data like news articles. Finding causal relations in texts has been a challenge since it requires methods ranging from defining event ontologies to
developing proper algorithmic approaches. In this paper, I developed a framework which classifies whether a given sentence contains a causal event. As my approach, I exploited an external corpus that has causal
labels to overcome the small size of the original corpus (Causal News Corpus) provided by task organizers.
Further, I employed a data augmentation technique utilizing PartOf-Speech (POS) based on my observation that some parts of speech are more (or less) relevant to causality. My approach especially improved the recall of detecting causal events in
sentences.
Language
eng
URI
https://hdl.handle.net/10371/193606

https://dcollection.snu.ac.kr/common/orgView/000000175589
Files in This Item:
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share