Publications
Detailed Information
Detecting Causality by Data Augmentation via Part-of-Speech tagging : 품사 활용 데이터 증강을 통한 인과관계 탐색
De Novo Augmentation approach in NLP
Cited 0 time in
Web of Science
Cited 0 time in Scopus
- Authors
- Advisor
- 이상학
- Issue Date
- 2023
- Publisher
- 서울대학교 대학원
- Keywords
- Causal NLP ; ELECTRA ; Augmentation ; POS tagging ; Causality
- Description
- 학위논문(석사) -- 서울대학교대학원 : 데이터사이언스대학원 데이터사이언스학과, 2023. 2. 이상학.
- Abstract
- With a deluge of text-based data available, the ability to automatically extract important information from the text data is crucial, especially extracting events from factual text data like news articles. Finding causal relations in texts has been a challenge since it requires methods ranging from defining event ontologies to
developing proper algorithmic approaches. In this paper, I developed a framework which classifies whether a given sentence contains a causal event. As my approach, I exploited an external corpus that has causal
labels to overcome the small size of the original corpus (Causal News Corpus) provided by task organizers.
Further, I employed a data augmentation technique utilizing PartOf-Speech (POS) based on my observation that some parts of speech are more (or less) relevant to causality. My approach especially improved the recall of detecting causal events in
sentences.
- Language
- eng
- Files in This Item:
- Appears in Collections:
Item View & Download Count
Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.