Publications

Detailed Information

A Study on the use of Etymology for Semantic Knowledge Extraction

Cited 0 time in Web of Science Cited 0 time in Scopus
Authors

Pablo Estrada

Advisor
정교민
Major
자연과학대학 협동과정 계산과학전공
Issue Date
2016-08
Publisher
서울대학교 대학원
Keywords
graph miningetymologycomputational linguisticschinese language
Description
학위논문 (석사)-- 서울대학교 대학원 : 계산과학전공, 2016. 8. 정교민.
Abstract
Etymology is the study of the composition of words through their historical roots. It is a rich area of study that dates back millennia, and that has contributed significantly to our understanding of human cultures and languages. The field of computational linguistics is a much younger field that grew from the advent of the digital era
and that has advanced continuously, even nowadays with the changes brought by Artificial Intelligence and Machine Learning. Computational linguistics have not yet leveraged the knowledge of etymology to its full potential. This work is a step to make etymology another contributor to the field of computational linguistics.

In this work we propose a framework to capture the complex etymological relationships that exist in the vocabulary of a human language by creating a complex network that associates words with their historical roots. We then use this framework to obtain insights into the semantics of the words that are part of the Chinese and Korean languages. We run two tasks: one of supervised learning, and one of unsupervised learning, and show that etymology can be effectively used to extract knowledge.

We believe that this work helps push etymology into the main stage of computational linguistics, and natural language processing.
Language
English
URI
https://hdl.handle.net/10371/131254
Files in This Item:
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share