Neural networks for compressing and classifying speaker-independent paralinguistic signals

변석현

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Neural networks for compressing and classifying speaker-independent paralinguistic signals : 준언어적 신호 압축 및 분류를 위한 심층 신경망

DC Field	Value	Language
dc.contributor.advisor	정교민	-
dc.contributor.author	변석현	-
dc.date.accessioned	2019-05-07T03:15:52Z	-
dc.date.available	2019-05-07T03:15:52Z	-
dc.date.issued	2019-02	-
dc.identifier.other	000000155122	-
dc.identifier.uri	https://hdl.handle.net/10371/150757	-
dc.description	학위논문 (석사)-- 서울대학교 대학원 : 공과대학 전기·정보공학부, 2019. 2. 정교민.	-
dc.description.abstract	Recognizing and classifying paralinguistic signals, with its various applications, is an important problem. In general, this task is considered challenging because the sound information from the signals is difficult to distinguish even by humans. Thus, analyzing signals with machine learning techniques is a reasonable approach to understanding signals. Audio features extracted from paralinguistic signals usually consist of highdimensional vectors such as prosody, energy, cepstrum, and other speech-related information. Therefore, when the size of a training corpus is not sufficiently large, it is extremely difficult to apply machine learning methods to analyze these signals due to their high feature dimensions. This paper addresses these limitations by using neural networks' feature learning abilities. First, we use a neural network-based autoencoder to compress the signal to eliminate redundancy within the signal feature, and we show than the compressed signal features are competitive in distinguishing the signal compared to the original methods such as logistic regression, support vector machine, decision trees, and boosted trees.	-
dc.description.abstract	준언어적 신호를 분류를 인식하고 분류하는 일은 그의 다양한 응용성 측면에서 매우 중요한 문제이다. 일반적으로 이 문제가 어려운 이유는 소리 정보가 인간에게도 구별되기 힘들다는 애매모호한 특성 때문이다. 이에 신호를 더 잘 이해하기 위해 기계학습 기법이 고안되는데, 이 때 분석에 사용되는 신호 특징 벡터는 운율, 에너지, 주파수 등 신호에 관련된 정보로 이루어진 고차원 벡터이다. 즉 훈련에 사용되는 데이터의 크기가 작은 경우에는, 특징 벡터의 높은 차원 때문에 적절히 기계학습 모델을 잘 훈련시키기가 어렵게 된다. 이 논문에서는 심층 신경망 모델을 이용하여 이와 같은 문제를 해결하고자 한다. 우선 다양한 압축 기법을 이용하여 특징 벡터 내의 불필요한 정보를 제거하고, 이 압축된 특징들이 전통적 기계학습 분류방법들보다 심층 신경망에 의해 더 잘 분류됨을 실험적으로 보인다.	-
dc.description.tableofcontents	Chapter 1. Introduction ...........................................................1 Chapter 2. Related Work .........................................................3 Chapter 3. Task Description .................................................... 4 Chapter 4. Proposed Framework .............................................5 Chapter 5. Performance Evaluation ......................................... 9 Chapter 6. Discussion ............................................................ 11 Chapter 7. Conclusion ............................................................13 Bibliography ...........................................................................14 Abstract in Korean ................................................................ 17	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject.ddc	621.3	-
dc.title	Neural networks for compressing and classifying speaker-independent paralinguistic signals	-
dc.title.alternative	준언어적 신호 압축 및 분류를 위한 심층 신경망	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Seokhyun Byun	-
dc.description.degree	Master	-
dc.contributor.affiliation	공과대학 전기·정보공학부	-
dc.date.awarded	2019-02	-
dc.identifier.uci	I804:11032-000000155122	-
dc.identifier.holdings	000000000026▲000000000039▲000000155122▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Electrical and Computer Engineering (전기·정보공학부)
  - Theses (Master's Degree_전기·정보공학부)

Files in This Item:

000000155122.pdf 0.40 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share