Phonetic Similarity Detection on Trademark Name using CNN-Siamese Networks

김기정

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Phonetic Similarity Detection on Trademark Name using CNN-Siamese Networks : CNN-Siamese 네트워크를 활용한 문자 상표 발음 유사성 탐지

DC Field	Value	Language
dc.contributor.advisor	조성준	-
dc.contributor.author	김기정	-
dc.date.accessioned	2022-12-29T07:33:37Z	-
dc.date.available	2022-12-29T07:33:37Z	-
dc.date.issued	2022	-
dc.identifier.other	000000172068	-
dc.identifier.uri	https://hdl.handle.net/10371/187646	-
dc.identifier.uri	https://dcollection.snu.ac.kr/common/orgView/000000172068	ko_KR
dc.description	학위논문(석사) -- 서울대학교대학원 : 공과대학 산업공학과, 2022. 8. 조성준.	-
dc.description.abstract	Recently, as the number of registered trademarks has rapidly increased, research to determine trademark similarity based on machine learning has been actively con- ducted. Similarity of trademarks is judged based on shapes, meaning, and pronun- ciation. In the case of pronunciation, there is a limit in judging similarity because the standards for similarity are ambiguous and spellings do not correspond to pro- nunciation in many cases. On the other hand, the performance of converting text into speech has been remarkably improved due to the recent development of speech synthesis technology. In this paper, we propose a deep learning framework that au- tomatically determines the pronunciation similarity of trademarks using speech data converted using speech synthesis technology. First, after synthesizing the trademark text into speech, it is converted into a log Mel spectrogram, and feature learning is performed through a convolutional neural network with a triplet loss. To compare the proposed method with previous studies, the trademark text dataset provided by AIhub was used, and our proposed method showed superior performance than the previous studies.	-
dc.description.abstract	최근 등록되는 상표의 수가 빠르게 증가함에 따라 기계학습을 기반으로 상표 유사성을 판단하려는 연구가 활발히 진행되어 왔다. 상표의 유사성은 도형, 관념, 발음을 기준으 로 판단되는데, 발음의 경우 유사함의 기준이 모호하며 철자가 발음에 대응되지 않는 경우가 많기 때문에 유사성을 판단하는데 한계가 존재한다. 한편, 최근 음성 합성 기술의 발달로 인해 텍스트를 음성으로 변환하는 성능이 눈에 띄게 향상하였다. 본 논문은 음 성합성기술을 활용하여 상표의 발음 유사성을 자동으로 판단하는 딥러닝 프레임워크를 제안한다. 먼저, 상표 텍스트를 음성으로 합성한 뒤, log Mel Spectrogram 으로 변환 하고 합성곱 신경망과 삼중항 손실을 통해 feature 학습을 진행한다. 제안하는 방법과 선행 연구를 비교하기 위해 AIhub 에서 제공하는 상표 텍스트 데이터셋을 활용하였고, 제안하는 방식이 선행 연구를 앞서는 것을 확인하였다.	-
dc.description.tableofcontents	Chapter 1 Introduction 1 Chapter 2 Related Work 5 Chapter 3 Proposed Method 8 3.1 Model Architecture 8 3.2 EvaluationMetric 12 Chapter 4 Datasets 14 4.1 Traindataset 14 4.2 Testdataset 15 4.3 Speechdataset 15 4.4 Preprocessing 15 Chapter 5 Experimental Results 18 5.1 Experiment1: Compare different input type 18 5.2 Experiment 2: Compare signal processing methods 19 5.3 Experiment3:Comparebackbonenetworks 20 5.4 Experiment4:Comparebaselinemodels 21 Chapter 6 Conclusion 23 Bibliography 25 국문초록 28 감사의 글 29	-
dc.format.extent	vi, 29	-
dc.language.iso	eng	-
dc.publisher	서울대학교 대학원	-
dc.subject	CNN	-
dc.subject	SiameseNetwork	-
dc.subject	TrademarkSimilarity	-
dc.subject	TrademarkPronunciation	-
dc.subject	SpeechSynthesis	-
dc.subject.ddc	670.42	-
dc.title	Phonetic Similarity Detection on Trademark Name using CNN-Siamese Networks	-
dc.title.alternative	CNN-Siamese 네트워크를 활용한 문자 상표 발음 유사성 탐지	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Gi Jung Kim	-
dc.contributor.department	공과대학 산업공학과	-
dc.description.degree	석사	-
dc.date.awarded	2022-08	-
dc.contributor.major	데이터 마이닝	-
dc.identifier.uci	I804:11032-000000172068	-
dc.identifier.holdings	000000000048▲000000000055▲000000172068▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Industrial Engineering (산업공학과)
  - Theses (Master's Degree_산업공학과)

Files in This Item:

000000172068.pdf 2.61 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share