다변량 시계열 데이터의 시각화를 위한 변수 인덱싱

박봉준

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

다변량 시계열 데이터의 시각화를 위한 변수 인덱싱 : Feature Indexing for Visualizing Multivariate Time Series Data

DC Field	Value	Language
dc.contributor.advisor	조성준	-
dc.contributor.author	박봉준	-
dc.date.accessioned	2019-10-18T15:28:37Z	-
dc.date.available	2019-10-18T15:28:37Z	-
dc.date.issued	2019-08	-
dc.identifier.other	000000157200	-
dc.identifier.uri	https://hdl.handle.net/10371/161029	-
dc.identifier.uri	http://dcollection.snu.ac.kr/common/orgView/000000157200	ko_KR
dc.description	학위논문(석사)--서울대학교 대학원 :공과대학 산업공학과,2019. 8. 조성준.	-
dc.description.abstract	제조 공정에서는 수많은 센서로부터 다양한 시계열 데이터가 발생한다. 이렇게 얻는 다변량 시계열 데이터로부터 공정의 성능과 이상을 예측한다. 예측된 공정의 이상은 원인 분석을 통해 적절한 조치로 이어진다. 본 연구는 다변량 시계열 데이터로부터 데이터 분석가가 쉽게 원인 분석을 할 수 있도록 변수 인덱싱을 통한 시각화 전처리하는 방법을 제안한다. 다변량 시계열 데이터를 각각 0-1 스케일링한 뒤, 이들 간의 상관 계수를 링크의 비용으로 하는 완전 그래프로 변환하였다. 완전 그래프의 모든 노드를 지나는 최소 비용 경로를 정수 최적화 수식화하여 구하고 최적해를 기반으로 재배열, 반전을 통해 변수를 인덱싱하였다. 제안한 방법을 제철 공정 데이터에 적용하여 직사각 형태의 단 채널 이미지로 변환한 뒤 무작위로 노이즈를 추가하여 이를 합성곱 신경망과 데이터 분석가를 통해 찾는 실험을 하였다. 전처리 전과 비교하여 합성곱 신경망의 분류 성능과 학습 속도가 향상되었고, 데이터 분석가가 노이즈의 위치를 더욱 정확히 찾을 수 있었다. 본 연구에서 제안한 전처리 방법이 합성곱 신경망의 성능과 인간의 데이터 분석 능력을 향상 시킴을 확인하였다.	-
dc.description.abstract	The manufacturing process produces a variety of time series data from a number of sensors. From this multivariate time series data, the performance and anomalies of the process are predicted. Anomalies of the predicted process lead to appropriate actions through cause analysis. This study proposed a method of preprocessing for visualization by indexing variables so that data analysts can easily analyze causes from multivariate time series data. After each of the multivariate time series data was scaled 0-1, the correlation coefficient between these was converted to a complete graph with the cost of the link. The minimum cost path across all nodes of the complete graph was obtained by integer programming modelling, and variables were indexed by rearranging and reversing based on the optimal solutions. The proposed method was applied to the steel process data to convert it into a single-channel image in a rectangular form and then randomly add noise to find it through a convolutional neural network and data analyst. For preprocessed data, the classification accuracy and learning speed of the convolutional neural network have been improved, and the data analyst has been able to locate the noise more accurately. The preprocessing method proposed in this study has been identified to improve the performance of the convolutional neural network and the ability of humans to analyze data.	-
dc.description.tableofcontents	제 1 장 서론 1 제 2 장 선행연구 3 2.1 다채널 단변량 시계열 합성곱 신경망(Zheng et al. (2014)) 3 2.2 Fault Detection and Classfication 합성곱 신경망(Lee et al. (2014)) 4 제 3 장 제안하는 모델 7 3.1 데이터의 재배열 및 반전을 통한 인덱싱 8 3.1.1 완전 그래프 문제로의 전환 9 3.1.2 정수최적화 10 3.1.3 최적해 적용 12 제 4 장 실험 및 결과 14 4.1 상관 계수 변화 실험 14 4.1.1 랜덤 생성 데이터 변화 14 4.1.2 제철 공정 데이터 변화 15 4.1.3 금융 데이터 변화 17 4.2 합성곱 신경망 적용 실험 18 4.2.1 공정 데이터 효율 예측 19 4.2.2 공정 데이터 노이즈 분류 20 4.3 노이즈 인지 실험 23 제 5 장 결론 및 의의 25 참고문헌 27	-
dc.language.iso	kor	-
dc.publisher	서울대학교 대학원	-
dc.subject	제조 공정	-
dc.subject	다변량 시계열 데이터	-
dc.subject	정수최적화	-
dc.subject	합성곱 신경망	-
dc.subject	전처리	-
dc.subject.ddc	670.42	-
dc.title	다변량 시계열 데이터의 시각화를 위한 변수 인덱싱	-
dc.title.alternative	Feature Indexing for Visualizing Multivariate Time Series Data	-
dc.type	Thesis	-
dc.type	Dissertation	-
dc.contributor.AlternativeAuthor	Bongjoon Park	-
dc.contributor.department	공과대학 산업공학과	-
dc.description.degree	Master	-
dc.date.awarded	2019-08	-
dc.contributor.major	데이터마이닝	-
dc.identifier.uci	I804:11032-000000157200	-
dc.identifier.holdings	000000000040▲000000000041▲000000157200▲	-

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Industrial Engineering (산업공학과)
  - Theses (Master's Degree_산업공학과)

Files in This Item:

000000157200.pdf 6.29 MB

Altmetrics

Item View & Download Count

Show Simple Item Record

Find it @ SNU

트윗하기

SNS Share