Performance Improvement of a Black Box Model via RL-based Spatial Transfromation

김재명

서울대학교 중앙도서관

S-Space 소개

My S-Space

로그인이 필요합니다.

S-Space

Publications

Detailed Information

Performance Improvement of a Black Box Model via RL-based Spatial Transfromation : 강화학습 기반의 공간변환을 통한 블랙박스 모델의 성능 향상

Cited 0 time in Web of Science Cited 0 time in Scopus

Export

Authors: 김재명

Advisor: 이정우

Issue Date: 2020

Publisher: 서울대학교 대학원

Description: 학위논문(석사)--서울대학교 대학원 :공과대학 전기·정보공학부,2020. 2. 이정우.

Abstract: In recent years, deep neural networks (DNN) have become a highly active area of research, and shown remarkable achievements on a variety of computer vision tasks. DNNs, however, are known to often make overconfident yet incorrect predictions on out-of-distribution samples, which can be a major obstacle to real-world deployments because the training dataset is always limited compared to diverse real-world samples. Thus, it is fundamental to provide guarantees of robustness to the distribution shift between training and test time when we construct DNN models in practice. Moreover, in many cases, the deep learning models are deployed as black boxes and the performance has been already optimized for a training dataset, thus changing the black box itself can lead to performance degradation. We here study the robustness to the geometric transformations in a specific condition where the black-box image classifier is given. We propose an additional learner, REinforcement Spatial Transform learner (REST), that transforms the warped input data into samples regarded as in-distribution by the black-box models. Our work aims to improve the robustness by adding a REST module in front of any black boxes and training only the REST module without retraining the original black box model in an end-to-end manner, i.e. we try to convert the real-world data into training distribution which the performance of the black-box model is best suited for. We use a confidence score that is obtained from the black-box model to determine whether the transformed input is drawn from in-distribution. We empirically show that our method has an advantage in generalization to geometric transformations and sample efficiency.
인공신경망은 많은 발전을 이루고 있으며, 그 중에서 컴퓨터 비젼 분야에서도 급격한 발전을 이루고 있다. 하지만 인공신경망은 학습데이터 분포 밖의 샘플들에 대해서는 잘못된 예측을 하면서도 확신을 갖는 경향을 가진다. 현실 사회에는 학습데이터 외에 다양한 샘플들이 존재하고, 인공신명망이 실제 시스템으로 작동할 때 이러한 샘플들에 대해 큰 오류를 범하기 쉽다. 그러므로, 인공신경 모델을 구축할 때 학습 데이터 분포와 테스트 데이터 분포간의 모델 성능 차이가 최소가 되도록 보장하는 것이 중요하다. 또한 많은 경우에 학습된 딥러닝 모델은 사회에 배치될 시 블랙박스 시스템으로 여겨지며 그 시스템은 학습 데이터로부터 최적화된 모델이기 때문에, 블랙박스에 수정을 가하게 되면 시스템의 성능 저하가 나타날 수 있다. 우리는 블랙박스 이미지 분류 모델이 주어졌을 때 이미지의 공간변환에도 성능을 잘 나타내는 방법을 제안한다. 우리는 REST 라고 명명한 추가 모델을 제안하는데, 이는 공간이 왜곡된 인풋 데이터를 변형시켜 블랙박스 모델의 학습데이터 분포로 여겨지는 이미지를 생성한다. 우리는 REST 모델을 블랙박스 모델 앞단에 배치시키므로써 out-of-distribution 샘플들에 대해서 성능이 향상되는 것을 목표로 한다. 우리가 제안하는 방법은 블랙박스 모델을 변형시키지 않은 채로 REST 모델을 end-to-end 로 학습을 시키는 장점을 가지고 있다. 우리는 confidence score라는 것을 사용하는데, 이는 블랙박스 모델이 인풋 샘플에 대한 예측을 얼마나 확신하는지를 나타내는 지표로 사용된다. 우리는 제안된 방법이 일반화 효과와 적은 샘플로도 학습시킬 수 있다는 장점이 있음을 실험적으로 보여준다.

Language: eng

URI: http://dcollection.snu.ac.kr/common/orgView/000000160623

Files in This Item:

000000160623.pdf 2.82 MB

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Electrical and Computer Engineering (전기·정보공학부)
  - Theses (Master's Degree_전기·정보공학부)

Altmetrics

Item View & Download Count

Show Full Item Record

Find it @ SNU

트윗하기

SNS Share