Energy-Based Probabilistic Models for Epistemic Uncertainty Quantification

Abstract: Probability is a principled measure of uncertainty. Particularly, the probability of input data p(x) in machine learning serves as a useful measure of epistemic uncertainty. The accurate estimation of the data probability can benefit multiple epistemic uncertainty quantification applications, such as detecting anomalous samples, avoiding over-confident predictions, and obtaining informative unlabeled samples. However, recent deep learning approaches for estimating the input data probability, often referred to as deep generative modeling, have shown limited uncertainty quantification capabilities, despite their impressive performance in synthesizing realistic samples. The failure of deep generative modeling has spurred skepticism on whether the modeling the data probability is a valid approach for capturing uncertainties.

In this thesis, we demonstrate that modeling the probability of data is essential for uncertainty quantification.
We focus on autoencoders and Gaussian processes, popular algorithms for anomaly detection and decision-making under uncertainty, and show that incorporating generative modeling improves their performance. Both algorithms are non-generative and do not include input data probability in their formulation, leading to critical failure modes due to not correctly reflecting the data probability distribution. We address these failure modes by introducing novel probabilistic formulations with generative modeling for autoencoders and Gaussian processes. The proposed methods are based on the energy-based model framework, which defines a probability distribution using an unnormalized scalar function called energy, where we introduce novel designs of energy functions built from autoencoders and Gaussian processes.

While introducing novel interpretations of autoencoders and Gaussian processes as generative models, we introduce additional contributions regarding energy-based modeling. First, we present a novel training algorithm for energy-based generative models that leverages the low-dimensional structure of data. The proposed algorithm can effectively suppress spuriously high likelihood in generative models and the resulting energy-based models show strong anomaly detection performance. Second, we also investigate the connection between adversarial attack and energy-based model formulation. We propose a generative adversarial attack algorithm for out-of-distribution detectors where the attack is formulated as sampling from an energy-based distribution.

Quantifying epistemic uncertainty is essential for robots and artificial intelligence agents to interact with the world safely and effectively. This paper demonstrated that the problem of quantifying epistemic uncertainty can be solved through probabilistic models. The energy-based probabilistic model techniques and algorithms discussed in this paper are expected to be widely applied to various applications in robotics and artificial intelligence systems.
이 논문에서는 데이터의 확률 분포를 추정하는 것이 불확실성 정량화에 중요한 접근 방법임을 논한다. 우리는 이상 검출과 불확실성 하에서의 의사결정 문제에서 널리 사용되는 알고리즘인 오토인코더와 가우시안 프로세스에 초점을 맞추어, 생성 모형의 접근 방법을 도입하면 이들의 성능을 향상시킬 수 있다는 것을 보인다. 오토인코더와 가우시안 프로세스 모두 데이터의 확률 분포를 명시적으로 학습하지 않는다. 따라서 데이터 확률 분포를 정확히 반영하지 못함으로 인해 심각한 실패를 일으킬 수 있다. 우리는 이러한 실패 사례를 보고하고, 이를 해결하기 위해 오토인코더와 가우시안 프로세스를 위한 새로운 생성 모형 기반 형식화를 도입한다. 제안된 방법들에서는 에너지 함수를 이용하여 확률 분포를 표현하는 에너지 기반 모형 방법론에 기반하여, 오토인코더와 가우시안 프로세스를 에너지 기반 확률 모형으로 새롭게 해석한다.
추가적으로, 본 논문에서는 에너지 기반 모형 기법에 관한 추가적인 아이디어들을 제시한다. 첫째, 데이터의 저차원 구조를 활용하는 에너지 기반 생성 모형의 새로운 학습 알고리즘을 제시한다. 제안된 알고리즘은 생성 모형이 학습 데이터가 아닌 곳에 높은 우도를 부여하는 현상을 효과적으로 억제할 수 있으며, 결과적으로 이러한 에너지 기반 모형들은 강력한 이상 탐지 성능을 보여준다. 둘째, 적대적 공격과 에너지 기반 모형 기법 사이의 연결을 탐색한다. 이러한 연결관계에 기반하여 이상치 탐지기를 위한 새로운 적대적인 생성 공격 알고리즘을 제안한다. 제안된 알고리즘은 에너지 기반 분포에서 샘플링하는 과정을 통해 적대적 공격을 수행한다.
인식론적 불확실성 정량화는 로봇 및 인공지능 에이전트가 세상과 안전하고 효과적으로 상호작용하는 데에 필수적인 요소이다. 이 논문에서는 확률 모형을 통해 인식론적 불확실성 정량화 문제를 해결할 수 있다는 것을 보였으며, 이 논문에서 논의된 에너지 기반 확률모형 기법과 알고리즘들은 로보틱스 및 인공지능 시스템의 다양한 응용에 널리 적용될 것으로 기대된다.

Language: eng

URI: https://hdl.handle.net/10371/196290

https://dcollection.snu.ac.kr/common/orgView/000000179295

Files in This Item:

000000179295.pdf 7.84 MB

Appears in Collections:

College of Engineering/Engineering Practice School (공과대학/대학원)
- Dept. of Mechanical Aerospace Engineering (기계항공공학부)
  - Theses (Ph.D. / Sc.D._기계항공공학부)

Altmetrics

Item View & Download Count

Show Full Item Record

Find it @ SNU

트윗하기

SNS Share