Nonconvex penalized matrix completion methods for causal inference in panel data

Abstract: Low-rank matrix completion is a widely used approach for imputing missing entries of a matrix. The nuclear norm penalty, which shrinks the singular values of a matrix, is often employed due to its computational convenience. However, it introduces bias in the estimation. To address this issue, nonconvex penalties such as SCAD are utilized, which provide sparse and unbiased estimators.

In this thesis, we study the nonconvex penalized matrix completion methods for estimating causal effects in panel data with time-dependent treatment adoption. We first derive an upper bound for the estimation error of our proposed estimator for the potential control outcomes, which improves upon existing methods that rely on the nuclear norm penalty. Remarkably, this upper bound matches the one obtained by the oracle estimator, under an additional condition on the magnitudes of true singular values. Furthermore, we establish the asymptotic normality of the corresponding estimator for the treatment effect, which exhibits a smaller asymptotic variance compared to an existing method. We perform numerical studies to assess the recovery of the potential control outcomes and the estimation of the average treatment effect. Simulations validate our theoretical results, and experiments using real data further demonstrate the promising performance of our proposed method.
Low-rank 행렬 완성은 행렬의 누락된 성분을 채우기 위해 널리 사용되는 방법이다. 행렬의 특이값 크기를 축소하는 nuclear norm 벌점화는 계산 편의성 때문에 일반적으로 사용되나, 추정에 편향을 발생시킨다. 이 문제를 해결하기 위해 SCAD와 같은 비볼록 벌점화가 사용되며, 이는 성기고 편향되지 않은 추정량을 제공한다.

본 학위 논문은 시간 의존적 처리 (치료) 채택 구조를 갖는 패널 자료에서 인과 효과를 추정하기 위해 비볼록 벌점화 행렬 완성 방법을 연구한다. 우리는 먼저 nuclear norm 벌점화에 의존하는 기존 방법을 개선하는 잠재 제어 행렬에 대해 제안된 추정량의 추정 오류에 대한 상한을 도출한다. 놀랍게도, 이 상한은 참 특이값의 크기에 대한 추가 조건이 주어졌을 때 오라클 추정량이 얻는 상한과 일치한다. 또한 치료 효과에 대한 추정량의 점근적 정규성과, 이 추정량이 기존 방법에 비해 더 작은 점근 분산을 갖는다는 것을 증명한다. 우리는 잠재 제어 행렬의 복구와 평균 치료 효과의 추정을 평가하기 위해 수치 연구를 수행한다. 시뮬레이션은 우리의 이론적 결과를 검증하고, 실제 자료를 사용한 실험은 제안 방법의 유망한 성능을 입증한다.

Language: eng

URI: https://hdl.handle.net/10371/197343

https://dcollection.snu.ac.kr/common/orgView/000000179257

Files in This Item:

000000179257.pdf 4.90 MB

Appears in Collections:

College of Natural Sciences (자연과학대학)
- Dept. of Statistics (통계학과)
  - Theses (Ph.D. / Sc.D._통계학과)

Altmetrics

Item View & Download Count

Show Full Item Record

Find it @ SNU

트윗하기

SNS Share