Publications

Detailed Information

Automated outlier detection and estimation of missing data

Cited 1 time in Web of Science Cited 1 time in Scopus
Authors

Rhyu, Jinwook; Bozinovski, Dragana; Dubs, Alexis B.; Mohan, Naresh; Bende, Elizabeth M. Cummings; Maloney, Andrew J.; Nieves, Miriam; Sangerman, Jose; Lu, Amos E.; Hong, Moo Sun; Artamonova, Anastasia; Ou, Rui Wen; Barone, Paul W.; Leung, James C.; Wolfrum, Jacqueline M.; Sinskey, Anthony J.; Springs, Stacy L.; Braatz, Richard D.

Issue Date
2024-01
Publisher
Pergamon Press Ltd.
Citation
Computers & Chemical Engineering, Vol.180, p. 108448
Abstract
The majority of algorithms used for data imputation are based on latent variable methods. The presence of outliers in process data, however, misleads the latent relations among variables, resulting in an inaccurate estimation of missing values. This article proposes an approach for automatically detecting outliers using T2 and Q contributions and estimating missing data using various general-purpose algorithms while reducing the impact of outliers. The software is validated using biomanufacturing data from the production of a monoclonal antibody produced by Chinese hamster ovary cells in a perfusion bioreactor for five missingness cases including missing completely at random, sensor drop-out, multi-rate, patterned, and censoring. Based on the normalized root mean squared error and the three proposed metrics corresponding to feasibility, plausibility, and rapidity, respectively, matrix completion methods are the most effective, except for the censoring case in which probabilistic principal component analysis-based methods are the most effective.
ISSN
0098-1354
URI
https://hdl.handle.net/10371/199884
DOI
https://doi.org/10.1016/j.compchemeng.2023.108448
Files in This Item:
There are no files associated with this item.
Appears in Collections:

Altmetrics

Item View & Download Count

  • mendeley

Items in S-Space are protected by copyright, with all rights reserved, unless otherwise indicated.

Share