Title
A Nuisance-Free Inference Procedure Accounting For The Unknown Missingness With Application To Electronic Health Records
Abstract
We study how to conduct statistical inference in a regression model where the outcome variable is prone to missing values and the missingness mechanism is unknown. The model we consider might be a traditional setting or a modern high-dimensional setting where the sparsity assumption is usually imposed and the regularization technique is popularly used. Motivated by the fact that the missingness mechanism, albeit usually treated as a nuisance, is difficult to specify correctly, we adopt the conditional likelihood approach so that the nuisance can be completely ignored throughout our procedure. We establish the asymptotic theory of the proposed estimator and develop an easy-to-implement algorithm via some data manipulation strategy. In particular, under the high-dimensional setting where regularization is needed, we propose a data perturbation method for the post-selection inference. The proposed methodology is especially appealing when the true missingness mechanism tends to be missing not at random, e.g., patient reported outcomes or real world data such as electronic health records. The performance of the proposed method is evaluated by comprehensive simulation experiments as well as a study of the albumin level in the MIMIC-III database.
Year
DOI
Venue
2020
10.3390/e22101154
ENTROPY
Keywords
DocType
Volume
nuisance, post-selection inference, missingness mechanism, regularization, asymptotic theory, unconventional likelihood
Journal
22
Issue
ISSN
Citations 
10
1099-4300
0
PageRank 
References 
Authors
0.34
0
2
Name
Order
Citations
PageRank
jiwei zhao131.81
Chi Chen200.34