Title: | Multiple Imputation with Sequential Penalized Regression |
---|---|
Description: | Generates multivariate imputations using sequential regression with L2 penalty. For more details see Zahid and Heumann (2018) <doi:10.1177/0962280218755574>. |
Authors: | Faisal Maqbool Zahid |
Maintainer: | Faisal Maqbool Zahid <[email protected]> |
License: | GPL-2 |
Version: | 1.0.0 |
Built: | 2024-12-26 06:39:46 UTC |
Source: | CRAN |
data1 artificially generated dataframe
with n=100 and p=50. Missing values using MAR (missing at random) mechanism are artificially generated in 10 covariates.
data(data1)
data(data1)
An object of class data.frame
with 100 rows and 51 columns.
data(data1)
data(data1)
data2 artificially generated dataframe
with n=100 and p=200. Missing values using MAR (missing at random) mechanism are artificially generated in 10 covariates.
data(data2)
data(data2)
An object of class data.frame
with 100 rows and 201 columns.
data(data2)
data(data2)
Generates Multivariate Imputations using sequential regression with L2 penalization.
mispr(x, x.select = FALSE, pen = FALSE, maxit = 5, m = 5, track = FALSE, init.method = "random", L2.fix = NULL, cv = TRUE, maxL2 = 2^10)
mispr(x, x.select = FALSE, pen = FALSE, maxit = 5, m = 5, track = FALSE, init.method = "random", L2.fix = NULL, cv = TRUE, maxL2 = 2^10)
x |
A data frame or a matrix containing the incomplete data. Missing
values are coded as |
x.select |
A Boolean flag. If |
pen |
A Boolean flag. If |
maxit |
A scalar giving the number of iterations. The default is 5. |
m |
Number of multiple imputations. The default is |
track |
A Boolean flag. If |
init.method |
Method for initialization of missing values.
|
L2.fix |
Fixed value of ridge penalty (optional) to use for each
imputation model. For default i.e., |
cv |
A Boolean flag. If |
maxL2 |
The maximum value of the tuning parameter for L2 penalization to be used for optimizing the cross-validated likelihood. Default value is $2^10$. |
Generates multiple imputations for incomplete multivariate data by fitting a sequence of regression models using L2 penalty iteratively. Missing data can occur in one or more variables of the data. In each step of the iteration, ridge regression is fitted according to the distributional form of the missing variable taken as a response. All other variables are taken as predictors. If some predictors are incomplete, the most #'recently generated imputations are used to complete the predictors before using them as a predictor.
a list containing the number of imputed datasets, number of iterations used to obtain imputed data, list of multiply imputed datasets, and summary of missing values.
Faisal Maqbool Zahid [email protected].
Zahid, F. M., and Heumann, C. (2018). Multiple imputation with sequential penalized regression. Statistical Methods in Medical Research, 0962280218755574.
data(data1) # Select a subset of data1 x=data1[ , 1:10] res1 = mispr(x) # to get 3 multiply imputed datasets res2 = mispr(x, m=3)
data(data1) # Select a subset of data1 x=data1[ , 1:10] res1 = mispr(x) # to get 3 multiply imputed datasets res2 = mispr(x, m=3)