Title: | Ridge-Type Penalized Estimation of a Potpourri of Models |
---|---|
Description: | The name of the package is derived from the French, 'pour' ridge, and provides functionality for ridge-type estimation of a potpourri of models. Currently, this estimation concerns that of various Gaussian graphical models from different study designs. Among others it considers the regular Gaussian graphical model and a mixture of such models. The porridge-package implements the estimation of the former either from i) data with replicated observations by penalized loglikelihood maximization using the regular ridge penalty on the parameters (van Wieringen, Chen, 2021) or ii) from non-replicated data by means of either a ridge estimator with multiple shrinkage targets (as presented in van Wieringen et al. 2020, <doi:10.1016/j.jmva.2020.104621>) or the generalized ridge estimator that allows for both the inclusion of quantitative and qualitative prior information on the precision matrix via element-wise penalization and shrinkage (van Wieringen, 2019, <doi:10.1080/10618600.2019.1604374>). Additionally, the porridge-package facilitates the ridge penalized estimation of a mixture of Gaussian graphical models (Aflakparast et al., 2018). On another note, the package also includes functionality for ridge-type estimation of the generalized linear model (as presented in van Wieringen, Binder, 2022, <doi:10.1080/10618600.2022.2035231>). |
Authors: | Wessel N. van Wieringen [aut, cre] , Mehran Aflakparast [ctb] (part of the R-code of the mixture functionality) |
Maintainer: | Wessel N. van Wieringen <[email protected]> |
License: | GPL (>= 2) |
Version: | 0.3.3 |
Built: | 2024-12-31 08:08:24 UTC |
Source: | CRAN |
The following functions facilitate the ridge-type penalized estimation of various models. Currently, it includes:
Generalized ridge estimation of the precision matrix of a Gaussian graphical model (van Wieringen, 2019) through the function ridgePgen
. This function is complemented by the functions ridgePgen.kCV
, ridgePgen.kCV.banded
, ridgePgen.kCV.groups
,
optPenaltyPgen.kCVauto.banded
and optPenaltyPgen.kCVauto.groups
for penalty parameters selection through K-fold cross-validation assuming a particularly structured precision matrix.
Multi-targeted ridge estimation of the precision matrix of a Gaussian graphical model (van Wieringen et al., 2020) through the functions ridgePmultiT
. This function is complemented by the functions optPenaltyPmultiT.kCVauto
for penalty parameters selection through K-fold cross-validation.
Gaussian graphical model estimation from data with replicates in ridge penalized fashion (van Wieringen, Chen, 2021) (ridgePrep
and ridgePrepEdiag
). The two functions
optPenaltyPrep.kCVauto
and optPenaltyPrepEdiag.kCVauto
implement the corresponding K-fold cross-validation procedures for an optimal choice of the penalty parameter.
Ridge penalized estimation of a mixture of Gaussian graphical models: ridgeGGMmixture
and its penalty selection via K-fold cross-validation
optPenaltyGGMmixture.kCVauto
.
Targeted and multi-targeted ridge estimation of the regression parameter of the generalized linear model (van Wieringen, Binder, 2022; van Wieringen, 2021; Lettink et al., 2022) through the functions ridgeGLM
and ridgeGLMmultiT
. This function is complemented by the functions optPenaltyGLM.kCVauto
and optPenaltyGLMmultiT.kCVauto
for penalty parameters selection through K-fold cross-validation, and the ridgeGLMdof
-function for the evaluation of the fitted model's degrees of freedom.
Future versions aim to include more ridge-type functionality.
In part the porridge
-package extends/builds upon the rags2ridges
-packages, in which some or all functionality of the porridge
-package may be absorped at some point in the future.
Wessel N. van Wieringen <[email protected]>
Aflakparast, M., de Gunst, M.C.M., van Wieringen, W.N. (2018), "Reconstruction of molecular network evolution from cross-sectional omics data", Biometrical Journal, 60(3), 547-563.
Lettink, A., Chinapaw, M.J.M., van Wieringen, W.N. (2022), "Two-dimensional fused targeted ridge regression for health indicator prediction from accelerometer data", submitted.
Peeters, C.F.W., Bilgrau, A.E., and van Wieringen, W.N. (2021), "rags2ridges
: Ridge Estimation of Precision Matrices from High-Dimensional Data", R package version 2.2.5. https://CRAN.R-project.org/package=rags2ridges.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
van Wieringen, W.N. (2021), "Lecture notes on ridge regression", Arxiv preprint, arXiv:1509.09169.
van Wieringen W.N., Chen, Y. (2021), "Penalized estimation of the Gaussian graphical model from data with replicates", Statistics in Medicine, 40(19), 4279-4293.
van Wieringen, W.N., Stam, K.A., Peeters, C.F.W., van de Wiel, M.A. (2020), "Updating of the Gaussian graphical model through targeted penalized estimation", Journal of Multivariate Analysis, 178, Article 104621.
van Wieringen, W.N. Binder, H. (2022), "Sequential learning of regression models by penalized estimation", Journal of Computational and Graphical Statistics, accepted.
The porridge
-package.
The function produces an unscaled penalty parameter matrix to be used in the generalized ridge regression estimator.
genRidgePenaltyMat(pr, pc=pr, type="2dimA")
genRidgePenaltyMat(pr, pc=pr, type="2dimA")
pr |
A positive |
pc |
A positive |
type |
A |
Various ridge penalty matrices are implemented.
The type="common"
-option supports the ‘homogeneity’ ridge penalization proposed by Anatolyev (2020). The ridge penalty matrix for a
-dimensional regression parameter
is such that:
This penalty matrix encourages shrinkage of the elements of to a common effect value.
The type="fused1dim"
-option facilitates the 1-dimensional fused ridge estimation of Goeman (2008). The ridge penalty matrix for a
-dimensional regression parameter
is such that:
This penalty matrix aims to shrink contiguous (as defined by their index) elements of towards each other.
The type="fused2dimA"
- and type="fused2dimD"
-options facilitate 2-dimensional ridge estimation as proposed by Lettink et al. (2022). It assumes the regression parameter is endowed with a 2-dimensional layout. The columns of this layout have been stacked to form . The 2-dimensional fused ridge estimation shrinks elements of
that are neighbors in the 2-dimensional layout towards each other. The two options use different notions of neighbors. If
type="fused2dimA"
, the ridge penalty matrix for a
-dimensional regression parameter
is such that:
where and
are the row and column dimension, respectively, of the 2-dimensional layout. This penalty matrix intends to shrink the elements of
along the axes of the 2-dimensional layout. If
type="fused2dimD"
, the ridge penalty matrix for a
-dimensional regression parameter
is such that:
This penalty matrix shrinks the elements of along the diagonally to the axes of the 2-dimensional layout. The penalty matrices
generated by
type="fused2dimA"
- and type="fused2dimD"
-options may be combined.
The function returns a non-negative definite matrix
.
W.N. van Wieringen.
Anatolyev, S. (2020), "A ridge to homogeneity for linear models", Journal of Statistical Computation and Simulation, 90(13), 2455-2472.
Goeman, J.J. (2008), "Autocorrelated logistic ridge regression for prediction based on proteomics spectra", Statistical Applications in Genetics and Molecular Biology, 7(2).
Lettink, A, Chinapaw, M.J.M., van Wieringen, W.N. (2022), "Two-dimensional fused targeted ridge regression for health indicator prediction from accelerometer data", submitted.
ridgeGLM
# generate unscaled general penalty parameter matrix Dfused <- genRidgePenaltyMat(10, type="fused1dim")
# generate unscaled general penalty parameter matrix Dfused <- genRidgePenaltyMat(10, type="fused1dim")
Function that evaluates the targeted ridge estimator of the regression parameter of generalized linear models.
makeFoldsGLMcv(fold, Y, stratified=TRUE, model="linear")
makeFoldsGLMcv(fold, Y, stratified=TRUE, model="linear")
fold |
An |
Y |
A |
stratified |
A |
model |
A |
A list
of length fold
. Each list item is a fold.
W.N. van Wieringen.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # generate folds folds <- makeFoldsGLMcv(10, Y, model="logistic")
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # generate folds folds <- makeFoldsGLMcv(10, Y, model="logistic")
Function that performs an automatic search for the optimal penalty parameter for the ridgeGGMmixture
call by employing Brent's
method to the calculation of a cross-validated (negative) log-likelihood score.
optPenaltyGGMmixture.kCVauto(Y, K, lambdaMin, lambdaMax, lambdaInit=(lambdaMin+lambdaMax)/2, fold=nrow(Y), target, iWeights=matrix(sample(seq(0+1/nrow(Y), 1-1/nrow(Y), by=1/(2*nrow(Y))), nrow(Y)*K, replace=TRUE), nrow=nrow(Y), ncol=K), nInit=100, minSuccDiff=10^(-10), minMixProp=0.01)
optPenaltyGGMmixture.kCVauto(Y, K, lambdaMin, lambdaMax, lambdaInit=(lambdaMin+lambdaMax)/2, fold=nrow(Y), target, iWeights=matrix(sample(seq(0+1/nrow(Y), 1-1/nrow(Y), by=1/(2*nrow(Y))), nrow(Y)*K, replace=TRUE), nrow=nrow(Y), ncol=K), nInit=100, minSuccDiff=10^(-10), minMixProp=0.01)
Y |
Data |
K |
A |
lambdaMin |
A |
lambdaMax |
A |
lambdaInit |
A |
fold |
A |
target |
A semi-positive definite target |
iWeights |
Sample-specific positive component weight |
nInit |
A |
minSuccDiff |
A |
minMixProp |
Smallest mixing probability tolerated. |
The function returns a positive numeric
, the cross-validated optimal penalty parameter.
The elements of iWeights
may be larger than one as they are rescaled internally to sum to one.
W.N. van Wieringen, M. Aflakparast.
Aflakparast, M., de Gunst, M.C.M., van Wieringen, W.N. (2018), "Reconstruction of molecular network evolution from cross-sectional omics data", Biometrical Journal, 60(3), 547-563.
ridgeGGMmixture
# define mixing proportions pis <- c(0.2, 0.3, 0.4) # set dimension and sample size p <- 5 n <- 100 # define population covariance matrices diags <- list(rep(1, p), rep(0.5, p-1), rep(0.25, p-2), rep(0.1, p-3)) Omega <- as.matrix(Matrix::bandSparse(p, k =-c(0:3), diag=c(diags), symm=TRUE)) Sigma1 <- solve(Omega) Omega <- matrix(0.3, p, p) diag(Omega) <- 1 Sigma2 <- solve(Omega) Sigma3 <- cov(matrix(rnorm(p*n), ncol=p)) # mean vectors mean1 <- rep(0,p) mean2 <- rexp(p) mean3 <- rnorm(p) # draw data data from GGM mixture Z <- sort(sample(c(1:3), n, prob=pis, replace=TRUE)) Y <- rbind(mvtnorm::rmvnorm(sum(Z==1), mean=mean1, sigma=Sigma1), mvtnorm::rmvnorm(sum(Z==2), mean=mean2, sigma=Sigma2), mvtnorm::rmvnorm(sum(Z==3), mean=mean3, sigma=Sigma3)) # find optimal penalty parameter ### optLambda <- optPenaltyGGMmixture.kCVauto(Y, K=3, ### 0.00001, 100, ### 10, fold=5, ### target=0*Sigma1) # ridge penalized estimation of the GGM mixture ### ridgeGGMmixFit <- ridgeGGMmixture(Y, 3, optLambda, target=0*Sigma1)
# define mixing proportions pis <- c(0.2, 0.3, 0.4) # set dimension and sample size p <- 5 n <- 100 # define population covariance matrices diags <- list(rep(1, p), rep(0.5, p-1), rep(0.25, p-2), rep(0.1, p-3)) Omega <- as.matrix(Matrix::bandSparse(p, k =-c(0:3), diag=c(diags), symm=TRUE)) Sigma1 <- solve(Omega) Omega <- matrix(0.3, p, p) diag(Omega) <- 1 Sigma2 <- solve(Omega) Sigma3 <- cov(matrix(rnorm(p*n), ncol=p)) # mean vectors mean1 <- rep(0,p) mean2 <- rexp(p) mean3 <- rnorm(p) # draw data data from GGM mixture Z <- sort(sample(c(1:3), n, prob=pis, replace=TRUE)) Y <- rbind(mvtnorm::rmvnorm(sum(Z==1), mean=mean1, sigma=Sigma1), mvtnorm::rmvnorm(sum(Z==2), mean=mean2, sigma=Sigma2), mvtnorm::rmvnorm(sum(Z==3), mean=mean3, sigma=Sigma3)) # find optimal penalty parameter ### optLambda <- optPenaltyGGMmixture.kCVauto(Y, K=3, ### 0.00001, 100, ### 10, fold=5, ### target=0*Sigma1) # ridge penalized estimation of the GGM mixture ### ridgeGGMmixFit <- ridgeGGMmixture(Y, 3, optLambda, target=0*Sigma1)
Function finds the optimal penalty parameter of the targeted ridge regression estimator of the generalized linear model parameter. The optimum is defined as the minimizer of the cross-validated loss associated with the estimator.
optPenaltyGLM.kCVauto(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambdaInit, lambdaGinit=0, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), model="linear", target=rep(0, ncol(X)), folds=makeFoldsGLMcv(min(10, length(X)), Y, model=model), loss="loglik", lambdaMin=10^(-5), lambdaGmin=10^(-5), minSuccDiff=10^(-5), maxIter=100, implementation="org")
optPenaltyGLM.kCVauto(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambdaInit, lambdaGinit=0, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), model="linear", target=rep(0, ncol(X)), folds=makeFoldsGLMcv(min(10, length(X)), Y, model=model), loss="loglik", lambdaMin=10^(-5), lambdaGmin=10^(-5), minSuccDiff=10^(-5), maxIter=100, implementation="org")
Y |
A |
X |
The design |
U |
The design |
lambdaInit |
A |
lambdaGinit |
A |
Dg |
A non-negative definite |
model |
A |
target |
A |
folds |
A |
loss |
A |
lambdaMin |
A positive |
lambdaGmin |
A positive |
minSuccDiff |
A |
maxIter |
A |
implementation |
A |
The function returns a all-positive numeric
, the cross-validated optimal penalty parameters. The average loglikelihood over the left-out samples is used as the cross-validation criterion. If model="linear"
, also the average sum-of-squares over the left-out samples is offered as cross-validation criterion.
The joint selection of penalty parameters and
through the optimization of the cross-validated loss may lead to a locally optimal choice. This is due to the fact that the penalties are to some extent communicating vessels. Both shrink towards the same target, only in slightly (dependending on the specifics of the generalized penalty matrix
) different ways. As such, the shrinkage achieved by one penalty may be partially compensated for by the other. This may hamper the algorithm in its search for the global optimizers.
Moreover, the penalized IRLS (Iterative Reweighted Least Squares) algorithm for the evaluation of the generalized ridge logistic regression estimator and implemented in the ridgeGLM
-function may fail to converge for small penalty parameter values in combination with a nonzero shrinkage target. This phenomenon propogates to the optPenaltyGLM.kCVauto
-function.
W.N. van Wieringen.
van Wieringen, W.N. Binder, H. (2022), "Sequential learning of regression models by penalized estimation", accepted.
Lettink, A., Chinapaw, M.J.M., van Wieringen, W.N. et al. (2022), "Two-dimensional fused targeted ridge regression for health indicator prediction from accelerometer data", submitted.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # tune the penalty parameter optLambda <- optPenaltyGLM.kCVauto(Y, X, lambdaInit=1, fold=5, target=betas/2, model="logistic", minSuccDiff=10^(-3)) # estimate the logistic regression parameter bHat <- ridgeGLM(Y, X, lambda=optLambda, target=betas/2, model="logistic")
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # tune the penalty parameter optLambda <- optPenaltyGLM.kCVauto(Y, X, lambdaInit=1, fold=5, target=betas/2, model="logistic", minSuccDiff=10^(-3)) # estimate the logistic regression parameter bHat <- ridgeGLM(Y, X, lambda=optLambda, target=betas/2, model="logistic")
Function finds the optimal penalty parameter of the targeted ridge regression estimator of the generalized linear model parameter. The optimum is defined as the minimizer of the cross-validated loss associated with the estimator.
optPenaltyGLMmultiT.kCVauto(Y, X, lambdaInit, model="linear", targetMat, folds=makeFoldsGLMcv(min(10, length(X)), Y, model=model), loss="loglik", lambdaMin=10^(-5), minSuccDiff=10^(-5), maxIter=100)
optPenaltyGLMmultiT.kCVauto(Y, X, lambdaInit, model="linear", targetMat, folds=makeFoldsGLMcv(min(10, length(X)), Y, model=model), loss="loglik", lambdaMin=10^(-5), minSuccDiff=10^(-5), maxIter=100)
Y |
A |
X |
The design |
lambdaInit |
A |
model |
A |
targetMat |
A |
folds |
A |
loss |
A |
lambdaMin |
A positive |
minSuccDiff |
A |
maxIter |
A |
The function returns an all-positive numeric
, the cross-validated optimal penalty parameters. The average loglikelihood over the left-out samples is used as the cross-validation criterion. If model="linear"
, also the average sum-of-squares over the left-out samples is offered as cross-validation criterion.
W.N. van Wieringen.
van Wieringen, W.N. Binder, H. (2022), "Sequential learning of regression models by penalized estimation", accepted.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # create targets targets <- cbind(betas/2, rep(0, length(betas))) # tune the penalty parameter ### optLambdas <- optPenaltyGLMmultiT.kCVauto(Y, X, c(50,0.1), fold=5, ### targetMat=targets, model="logistic", ### minSuccDiff=10^(-3)) # estimate the logistic regression parameter ### bHat <- ridgeGLMmultiT(Y, X, lambdas=optLambdas, targetMat=targets, model="logistic")
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # create targets targets <- cbind(betas/2, rep(0, length(betas))) # tune the penalty parameter ### optLambdas <- optPenaltyGLMmultiT.kCVauto(Y, X, c(50,0.1), fold=5, ### targetMat=targets, model="logistic", ### minSuccDiff=10^(-3)) # estimate the logistic regression parameter ### bHat <- ridgeGLMmultiT(Y, X, lambdas=optLambdas, targetMat=targets, model="logistic")
Function that determines the optimal penalty parameters through maximization of the k-fold cross-validated log-likelihood score, with a penalization that encourages banded precisions.
optPenaltyPgen.kCVauto.banded(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin + lambdaMax)/2, fold=nrow(Y), target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
optPenaltyPgen.kCVauto.banded(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin + lambdaMax)/2, fold=nrow(Y), target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
Y |
Data |
lambdaMin |
A |
lambdaMax |
A |
lambdaInit |
A |
fold |
A |
target |
A semi-positive definite target |
zeros |
A two-column |
penalize.diag |
A |
nInit |
A |
minSuccDiff |
A |
The penalty matrix is parametrized as follows. The elements of
are
for
.
The function returns a numeric
containing the cross-validated optimal positive penalty parameters.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
Function that determines the optimal penalty parameters through maximization of the k-fold cross-validated log-likelihood score, assuming that variates are grouped and penalized group-wise.
optPenaltyPgen.kCVauto.groups(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin + lambdaMax)/2, fold=nrow(Y), groups, target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
optPenaltyPgen.kCVauto.groups(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin + lambdaMax)/2, fold=nrow(Y), groups, target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
Y |
Data |
lambdaMin |
A |
lambdaMax |
A |
lambdaInit |
A |
fold |
A |
groups |
A |
target |
A semi-positive definite target |
zeros |
A two-column |
penalize.diag |
A |
nInit |
A |
minSuccDiff |
A |
The penalty matrix is parametrized as follows. The elements of
are
for
if
and
belong to groups
and
, respectively, where
and
are the corresponding group-specific penalty parameters.
The function returns a numeric
containing the cross-validated optimal positive penalty parameters.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
ridgePgen
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.groups(Y, rep(10^(-10), 2), rep(10^(10), 2), groups=c(rep(0, p/2), rep(1, p/2)), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptVec <- c(rep(lambdaOpt[1], p/2), rep(lambdaOpt[2], p/2)) lambdaOptMat <- outer(lambdaOptVec, lambdaOptVec, "+") # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.groups(Y, rep(10^(-10), 2), rep(10^(10), 2), groups=c(rep(0, p/2), rep(1, p/2)), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptVec <- c(rep(lambdaOpt[1], p/2), rep(lambdaOpt[2], p/2)) lambdaOptMat <- outer(lambdaOptVec, lambdaOptVec, "+") # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
Function that determines the optimal penalty parameters through maximization of the k-fold cross-validated log-likelihood score, assuming that variates are grouped and penalized group-wise.
optPenaltyPmultiT.kCVauto(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin+lambdaMax)/2, fold=nrow(Y), targetList)
optPenaltyPmultiT.kCVauto(Y, lambdaMin, lambdaMax, lambdaInit=(lambdaMin+lambdaMax)/2, fold=nrow(Y), targetList)
Y |
Data |
lambdaMin |
A |
lambdaMax |
A |
lambdaInit |
A |
fold |
A |
targetList |
A list of semi-positive definite target matrices towards which the precision matrix is potentially shrunken. |
The function returns a numeric
containing the cross-validated optimal positive penalty parameters.
W.N. van Wieringen.
van Wieringen, W.N., Stam, K.A., Peeters, C.F.W., van de Wiel, M.A. (2020), "Updating of the Gaussian graphical model through targeted penalized estimation", Journal of Multivariate Analysis, 178, Article 104621.
ridgePmultiT
# set dimension and sample size p <- 10 n <- 10 # specify vector of penalty parameters lambda <- c(2, 1) # generate precision matrix T1 <- matrix(0.7, p, p) diag(T1) <- 1 T2 <- diag(rep(2, p)) # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 2 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPmultiT.kCVauto(Y, rep(10^(-10), 2), rep(10^(10), 2), rep(1, 2), targetList=list(T1=T1, T2=T2)) # unpenalized diagonal estimate Phat <- ridgePmultiT(S, lambdaOpt, list(T1=T1, T2=T2))
# set dimension and sample size p <- 10 n <- 10 # specify vector of penalty parameters lambda <- c(2, 1) # generate precision matrix T1 <- matrix(0.7, p, p) diag(T1) <- 1 T2 <- diag(rep(2, p)) # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 2 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPmultiT.kCVauto(Y, rep(10^(-10), 2), rep(10^(10), 2), rep(1, 2), targetList=list(T1=T1, T2=T2)) # unpenalized diagonal estimate Phat <- ridgePmultiT(S, lambdaOpt, list(T1=T1, T2=T2))
Function that performs an automatic search of the optimal penalty parameter for the ridgePrep
call by employing either the Nelder-Mead or quasi-Newton
method to calculate the cross-validated (negative) log-likelihood score.
optPenaltyPrep.kCVauto(Y, ids, lambdaInit, fold=nrow(Y), CVcrit, splitting="stratified", targetZ=matrix(0, ncol(Y), ncol(Y)), targetE=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
optPenaltyPrep.kCVauto(Y, ids, lambdaInit, fold=nrow(Y), CVcrit, splitting="stratified", targetZ=matrix(0, ncol(Y), ncol(Y)), targetE=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
Y |
Data |
ids |
A |
lambdaInit |
A |
fold |
A |
CVcrit |
A |
splitting |
A |
targetZ |
A semi-positive definite target |
targetE |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
The function returns an all-positive numeric
, the cross-validated optimal penalty parameters.
W.N. van Wieringen.
van Wieringen, W.N., Chen, Y. (2021), "Penalized estimation of the Gaussian graphical model from data with replicates", Statistics in Medicine, 40(19), 4279-4293.
ridgePrep
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # find optimal penalty parameters ### optLambdas <- optPenaltyPrep.kCVauto(Y, ids, ### lambdaInit=c(1,1), ### fold=nrow(Y), ### CVcrit="LL") # estimate the precision matrices ### Ps <- ridgePrep(Y, ids, optLambdas[1], optLambdas[2])
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # find optimal penalty parameters ### optLambdas <- optPenaltyPrep.kCVauto(Y, ids, ### lambdaInit=c(1,1), ### fold=nrow(Y), ### CVcrit="LL") # estimate the precision matrices ### Ps <- ridgePrep(Y, ids, optLambdas[1], optLambdas[2])
Function that performs an automatic search of the optimal penalty parameter for the ridgePrepEdiag
call by employing either the Nelder-Mead or quasi-Newton
method to calculate of the cross-validated (negative) log-likelihood score.
optPenaltyPrepEdiag.kCVauto(Y, ids, lambdaInit, fold=nrow(Y), CVcrit, splitting="stratified", targetZ=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
optPenaltyPrepEdiag.kCVauto(Y, ids, lambdaInit, fold=nrow(Y), CVcrit, splitting="stratified", targetZ=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
Y |
Data |
ids |
A |
lambdaInit |
A |
fold |
A |
CVcrit |
A |
splitting |
A |
targetZ |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
The function returns an all-positive numeric
, the cross-validated optimal penalty parameters.
W.N. van Wieringen.
van Wieringen, W.N., Chen, Y. (2021), "Penalized estimation of the Gaussian graphical model from data with replicates", Statistics in Medicine, 40(19), 4279-4293.
ridgePrepEdiag
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # find optimal penalty parameters ### optLambdas <- optPenaltyPrepEdiag.kCVauto(Y, ids, ### lambdaInit=c(1,1), ### fold=nrow(Y), ### CVcrit="LL") # estimate the precision matrices ### Ps <- ridgePrepEdiag(Y, ids, optLambdas[1], optLambdas[2])
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # find optimal penalty parameters ### optLambdas <- optPenaltyPrepEdiag.kCVauto(Y, ids, ### lambdaInit=c(1,1), ### fold=nrow(Y), ### CVcrit="LL") # estimate the precision matrices ### Ps <- ridgePrepEdiag(Y, ids, optLambdas[1], optLambdas[2])
Function that estimates a mixture of GGMs (Gaussian graphical models) through a ridge penalized EM (Expectation-Maximization) algorithm as described in Aflakparast et al. (2018).
ridgeGGMmixture(Y, K, lambda, target, iWeights=matrix(sample(seq(0+1/nrow(Y), 1-1/nrow(Y), by=1/(2*nrow(Y))), nrow(Y)*K, replace=TRUE), nrow=nrow(Y), ncol=K), nInit=100, minSuccDiff=10^(-10), minMixProp=0.01)
ridgeGGMmixture(Y, K, lambda, target, iWeights=matrix(sample(seq(0+1/nrow(Y), 1-1/nrow(Y), by=1/(2*nrow(Y))), nrow(Y)*K, replace=TRUE), nrow=nrow(Y), ncol=K), nInit=100, minSuccDiff=10^(-10), minMixProp=0.01)
Y |
Data |
K |
A |
lambda |
A positive |
target |
A semi-positive definite target |
iWeights |
Sample-specific positive component weight |
nInit |
A |
minSuccDiff |
A |
minMixProp |
Smallest mixing probability tolerated. |
The data are assumed to follow a mixture of Gaussian graphical models:
where is the probability that the
-th sample stems from the
-the component. The model parameters are estimated by ridge penalized likelihood maximization:
where is the penalty parameter and
is the shrinkage target of the
-th component's precision matrix. This function yields the maximizer of this penalized loglikelihood, which is found by means of a penalized EM algorithm.
The function returns a regularized inverse covariance list
-object with slots:
mu |
A |
P |
A |
pi |
A |
weights |
A |
penLL |
A |
The elements of iWeights
may be larger than one as they are rescaled internally to sum to one.
W.N. van Wieringen, M. Aflakparast.
Aflakparast, M., de Gunst, M.C.M., van Wieringen, W.N. (2018), "Reconstruction of molecular network evolution from cross-sectional omics data", Biometrical Journal, 60(3), 547-563.
optPenaltyGGMmixture.kCVauto
# define mixing proportions pis <- c(0.2, 0.3, 0.4) # set dimension and sample size p <- 5 n <- 100 # define population covariance matrices diags <- list(rep(1, p), rep(0.5, p-1), rep(0.25, p-2), rep(0.1, p-3)) Omega <- as.matrix(Matrix::bandSparse(p, k=-c(0:3), diag=c(diags), symm=TRUE)) Sigma1 <- solve(Omega) Omega <- matrix(0.3, p, p) diag(Omega) <- 1 Sigma2 <- solve(Omega) Sigma3 <- cov(matrix(rnorm(p*n), ncol=p)) # mean vectors mean1 <- rep(0,p) mean2 <- rexp(p) mean3 <- rnorm(p) # draw data data from GGM mixture Z <- sort(sample(c(1:3), n, prob=pis, replace=TRUE)) Y <- rbind(mvtnorm::rmvnorm(sum(Z==1), mean=mean1, sigma=Sigma1), mvtnorm::rmvnorm(sum(Z==2), mean=mean2, sigma=Sigma2), mvtnorm::rmvnorm(sum(Z==3), mean=mean3, sigma=Sigma3)) # find optimal penalty parameter optLambda <- optPenaltyGGMmixture.kCVauto(Y, K=3, 0.00001, 100, 10, fold=5, target=0*Sigma1) # ridge penalized estimation of the GGM mixture ridgeGGMmixFit <- ridgeGGMmixture(Y, 3, 1, target=0*Sigma1)
# define mixing proportions pis <- c(0.2, 0.3, 0.4) # set dimension and sample size p <- 5 n <- 100 # define population covariance matrices diags <- list(rep(1, p), rep(0.5, p-1), rep(0.25, p-2), rep(0.1, p-3)) Omega <- as.matrix(Matrix::bandSparse(p, k=-c(0:3), diag=c(diags), symm=TRUE)) Sigma1 <- solve(Omega) Omega <- matrix(0.3, p, p) diag(Omega) <- 1 Sigma2 <- solve(Omega) Sigma3 <- cov(matrix(rnorm(p*n), ncol=p)) # mean vectors mean1 <- rep(0,p) mean2 <- rexp(p) mean3 <- rnorm(p) # draw data data from GGM mixture Z <- sort(sample(c(1:3), n, prob=pis, replace=TRUE)) Y <- rbind(mvtnorm::rmvnorm(sum(Z==1), mean=mean1, sigma=Sigma1), mvtnorm::rmvnorm(sum(Z==2), mean=mean2, sigma=Sigma2), mvtnorm::rmvnorm(sum(Z==3), mean=mean3, sigma=Sigma3)) # find optimal penalty parameter optLambda <- optPenaltyGGMmixture.kCVauto(Y, K=3, 0.00001, 100, 10, fold=5, target=0*Sigma1) # ridge penalized estimation of the GGM mixture ridgeGGMmixFit <- ridgeGGMmixture(Y, 3, 1, target=0*Sigma1)
Function that evaluates the targeted ridge estimator of the regression parameter of generalized linear models.
ridgeGLM(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambda, lambdaG=0, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), target=rep(0, ncol(X)), model="linear", minSuccDiff=10^(-10), maxIter=100)
ridgeGLM(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambda, lambdaG=0, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), target=rep(0, ncol(X)), model="linear", minSuccDiff=10^(-10), maxIter=100)
Y |
A |
X |
The design |
U |
The design |
lambda |
A positive |
lambdaG |
A positive |
Dg |
A non-negative definite |
target |
A |
model |
A |
minSuccDiff |
A |
maxIter |
A |
This function finds the maximizer of the following penalized loglikelihood: , with loglikelihood
, response
, design matrices
and
, regression parameters
and
, penalty parameter
, shrinkage target
, and generalized ridge penalty matrix
. For more details, see van Wieringen, Binder (2020) and Lettink et al. (2022).
A numeric
, the generalized ridge estimate of the regression parameter. If a nonempty is supplied, the first few elements are the unpenalized effect estimates of the covariates that comprise this design matrix.
The penalized IRLS (Iterative Reweighted Least Squares) algorithm for the evaluation of the generalized ridge logistic regression estimator may fail to converge for small penalty parameter values in combination with a nonzero shrinkage target.
W.N. van Wieringen.
van Wieringen, W.N. Binder, H. (2022), "Sequential learning of regression models by penalized estimation", submitted.
Lettink, A., Chinapaw, M.J.M., van Wieringen, W.N. (2022), "Two-dimensional fused targeted ridge regression for health indicator prediction from accelerometer data", submitted.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # set the penalty parameter lambda <- 3 # estimate the logistic regression parameter bHat <- ridgeGLM(Y, X, lambda=lambda, target=betas/2, model="logistic")
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # set the penalty parameter lambda <- 3 # estimate the logistic regression parameter bHat <- ridgeGLM(Y, X, lambda=lambda, target=betas/2, model="logistic")
Function that evaluates the degrees of freedom of the generalized ridge estimator of the regression parameter of generalized linear models.
ridgeGLMdof(X, U=matrix(ncol=0, nrow=nrow(X)), lambda, lambdaG, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), model="linear", linPred=rep(0,nrow(X)))
ridgeGLMdof(X, U=matrix(ncol=0, nrow=nrow(X)), lambda, lambdaG, Dg=matrix(0, ncol=ncol(X), nrow=ncol(X)), model="linear", linPred=rep(0,nrow(X)))
X |
The design |
U |
The design |
lambda |
A positive |
lambdaG |
A positive |
Dg |
A non-negative definite |
model |
A |
linPred |
A |
The degrees of freedom of the regular ridge regression estimator is usually defined the trace of the ridge hat matrix: . That of the regular ridge logistic regression estimator is defined analoguously by Park, Hastie (2008). Lettink et al. (2022) translates these definitions to the generalized ridge (logistic) regression case.
A numeric
, the degrees of freedom consumed by the (generalized) ridge (logistic) regression estimator.
W.N. van Wieringen.
Park, M. Y., & Hastie, T. (2008). Penalized logistic regression for detecting gene interactions. Biostatistics, 9(1), 30-50.
Lettink, A., Chinapaw, M.J.M., van Wieringen, W.N. (2022), "Two-dimensional fused targeted ridge regression for health indicator prediction from accelerometer data", submitted.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # set the penalty parameter lambda <- 3 # estimate the logistic regression parameter dofs <- ridgeGLMdof(X, lambda=lambda, lambdaG=0, model="logistic", linPred=tcrossprod(X, t(betas)))
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # set the penalty parameter lambda <- 3 # estimate the logistic regression parameter dofs <- ridgeGLMdof(X, lambda=lambda, lambdaG=0, model="logistic", linPred=tcrossprod(X, t(betas)))
Function that evaluates the multi-targeted ridge estimator of the regression parameter of generalized linear models.
ridgeGLMmultiT(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambdas, targetMat, model="linear", minSuccDiff=10^(-10), maxIter=100)
ridgeGLMmultiT(Y, X, U=matrix(ncol=0, nrow=length(Y)), lambdas, targetMat, model="linear", minSuccDiff=10^(-10), maxIter=100)
Y |
A |
X |
The design |
U |
The design |
lambdas |
An all-positive |
targetMat |
A |
model |
A |
minSuccDiff |
A |
maxIter |
A |
This function finds the maximizer of the following penalized loglikelihood: , with loglikelihood
, response
, design matrix
, regression parameter
, penalty parameter
, and the
-th shrinkage target
. For more details, see van Wieringen, Binder (2020).
The ridge estimate of the regression parameter.
W.N. van Wieringen.
van Wieringen, W.N. Binder, H. (2020), "Online learning of regression models from a sequence of datasets by penalized estimation", submitted.
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # set the penalty parameter lambdas <- c(1,3) # estimate the logistic regression parameter # bHat <- ridgeGLMmultiT(Y, X, lambdas, model="logistic", # targetMat=cbind(betas/2, rnorm(length(betas))))
# set the sample size n <- 50 # set the true parameter betas <- (c(0:100) - 50) / 20 # generate covariate data X <- matrix(rnorm(length(betas)*n), nrow=n) # sample the response probs <- exp(tcrossprod(betas, X)[1,]) / (1 + exp(tcrossprod(betas, X)[1,])) Y <- numeric() for (i in 1:n){ Y <- c(Y, sample(c(0,1), 1, prob=c(1-probs[i], probs[i]))) } # set the penalty parameter lambdas <- c(1,3) # estimate the logistic regression parameter # bHat <- ridgeGLMmultiT(Y, X, lambdas, model="logistic", # targetMat=cbind(betas/2, rnorm(length(betas))))
Function that evaluates the generalized ridge estimator of the inverse covariance matrix with element-wise penalization and shrinkage.
ridgePgen(S, lambda, target, nInit=100, minSuccDiff=10^(-10))
ridgePgen(S, lambda, target, nInit=100, minSuccDiff=10^(-10))
S |
Sample covariance |
lambda |
A symmetric |
target |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
This function generalizes the ridgeP
-function in the sense that, besides element-wise shrinkage, it allows for element-wise penalization in the estimation of the precision matrix of a zero-mean multivariate normal distribution. Hence, it assumes that the data stem from . The estimator maximizes the following penalized loglikelihood:
where the sample covariance matrix,
a symmetric, positive matrix of penalty parameters, the
-operator represents the Hadamard or element-wise multipication, and
the precision matrix' shrinkage target. For more details see van Wieringen (2019).
The function returns a regularized inverse covariance matrix
.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # unpenalized diagonal estimate Phat <- ridgePgen(S, lambda, 0*S)
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # unpenalized diagonal estimate Phat <- ridgePgen(S, lambda, 0*S)
Function that calculates of the k-fold cross-validated negative (!) loglikelihood of the generalized ridge precision estimator.
ridgePgen.kCV(lambda, Y, fold=nrow(Y), target, nInit=100, minSuccDiff=10^(-5))
ridgePgen.kCV(lambda, Y, fold=nrow(Y), target, nInit=100, minSuccDiff=10^(-5))
lambda |
A symmetric |
Y |
Data |
fold |
A |
target |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
The function returns a numeric
containing the cross-validated negative loglikelihood.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
Function that calculates of the k-fold cross-validated negative (!) loglikelihood of the generalized ridge precision estimator, with a penalization that encourages a banded precision matrix.
ridgePgen.kCV.banded(lambda, Y, fold=nrow(Y), target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
ridgePgen.kCV.banded(lambda, Y, fold=nrow(Y), target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
lambda |
A |
Y |
Data |
fold |
A |
target |
A semi-positive definite target |
zeros |
A two-column |
penalize.diag |
A |
nInit |
A |
minSuccDiff |
A |
The penalty matrix is parametrized as follows. The elements of
are
for
.
The function returns a numeric
containing the cross-validated negative loglikelihood.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.banded(Y, 10^(-10), 10^(10), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptMat <- matrix(NA, p, p) for (j1 in 1:p){ for (j2 in 1:p){ lambdaOptMat[j1, j2] <- lambdaOpt * (abs(j1-j2)+1) } } # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
Function that calculates of the k-fold cross-validated negative (!) loglikelihood of the generalized ridge precision estimator, assuming that variates are grouped and penalized group-wise.
ridgePgen.kCV.groups(lambdaGrps, Y, fold=nrow(Y), groups, target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
ridgePgen.kCV.groups(lambdaGrps, Y, fold=nrow(Y), groups, target, zeros=matrix(nrow=0, ncol=2), penalize.diag=TRUE, nInit=100, minSuccDiff=10^(-5))
lambdaGrps |
A |
Y |
Data |
fold |
A |
groups |
A |
target |
A semi-positive definite target |
zeros |
A two-column |
penalize.diag |
A |
nInit |
A |
minSuccDiff |
A |
The penalty matrix is parametrized as follows. The elements of
are
for
if
and
belong to groups
and
, respectively, where
and
are the corresponding group-specific penalty parameters.
The function returns a numeric
containing the cross-validated negative loglikelihood.
W.N. van Wieringen.
van Wieringen, W.N. (2019), "The generalized ridge estimator of the inverse covariance matrix", Journal of Computational and Graphical Statistics, 28(4), 932-942.
ridgePgen
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.groups(Y, rep(10^(-10), 2), rep(10^(10), 2), groups=c(rep(0, p/2), rep(1, p/2)), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptVec <- c(rep(lambdaOpt[1], p/2), rep(lambdaOpt[2], p/2)) lambdaOptMat <- outer(lambdaOptVec, lambdaOptVec, "+") # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
# set dimension and sample size p <- 10 n <- 10 # penalty parameter matrix lambda <- matrix(1, p, p) diag(lambda) <- 0.1 # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 1 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # find optimal penalty parameters through cross-validation lambdaOpt <- optPenaltyPgen.kCVauto.groups(Y, rep(10^(-10), 2), rep(10^(10), 2), groups=c(rep(0, p/2), rep(1, p/2)), target=matrix(0, p, p), penalize.diag=FALSE, nInit=100, minSuccDiff=10^(-5)) # format the penalty matrix lambdaOptVec <- c(rep(lambdaOpt[1], p/2), rep(lambdaOpt[2], p/2)) lambdaOptMat <- outer(lambdaOptVec, lambdaOptVec, "+") # generalized ridge precision estimate Phat <- ridgePgen(S, lambdaOptMat, matrix(0, p, p))
Function that evaluates the ridge estimator of the inverse covariance matrix with multi-target shrinkage.
ridgePmultiT(S, lambda, targetList)
ridgePmultiT(S, lambda, targetList)
S |
Sample covariance |
lambda |
A |
targetList |
A list of semi-positive definite target matrices towards which the precision matrix is potentially shrunken. |
This function generalizes the ridgeP
-function in the sense that multiple shrinkage targets can be provided in the estimation of the precision matrix of a zero-mean multivariate normal distribution. Hence, it assumes that the data stem from . The estimator maximizes the following penalized loglikelihood:
where the sample covariance matrix,
the penalty parameters of each target matrix, and the
the precision matrix' shrinkage targets. For more details see van Wieringen et al. (2020).
The function returns a regularized inverse covariance matrix
.
W.N. van Wieringen.
van Wieringen, W.N., Stam, K.A., Peeters, C.F.W., van de Wiel, M.A. (2020), "Updating of the Gaussian graphical model through targeted penalized estimation", Journal of Multivariate Analysis, 178, Article 104621.
# set dimension and sample size p <- 10 n <- 10 # specify vector of penalty parameters lambda <- c(2, 1) # generate precision matrix T1 <- matrix(0.7, p, p) diag(T1) <- 1 T2 <- diag(rep(2, p)) # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 2 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # unpenalized diagonal estimate Phat <- ridgePmultiT(S, lambda, list(T1=T1, T2=T2))
# set dimension and sample size p <- 10 n <- 10 # specify vector of penalty parameters lambda <- c(2, 1) # generate precision matrix T1 <- matrix(0.7, p, p) diag(T1) <- 1 T2 <- diag(rep(2, p)) # generate precision matrix Omega <- matrix(0.4, p, p) diag(Omega) <- 2 Sigma <- solve(Omega) # data Y <- mvtnorm::rmvnorm(n, mean=rep(0,p), sigma=Sigma) S <- cov(Y) # unpenalized diagonal estimate Phat <- ridgePmultiT(S, lambda, list(T1=T1, T2=T2))
Estimation of the precision matrix from data with replicates through a ridge penalized EM (Expectation-Maximization) algorithm. It assumes a simple 'signal+noise' model, both random variables are assumed to be drawn from a multivariate normal distribution with their own unstructured precision matrix. These precision matrices are estimated.
ridgePrep(Y, ids, lambdaZ, lambdaE, targetZ=matrix(0, ncol(Y), ncol(Y)), targetE=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
ridgePrep(Y, ids, lambdaZ, lambdaE, targetZ=matrix(0, ncol(Y), ncol(Y)), targetE=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
Y |
Data |
ids |
A |
lambdaZ |
A positive |
lambdaE |
A positive |
targetZ |
A semi-positive definite target |
targetE |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
Data are assumed to originate from a design with replicates. Each observation with
(
) the
-th replicate of the
-th sample, is described by a ‘signal+noise’ model:
, where
and
represent the signal and noise, respectively. Each observation
follows a multivariate normal law of the form
, which results from the distributional assumptions of the signal and the noise,
and
, and their independence. The model parameters are estimated by means of a penalized EM algorithm that maximizes the loglikelihood augmented with the penalty
, in which
and
are the shrinkage targets of the signal and noise precision matrices, respectively. For more details see van Wieringen and Chen (2019).
The function returns the regularized inverse covariance list
-object with slots:
Pz |
The estimated signal precision matrix. |
Pz |
The estimated error precision matrix. |
penLL |
The penalized loglikelihood of the estimated model. |
W.N. van Wieringen.
van Wieringen, W.N., Chen, Y. (2021), "Penalized estimation of the Gaussian graphical model from data with replicates", Statistics in Medicine, 40(19), 4279-4293.
optPenaltyPrep.kCVauto
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # estimate Ps <- ridgePrep(Y, ids, 1, 1)
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # estimate Ps <- ridgePrep(Y, ids, 1, 1)
Estimation of precision matrices from data with replicates through a ridge penalized EM (Expectation-Maximization) algorithm. It assumes a simple 'signal+noise' model, both random variables are assumed to be drawn from a multivariate normal distribution with their own precision matrix. The signal precision matrix is unstructured, while the former is diagonal. These precision matrices are estimated.
ridgePrepEdiag(Y, ids, lambdaZ, targetZ=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
ridgePrepEdiag(Y, ids, lambdaZ, targetZ=matrix(0, ncol(Y), ncol(Y)), nInit=100, minSuccDiff=10^(-10))
Y |
Data |
ids |
A |
lambdaZ |
A positive |
targetZ |
A semi-positive definite target |
nInit |
A |
minSuccDiff |
A |
Data are assumed to originate from a design with replicates. Each observation with
(
) the
-th replicate of the
-th sample, is described by a ‘signal+noise’ model:
, where
and
represent the signal and noise, respectively. Each observation
follows a multivariate normal law of the form
, which results from the distributional assumptions of the signal and the noise,
and
with
diagonal, and their independence. The model parameters are estimated by means of a penalized EM algorithm that maximizes the loglikelihood augmented with the penalty
, in which
is the shrinkage target of the signal precision matrix. For more details see van Wieringen and Chen (2019).
The function returns the regularized inverse covariance list
-object with slots:
Pz |
The estimated signal precision matrix. |
Pe |
The estimated error precision matrix. |
penLL |
The penalized loglikelihood of the estimated model. |
W.N. van Wieringen.
van Wieringen, W.N., Chen, Y. (2021), "Penalized estimation of the Gaussian graphical model from data with replicates", Statistics in Medicine, 40(19), 4279-4293.
optPenaltyPrepEdiag.kCVauto
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # estimate Ps <- ridgePrepEdiag(Y, ids, 1)
# set parameters p <- 10 Se <- diag(runif(p)) Sz <- matrix(3, p, p) diag(Sz) <- 4 # draw data n <- 100 ids <- numeric() Y <- numeric() for (i in 1:n){ Ki <- sample(2:5, 1) Zi <- mvtnorm::rmvnorm(1, sigma=Sz) for (k in 1:Ki){ Y <- rbind(Y, Zi + mvtnorm::rmvnorm(1, sigma=Se)) ids <- c(ids, i) } } # estimate Ps <- ridgePrepEdiag(Y, ids, 1)