Title: | L1 (Lasso and Fused Lasso) and L2 (Ridge) Penalized Estimation in GLMs and in the Cox Model |
---|---|
Description: | Fitting possibly high dimensional penalized regression models. The penalty structure can be any combination of an L1 penalty (lasso and fused lasso), an L2 penalty (ridge) and a positivity constraint on the regression coefficients. The supported regression models are linear, logistic and Poisson regression and the Cox Proportional Hazards model. Cross-validation routines allow optimization of the tuning parameters. |
Authors: | Jelle Goeman, Rosa Meijer, Nimisha Chaturvedi, Matthew Lueder |
Maintainer: | Jelle Goeman <[email protected]> |
License: | GPL (>= 2) |
Version: | 0.9-52 |
Built: | 2024-12-29 08:37:16 UTC |
Source: | CRAN |
Stores one or more survival stepfunctions of the Kaplan-Meier and Breslow type.
Breslow objects are generally created by the penalized
function and related functions.
time
:Object of class "vector". The time points at which the function starts, jumps or ends.
curves
:Object of class "matrix". The values of the curve at each time point. Note that the function is left-continuous. Curves are stored as a matrix of dimension (\# curves) x (\# timepoints).
(breslow): Coerces the object to a matrix (as as.matrix) and selects rows and/or columns.
(breslow): Selects a subset of the curves.
(breslow): Returns the "curves" slot together with the "time" slot in a data.frame
.
(breslow): Coerces the object to a list.
(breslow): Returns the "curves" slot with the "time" slot as column names.
(breslow): Straightforward plotting (all curves in one plot).
(breslow): Summarizes the dimensions of the object.
(breslow): Takes a second argument (time
). Extracts the value of the survival curve(s) at any time point.
Jelle Goeman: [email protected]
Cross-validating generalized linear models with L1 (lasso or fused lasso) and/or L2 (ridge) penalties, using likelihood cross-validation.
cvl (response, penalized, unpenalized, lambda1 = 0, lambda2= 0, positive = FALSE, fusedl = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, trace = TRUE, approximate = FALSE) optL1 (response, penalized, unpenalized, minlambda1, maxlambda1, base1, lambda2 = 0, fusedl = FALSE, positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter = Inf, standardize = FALSE, tol = .Machine$double.eps^0.25, trace = TRUE) optL2 (response, penalized, unpenalized, lambda1 = 0, minlambda2, maxlambda2, base2, fusedl = FALSE ,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, tol = .Machine$double.eps^0.25, trace = TRUE, approximate = FALSE) profL1 (response, penalized, unpenalized, minlambda1, maxlambda1, base1, lambda2 = 0, fusedl = FALSE,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter = Inf, standardize = FALSE, steps = 100, minsteps = steps/3, log = FALSE, save.predictions = FALSE, trace = TRUE, plot = FALSE) profL2 (response, penalized, unpenalized, lambda1 = 0, minlambda2, maxlambda2, base2, fusedl = FALSE,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, steps = 100, minsteps = steps/2, log = TRUE, save.predictions = FALSE, trace = TRUE, plot = FALSE, approximate = FALSE)
cvl (response, penalized, unpenalized, lambda1 = 0, lambda2= 0, positive = FALSE, fusedl = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, trace = TRUE, approximate = FALSE) optL1 (response, penalized, unpenalized, minlambda1, maxlambda1, base1, lambda2 = 0, fusedl = FALSE, positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter = Inf, standardize = FALSE, tol = .Machine$double.eps^0.25, trace = TRUE) optL2 (response, penalized, unpenalized, lambda1 = 0, minlambda2, maxlambda2, base2, fusedl = FALSE ,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, tol = .Machine$double.eps^0.25, trace = TRUE, approximate = FALSE) profL1 (response, penalized, unpenalized, minlambda1, maxlambda1, base1, lambda2 = 0, fusedl = FALSE,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter = Inf, standardize = FALSE, steps = 100, minsteps = steps/3, log = FALSE, save.predictions = FALSE, trace = TRUE, plot = FALSE) profL2 (response, penalized, unpenalized, lambda1 = 0, minlambda2, maxlambda2, base2, fusedl = FALSE,positive = FALSE, data, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, fold, epsilon = 1e-10, maxiter, standardize = FALSE, steps = 100, minsteps = steps/2, log = TRUE, save.predictions = FALSE, trace = TRUE, plot = FALSE, approximate = FALSE)
response |
The response variable (vector). This should be a numeric vector for linear regression, a |
penalized |
The penalized covariates. These may be specified either as a matrix or as a (one-sided) |
unpenalized |
Additional unpenalized covariates. Specified as under |
lambda1 , lambda2
|
The fixed values of the tuning parameters for L1 and L2 penalization. Each must be either a single positive numbers or a vector with length equal to the number of covariates in |
minlambda1 , minlambda2 , maxlambda1 , maxlambda2
|
The values of the tuning parameters for L1 or L2 penalization between which the cross-validated likelihood is to be profiled or optimized. For fused lasso penalty |
base1 , base2
|
An optional vector of length equal to the number of covariates in penalized. If supplied, profiling or optimization is performed between |
fusedl |
If |
positive |
If |
data |
A |
model |
The model to be used. If missing, the model will be guessed from the |
startbeta |
Starting values for the regression coefficients of the penalized covariates. These starting values will be used only for the first values of |
startgamma |
Starting values for the regression coefficients of the unpenalized covariates. These starting values will be used only for the first values of |
fold |
The fold for cross-validation. May be supplied as a single number (between 2 and n) giving the number of folds, or, alternatively, as a length |
epsilon |
The convergence criterion. As in |
maxiter |
The maximum number of iterations allowed in each fitting of the model. Set by default at 25 when only an L2 penalty is present, infinite otherwise. |
standardize |
If |
steps |
The maximum number of steps between |
minsteps |
The minimum number of steps between |
log |
If |
tol |
The tolerance of the Brent algorithm used for minimization. See also |
save.predictions |
Controls whether or not to save cross-validated predictions for all values of lambda. |
trace |
If |
approximate |
If |
plot |
If |
All five functions return a list with the following named elements:
lambda
:For optL1
and optL2
lambda
gives the optimal value of the tuning parameters found. For profL1
and profL2
lambda
is the vector of values of the tuning parameter for which the cross-validated likelihood has been calculated. Absent in the output of cvl
.
cvl
:The value(s) of the cross-validated likelihood. For optL1
, optL2
this is the cross-validated likelihood at the optimal value of the tuning parameter.
fold
:Returns the precise allocation of the subjects into the cross-validation folds. Note that the same allocation is used for all cross-validated likelihood calculations in each call to optL1
, optL2
, profL1
, profL2
.
predictions
:The cross-validated predictions for the left-out samples. The precise format of the cross-validated predictions depends on the type of generalized linear model (see breslow
for survival models. The functions profL1
and profL2
return a list here (only if save.predictions = TRUE
), whereas optL1
, optL2
return the predictions for the optimal value of the tuning parameter only.
fullfit
:The fitted model on the full data. The functions profL1
and profL2
return a list of penfit
objects here, whereas optL1
, optL2
return the full data fit (a single penfit
object) for the optimal value of the tuning parameter only.
A named list. See details.
The optL1
and optL2
functions use Brent's algorithm for minimization without derivatives (see also optimize
). There is a risk that these functions converge to a local instead of to a global optimum. This is especially the case for optL1
, as the cross-validated likelihood as a function of lambda1
quite often has local optima. It is recommended to use optL1
in combination with profL1
to check whether optL1
has converged to the right optimum.
See also the notes under penalized
.
Jelle Goeman: [email protected]
Goeman J.J. (2010). L-1 Penalized Estimation in the Cox Proportional Hazards Model. Biometrical Journal 52 (1) 70-84.
# More examples in the package vignette: # type vignette("penalized") data(nki70) attach(nki70) # Finding an optimal cross-validated likelihood opt <- optL1(Surv(time, event), penalized = nki70[,8:77], fold = 5) coefficients(opt$fullfit) plot(opt$predictions) # Plotting the profile of the cross-validated likelihood prof <- profL1(Surv(time, event), penalized = nki70[,8:77], fold = opt$fold, steps=10) plot(prof$lambda, prof$cvl, type="l") plotpath(prof$fullfit)
# More examples in the package vignette: # type vignette("penalized") data(nki70) attach(nki70) # Finding an optimal cross-validated likelihood opt <- optL1(Surv(time, event), penalized = nki70[,8:77], fold = 5) coefficients(opt$fullfit) plot(opt$predictions) # Plotting the profile of the cross-validated likelihood prof <- profL1(Surv(time, event), penalized = nki70[,8:77], fold = opt$fold, steps=10) plot(prof$lambda, prof$cvl, type="l") plotpath(prof$fullfit)
A data.frame
of 144 lymph node positive breast cancer patients on metastasis-free survival, 5 clinical risk factors, and gene expression measurements of 70 genes found to be prognostic for metastasis-free survival in an earlier study. The included variables are
Metastasis-free follow-up time (months).
Event indicator. 1 = metastasis or death; 0 = censored.
Diameter of the tumor (two levels).
Number of affected lymph nodes (two levels).
Estrogen receptor status (two levels).
Grade of the tumor (three ordered levels).
Patient age at diagnosis (years).
Gene expression measurements of 70 prognostic genes.
data(nki70)
data(nki70)
A data.frame
.
The 70 gene signature was trained on lymph node negative patients only.
M.J. van de Vijver, Y.D. He, L.J. van 't Veer, H. Dai, A.A.M. Hart, D.W. Voskuil, G.J. Schreiber, J.L. Peterse, C. Roberts, M.J. Marton, M. Parrish, D. Atsma, A. Witteveen, A. Glas, L. Delahaye, T. van der Velde, H. Bartelink, S. Rodenhuis, E.T. Rutgers, S.H. Friend, and R. Bernards (2002). A gene-expression signature as a predictor of survival in breast cancer. New England Journal of Medicine 347 (25), 1999–2009.
Fitting generalized linear models with L1 (lasso and fused lasso) and/or L2 (ridge) penalties, or a combination of the two.
penalized (response, penalized, unpenalized, lambda1=0, lambda2=0, positive = FALSE, data, fusedl=FALSE, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, steps =1, epsilon = 1e-10, maxiter, standardize = FALSE, trace = TRUE)
penalized (response, penalized, unpenalized, lambda1=0, lambda2=0, positive = FALSE, data, fusedl=FALSE, model = c("cox", "logistic", "linear", "poisson"), startbeta, startgamma, steps =1, epsilon = 1e-10, maxiter, standardize = FALSE, trace = TRUE)
response |
The response variable (vector). This should be a numeric vector for linear regression, a |
penalized |
The penalized covariates. These may be specified either as a matrix or as a (one-sided) |
unpenalized |
Additional unpenalized covariates. Specified as under |
lambda1 , lambda2
|
The tuning parameters for L1 and L2 penalization. Each must be either a single positive numbers or a vector with length equal to the number of covariates in |
positive |
If |
data |
A |
fusedl |
If |
model |
The model to be used. If missing, the model will be guessed from the |
startbeta |
Starting values for the regression coefficients of the penalized covariates. |
startgamma |
Starting values for the regression coefficients of the unpenalized covariates. |
steps |
If greater than 1, the algorithm will fit the model for a range of |
epsilon |
The convergence criterion. As in |
maxiter |
The maximum number of iterations allowed. Set by default at 25 when only an L2 penalty is present, infinite otherwise. |
standardize |
If |
trace |
If |
The penalized
function fits regression models for a given combination of L1 and L2 penalty parameters.
penalized
returns a penfit
object when steps = 1
or a list of such objects if steps > 1
.
The response
argument of the function also accepts formula input as in lm
and related functions. In that case, the right hand side of the response
formula is used as the penalized
argument or, if that is already given, as the unpenalized
argument. For example, the input penalized(y~x)
is equivalent to penalized(y, ~x)
and penalized(y~x, ~z)
to penalized(y, ~z, ~x)
.
In case of tied survival times, the function uses Breslow's version of the partial likelihood.
Jelle Goeman: [email protected]
Goeman J.J. (2010). L-1 Penalized Estimation in the Cox Proportional Hazards Model. Biometrical Journal 52 (1) 70-84.
penfit
for the penfit
object returned, plotpath
for plotting the solution path, and cvl
for cross-validation and
optimizing the tuning parameters.
# More examples in the package vignette: # type vignette("penalized") data(nki70) # A single lasso fit predicting survival pen <- penalized(Surv(time, event), penalized = nki70[,8:77], unpenalized = ~ER+Age+Diam+N+Grade, data = nki70, lambda1 = 10) show(pen) coefficients(pen) coefficients(pen, "penalized") basehaz(pen) # A single lasso fit using the clinical risk factors pen <- penalized(Surv(time, event), penalized = ~ER+Age+Diam+N+Grade, data = nki70, lambda1=10, standardize=TRUE) # using steps pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1 = 1,steps = 20) plotpath(pen) # A fused lasso fit predicting survival pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1 = 1, lambda2 = 2, fusedl = TRUE) plot(coefficients(pen, "all"),type="l",xlab = "probes",ylab = "coefficient value") plot(predict(pen,penalized=nki70[,8:77]))
# More examples in the package vignette: # type vignette("penalized") data(nki70) # A single lasso fit predicting survival pen <- penalized(Surv(time, event), penalized = nki70[,8:77], unpenalized = ~ER+Age+Diam+N+Grade, data = nki70, lambda1 = 10) show(pen) coefficients(pen) coefficients(pen, "penalized") basehaz(pen) # A single lasso fit using the clinical risk factors pen <- penalized(Surv(time, event), penalized = ~ER+Age+Diam+N+Grade, data = nki70, lambda1=10, standardize=TRUE) # using steps pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1 = 1,steps = 20) plotpath(pen) # A fused lasso fit predicting survival pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1 = 1, lambda2 = 2, fusedl = TRUE) plot(coefficients(pen, "all"),type="l",xlab = "probes",ylab = "coefficient value") plot(predict(pen,penalized=nki70[,8:77]))
Contrast functions for factors that are appropriate for penalized regression.
contr.none(n, contrasts) contr.diff(n, contrasts = TRUE)
contr.none(n, contrasts) contr.diff(n, contrasts = TRUE)
n |
A vector of levels for a factor, or the number of levels. |
contrasts |
A logical indicating whether contrasts should be computed. This
argument is ignored in |
These functions are used for creating contrast matrices for use in fitting penalized analysis of variance and regression models. The columns of the resulting matrices contain contrasts which can be used for coding a factor with n levels. The returned value contains the computed contrasts.
contr.none
returns an identity matrix except when the number of levels is 2,
in which case it returns a single contrast. contr.none
ensures that all levels
of an unordered factor are treated symmetrically in a penalized regression model.
contr.diff
returns a lower triangular matrix of ones if contrasts=FALSE
and the same matrix without its first column if contrasts=TRUE
. This makes sure
that penalization is done on the difference between successive levels of an ordered factor.
It is not appropriate for unordered factors.
contr.diff
returns a matrix with n
rows and k
columns, with
k=n-1
if contrasts is TRUE
and k=n
if contrasts is FALSE
.
contr.none
returns a matrix with n
rows and n
columns, except when n=2
when it returns a matrix with 2 rows and one column.
Jelle Goeman: [email protected]
penalized
, contr.treatment
, contr.poly
, contr.helmert
,
contr.SAS
, contr.sum
.
# Three levels levels <- LETTERS[1:3] contr.none(levels) contr.diff(levels) # Two levels levels <- LETTERS[1:2] contr.none(levels)
# Three levels levels <- LETTERS[1:3] contr.none(levels) contr.diff(levels) # Two levels levels <- LETTERS[1:2] contr.none(levels)
Stores the result of a call to penalized
.
penalized
:Object of class "vector". Regression coefficients for the penalized covariates.
unpenalized
:Object of class "vector". Regression coefficients for the unpenalized covariates.
residuals
:Object of class "vector". Unstandardized residuals for each subject in the fitted model. Martingale residuals are given for the cox model.
fitted
:Object of class "vector". Fitted values (means) for each subject in the fitted model. In the cox model, this slot holds the relative risks.
lin.pred
:Object of class "vector". Linear predictors for each subject in the fitted model.
loglik
:Object of class "numeric". Log likelihood of the fitted model. For the Cox model, reports the full likelihood rather than the partial likelihood.
penalty
:Object of class "vector". L1 and L2 penalties of the fitted model.
iterations
:Object of class "numeric". Number of iterations used in the fitting process.
converged
:Object of class "logical". Whether the fitting process was judged to be converged.
model
:Object of class "character". The name of the generalized linear model used.
lambda1
:Object of class "vector". The lambda1 parameter(s) used.
lambda2
:Object of class "vector". The lambda2 parameter(s) used.
nuisance
:Object of class "list". The maximum likelihood estimates of any nuisance parameters in the model.
weights
:Object of class "vector". The weights of the covariates used for standardization.
formula
:Object of class "list". A named list containing the unpenalized and penalized formula objects, if present.
(penfit): Returns the baseline hazard (a data.frame
) if a cox model was fitted, NULL
otherwise. An additional argument center
(default (TRUE
) can be used to give the survival curve at the covariate mean (center = TRUE
) rather than at zero.
(penfit): Returns the baseline survival curve (a breslow
object) if a cox model was fitted, NULL
otherwise. An additional argument center
(default (TRUE
) can be used to give the survival curve at the covariate mean (center = TRUE
) rather than at zero.
(penfit): Returns the regression coefficients. Accepts a second argument "which", that takes values "nonzero" (the default), "all", "penalized" or "unpenalized" for extracting only the non-zero, the penalized or the unpenalized regression coefficients. A third argument "standardize" (default FALSE) can be used to let the method return the regression coefficients for the standardized covariates.
(penfit): synonym for coef
above.
(penfit): Returns the fitted values for each subject (i.e. the predicted means). In the Cox model, this method returns the relative risks for each individual.
(penfit): synonym for fitted
above.
(penfit): Returns the linear predictors for each subject.
(penfit): Returns the log likelihood of the fitted model.
(penfit): Returns the L1 and L2 penalties of the fitted model.
(penfit): Returns the residuals.
(penfit): Summarizes the fitted model.
(penfit): Returns the weights used for standardization.
(penfit): Calculates predictions for new subjects. See predict
.
Jelle Goeman: [email protected]
Plotting a LASSO path fitted using penalized
with steps > 1
.
plotpath(object, labelsize = 0.6, standardize = FALSE, ...)
plotpath(object, labelsize = 0.6, standardize = FALSE, ...)
object |
A |
labelsize |
Sets the size of the variable labels in the plot. Set to zero for no variable labels. |
standardize |
If |
... |
Any other arguments will be forwarded to the plot function. |
Jelle Goeman: [email protected]
data(nki70) # Fit the model with the steps argument and plot pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1=1, steps = 20) plotpath(pen)
data(nki70) # Fit the model with the steps argument and plot pen <- penalized(Surv(time, event), penalized = nki70[,8:77], data = nki70, lambda1=1, steps = 20) plotpath(pen)
Predicting a response for new subjects based on a fitted penalized regression model.
## S4 method for signature 'penfit' predict(object, penalized, unpenalized, data)
## S4 method for signature 'penfit' predict(object, penalized, unpenalized, data)
object |
The fitted model (a |
penalized |
The matrix of penalized covariates for the new subjects. |
unpenalized |
The unpenalized covariates for the new subjects. |
data |
A |
The user need only supply those terms from the original call that are different relative to the original call that produced the penfit
object. In particular, if penalized and/or unpenalized was specified in matrix form, a matrix must be given with the new subjects' data. The columns of these matrices must be exactly the same as in the matrices supplied in the original call that produced the penfit
object. If either penalized or unpenalized was given as a formula
in the original call, the user of predict
must supply a new data
argument. As with matrices, the new data
argument must have a similar make-up as the data
argument in the original call that produced the penfit
object. In particular, any factors in data
must have the same levels.
The predictions, either as a vector
(logistic and Poisson models), a matrix
(linear model), or a breslow
object (Cox model).
data(nki70) pen <- penalized(Surv(time, event), penalized = nki70[1:50,8:77], unpenalized = ~ER+Age+Diam+N+Grade, data = nki70[1:50,], lambda1 = 10) predict(pen, nki70[51:52,8:77], data = nki70[51:52,])
data(nki70) pen <- penalized(Surv(time, event), penalized = nki70[1:50,8:77], unpenalized = ~ER+Age+Diam+N+Grade, data = nki70[1:50,], lambda1 = 10) predict(pen, nki70[51:52,8:77], data = nki70[51:52,])