Title: | Fitting Bayesian Poisson Regression |
---|---|
Description: | Posterior sampling and inference for Bayesian Poisson regression models. The model specification makes use of Gaussian (or conditionally Gaussian) prior distributions on the regression coefficients. Details on the algorithm are found in D'Angelo and Canale (2023) <doi:10.1080/10618600.2022.2123337>. |
Authors: | Laura D'Angelo |
Maintainer: | Laura D'Angelo <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.0.8 |
Built: | 2024-12-13 06:49:01 UTC |
Source: | CRAN |
This function is a method for class poisreg
. It prints convergence diagnostics and accuracy statistics of the MCMC output.
mcmc_diagnostics(object)
mcmc_diagnostics(object)
object |
object of class " |
The printed output of mcmc_diagnostics
summarizes some common convergence diagnostics for Markov chains.
The first part recaps the total length, burn-in and thinning used for the simulation.
The second part is a table with diagnostic statistics about each chain of the regression parameters. The first column is the effective sample size computed after removing the burn-in and thinning. The last two columns report the value and observed p-value of the Geweke test of equality of the first and last part of the chain.
The last part is printed only if multiple chains are computed. In this case, it reports the Gelman-Rubin statistics to test convergence to the same stationary distribution. Values much larger than 1 suggest lack of convergence to a common distribution.
mcmc_diagnostics
returns a list with elements:
chain_length
: total length of the MCMC chains.
len_burnin
: the length of the burn-in used to compute the estimates.
thin
: the thinning frequency used (from object
).
effSize
: effective sample size of each parameter chain after removing burn-in and thinning. See effectiveSize
.
geweke
: Geweke diagnostics of convergence of the chains (value of the test and p-value). See geweke.diag
gelman_rubin
: if nchains > 1
, Gelman-Rubin diagnostics of convergence. See gelman.diag
.
summary.poisreg
, plot.poisreg
,
merge_sim
, effectiveSize
, geweke.diag
, gelman.diag
# For examples see example(sample_bpr)
# For examples see example(sample_bpr)
This function is a method for class poisreg
. Merge multiple MCMC chains into a unique chain when sampling with nchains > 1
is used.
merge_sim(object)
merge_sim(object)
object |
object of class " |
The function returns an object of class poisreg
with a single element $sim
.
The returned chains (elements of sim
) are obtained by appending the simulated values of each independent chain,
under the assumption that they all have reached the same stationary distribution.
library(MASS) # load the data set head(epil) # Simulate multiple chains by setting nchains > 1 fit4 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, nchains = 4, thin = 2) # fit4 contains 4 elements with simulation ($sim, $sim2, $sim3, $sim4) mcmc_diagnostics(fit4) # the Gelman-Rubin diagnostics confirms convergence of the 4 # independent chains to the same stationary distribution fit4b = merge_sim(fit4) str(fit4b$sim) # fit 4b contains only one element $sim, of length 1500 # (which is the result of concatenating the 4 simulations, after removing the first 25% # iterations as burn-in and keeping one iteration every two).
library(MASS) # load the data set head(epil) # Simulate multiple chains by setting nchains > 1 fit4 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, nchains = 4, thin = 2) # fit4 contains 4 elements with simulation ($sim, $sim2, $sim3, $sim4) mcmc_diagnostics(fit4) # the Gelman-Rubin diagnostics confirms convergence of the 4 # independent chains to the same stationary distribution fit4b = merge_sim(fit4) str(fit4b$sim) # fit 4b contains only one element $sim, of length 1500 # (which is the result of concatenating the 4 simulations, after removing the first 25% # iterations as burn-in and keeping one iteration every two).
Plot Trace and Distribution of Regression Parameters
## S3 method for class 'poisreg' plot(x, ...)
## S3 method for class 'poisreg' plot(x, ...)
x |
object of class " |
... |
further arguments passed to or from other methods. |
The function calls plot.mcmc
on the matrix of sampled regression coefficients, and returns the trace of the sampled outputs and a density estimate for each variable in the chain.
This function is a method for class posterior_check
. Plot diagnostic statistics for graphical posterior predictive checks.
## S3 method for class 'posterior_check' plot(x, ...)
## S3 method for class 'posterior_check' plot(x, ...)
x |
object of class " |
... |
other parameters to be passed through to plotting functions. See Details. |
It is possible to generate additional plots that compare the posterior predictive distribution of a statistics with the observed value.
This is done through the parameter stats
: it is a list with elements the function names of the statistics one wants to compare.
Default is stats = list("mean")
, other possible values are, e.g., "median", "sd", "max" etc.
The function outputs (at least) three plots for graphical posterior predictive check.
The first plot compares the empirical cumulative distribution function (ECDF) with the cumulative distribution function obtained
with samples from the posterior predictive distribution (median and point-wise 95% credible bands).
The second plot compares the distribution of the observed sample with the predictive distribution obtained using the maximum a posteriori (MAP)
estimates of the regression parameters.
The third plot compares the predictive distribution of a statistic (default is the mean) with the observed value of the same statistics,
displayed with a red line.
library(MASS) # load the data set head(epil) fit = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000) plot(posterior_predictive(fit), stats = c("mean", "sd", "max")) # plots for posterior predictive check
library(MASS) # load the data set head(epil) fit = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000) plot(posterior_predictive(fit), stats = c("mean", "sd", "max")) # plots for posterior predictive check
This function is a method for class poisreg
. Compute the posterior predictive distribution and summary statistics for
posterior check of the model;
optionally, it also computes
the predictive distribution with new values of the explanatory variables.
posterior_predictive(object, new_X = NULL)
posterior_predictive(object, new_X = NULL)
object |
object of class " |
new_X |
(optional) a data frame in which to look for variables with which to predict. |
The call to this function returns an object of S3 class posterior_check
. The object is a list with the following elements:
data
: the component from object
(list with covariates X
and response variable y
).
y_pred
: matrix of dimension [n, iter]
(with n
sample size), each column is a draw from the posterior predictive distribution.
y_MAP_pred
: vector of length n
containing a draw from the posterior distribution obtained using the maximum a posteriori estimates (MAP) of the parameters.
diagnostics
: list containing 2 elements: CPO
, i.e. the Conditional Predictive Ordinate (Gelfand et al. 1992); and LPML
, i.e.
the logarithm of the pseudo-marginal likelihood (Ibrahim et al. 2014).
newdata
: if the matrix new_X
of new values of the covariates is provided, list of three elements:
new_X
: the provided matrix of explanatory variables;
y_newdata
: a matrix of dimension [nrow(new_X), iter]
, each column is a draw from the posterior predictive distribution using new_X
;
y_MAP_newdata
: vector of length nrow(new_X)
containing a draw from the posterior distribution obtained using the MAP estimate of the parameters,
computed on the new data new_X
.
perc_burnin
: the component from object
.
Gelfand, A., Dey, D. and Chang, H. (1992), Model determination using predictive distributions with implementation via sampling-based-methods (with discussion),
in ‘Bayesian Statistics 4’, University Press.
Ibrahim, J. G., Chen, M.H. and Sinha, D. (2014), Bayesian Survival Analysis, American Cancer Society.
The function generates draws from the posterior distribution of the coefficients of Poisson regression models. The method allows for Gaussian and horseshoe (Carvalho et al, 2010) prior distributions, and relies on a Metropolis-Hastings or importance sampler algorithm.
sample_bpr( formula = NULL, data = NULL, iter, burnin = NULL, prior = list(type = "gaussian", b = NULL, B = NULL, tau = NULL), pars = list(method = "MH", max_dist = 50, max_r = NULL, max_dist_burnin = 1e+06), state = NULL, thin = 1, verbose = TRUE, seed = NULL, nchains = 1, perc_burnin = 0.25 )
sample_bpr( formula = NULL, data = NULL, iter, burnin = NULL, prior = list(type = "gaussian", b = NULL, B = NULL, tau = NULL), pars = list(method = "MH", max_dist = 50, max_r = NULL, max_dist_burnin = 1e+06), state = NULL, thin = 1, verbose = TRUE, seed = NULL, nchains = 1, perc_burnin = 0.25 )
formula |
an object of class "formula": a symbolic description of the model to be fitted. |
data |
data frame or matrix containing the variables in the model. |
iter |
number of algorithm iterations. |
burnin |
(optional) a positive integer specifying the length of the burn-in.
If a value > 1 is provided, the first |
prior |
a named list of parameters to select prior type and parameters, with arguments:
|
pars |
a named list of parameters to select algorithm type and tuning parameters, with arguments:
|
state |
optional vector providing the starting points of the chains. |
thin |
a positive integer specifying the period for saving samples. The default is 1. |
verbose |
logical (default = TRUE) indicating whether to print messages on the progress of the algorithm and possible convergence issues. |
seed |
(optional) positive integer: the seed of random number generator. |
nchains |
(optional) positive integer specifying the number of Markov chains. The default is 1. |
perc_burnin |
(default = 0.25) percentage of the chain to be discarded to perform inference. If both burnin and perc_burnin are specified, the most conservative burn-in is considered. |
This function fits a Bayesian Poisson regression model with Gaussian prior distributions on the regression coefficients:
where is a size
vector of counts and
is a
matrix of coefficients; and
has a Gaussian distribution (possibly conditionally on some parameters).
Specifically, the function allows for informative Gaussian prior distribution on the parameters,
i.e. , and for a horseshoe prior distribution (Carvalho et al, 2010).
The horseshoe prior is a scale mixture of normals, which is typically used in high-dimension settings to induce sparsity and
regularization of the coefficients.
The implemented Metropolis-Hastings and importance sampler exploit as proposal density a multivariate Gaussian approximation of the posterior distribution. Such proposal is based on the convergence of the negative binomial distribution to the Poisson distribution and on the Polya-gamma data augmentation of Polson et al. (2013).
The output of the sampling is an object of class poisreg
and admits class-specific methods to perform inference.
The function summary.poisreg
can be used to obtain or print a summary of the results and of the algorithm diagnostics.
The function mcmc_diagnostics
can be used to obtain or print convergence diagnostics for the sampled chains.
The function plot.poisreg
prints the trace of the sampled values and a density estimate of the regression coefficients.
See plot.mcmc
.
The function posterior_predictive
can be used to compute the posterior predictive distributions to check the model.
See also the related function plot.ppc
.
An object of S3 class poisreg
containing the results of the sampling. poisreg
is a list containing at least the following elements:
sim
: list of the results of the sampling. It contains the following elements:
beta
: mcmc
object of posterior draws of the regression coefficients.
r
: the sequence of adaptive tuning parameters used in each iteration.
time
: the total amount of time to perform the simulation.
formula
: the formula
object used.
data
: list with elements the matrix of covariates X
and response variable y
.
state
: the starting points of the chain.
burnin
: length of the used burn-in.
prior
: whether a Gaussian or horseshoe prior was used.
prior_pars
: prior parameters.
thin
: thinning frequency passed to the thin
parameter.
nchains
: number of chains. If nchains
was chosen >1, the output list will also include additional
numbered sim
elements, one for each sampled chain.
perc_burnin
: percentage of the chain used as burn-in.
Carvalho, C., Polson, N., & Scott, J. (2010). The horseshoe estimator for sparse signals. Biometrika, 97(2), 465-480.
van der Pas, S., Szabo, B. and van der Vaart, A. (2017), Adaptive posterior contractionrates for the horseshoe, Electronic Journal of Statistics, 11(2), 3196-3225.
summary.poisreg
, mcmc_diagnostics
, plot.poisreg
,
merge_sim
, posterior_predictive
require(MASS) # load the data set head(epil) fit = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000) summary(fit) # summary of posterior inference mcmc_diagnostics(fit) # summary of MCMC convergence diagnostics plot(fit) ## Examples with different options # Select prior parameters and set tuning parameter fit2 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, prior = list( type = "gaussian", b = rep(0, 6), B = diag(6) * 3 ), pars = list( max_dist = 10 )) # Simulate multiple chains and merge outputs after checking convergence fit3 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, nchains = 4, thin = 2) # fit3 now contains additional elements ($sim2, $sim3, $sim4) mcmc_diagnostics(fit3) # the Gelman-Rubin diagnostics confirms convergence of the 4 # independent chains to the same stationary distribution fit3b = merge_sim(fit3) str(fit3b$sim) # fit 3b contains only one MCMC chain of length 1500 # (after thinning and burn-in) ## introduce more variables and use regularization epil2 <- epil[epil$period == 1, ] epil2["period"] <- rep(0, 59); epil2["y"] <- epil2["base"] epil["time"] <- 1; epil2["time"] <- 4 epil2 <- rbind(epil, epil2) epil2$pred <- unclass(epil2$trt) * (epil2$period > 0) epil2$subject <- factor(epil2$subject) epil3 <- aggregate(epil2, list(epil2$subject, epil2$period > 0), function(x) if(is.numeric(x)) sum(x) else x[1]) epil3$pred <- factor(epil3$pred, labels = c("base", "placebo", "drug")) contrasts(epil3$pred) <- structure(contr.sdif(3), dimnames = list(NULL, c("placebo-base", "drug-placebo"))) fit4 = sample_bpr(y ~ pred + factor(subject), data = epil3, pars = list(max_dist = 0.3), prior = list(type = "horseshoe", tau = 2), iter = 3000, burnin = 1000) summary(fit4) mcmc_diagnostics(fit4) plot(posterior_predictive(fit4), stats = c("mean", "sd", "max"))
require(MASS) # load the data set head(epil) fit = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000) summary(fit) # summary of posterior inference mcmc_diagnostics(fit) # summary of MCMC convergence diagnostics plot(fit) ## Examples with different options # Select prior parameters and set tuning parameter fit2 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, prior = list( type = "gaussian", b = rep(0, 6), B = diag(6) * 3 ), pars = list( max_dist = 10 )) # Simulate multiple chains and merge outputs after checking convergence fit3 = sample_bpr( y ~ lbase*trt + lage + V4, data = epil, iter = 1000, nchains = 4, thin = 2) # fit3 now contains additional elements ($sim2, $sim3, $sim4) mcmc_diagnostics(fit3) # the Gelman-Rubin diagnostics confirms convergence of the 4 # independent chains to the same stationary distribution fit3b = merge_sim(fit3) str(fit3b$sim) # fit 3b contains only one MCMC chain of length 1500 # (after thinning and burn-in) ## introduce more variables and use regularization epil2 <- epil[epil$period == 1, ] epil2["period"] <- rep(0, 59); epil2["y"] <- epil2["base"] epil["time"] <- 1; epil2["time"] <- 4 epil2 <- rbind(epil, epil2) epil2$pred <- unclass(epil2$trt) * (epil2$period > 0) epil2$subject <- factor(epil2$subject) epil3 <- aggregate(epil2, list(epil2$subject, epil2$period > 0), function(x) if(is.numeric(x)) sum(x) else x[1]) epil3$pred <- factor(epil3$pred, labels = c("base", "placebo", "drug")) contrasts(epil3$pred) <- structure(contr.sdif(3), dimnames = list(NULL, c("placebo-base", "drug-placebo"))) fit4 = sample_bpr(y ~ pred + factor(subject), data = epil3, pars = list(max_dist = 0.3), prior = list(type = "horseshoe", tau = 2), iter = 3000, burnin = 1000) summary(fit4) mcmc_diagnostics(fit4) plot(posterior_predictive(fit4), stats = c("mean", "sd", "max"))
This function is a method for class poisreg
. It prints summary statistics and returns posterior estimates of regression quantities.
## S3 method for class 'poisreg' summary(object, ...) ## S3 method for class 'poisreg' print(x, ...)
## S3 method for class 'poisreg' summary(object, ...) ## S3 method for class 'poisreg' print(x, ...)
object |
object of class " |
... |
further arguments passed to or from other methods. |
x |
object of class " |
The printed output of summary.poisreg
summarizes the main quantities of the fit.
The first component Call
recaps the type of prior and algorithm used.
Coefficients
is a table of estimated quantities for the regression parameters. The first three columns report the estimated posterior mean,
standard errors and medians. The last two columns correspond to the lower and upper bounds of the 0.95 credible intervals.
If the credible interval does not include zero, a star is printed in correspondence of each parameter
(similarly to the 'significance stars' of summary.lm
).
All the estimates are computed discarding the first part of the chain as burn-in (more details are printed in the Algorithm
section).
Algorithm
briefly summarizes the main diagnostics of convergence and efficiency of the algorithm.
It prints the number of iterations actually used to obtain the estimates, after removing the burn-in and thinning.
If a Metropolis-Hastings algorithm is used, the summary reports the acceptance rate,
which is the most commonly used indicator to tune the performance of the algorithm, along with the mean effective sample size
(averaged over all parameters).
If the importance sampler is used, the summary only reports the effective sample size, which is computed as
(where
is the sequence of weights) and is a measure of the efficiency of the sampler.
summary.poisreg
returns a list with elements:
formula
: the component from object
.
data
: list with elements the matrix of covariates X
and response variable y
.
prior
: prior$type
from object
.
prior_pars
: prior parameters from object
.
coefficients
: the matrix of coefficient estimantes, standard errors and 95% credible intervals.
psi2
: if a horseshoe prior is selected, the estimate of the local shrinkage parameter.
len_burnin
: the length of the burn-in used to compute the estimates.
effSize
: the mean effective sample size of the chains used to compute the estimates.
# For examples see example(sample_bpr)
# For examples see example(sample_bpr)