Title: | Spatially Varying Change Points |
---|---|
Description: | Implements a spatially varying change point model with unique intercepts, slopes, variance intercepts and slopes, and change points at each location. Inference is within the Bayesian setting using Markov chain Monte Carlo (MCMC). The response variable can be modeled as Gaussian (no nugget), probit or Tobit link and the five spatially varying parameter are modeled jointly using a multivariate conditional autoregressive (MCAR) prior. The MCAR is a unique process that allows for a dissimilarity metric to dictate the local spatial dependencies. Full details of the package can be found in the accompanying vignette. Furthermore, the details of the package can be found in the corresponding paper on arXiv by Berchuck et al (2018): "A spatially varying change points model for monitoring glaucoma progression using visual field data", <arXiv:1811.11038>. |
Authors: | Samuel I. Berchuck [aut, cre] |
Maintainer: | Samuel I. Berchuck <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.3 |
Built: | 2024-12-04 07:32:24 UTC |
Source: | CRAN |
Calculates diagnostic metrics using output from the spCP
model.
diagnostics( obj, diags = c("dic", "dinf", "waic"), keepDeviance = FALSE, keepPPD = FALSE )
diagnostics( obj, diags = c("dic", "dinf", "waic"), keepDeviance = FALSE, keepPPD = FALSE )
obj |
A |
diags |
A vector of character strings indicating the diagnostics to compute. Options include: Deviance Information Criterion ("dic"), d-infinity ("dinf") and Watanabe-Akaike information criterion ("waic"). At least one option must be included. Note: The probit model cannot compute the DIC or WAIC diagnostics due to computational issues with computing the multivariate normal CDF. |
keepDeviance |
A logical indicating whether the posterior deviance distribution is returned (default = FALSE). |
keepPPD |
A logical indicating whether the posterior predictive distribution at each observed location is returned (default = FALSE). |
To assess model fit, DIC, d-infinity and WAIC are used. DIC is based on the deviance statistic and penalizes for the complexity of a model with an effective number of parameters estimate pD (Spiegelhalter et al 2002). The d-infinity posterior predictive measure is an alternative diagnostic tool to DIC, where d-infinity=P+G. The G term decreases as goodness of fit increases, and P, the penalty term, inflates as the model becomes over-fit, so small values of both of these terms and, thus, small values of d-infinity are desirable (Gelfand and Ghosh 1998). WAIC is invariant to parametrization and is asymptotically equal to Bayesian cross-validation (Watanabe 2010). WAIC = -2 * (lppd - p_waic_2). Where lppd is the log pointwise predictive density and p_waic_2 is the estimated effective number of parameters based on the variance estimator from Vehtari et al. 2016. (p_waic_1 is the mean estimator).
diagnostics
returns a list containing the diagnostics requested and
possibly the deviance and/or posterior predictive distribution objects.
Samuel I. Berchuck
Gelfand, A. E., & Ghosh, S. K. (1998). Model choice: a minimum posterior predictive loss approach. Biometrika, 1-11.
Spiegelhalter, D. J., Best, N. G., Carlin, B. P., & Van Der Linde, A. (2002). Bayesian measures of model complexity and fit. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 64(4), 583-639.
Vehtari, A., Gelman, A., & Gabry, J. (2016). Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC. Statistics and Computing, 1-20.
Watanabe, S. (2010). Asymptotic equivalence of Bayes cross validation and widely applicable information criterion in singular learning theory. Journal of Machine Learning Research, 11(Dec), 3571-3594.
These Garway-Heath angles are used as the dissimilarity metric when implementing the boundary detection model for a longitudinal series of visual fields.
data(GarwayHeath)
data(GarwayHeath)
A vector with length 54, where each entry represents the angle (in degrees) that the underlying retinal nerve fiber enters the optic nerve head. The measure ranges from 0-360, where 0 is designated at the 9-o’clock position (right eye) and angles are counted counter clockwise. These angles are estimates for the Humphrey Field Analyzer-II (Carl Zeiss Meditec Inc., Dublin, CA). The 26th and 35th entries are missing as they correspond to a natural blind spot.
Garway-Heath, et al. (2000). Ophthalmology 107:10:1809–1815. (PubMed)
Binary adjacency matrix for the Humphrey Field Analyzer-II (Carl Zeiss Meditec Inc., Dublin, CA)
data(HFAII_Queen)
data(HFAII_Queen)
This adjacency matrix is formatted using queen neighbor criteria, meaning two locations on the visual field are only considered neighbors if they share an edge or corner. The adjacency matrix is a 54 x 54 dimensional binary object with zeros on the diagonal and the column and row sums are equal to the number of neighbors.
Binary adjacency matrix for the Humphrey Field Analyzer-II (Carl Zeiss Meditec Inc., Dublin, CA)
data(HFAII_QueenHF)
data(HFAII_QueenHF)
This adjacency matrix is formatted using queen neighbor criteria, meaning two locations on the visual field are only considered neighbors if they share an edge or corner. An additional criterion is included that locations are not considered neighbors if they fall within different hemispheres on the visual field. The adjacency matrix is a 54 x 54 dimensional binary object with zeros on the diagonal and the column and row sums are equal to the number of neighbors.
Binary adjacency matrix for the Humphrey Field Analyzer-II (Carl Zeiss Meditec Inc., Dublin, CA)
data(HFAII_Rook)
data(HFAII_Rook)
This adjacency matrix is formatted using rook neighbor criteria, meaning two locations on the visual field are only considered neighbors if they share an edge. The adjacency matrix is a 54 x 54 dimensional binary object with zeros on the diagonal and the column and row sums are equal to the number of neighbors.
Plots estimated visual field sensitivities using change point model.
PlotCP( object, data, location = "Location", time = "Time", dls = "DLS", line = TRUE, ci = TRUE, lwd = 1, lty = 1, col = 2, ci.lwd = 1, ci.lty = 2, ci.col = 2, cp.line = FALSE, cp.ci = FALSE, cp.lwd = 1, cp.lty = 1, cp.col = 4, cp.ci.lwd = 1, cp.ci.lty = 2, cp.ci.col = 4, main = "Estimated visual field sensitivity\nusing change points", xlab = "Years from baseline visit", ylab = "Sensitivity (dB)" )
PlotCP( object, data, location = "Location", time = "Time", dls = "DLS", line = TRUE, ci = TRUE, lwd = 1, lty = 1, col = 2, ci.lwd = 1, ci.lty = 2, ci.col = 2, cp.line = FALSE, cp.ci = FALSE, cp.lwd = 1, cp.lty = 1, cp.col = 4, cp.ci.lwd = 1, cp.ci.lty = 2, cp.ci.col = 4, main = "Estimated visual field sensitivity\nusing change points", xlab = "Years from baseline visit", ylab = "Sensitivity (dB)" )
object |
a spCP regression object. |
data |
a dataframe containing the raw sensitivities. |
location |
a character string indicating the column name in data of the variable of locations on the visual field. |
time |
a character string indicating the column name in data of the variable of visual field testing times. |
dls |
a character string indicating the column name in data of the variable of the raw visual field sensitivities. |
line |
logical, determines if there are regression lines printed (default = TRUE). |
ci |
logical, determines if there are confidence intervals printed (default = TRUE). |
lwd |
integer, specifies the width of the regression line (default = 1). |
lty |
integer, specifies the type of regression line (default = 1). |
col |
color for the regression line, either character string corresponding to a color or a integer (default = "red"). |
ci.lwd |
integer, specifies the width of the confidence intervals (default = 1). |
ci.lty |
integer, specifies the type of confidence intervals (default = 2). |
ci.col |
color for the confidence intervals for the regression line, either character string corresponding to a color or a integer (default = "red"). |
cp.line |
logical, determines if there the change point is printed (default = FALSE). |
cp.ci |
logical, determines if there the change point confidence intervals are printed (default = FALSE). |
cp.lwd |
integer, specifies the width of the change point line (default = 1). |
cp.lty |
integer, specifies the type of change point line (default = 1). |
cp.col |
color for the optional change point, either character string corresponding to a color or a integer (default = "blue"). |
cp.ci.lwd |
integer, specifies the width of the change point confidence intervals (default = 1). |
cp.ci.lty |
integer, specifies the type of change point confidence intervals (default = 2). |
cp.ci.col |
color for the confidence intervals of the optional change point, either character string corresponding to a color or a integer (default = "blue"). |
main |
an overall title for the plot. |
xlab |
a title for the x axis. |
ylab |
a title for the y axis. |
PlotCP
is used in the application of glaucoma progression.
The function is capable of plotting the observed DLS values across the visual field,
along with the estimated mean process (with 95 percent credible intervals) and the
estimated mean posterior change point location (with 95 percent credible intervals).
Samuel I. Berchuck
Predicts future observations from the spCP
model.
## S3 method for class 'spCP' predict(object, NewTimes, ...)
## S3 method for class 'spCP' predict(object, NewTimes, ...)
object |
a |
NewTimes |
a numeric vector including desired time(s) points for prediction. |
... |
other arguments. |
predict.spCP
uses Bayesian krigging to obtain posterior samples
from future time points.
predict.spCP
returns a list containing the following objects.
Y
A list
containing a matrix of predictions for each future time
point. Each matrix has one column for each location and contains posterior
samples obtained by Bayesian krigging.
Samuel I. Berchuck
spCP
is a Markov chain Monte Carlo (MCMC) sampler for a spatially varying change point
model with spatially varying slopes, intercepts, and unique variances at each spatial-temporal
location. The model is implemented using a Bayesian hierarchical framework.
spCP( Y, DM, W, Time, Starting = NULL, Hypers = NULL, Tuning = NULL, MCMC = NULL, Family = "tobit", Weights = "continuous", Distance = "circumference", Rho = 0.99, ScaleY = 10, ScaleDM = 100, Seed = 54 )
spCP( Y, DM, W, Time, Starting = NULL, Hypers = NULL, Tuning = NULL, MCMC = NULL, Family = "tobit", Weights = "continuous", Distance = "circumference", Rho = 0.99, ScaleY = 10, ScaleDM = 100, Seed = 54 )
Y |
An |
DM |
An |
W |
An |
Time |
A |
Starting |
Either When |
Hypers |
Either When
|
Tuning |
Either When |
MCMC |
Either
|
Family |
Character string indicating the distribution of the observed data. Options
include: |
Weights |
Character string indicating the type of weight used. Options include:
|
Distance |
Character string indicating the distance metric for computing the
dissimilarity metric. Options include: |
Rho |
A scalar in |
ScaleY |
A positive scalar used for scaling the observed data, |
ScaleDM |
A positive scalar used for scaling the dissimilarity metric distances,
|
Seed |
An integer value used to set the seed for the random number generator (default = 54). |
Details of the underlying statistical model proposed by proposed by Berchuck et al. 2018. are forthcoming.
spCP
returns a list containing the following objects
beta0
NKeep x M
matrix
of posterior samples for beta0
.
The s-th column contains posterior samples from the the s-th location.
beta1
NKeep x M
matrix
of posterior samples for beta1
.
The s-th column contains posterior samples from the the s-th location.
lambda0
NKeep x M
matrix
of posterior samples for lambda0
.
The s-th column contains posterior samples from the the s-th location.
lambda1
NKeep x M
matrix
of posterior samples for lambda1
.
The s-th column contains posterior samples from the the s-th location.
eta
NKeep x M
matrix
of posterior samples for eta
.
The s-th column contains posterior samples from the the s-th location.
theta
NKeep x M
matrix
of posterior samples for theta
.
The s-th column contains posterior samples from the the s-th location.
delta
NKeep x 5
matrix
of posterior samples for delta
.
The columns have names that describe the samples within them.
sigma
NKeep x 15
matrix
of posterior samples for Sigma
. The
columns have names that describe the samples within them. The row is listed first, e.g.,
Sigma32
refers to the entry in row 3
, column 2
.
alpha
NKeep x 1
matrix
of posterior samples for Alpha
.
metropolis
(3 * M + 1) x 3
matrix
of metropolis
acceptance rates, updated tuners, and original tuners that result from the pilot
adaptation. The first M
correspond to the Lambda0Vec
parameters,
the next M
correspond to the Lambda1Vec
, the next M
correspond
to the EtaVec
parameters and the last row give the Alpha
values.
runtime
A character
string giving the runtime of the MCMC sampler.
datobj
A list
of data objects that are used in future spCP
functions
and should be ignored by the user.
dataug
A list
of data augmentation objects that are used in future
spCP
functions and should be ignored by the user.
Samuel I. Berchuck [email protected]
Reference for Berchuck et al. 2018 is forthcoming.
A dataset containing 9 visual field series from a patient of the Vein Pulsation Study Trial in Glaucoma and the Lions Eye Institute trial registry, Perth, Western Australia.
data(VFSeries)
data(VFSeries)
A data frame with 486 rows and 4 variables:
The visual field test visit number, (1, 2, ... , 9).
The observed outcome variable, differential light sensitivity (DLS).
The time of the visual field test (in days from baseline).
The location on the visual field of a Humphrey Field Analyzer-II (Carl Zeiss Meditec Inc., Dublin, CA) (1, 2, ... , 54).
https://anzctr.org.au/Trial/Registration/TrialReview.aspx?ACTRN=12608000274370