Title: | Confidence Envelopes for Model Selection Criteria Based on Minimum ZIC |
---|---|
Description: | Narrow down the number of models to look at in model selection using the confidence envelopes based on the minimum ZIC (Generalized Information Criteria) values for regression and time series data. Functions involve the computation of multivariate normal-probabilities with covariance matrices based on minimum ZIC inverting the CDF of the minimum ZIC. It involves both the computation of singular and non-singular probabilities as described in Genz (1992) <[https:doi.org/10.2307/1390838]https:doi.org/10.2307/1390838>. |
Authors: | I.M.L. Nadeesha Jayaweera [aut, cre] , A. Alex Trindade [ctb, aut] |
Maintainer: | I.M.L. Nadeesha Jayaweera <[email protected]> |
License: | GPL-2 |
Version: | 1.0.1 |
Built: | 2024-10-24 07:01:30 UTC |
Source: | CRAN |
Concrete strength is very important in civil engineering and is a highly nonlinear function of age and ingredients. This dataset contains 1030 instances and there are 8 features relevant to concrete strength.
Concrete
Concrete
A data frame with 1030 rows and 8 covariate variables and 1 response variable
https://archive.ics.uci.edu/ml/datasets/Concrete+Compressive+Strength
data(Concrete)
data(Concrete)
Narrow down the number of models to look at in model selection using the confidence envelope based on the minimum ZIC values for regression data. Here, we compute the ZIC values ("AIC", "BIC", or "AICc") for regression data, confidence envelope for the minimum ZIC values for the given confidence limit, and rank the best models which lie in the confidence envelope.
RankReg(data,alphaval=0.95, model_ZIC="AIC")
RankReg(data,alphaval=0.95, model_ZIC="AIC")
data |
a matrix of |
alphaval |
confidence limit of the confidence envelope (Default is 0.95). |
model_ZIC |
type of the information criterion, it can be "AIC", "BIC", or "AICc" (Default is the "AIC"). |
This program involves the computation of multivariate normal-probabilities with covariance matrices based on minimum ZIC inverting the CDF of the minimum ZIC. It involves both the computation of singular and nonsingular probabilities. The methodology is described in Genz (1992).
Let be the ZIC value for the
fitted model. Compute the cdf values of the minimum ZIC,
numerically and then obtain the
confidence envelope:
See details:
Jayaweera I.M.L.N, Trindade A.A., “How Certain are You in Your Minimum AIC and BIC Values?", Sankhya A (2023+)
A list containing at least the following components.
Ranked_Models |
A set of top ranked models which lie in the confidence envelop |
Confidence_Envelope |
gives the confidence envelope |
Confidence_Limit |
the confidence limit, |
Total_Models |
number of total fitted models. |
Genz, A. (1992). Numerical computation of multivariate normal probabilities. Journal of computational and graphical statistics, 1(2), 141-149.
library("ConfZIC") data(Concrete) x=Concrete Y=x[,9] #dependent variable #independent variables X1=x[,1];X2=x[,2];X3=x[,3];X4=x[,4]; X5=x[,5];X6=x[,6];X7=x[,7];X8=x[,8]; mydata=cbind(Y,X1,X2,X3,X4,X5,X6,X7,X8) #data matrix RankReg(mydata,0.95,"BIC")
library("ConfZIC") data(Concrete) x=Concrete Y=x[,9] #dependent variable #independent variables X1=x[,1];X2=x[,2];X3=x[,3];X4=x[,4]; X5=x[,5];X6=x[,6];X7=x[,7];X8=x[,8]; mydata=cbind(Y,X1,X2,X3,X4,X5,X6,X7,X8) #data matrix RankReg(mydata,0.95,"BIC")
Narrow down the number of models to look at in model selection using the confidence envelope based on the minimum ZIC values for time series data. Here, we compute the ZIC values ("AIC", "BIC", or "AICc") for time-series data, confidence envelope for the minimum ZIC values for the given confidence limit, and rank the top models which lie in the confidence envelope.
RankTS(x,max.p,max.q,alphaval=0.95,model_ZIC="AIC")
RankTS(x,max.p,max.q,alphaval=0.95,model_ZIC="AIC")
x |
a vector of time series data (should be included with the maximum of 1000 data points). |
max.p |
maximum value for AR coefficient. |
max.q |
maximum value for MA coefficient. |
alphaval |
confidence limit |
model_ZIC |
type of the information criterion, it can be "AIC", "BIC", or "AICc" (Default is the "AIC"). |
This program involves the computation of multivariate normal-probabilities with covariance matrices based on minimum ZIC inverting the CDF of the minimum ZIC. It involves both the computation of singular and non-singular probabilities. The methodology is described in Genz (1992).
Let be the ZIC value for the
fitted model. Compute the cdf values of the minimum ZIC,
numerically and then obtain the
confidence envelope:
See details:
Jayaweera I.M.L.N, Trindade A.A., “How Certain are You in Your Minimum AIC and BIC Values?", Sankhya A (2023+)
a list of ranked models which lies in the confidence envelope,
Ranked_Models |
A set of top ranked time series models which lie in the confidence envelope |
Confidence_Envelope |
gives the confidence envelope |
Confidence_Limit |
the confidence limit, |
Total_Models |
number of total fitted models. |
Genz, A. (1992). Numerical computation of multivariate normal probabilities. Journal of computational and graphical statistics, 1(2), 141-149.
library("ConfZIC") data(Sunspots) x=Sunspots RankTS(x,max.p=13,max.q=13,0.95,"AICc")
library("ConfZIC") data(Sunspots) x=Sunspots RankTS(x,max.p=13,max.q=13,0.95,"AICc")
Test whether two ZIC values differ significantly based on minimum ZIC for regression data.
regZIC.test(model1,model2,model_ZIC="AIC",data,alpha=0.05)
regZIC.test(model1,model2,model_ZIC="AIC",data,alpha=0.05)
model1 |
an object of class “lm". |
model2 |
an object of class “lm". |
model_ZIC |
type of the information criterion, it can be "AIC", "BIC", or "AICc" (Default is the "AIC"). |
data |
a matrix of |
alpha |
significance level |
Consider the hypothesis: Under the null hypothesis that the two expected discrepancies are equal.
is calculated empirically.
p-value with significance status.
Linhart, H. (1988). A test whether two AIC's differ significantly. South African Statistical Journal, 22(2), 153-161.
library(ConfZIC) data(Concrete) x=Concrete Y=x[,9] #dependent variable #independent variables X1=x[,1];X2=x[,2];X3=x[,3];X4=x[,4]; X5=x[,5];X6=x[,6];X7=x[,7];X8=x[,8]; mydata=cbind(Y,X1,X2,X3,X4,X5,X6,X7,X8) #data matrix model1=lm(Y~X1); model2=lm(Y~X1+X2) regZIC.test(model1,model2,model_ZIC="BIC",data=mydata,alpha=0.05)
library(ConfZIC) data(Concrete) x=Concrete Y=x[,9] #dependent variable #independent variables X1=x[,1];X2=x[,2];X3=x[,3];X4=x[,4]; X5=x[,5];X6=x[,6];X7=x[,7];X8=x[,8]; mydata=cbind(Y,X1,X2,X3,X4,X5,X6,X7,X8) #data matrix model1=lm(Y~X1); model2=lm(Y~X1+X2) regZIC.test(model1,model2,model_ZIC="BIC",data=mydata,alpha=0.05)
Number of sunspots, 1770 to 1869
Sunspots
Sunspots
Number of sunspots, 1770 to 1869
Brockwell, P. J., & Davis, R. A. (Eds.). (2002). Introduction to time series and forecasting. New York, NY: Springer New York.
data(Sunspots)
data(Sunspots)
Test whether two ZIC values differ significantly based on minimum ZIC for time series data.
tsZIC.test(x,model1,model2,model_ZIC="AIC",alpha=0.05)
tsZIC.test(x,model1,model2,model_ZIC="AIC",alpha=0.05)
x |
time series data (maximum of 1000 data points). |
model1 |
AR and MA coefficients of Model 1. |
model2 |
AR and MA coefficients of Model 2. |
model_ZIC |
type of the information criterion, it can be "AIC", "BIC", or "AICc" (Default is the "AIC"). |
alpha |
significance level |
Consider the hypothesis: Under the null hypothesis that the two expected discrepancies are equal.
is calculated empirically.
p-value with significance status.
Linhart, H. (1988). A test whether two AIC's differ significantly. South African Statistical Journal, 22(2), 153-161.
library(ConfZIC) data(Sunspots) x=Sunspots model1=try(arima(x,order=c(1,0,1),method="ML",include.mean=FALSE),silent = TRUE) model2=try(arima(x,order=c(1,0,0),method="ML",include.mean=FALSE),silent = TRUE) tsZIC.test(x,model1,model2,model_ZIC="AIC",alpha=0.05)
library(ConfZIC) data(Sunspots) x=Sunspots model1=try(arima(x,order=c(1,0,1),method="ML",include.mean=FALSE),silent = TRUE) model2=try(arima(x,order=c(1,0,0),method="ML",include.mean=FALSE),silent = TRUE) tsZIC.test(x,model1,model2,model_ZIC="AIC",alpha=0.05)