Title: | Robust Bayesian Small Area Estimation |
---|---|
Description: | Functions for Robust Bayesian Small Area Estimation. |
Authors: | Malay Ghosh, Jiyoun Myung, Fernando Moura |
Maintainer: | Jiyoun Myung <[email protected]> |
License: | GPL-3 |
Version: | 0.1.0 |
Built: | 2024-12-24 06:45:19 UTC |
Source: | CRAN |
The package provides a function robustsae
for full non-subjective Bayesian analysis for the general area level small area models. This considers small area modeling of both the population means and the population variances. This is possible due to the availability of additional data purported to estimate the error variances. Also, in order to induce some robustness of the procedure, t-prior for the random effects is used. When the data set includes true values for interest parameter, this function returns the comparison criteria.
Package: | robustsae |
Type: | Package |
Version: | 1.0 |
Date: | 2016-12-05 |
License: | GPL-3 |
This package provides function for full Bayesian analysis of small area models.
Malay Ghosh, Jiyoun Myung, Fernando Moura
Maintainer: Jiyoun Myung <[email protected]>
Chip, S., and Green berg, E. (1995). Understanding the Metropolis-Hastings Algorithm. The American Statistician, 49, 327-335.
Rao, J. N. K. (2003) Small Area Estimation. John Wiley and Sons.
You, Y. and Chapman, B. (2006) Small Area Estimation Using Area Level Models and Estimated Sampling Variances. Survey Methodology, 32: 97-103.
Malay Ghosh, Jiyoun Myung, and Fernando Moura. (submitted) Robust Bayesian Small Area Estimation.
The data set is selected by a 10% random sampling of households in each area from a test demographic census completed in one municipality in Brazil consisting of 140 enumeration districts. This data set includes two centered auxiliary covariates, sampling means, sampling variances and true means for all areas. The contained information is available only at the area level.
data("BZdata")
data("BZdata")
A data frame with 140 observations on the following 6 variables.
ni
:sample size for each district.
X1
:respective small area population means of the educational attainment of the head of household, centered auxiliary covariate.
X2
:respective average number of rooms in households, centered auxiliary covariate
S2
:respective sampling variances income of head of the household.
y
:respective average mean income of head of the household.
truemean
:respective true mean income of head of the household.
Survey and satellite data for corn and soy beans in 12 Iowa counties, obtained from the 1978 June Enumerative Survey of the U.S. Department of Agriculture and from land observatory satellites (LANDSAT) during the 1978 growing season.
data("corndata")
data("corndata")
A data frame with 8 observations on the following 6 variables.
County
:county names.
ni
:sample size for each county.
Xi
:mean of reported hectares of corn from the survey, direct survey estimate.
Z1i
:mean of pixels of corn for each, from satellite data.
Z2i
:mean of pixels of soy bean for each county, from satellite data.
Si
:square root of sample variance of reported hectares of corn from the survey.
While the original dataset includes survey and satellite data for corn in 12 Iowa counties, this dataset contains only 8 counties' information where sample sizes are greater than 1.
- Battesse, G.E., Harter, R.M. and Fuller, W.A. (1988). An Error-Components Model for Prediction of County Crop Areas Using Survey and Satellite Data. Journal of the American Statistical Association, 83, 28-36.
- You, Y. and Chapman, B. (2006) Small Area Estimation Using Area Level Models and Estimated Sampling Variances. Survey Methodology, 32, 97-103.
This function provides full Bayesian Analysis for specific area-level small area models when data are provided for modeling both the mean and the variance.
robustsae(formula, S2, ni, nsim = 1000, burnin = 500, data, truemean)
robustsae(formula, S2, ni, nsim = 1000, burnin = 500, data, truemean)
formula |
a symbolic description of the model to be fitted. The details of model specification are given under Details. |
S2 |
a vector contain the sampling variances which are given for estimating the true variances. |
ni |
a vector containing the sample sizes for each area. |
nsim |
user-specified number of MCMC draws. See German (2006). |
burnin |
the number of burning iterations for the sampler. See German (2006). |
data |
an optional data frame containing the variables named in |
truemean |
true mean values for each area. |
Let denotes interest parameter for each area i,
the available area-specific auxiliary data,
the regression coefficients and
the number of small areas. A typical area level model is given by
Assume that the random effects and the sampling errors
are to be independently distributed with the
and the
. To foster robustness in small area estimation procedures, student t distribution is used for the random effects. Also, due to the availability of additional data purported to estimate the error variances, this considers modeling of both the means and the variances.
The robust Bayesian small area estimation model is
,
where is degrees of freedom parameter.
For a full Bayesian analysis, this function uses the modified Jeffrey' prior which is the product of the general Jeffrey' prior and
where
is chosen as 1:
The estimates of interest parameters are obtained by Rao-Balackwellization with Gibbs sampling with Metropolis-Hastings algorithm.
The function returns a object of class "robustsae"
containing the following components:
mean |
Rao-Balackwellization estimates of theta's |
variance |
Rao-Balackwellization estimates of v's |
Criteria |
a list containing the following comparison criteria : Returns NA if
|
Malay Ghosh, Jiyoun Myung, Fernando Moura
Rao, J. N. K. (2003) Small Area Estimation. John Wiley and Sons.
Chip, S., and Green berg, E. (1995). Understanding the Metropolis-Hastings Algorithm. The American Statistician, 49, 327-335.
# If there is truemean data, # load data set data(BZdata) attach(BZdata) result <- robustsae(y ~ X1 + X2, S2, ni = BZdata$ni, nsim = 1000, burnin = 500, data = BZdata, truemean = truemean) result detach(BZdata) # If there is no truemean data, #load data set data(corndata) attach(corndata) result2 <- robustsae(Xi ~ Z1i, Si^2, ni=corndata$ni, data = corndata) # no truemean result2$mean result2$variance detach(corndata)
# If there is truemean data, # load data set data(BZdata) attach(BZdata) result <- robustsae(y ~ X1 + X2, S2, ni = BZdata$ni, nsim = 1000, burnin = 500, data = BZdata, truemean = truemean) result detach(BZdata) # If there is no truemean data, #load data set data(corndata) attach(corndata) result2 <- robustsae(Xi ~ Z1i, Si^2, ni=corndata$ni, data = corndata) # no truemean result2$mean result2$variance detach(corndata)