Package 'CompositionalSR' reference manual

Title:	Spatial Regression Models with Compositional Data
Description:	Spatial and non-spatial regression models with compositional responses (and compositional predictors) using the alpha--transformation. Relevant papers include: Tsagris M. and Pantazis Y. (2026), <doi:10.48550/arXiv.2510.12663>, Tsagris M. (2015), <https://soche.cl/chjs/volumes/06/02/Tsagris(2015).pdf>, Tsagris M.T., Preston S. and Wood A.T.A. (2011), <doi:10.48550/arXiv.1106.1451>.
Authors:	Michail Tsagris [aut, cre]
Maintainer:	Michail Tsagris <[email protected]>
License:	GPL (>= 2)
Version:	1.4
Built:	2026-07-13 06:52:35 UTC
Source:	https://github.com/cran/CompositionalSR

Spatial Regression Models with Compositional Data

Description

Spatial regression models with compositional responses using the $\alpha$ –transformation. The models included are the $\alpha$ -regression (not spatial), the $\alpha$ -regression (not spatial) with compositional predictors, the $\alpha$ -spatially lagged X ( $\alpha$ -SLX) model, the geographically weighted $\alpha$ -regression (GW $\alpha$ R) model and the $\alpha$ -eigenvector spatial filtering ( $\alpha$ -ESF) model.

Details

Package:	CompositionalSR
Type:	Package
Version:	1.4
Date:	2026-04-14
License:	GPL-2

Maintainers

Michail Tsagris <[email protected]>

Author(s)

Michail Tsagris [email protected].

References

Tsagris M. and Pantazis Y. (2026). The $\alpha$ –regression for compositional data: a unified framework for standard, temporal and spatial regression models including compositional predictors. https://arxiv.org/pdf/2510.12663

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Tsagris M.T., Preston S. and Wood A.T.A. (2011). A data-based power transformation for compositional data. In Proceedings of the 4th Compositional Data Analysis Workshop, Girona, Spain. https://arxiv.org/pdf/1106.1451.pdf

Tsagris M., Papadovasilakis Z., Lakiotaki K. and Tsamardinos I. (2022). The $\gamma$ -OMP Algorithm for Feature Selection With Application to Gene Expression Data. IEEE/ACM Transactions on Computational Biology and Bioinformatics, 19(2), 1214–1224.

Compositional regression with compositional predictors using the $\alpha$ -transformation

Description

Compositional regression with compositional predictors using the $\alpha$ -transformation.

Usage

alfa.pcreg(y, x, a, k, xnew = NULL, yb = NULL)
alfa.pcreg(y, x, a, k, xnew = NULL, yb = NULL)

Arguments

y

A matrix with the compositional responses. Zero values are allowed.

x

A matrix with the compositional predictors. Zero values are allowed.

a

k

How many principal components to compute?

xnew

If you have new data use it, otherwise leave it NULL.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

Details

The $\alpha$ -transformation is applied to both the compositional responses and predictors. Then, principal component analysis is performed in the $\alpha$ -transformed predictors and the projected scores are used a predictors. The same value of $\alpha$ is used for both the responses and the predictors.

Value

A list including:

runtime

The time required by the regression.

be

The beta coefficients.

dev

The sum of the squared residuals, as produced by the function minpack.lm::nls.lm().

est

The fitted values for xnew if xnew is not NULL.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Mardia K.V., Kent J.T., and Bibby J.M. (1979). Multivariate analysis. Academic press.

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
x <- x / rowSums(x)
mod <- alfa.pcreg(y, x, k = 3, 0.2)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
x <- x / rowSums(x)
mod <- alfa.pcreg(y, x, k = 3, 0.2)

Computation of the contiguity matrix W

Description

Computation of the contiguity matrix W.

Usage

contiguity(coords, k = 10)
contiguity(coords, k = 10)

Arguments

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

Value

The contiguity matrix W. A square matrix with row standardised values (the elements of each row sum to 1).

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

Examples

data(fadn)
W <- contiguity(fadn[, 1:2])
data(fadn)
W <- contiguity(fadn[, 1:2])

FADN dataset

Description

A matrix with 11 columns. The first two are the locations (latitude and longitude), the next five contain the compositional data (percentages of cultivated area of five crops), Y1.1: cereals, Y2.1: cotton, Y3.1: tree crops, Y4.1: other annual crops and pasture and Y5.1: grapes and wine. The next four columns contain the covariates, G1: Human Influence Index, G2: soil pH, G3: topsoil organic carbon content and G7: erosion.

Usage

fadnfadn

Format

A matrix with 168 rows and 11 columns.

Source

Clark and Dixon (2021), available at https://github.com/nick3703/Chicago-Data.

References

Clark, N. J. and P. M. Dixon (2021). A class of spatially correlated self-exciting statistical models. Spatial Statistics, 43, 1–18.

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
mod <- alfa.reg(y, x, a = 0.1)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
mod <- alfa.reg(y, x, a = 0.1)

ICE plot for the $\alpha$ -ESF model

Description

ICE plot for the $\alpha$ -ESF model.

Usage

ice.aesf(be, gama, x, X.esf, ind = 1, frac = 0.25, pos = 0.5)
ice.aesf(be, gama, x, X.esf, ind = 1, frac = 0.25, pos = 0.5)

Arguments

be

A numerical matrix with the estimated $\beta$ coefficients of the $\alpha$ -ESF model.

gama

A numerical matrix with the estimated $\gamma$ coefficients of the $\alpha$ -ESF model.

x

A numerical matrix with the predictor variables.

X.esf

A matrix with the values of the eigenvectors computed.

ind

Which variable to select?.

frac

Fraction of observations to use. The default value is 0.25.

pos

This is a number between 0 and 1 and is used to place the legend in the appropriate place.

Details

This function implements the Individual Conditional Expecation plots of Goldstein et al. (2015). See the references for more details.

Value

A graph with several curves, one for each component. The horizontal axis contains the selected variable, whereas the vertical axis contains the locally smoothed predicted compositional lines.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

https://christophm.github.io/interpretable-ml-book/ice.html

Goldstein, A., Kapelner, A., Bleich, J. and Pitkin, E. (2015). Peeking inside the black box: Visualizing statistical learning with plots of individual conditional expectation. Journal of Computational and Graphical Statistics 24(1): 44-65.

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8, drop = FALSE]
mod <- alfa.esf(y, x, a = 0.1, coords = coords)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8, drop = FALSE]
mod <- alfa.esf(y, x, a = 0.1, coords = coords)

ICE plot for the $\alpha$ -regression

Description

ICE plot for the $\alpha$ -regression.

Usage

ice.areg(be, x, ind = 1, frac = 0.25, pos = 0.5)
ice.areg(be, x, ind = 1, frac = 0.25, pos = 0.5)

Arguments

be

A numerical matrix with the estimated $\alpha$ -regression coefficients.

x

A numerical matrix with the predictor variables.

ind

Which variable to select?.

frac

Fraction of observations to use. The default value is 0.25.

pos

This is a number between 0 and 1 and is used to place the legend in the appropriate place.

Details

This function implements the Individual Conditional Expecation plots of Goldstein et al. (2015). See the references for more details.

Value

A graph with several curves, one for each component. The horizontal axis contains the selected variable, whereas the vertical axis contains the locally smoothed predicted compositional lines.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

https://christophm.github.io/interpretable-ml-book/ice.html

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8, drop = FALSE]
mod <- alfa.reg(y, x, 0.2)
ice <- ice.areg(mod$be, x, ind = 1)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8, drop = FALSE]
mod <- alfa.reg(y, x, 0.2)
ice <- ice.areg(mod$be, x, ind = 1)

K-fold cross-validation for the $\alpha$ -regression

Description

K-fold cross-validation for the $\alpha$ -regression.

Usage

cv.alfareg(y, x, a = seq(0.1, 1, by = 0.1), nfolds = 10,
folds = NULL, nc = 1, seed = NULL)
cv.alfareg(y, x, a = seq(0.1, 1, by = 0.1), nfolds = 10,
folds = NULL, nc = 1, seed = NULL)

Arguments

y

A matrix with compositional data. zero values are allowed.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

nfolds

The number of folds to split the data.

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

nc

The number of cores to use. IF you have a multicore computer it is advisable to use more than 1. It makes the procedure faster. It is advisable to use it if you have many observations and or many variables, otherwise it will slow down th process.

seed

You can specify your own seed number here or leave it NULL.

Details

Tuning the value of $\alpha$ in the $\alpha$ -regression takes place using K-fold cross-validation.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A vector with the average Kullback-Leibler divergence, for every value of $\alpha$ .

opt

A vector with the minimum Kullback-Leibler divergence and the optimal value of $\alpha$ .

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- cv.alfareg(y, x, a = c(0.5, 1))
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- cv.alfareg(y, x, a = c(0.5, 1))

K-fold cross-validation the $\alpha$ -regression with compositional predictors

Description

K-fold cross-validation the $\alpha$ -regression with compositional predictors.

Usage

cv.alfapcreg(y, x, a = seq(0.1, 1, by = 0.1), k = dim(x)[2] - 2,
nfolds = 10, folds = NULL, seed = NULL)
cv.alfapcreg(y, x, a = seq(0.1, 1, by = 0.1), k = dim(x)[2] - 2,
nfolds = 10, folds = NULL, seed = NULL)

Arguments

y

A matrix with compositional response data. Zero values are allowed.

x

A matrix with the compositional predictor variables. Zero values are allowed.

a

A numerical vector with the values of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

k

A number with the maximum number of principal components to consider. Use at most the default value, dim(x)[2] - 2.

nfolds

The number of folds to split the data.

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

seed

You can specify your own seed number here or leave it NULL.

Details

Tuning the value of $\alpha$ and k, the number of principal components in the $\alpha$ -regression with compositional predictors takes place using the classical K-fold cross-validation.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A matrix with the average Kullback-Leibler divergence, for every value of $\alpha$ and k.

kl

The minimum average value of the Kullback-Leibler divergence.

opt_a

The optimal value of $\alpha$ .

opt_k

The optimal value of k, the number of principal components.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
x <- x / rowSums(x)
mod <- cv.alfapcreg(y, x, a = c(0.5, 1))
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8:11]
x <- x / rowSums(x)
mod <- cv.alfapcreg(y, x, a = c(0.5, 1))

Leave-one-out cross-validation for the GW $\alpha$ R model

Description

Leave-one-out cross-validation for the GW $\alpha$ R model

Usage

cv.gwar(y, x, a = c(0.1, 0.25, 0.5, 0.75, 1), coords, h,
nfolds = 10, size = 1000, folds = NULL)
cv.gwar(y, x, a = c(0.1, 0.25, 0.5, 0.75, 1), coords, h,
nfolds = 10, size = 1000, folds = NULL)

Arguments

y

A matrix with compositional data. zero values are allowed.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

h

A vector with bandwith values.

nfolds

The number of folds to split the data.

size

A numeric value of the specified range by which blocks are created and training/testing data are separated. This distance should be in metres. If you have big regions you should consider increasing this number. For more information see the package blockCV.

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

Details

The 10-fold spatial cross-validation protocol is applied to choose the optimal values of $\alpha$ and h.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A vector with the average Kullback-Leibler divergence, for every value of $\alpha$ .

opt

A vector with the minimum Kullback-Leibler divergance, the optimal value of $\alpha$ and h.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)

Marginal effects for the $\alpha$ -ESF model

Description

Marginal effects for the $\alpha$ -ESF model.

Usage

me.aesf(be, gama, mu, x, X.esf)
me.aesf(be, gama, mu, x, X.esf)

Arguments

be

A matrix with the beta regression coefficients of the $\alpha$ -ESF model.

gama

A matrix with the beta regression coefficients of the $\alpha$ -ESF model.

mu

The fitted values of the $\alpha$ -ESF model.

x

A matrix with the continuous predictor variables or a data frame. Categorical predictor variables are not suited here.

X.esf

A matrix with the eigenvectors. Categorical predictor variables are not suited here.

Details

The marginal effects of the $\alpha$ -ESF model are computed.

Value

A list including:

me

An array with the marginal effects of each component for each predictor variable.

ame

The average marginal effects of each component for each predictor variable.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
mod <- alfa.esf(y, x, a = 0.1, coords = coords, xnew = x, coordsnew = coords)
me <- me.aesf(mod$be, mod$gama, mod$est, x, mod$X.esf)
data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
mod <- alfa.esf(y, x, a = 0.1, coords = coords, xnew = x, coordsnew = coords)
me <- me.aesf(mod$be, mod$gama, mod$est, x, mod$X.esf)

Marginal effects for the $\alpha$ -regression model

Description

Marginal effects for the $\alpha$ -regression model.

Usage

me.ar(be, mu, x, cov_be = NULL)
me.ar(be, mu, x, cov_be = NULL)

Arguments

be

A matrix with the beta regression coefficients of the $\alpha$ -regression model.

mu

The fitted values of the $\alpha$ -regression.

x

A matrix with the continuous predictor variables or a data frame. Categorical predictor variables are not suited here.

cov_be

The covariance matrix of the beta regression coefficients. If you pass this argument, then the standard error of the average marginal effects will be returned.

Details

The marginal effects of the $\alpha$ -regression model are computed.

Value

A list including:

me

An array with the marginal effects of each component for each predictor variable.

ame

The average marginal effects of each component for each predictor variable.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.reg(y, x, 0.2, xnew = x)
me <- me.ar(mod$be, mod$est, x)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.reg(y, x, 0.2, xnew = x)
me <- me.ar(mod$be, mod$est, x)

Marginal effects for the $\alpha$ -SAR model

Description

Marginal effects for the $\alpha$ -SAR model.

Usage

me.asar(be, rho, mu, x, coords, k, cov_theta = NULL)
me.asar(be, rho, mu, x, coords, k, cov_theta = NULL)

Arguments

be

A matrix with the beta coefficients of the $\alpha$ -SAR model.

rho

The spatial auto-regressive coefficient $\rho$ of the $\alpha$ -SAR model.

mu

The fitted values of the $\alpha$ -SAR model.

x

A matrix with the continuous predictor variables or a data frame. Categorical predictor variables are not suited here.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

cov_theta

The covariance matrix of the beta and gamma regression coefficients. If you pass this argument, then the standard error of the average marginal effects will be returned.

Details

The marginal effects of the $\alpha$ -SAR model are computed.

Value

A list including:

me.dir

An array with the direct marginal effects of each component for each predictor variable.

me.indir

An array with the indirect marginal effects of each component for each predictor variable.

me.total

An array with the total marginal effects of each component for each predictor variable.

ame.dir

An array with the average direct marginal effects of each component for each predictor variable.

ame.indir

An array with the average indirect marginal effects of each component for each predictor variable.

ame.total

An array with the aerage total marginal effects of each component for each predictor variable.

se.amedir

An array with the standard errors of the average direct marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

se.ameindir

An array with the standard errors of the average indirect marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

se.ametotal

An array with the standard errors of the average total marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.sar(y, x, a = 0.5, coords, k = 8)
me <- me.asar(mod$be, mod$rho, mod$est, x, coords, k = 6)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.sar(y, x, a = 0.5, coords, k = 8)
me <- me.asar(mod$be, mod$rho, mod$est, x, coords, k = 6)

Marginal effects for the $\alpha$ -SLX model

Description

Marginal effects for the $\alpha$ -SLX model.

Usage

me.aslx(be, gama, mu, x, coords, k = 10, cov_theta = NULL)
me.aslx(be, gama, mu, x, coords, k = 10, cov_theta = NULL)

Arguments

be

A matrix with the beta coefficients of the $\alpha$ -SLX model.

gama

A matrix with the gamma coefficients of the $\alpha$ -SLX model.

mu

The fitted values of the $\alpha$ -SLX model.

x

A matrix with the continuous predictor variables or a data frame. Categorical predictor variables are not suited here.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

cov_theta

The covariance matrix of the beta and gamma regression coefficients. If you pass this argument, then the standard error of the average marginal effects will be returned.

Details

The marginal effects of the $\alpha$ -SLX model are computed.

Value

A list including:

me.dir

An array with the direct marginal effects of each component for each predictor variable.

me.indir

An array with the indirect marginal effects of each component for each predictor variable.

me.total

An array with the total marginal effects of each component for each predictor variable.

ame.dir

An array with the average direct marginal effects of each component for each predictor variable.

ame.indir

An array with the average indirect marginal effects of each component for each predictor variable.

ame.total

An array with the aerage total marginal effects of each component for each predictor variable.

se.amedir

An array with the standard errors of the average direct marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

se.ameindir

An array with the standard errors of the average indirect marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

se.ametotal

An array with the standard errors of the average total marginal effects of each component for each predictor variable. This is returned if you supply the covariance matrix cov_theta.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10, xnew = x, coordsnew = coords)
me <- me.aslx(mod$be, mod$gama, mod$est, x, coords, k = 10)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10, xnew = x, coordsnew = coords)
me <- me.aslx(mod$be, mod$gama, mod$est, x, coords, k = 10)

Marginal effects for the GW $\alpha$ R model

Description

Marginal effects for the GW $\alpha$ R model.

Usage

me.gwar(be, mu, x)
me.gwar(be, mu, x)

Arguments

be

A matrix with the beta regression coefficients of the $\alpha$ -regression model.

mu

The fitted values of the $\alpha$ -regression.

x

A matrix with the continuous predictor variables or a data frame. Categorical predictor variables are not suited here.

Details

The location-specific marginal effects for the GW $\alpha$ R model are computed.

Value

A list including:

me

An array with the location-specific marginal effects of each component for each predictor variable.

ame

The average location-specific marginal effects of each component for each predictor variable.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)
me <- me.gwar(mod$be, mod$est, x)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)
me <- me.gwar(mod$be, mod$est, x)

Prediction with the GW $\alpha$ R model

Description

Prediction with GW $\alpha$ R model.

Usage

gwar.pred(y, x, a, coords, h, xnew, coordsnew)
gwar.pred(y, x, a, coords, h, xnew, coordsnew)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

A vector with values for the power transformation, it has to be between -1 and 1.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

h

A vector with bandwith values.

xnew

The new data.

coordsnew

A matrix with the coordinates of the new locations. The first column is the latitude and the second is the longitude.

Details

The $\alpha$ -transformation is applied to the compositional data first and then the GW $\alpha$ R model is applied and predictions are given for each observation.

Value

A list including:

runtime

The time required by the regression.

est

A list with the fitted values, for each combination of $\alpha$ and h.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[-c(1:10), 1:2]
y <- fadn[-c(1:10), 3:7]
x <- fadn[-c(1:10), 8]
xnew <- fadn[1:10, 8]
coordsnew <- fadn[1:10, 1:2]
mod <- gwar.pred(y, x, a = c(0.25, 0.5, 1), coords,
h = c(0.002, 0.006), xnew = xnew, coordsnew = coordsnew)
data(fadn)
coords <- fadn[-c(1:10), 1:2]
y <- fadn[-c(1:10), 3:7]
x <- fadn[-c(1:10), 8]
xnew <- fadn[1:10, 8]
coordsnew <- fadn[1:10, 1:2]
mod <- gwar.pred(y, x, a = c(0.25, 0.5, 1), coords,
h = c(0.002, 0.006), xnew = xnew, coordsnew = coordsnew)

Regression with compositional data using the $\alpha$ -transformation

Description

Regression with compositional data using the $\alpha$ -transformation.

Usage

areg(y, x, a, covb = FALSE, xnew = NULL, yb = NULL)
alfa.reg(y, x, a, covb = FALSE, xnew = NULL, yb = NULL)
alfa.reg2(y, x, a, xnew = NULL, ncores = 1)
alfa.reg3(y, x, a = c(-1, 1), xnew = NULL)
areg(y, x, a, covb = FALSE, xnew = NULL, yb = NULL)
alfa.reg(y, x, a, covb = FALSE, xnew = NULL, yb = NULL)
alfa.reg2(y, x, a, xnew = NULL, ncores = 1)
alfa.reg3(y, x, a = c(-1, 1), xnew = NULL)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied and the solution exists in a closed form, since it the classical mutivariate regression. For the alfa.reg2() this should be a vector of $\alpha$ values and the function call repeatedly the alfa.reg() function. For the alfa.reg3() function it should be a vector with two values, the endpoints of the interval of $\alpha$ . This function searches for the optimal vaue of $\alpha$ that minimizes the Kullback-Leibler between the observed and fitted compositions. Using the optimize function it searches for the optimal value of $\alpha$ . Instead of choosing the value of $\alpha$ using cv.alfareg (that uses cross-validation) one can select it this way.

The function areg() is faster as it passes the Jacobian matrix to the nls.lm() function.

covb

Do you want the covariance matrix of the regression coefficients to be returned? If TRUE, this will slow down the process, as it is computed numerically.

xnew

If you have new data use it, otherwise leave it NULL.

ncores

The number of cores to use for parallel computations.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

This is intended to be used in the function cv.alfareg in order to speed up the process. The time difference in that function is small for small samples. But, if you have a few thousands and or a few more components, there will be bigger differences.

Details

The $\alpha$ -transformation is applied to the compositional data first and then multivariate regression is applied. This involves numerical optimisation. The alfa.reg2() function accepts a vector with many values of $\alpha$ , while the the alfa.reg3() function searches for the value of $\alpha$ that minimizes the Kulback-Leibler divergence between the observed and the fitted compositional values. The functions are highly optimized.

Value

For the alfa.reg() function a list including:

runtime

The time required by the regression.

be

The beta coefficients.

covbe

The covariance matrix if covb was set to TRUE, otherwise NULL.

dev

The sum of the squared residuals, as produced by the function minpack.lm::nls.lm().

est

The fitted values for xnew if xnew is not NULL.

For the alfa.reg2() function a list with the time required by all regressions and the regression coefficients and the fitted values for each value of $\alpha$ .

For the alfa.reg3() function a list with the previous elements plus an output "alfa", the optimal value of $\alpha$ .

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Mardia K.V., Kent J.T., and Bibby J.M. (1979). Multivariate analysis. Academic press.

Aitchison J. (1986). The statistical analysis of compositional data. Chapman & Hall.

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.reg(y, x, 0.2)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.reg(y, x, 0.2)

Regression with compositional data using the $\alpha$ -transformation

Description

Regression with compositional data using the $\alpha$ -transformation.

Usage

rob.alfareg(y, x, a, loss = "welsh", xnew = NULL, yb = NULL)
rob.alfareg(y, x, a, loss = "welsh", xnew = NULL, yb = NULL)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

loss

The loss function to use. One of these available options, "barron", "bisquare", "welsh", "optimal", "hampel", "ggw", or "lqq". For more information see the package gslnls.

xnew

If you have new data use it, otherwise leave it NULL.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

Details

The $\alpha$ -transformation is applied to the compositional data first and then robust multivariate regression is applied. This involves numerical optimisation.

Value

A list including:

runtime

The time required by the regression.

be

The beta coefficients.

est

The fitted values for xnew if xnew is not NULL.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Mardia K.V., Kent J.T., and Bibby J.M. (1979). Multivariate analysis. Academic press.

Aitchison J. (1986). The statistical analysis of compositional data. Chapman & Hall.

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- rob.alfareg(y, x, 0.2)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- rob.alfareg(y, x, 0.2)

Spatial K-fold cross-validation for the $\alpha$ -ESF model

Description

Spatial K-fold cross-validation for the $\alpha$ -ESF model

Usage

cv.alfaesf(y, x, a = seq(0.1, 1, by = 0.1), coords, model = "exp",
nfolds = 10, size = 1000, folds = NULL)
cv.alfaesf(y, x, a = seq(0.1, 1, by = 0.1), coords, model = "exp",
nfolds = 10, size = 1000, folds = NULL)

Arguments

y

A matrix with compositional data. zero values are allowed.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

model

Type of kernel to model spatial dependence. The currently available options are "exp" for the exponential kernel, "gau" for the Gaussian kernel, and "sph" for the spherical kernel. For more information check the package "spmoran".

nfolds

The number of folds to split the data.

size

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

Details

The 10-fold spatial cross-validation protocol is applied to choose the optimal values of $\alpha$ and k.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A vector with the average Kullback-Leibler divergence, for every value of $\alpha$ .

opt

A vector with the minimum Kullback-Leibler divergence, and the optimal value of $\alpha$ .

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. and Pantazis Y. (2026). The $\alpha$ –regression for compositional data: a unified framework for standard, spatially-lagged, spatial autoregressive and geographically-weighted regression models. https://arxiv.org/pdf/2510.12663

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- cv.alfaesf(y, x, a = c(0.1, 0.5), coords, nfolds = 5)
data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- cv.alfaesf(y, x, a = c(0.1, 0.5), coords, nfolds = 5)

Spatial K-fold cross-validation for the $\alpha$ -SAR model

Description

Spatial K-fold cross-validation for the $\alpha$ -SAR model

Usage

cv.alfasar(y, x, a = seq(0.1, 1, by = 0.1), coords, k = 2:15,
nfolds = 10, size = 1000, folds = NULL)
cv.alfasar(y, x, a = seq(0.1, 1, by = 0.1), coords, k = 2:15,
nfolds = 10, size = 1000, folds = NULL)

Arguments

y

A matrix with compositional data. zero values are allowed.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

A vector with the nearest neighbours to consider for the contiguity matrix.

nfolds

The number of folds to split the data.

size

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

Details

The 10-fold spatial cross-validation protocol is applied to choose the optimal values of $\alpha$ and k.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A vector with the average Kullback-Leibler divergence, for every value of $\alpha$ .

opt

A vector with the minimum Kullback-Leibler divergence, the optimal value of $\alpha$ and k.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]

Spatial K-fold cross-validation for the $\alpha$ -SLX model

Description

Spatial K-fold cross-validation for the $\alpha$ -SLX model

Usage

cv.alfaslx(y, x, a = seq(0.1, 1, by = 0.1), coords, k = 2:15,
nfolds = 10, size = 1000, folds = NULL)
cv.alfaslx(y, x, a = seq(0.1, 1, by = 0.1), coords, k = 2:15,
nfolds = 10, size = 1000, folds = NULL)

Arguments

y

A matrix with compositional data. zero values are allowed.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0. If $\alpha=0$ the isometric log-ratio transformation is applied.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

A vector with the nearest neighbours to consider for the contiguity matrix.

nfolds

The number of folds to split the data.

size

folds

If you have the list with the folds supply it here. You can also leave it NULL and it will create folds.

Details

The 10-fold spatial cross-validation protocol is applied to choose the optimal values of $\alpha$ and k.

Value

A list including:

runtime

The runtime required by the cross-validation.

perf

A vector with the average Kullback-Leibler divergence, for every value of $\alpha$ .

opt

A vector with the minimum Kullback-Leibler divergence, the optimal value of $\alpha$ and k.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:60, 1:2]
y <- fadn[1:60, 3:7]
x <- fadn[1:60, 8]
mod <- cv.alfaslx(y, x, a = 0.5, coords, k = 2)
data(fadn)
coords <- fadn[1:60, 1:2]
y <- fadn[1:60, 3:7]
x <- fadn[1:60, 8]
mod <- cv.alfaslx(y, x, a = 0.5, coords, k = 2)

Spatial k-folds

Description

Spatial k-folds.

Usage

spat.folds(coords, nfolds = 10, size = 1000)
spat.folds(coords, nfolds = 10, size = 1000)

Arguments

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

nfolds

The number of spatial folds to create.

size

Details

Folds of data are created based on their coordinates. For more information see the package blockCV.

Value

A list with nfolds elements. Each elements contains a list with two elements, the first is the indices of the training set and the second contains the indices of the test set.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

Examples

data(fadn)
coords <- fadn[1:100, 1:2]
folds <- spat.folds(coords, nfolds = 5)
data(fadn)
coords <- fadn[1:100, 1:2]
folds <- spat.folds(coords, nfolds = 5)

The $\alpha$ -ESF model

Description

The $\alpha$ -ESF model.

Usage

alfa.esf(y, x, a, coords, model = "exp", xnew = NULL, coordsnew, yb = NULL)
alfa.esf(y, x, a, coords, model = "exp", xnew = NULL, coordsnew, yb = NULL)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

model

xnew

If you have new data use it, otherwise leave it NULL.

coordsnew

A matrix with the coordinates of the new locations. The first column is the latitude and the second is the longitude. If you do not have new data to make predictions leave this NULL.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

Details

The $\alpha$ -transformation is applied to the compositional data first. Then the eigenvectors of the kernelized distance matrix are computed and the appropriate number is selected to be included as predictors. The selection takes place using the $\gamma$ -OMP algorithm (Tsagris et al., 2022).

Value

A list including:

runtime

The time required by the regression.

be

The beta coefficients.

gama

The gamma coefficients of the eigenvectors.

ESF

A vector with the indices of the eigenvectors used.

X.esf

A matrix with the values of the eigenvectors used.

dev

The sum of the squared residuals, as produced by the function minpack.lm::nls.lm().

est

The fitted values for xnew if xnew and coordsnew are not NULL.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
mod <- alfa.esf(y, x, a = 0.1, coords = coords)
data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
mod <- alfa.esf(y, x, a = 0.1, coords = coords)

The $alpha$ -regression using Newton-Raphson

Description

The $alpha$ -regression using Newton-Raphson.

Usage

alfareg.nr(y, x, alpha = 1, beta_init = NULL, max_iter = 100,
tol = 1e-6, line_search = TRUE, hess.eps = 1e-5)
alfareg.nr(y, x, alpha = 1, beta_init = NULL, max_iter = 100,
tol = 1e-6, line_search = TRUE, hess.eps = 1e-5)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

alpha

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0.

beta_init

A vector of initial parameters (optional). This is then transformed into a matrix.

max_iter

The maximum number of iterations for the Newton-Raphson algorithm.

tol

The tolerance value to terminate the Newton-Raphson algorithm.

line_search

Do you want to perform line search? The default value is TRUE.

hess.eps

This is the infinitesimal change to compute the Hessian matrix numerically.

Details

The $\alpha$ -transformation is applied to the compositional data first and then multivariate regression is applied. This involves numerical optimisation.

Value

A list including:

runtime

The time required by the regression.

iters

The iterations of the Newton-Raphson algorithm

be

The beta coefficients.

objective

The sum of the squared residuals.

est

The predicted values if xnew is not NULL.

covb

The covariance matrix of the beta coefficients, or NULL if it is singular.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Mardia K.V., Kent J.T., and Bibby J.M. (1979). Multivariate analysis. Academic press.

Aitchison J. (1986). The statistical analysis of compositional data. Chapman & Hall.

Examples

data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfareg.nr(y, x, a = 0.2)
mod2 <- alfa.reg(y, x, 0.2)
data(fadn)
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfareg.nr(y, x, a = 0.2)
mod2 <- alfa.reg(y, x, 0.2)

The $\alpha$ -SAR model

Description

The $\alpha$ -SAR model.

Usage

alfa.sar(y, x, a, coords, k = 10, covb = FALSE, xnew = NULL, coordsnew, yb = NULL)
alfa.sar(y, x, a, coords, k = 10, covb = FALSE, xnew = NULL, coordsnew, yb = NULL)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

covb

Do you want the covariance matrix of the spatial autoregressive parameter and the regression coefficients to be returned? If TRUE, this will slow down the process, as it is computed numerically.

xnew

If you have new data use it, otherwise leave it NULL.

coordsnew

A matrix with the coordinates of the new locations. The first column is the latitude and the second is the longitude. If you do not have new data to make predictions leave this NULL.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

Details

The $\alpha$ -transformation is applied to the compositional data first and the spatial autocorrelation (SAR) model is applied. The function performs a grid search searching for the range of good values of $\rho$ and then uses that as starting value.

Value

A list including:

runtime

The time required by the regression.

be

The beta coefficients.

covbe

The covariance matrix if covb was set to TRUE, otherwise NULL.

dev

The sum of the squared residuals, as produced by the function minpack.lm::nls.lm().

est

The fitted values for xnew if xnew and coordsnew are not NULL.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]
data(fadn)
coords <- fadn[1:50, 1:2]
y <- fadn[1:50, 3:7]
x <- fadn[1:50, 8]

The $\alpha$ -SLX model

Description

The $\alpha$ -SLX model.

Usage

alfa.slx(y, x, a, coords, k = 10, covb = FALSE, xnew = NULL, coordsnew, yb = NULL)
alfa.slx2(y, x, a, coords, k = 2:15, xnew = NULL, coordsnew, yb = NULL)
alfa.slx(y, x, a, coords, k = 10, covb = FALSE, xnew = NULL, coordsnew, yb = NULL)
alfa.slx2(y, x, a, coords, k = 2:15, xnew = NULL, coordsnew, yb = NULL)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix. For the alfa.slx2() this should be a vector.

covb

Do you want the covariance matrix of the regression coefficients to be returned? If TRUE, this will slow down the process, as it is computed numerically.

xnew

If you have new data use it, otherwise leave it NULL.

coordsnew

A matrix with the coordinates of the new locations. The first column is the latitude and the second is the longitude. If you do not have new data to make predictions leave this NULL.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

Details

The $\alpha$ -transformation is applied to the compositional data first and then the spatially lagged X (SLX) model is applied.

Value

For the alfa.slx() a list including:

runtime

The time required by the regression.

be

The beta coefficients.

gama

The gamma coefficients.

covbe

The covariance matrix if covb was set to TRUE, otherwise NULL.

dev

The sum of the squared residuals, as produced by the function minpack.lm::nls.lm().

est

The fitted values for xnew if xnew and coordsnew are not NULL.

For the alfa.slx2() a list including:

runtime

The time required by the regression.

be

A list with the beta coefficients for each value of k.

gama

A list with the gamma coefficients.

dev

A vector with the sum of the squared residuals, as produced by the function minpack.lm::nls.lm(). The positions of the vector are the ones defined by the argument k that is a vector.

est

A list with the fitted values for the xnew and coordsnew, for each value of k.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10)

The gradient vector of the $\alpha$ -regression model at each observation

Description

The gradient vector of the $\alpha$ -regression model at each observation.

Usage

ar.grads(y, x, a, be)
ar.grads(y, x, a, be)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0.

be

The regression coefficients of the $\alpha$ -SAR model.

Details

The gradient vector of the $\alpha$ -regression model is computed at each observation.

Value

A matrix with the gradient vector computed at each observation.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8:10]
mod <- alfa.reg(y, x, 0.5)
grads <- ar.grads(y, x, a = 0.5, mod$be)
colSums(grads)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8:10]
mod <- alfa.reg(y, x, 0.5)
grads <- ar.grads(y, x, a = 0.5, mod$be)
colSums(grads)

The gradient vector of the $\alpha$ -SAR model at each observation

Description

The gradient vector of the $\alpha$ -SAR model at each observation.

Usage

asar.grads(y, x, a, rho, be, coords, k)
asar.grads(y, x, a, rho, be, coords, k)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0.

rho

The spatial autocorrelation parameter $\rho$ .

be

The regression coefficients of the $\alpha$ -SAR model.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

Details

The gradient vector of the $\alpha$ -SAR model is computed at each observation.

Value

A matrix with the gradient vector computed at each observation.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8:10]

be <- matrix( c( 12.72191991, 0.04300266, -1.78301001, -3.02074120, -23.54785921,
0.06771573, 2.71969599, 1.89312564, 5.38640736, 0.05179626, -1.21336879, 0.40175088,
-1.98258721, 0.06815682, -0.64458883, 0.95470802 ), ncol = 4 )
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8:10]

be <- matrix( c( 12.72191991, 0.04300266, -1.78301001, -3.02074120, -23.54785921,
0.06771573, 2.71969599, 1.89312564, 5.38640736, 0.05179626, -1.21336879, 0.40175088,
-1.98258721, 0.06815682, -0.64458883, 0.95470802 ), ncol = 4 )

The gradient vector of the $\alpha$ -SLX model at each observation

Description

The gradient vector of the $\alpha$ -SLX model at each observation.

Usage

aslx.grads(y, x, a, be, gama, coords, k = 10)
aslx.grads(y, x, a, be, gama, coords, k = 10)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1. If zero values are present it has to be greater than 0.

be

The regression coefficients of the $\alpha$ -SLX model.

gama

The gamma coefficients of the $\alpha$ -SLX model.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

k

The number of nearest neighbours to consider for the contiguity matrix.

Details

The gradient vector of the $\alpha$ -SLX model is computed at each observation.

Value

A matrix with the gradient vector computed at each observation.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10)
grads <- aslx.grads(y, x, a = 0.5, mod$be, mod$gama, coords, k = 10)
colSums(grads)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- alfa.slx(y, x, a = 0.5, coords, k = 10)
grads <- aslx.grads(y, x, a = 0.5, mod$be, mod$gama, coords, k = 10)
colSums(grads)

The GW $\alpha$ R model

Description

The GW $\alpha$ R model.

Usage

gwar(y, x, a, coords, h, yb = NULL, nc = 1)
gwar(y, x, a, coords, h, yb = NULL, nc = 1)

Arguments

y

A matrix with the compositional data.

x

A matrix with the continuous predictor variables or a data frame including categorical predictor variables.

a

The value of the power transformation, it has to be between -1 and 1.

coords

A matrix with the coordinates of the locations. The first column is the latitude and the second is the longitude.

h

The bandwith value.

yb

If you have already transformed the data using the $\alpha$ -transformation with the same $\alpha$ as given in the argument "a", put it here. Othewrise leave it NULL.

nc

Details

The $\alpha$ -transformation is applied to the compositional data first and then the GW $\alpha$ R model is applied.

Value

A list including:

runtime

The time required by the regression.

be

The beta coefficients.

est

The fitted values.

Author(s)

Michail Tsagris.

R implementation and documentation: Michail Tsagris [email protected].

References

Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2): 47-57. https://arxiv.org/pdf/1508.01913v1.pdf

Examples

data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)
data(fadn)
coords <- fadn[, 1:2]
y <- fadn[, 3:7]
x <- fadn[, 8]
mod <- gwar(y, x, a = 1, coords, h = 0.001)

Package 'CompositionalSR'

Help Index

Spatial Regression Models with Compositional Data

Description

Details

Maintainers

Author(s)

References

Compositional regression with compositional predictors using the α\alphaα-transformation

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Computation of the contiguity matrix W

Description

Usage

Arguments

Value

Author(s)

See Also

Examples

FADN dataset

Description

Usage

Format

Source

References

See Also

Examples

ICE plot for the α\alphaα-ESF model

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

ICE plot for the α\alphaα-regression

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

K-fold cross-validation for the α\alphaα-regression

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

K-fold cross-validation the α\alphaα-regression with compositional predictors

Description

Usage

Arguments

Details

Value

Author(s)

References

See Also

Examples

Leave-one-out cross-validation for the GWα\alphaαR model

Description

Usage

Arguments

Details

Value

Compositional regression with compositional predictors using the $\alpha$ -transformation

ICE plot for the $\alpha$ -ESF model

ICE plot for the $\alpha$ -regression

K-fold cross-validation for the $\alpha$ -regression

K-fold cross-validation the $\alpha$ -regression with compositional predictors

Leave-one-out cross-validation for the GW $\alpha$ R model

Marginal effects for the $\alpha$ -ESF model

Marginal effects for the $\alpha$ -regression model

Marginal effects for the $\alpha$ -SAR model

Marginal effects for the $\alpha$ -SLX model

Marginal effects for the GW $\alpha$ R model

Prediction with the GW $\alpha$ R model

Regression with compositional data using the $\alpha$ -transformation