Title: | Nonparametric Analysis of Multivariate Data in Factorial Designs |
---|---|
Description: | Analysis of multivariate data with two-way completely randomized factorial design. The analysis is based on fully nonparametric, rank-based methods and uses test statistics based on the Dempster's ANOVA, Wilk's Lambda, Lawley-Hotelling and Bartlett-Nanda-Pillai criteria. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The package offers two functions performing the analysis, one for small and the other for large sample sizes. The underlying methodology is largely described in Bathke and Harrar (2016) <doi:10.1007/978-3-319-39065-9_7> and in Munzel and Brunner (2000) <doi:10.1016/S0378-3758(99)00212-8>. |
Authors: | Maximilian Kiefel and Arne C. Bathke |
Maintainer: | Maximilian Kiefel <[email protected]> |
License: | GPL-2 | GPL-3 |
Version: | 0.2.1 |
Built: | 2024-11-27 06:36:31 UTC |
Source: | CRAN |
Analysis of multivariate data with two-way completely randomized factorial design - version for large samples. The analysis is based on fully nonparametric, rank-based methods and uses an F-approximation for 'Dempster's ANOVA' and a chisquare-approximation for the criteria called 'Wilks Lambda', 'Lawley-Hotelling' and 'Bartlett-Nanda-Pillai'. These approximations are given by the asymtotic distribution of these statistics under true null-hypothesis. In contrast to the normal-approximated test (as used in the small sample version) it is designed for data with large samples (see details) while the number of factorial levels is allowed to be small. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The test statistics are constructed using nonparametric relative effect estimators.
nparml(formula, data)
nparml(formula, data)
formula |
an object of class "formula" with two explanatory variables (factors), see examples. |
data |
an object of class "data.frame" containing the variables in the formula |
The data is analysed for main effects and interaction effect of the explanatory factors. In each case the null hypothesis "no effect" is testet. In order to obtain reliable results the considered data should include at least 7 observations per factor level combination. This method is only implemented for complete data sets without missing values.
Returns a list of data frames providing the values of the test statistics, p-values, degrees of freedom, factor levels, and groupsize per factor level combination.
Kiefel M., Bathke A.C. (2020) Rank-Based Analysis of Multivariate Data in Factorial Designs and Its Implementation in R In: Nonparametric Statistics (285-294) Springer Proceedings in Mathematics & Statistics Springer International Publishing, Cham
Bathke A.C., Harrar S.W. (2016) Rank-Based Inference for Multivariate Data in Factorial Designs. In: Liu R., McKean J. (eds) Robust Rank-Based and Nonparametric Methods. Springer Proceedings in Mathematics & Statistics, vol 168. Springer, Cham
Harrar S.W., Bathke A.C. (2012) A modified two-factor multivariate analysis of variance: asymptotics and small sample approximations (and erratum). In: Annals of the Institute of Statistical Mathematics, 64(1&5):135-165&1087, 2012.
Brunner E., Dette H., Munk A. (1997) Box-Type Approximations in Nonparametric Factorial Designs In: Journal of the American Statistical Association, 92(440):1494-1502
data(pseudostudy1) nparml(resp1|resp2|resp3~treatment*age, pseudostudy1)
data(pseudostudy1) nparml(resp1|resp2|resp3~treatment*age, pseudostudy1)
Analysis of multivariate data with two-way completely randomized factorial design - version for small samples. The analysis is based on fully nonparametric, rank-based methods and uses a N(0,1)-approximation for test statistics based on 'Dempster's ANOVA', 'Wilk's Lambda', 'Lawley-Hotelling' and 'Bartlett-Nanda-Pillai' criteria. This approximation is established by the asymptotic distribution of these four statistics under true null-hypothesis if one of the explanatory factors has a large number of levels. The multivariate response is allowed to be ordinal, quantitative, binary or a mixture of the different variable types. The test statistics are constructed using nonparametric relative effect estimators.
nparms(formula,data)
nparms(formula,data)
formula |
an object of class "formula" with two explanatory variables (factors), see examples. |
data |
an object of class "data.frame" containing the variables in the formula |
This method is only implemented for complete data sets without missing values. The data is analysed for main effects and interaction effect of the explanatory factors. In each case the null hypothesis "no effect" is testet. The explanatory factor that has the higher number of levels is automatically designated as factor "A". The covariance matrix estimation requires at least 4 observations (observation vectors) per factor level combination. As the estimation is very time-consuming for large groups it is performed wih a random selection of observations when a group exceeds a size of 6 observation vectors.
Returns a list of data frames providing the values of the test statistics, p-values, degrees of freedom, factor levels, and groupsize per factor level combination.
Kiefel M., Bathke A.C. (2020) Rank-Based Analysis of Multivariate Data in Factorial Designs and Its Implementation in R In: Nonparametric Statistics (285-294) Springer Proceedings in Mathematics & Statistics Springer International Publishing, Cham
Bathke A.C., Harrar S.W. (2016) Rank-Based Inference for Multivariate Data in Factorial Designs. In: Liu R., McKean J. (eds) Robust Rank-Based and Nonparametric Methods. Springer Proceedings in Mathematics & Statistics, vol 168. Springer, Cham
Harrar S.W., Bathke A.C. (2012) A modified two-factor multivariate analysis of variance: asymptotics and small sample approximations (and erratum). In: Annals of the Institute of Statistical Mathematics, 64(1&5):135-165&1087, 2012.
Brunner E., Dette H., Munk A. (1997) Box-Type Approximations in Nonparametric Factorial Designs In: Journal of the American Statistical Association, 92(440):1494-1502
data(pseudostudy2) nparms(resp1|resp2|resp3~treatment*age, pseudostudy2)
data(pseudostudy2) nparms(resp1|resp2|resp3~treatment*age, pseudostudy2)
Example data set in a two-way layout without effects (in terms of disribution) by factors "treatment" and "age". In constrast to the 'pseudostudy2' data set the number of factorial levels is low while the number of observations per factor level combination is large.
data(pseudostudy1)
data(pseudostudy1)
pseudostudy consists of 56 cases (rows) and 5 variables where 'treatment'('age') is considered to be the explanatory factor 'A'('B') and the variables resp1, resp2. resp3 form the response vector.
data(pseudostudy1)
data(pseudostudy1)
Example data set in a two-way layout without effects (in terms of disribution) by factors "treatment" and "age".
data(pseudostudy2)
data(pseudostudy2)
pseudostudy consists of 107 cases (rows) and 5 variables where 'treatment'('age') is considered to be the explanatory factor 'A'('B') and the variables resp1, resp2. resp3 form the response vector.
data(pseudostudy2)
data(pseudostudy2)