Title: | Maximum Likelihood Estimation and Likelihood Ratio Test Functions for Separable Variance-Covariance Structures |
---|---|
Description: | Maximum likelihood estimation of the parameters of matrix and 3rd-order tensor normal distributions with unstructured factor variance covariance matrices, two procedures, and for unbiased modified likelihood ratio testing of simple and double separability for variance-covariance structures, two procedures. References: Dutilleul P. (1999) <doi:10.1080/00949659908811970>, Manceur AM, Dutilleul P. (2013) <doi:10.1016/j.cam.2012.09.017>, and Manceur AM, Dutilleul P. (2013) <doi:10.1016/j.spl.2012.10.020>. |
Authors: | Ameur Manceur [aut], Timothy Schwinghamer [aut, cre], Pierre Dutilleul [aut, cph] |
Maintainer: | Timothy Schwinghamer <[email protected]> |
License: | MIT + file LICENSE |
Version: | 0.3.2 |
Built: | 2024-11-01 11:39:03 UTC |
Source: | CRAN |
An i.i.d. random sample of size 7 from a 2 x 3 matrix normal distribution, for a small numerical example of the use of the functions mle2d_svc and lrt2d_svc from the sEparaTe package
data2d
data2d
A frame (excluding the headings) with 42 lines of data and 4 variables:
an integer ranging from 1 to 7, the size of an i.i.d. random sample from a 2 x 3 matrix normal distribution
an integer ranging from 1 to 2 , the number of rows of the matrix normal distribution
an integer ranging from 1 to 3 , the number of columns of the matrix normal distribution
the sample data for the observed variable
An i.i.d. random sample of size 13 from a 2 x 3 x 2 tensor normal distribution, for a small numerical example of the use of the functions mle3d_svc and lrt3d_svc from the sEparaTe package
data3d
data3d
A frame (excluding the headings) with 156 lines of data and 5 variables:
an integer ranging from 1 to 13, the size of an i.i.d. random sample from a 2 x 3 x 2 tensor matrix normal distribution
an integer ranging from 1 to 2, the number of rows of the 3rd-order tensor normal distribution
an integer ranging from 1 to 3, the number of columns of the 3rd-order tensor normal distribution
an integer ranging from 1 to 2, the number of edges of the 3rd-order tensor normal distribution
the sample data for the observed variable
A likelihood ratio test (LRT) for simple separability of a variance-covariance matrix, modified to be unbiased in finite samples. The modification is a penalty-based homothetic transformation of the LRT statistic. The penalty value is optimized for a given mean model, which is left unstructured here. In the required function, the Id1 and Id2 variables correspond to the row and column subscripts, respectively; “value2d” refers to the observed variable.
lrt2d_svc( value2d, Id1, Id2, subject, data_2d, eps, maxiter, startmat, sign.level, n.simul )
lrt2d_svc( value2d, Id1, Id2, subject, data_2d, eps, maxiter, startmat, sign.level, n.simul )
value2d |
from the formula value2d ~ Id1 + Id2 |
Id1 |
from the formula value2d ~ Id1 + Id2 |
Id2 |
from the formula value2d ~ Id1 + Id2 |
subject |
the replicate, also called the subject or individual, the first column in the matrix (2d) data file |
data_2d |
the name of the matrix data |
eps |
the threshold in the stopping criterion for the iterative mle algorithm (estimation) |
maxiter |
the maximum number of iterations for the mle algorithm (estimation) |
startmat |
the value of the second factor variance-covariance matrix used for initialization, i.e., to start the mle algorithm (estimation) and obtain the initial estimate of the first factor variance-covariance matrix |
sign.level |
the significance level, or rejection rate in the testing of the null hypothesis of simple separability for a variance-covariance structure, when the unbiased modified LRT is used, i.e., the critical value in the chi-square test is derived by simulations from the sampling distribution of the LRT statistic |
n.simul |
the number of simulations used to build the sampling distribution of the LRT statistic under the null hypothesis, using the same characteristics as the i.i.d. random sample from a matrix normal distribution |
“Convergence”, TRUE or FALSE
“chi.df”, the theoretical number of degrees of freedom of the asymptotic chi-square distribution that would apply to the unmodified LRT statistic for simple separability of a variance-covariance structure
“Lambda”, the observed value of the unmodified LRT statistic
“critical.value”, the critical value at the specified significance level for the chi-square distribution with “chi.df” degrees of freedom
“Decision.lambda” will indicate whether or not the null hypothesis of separability was rejected, based on the theoretical LRT statistic
“Simulation.critical.value”, the critical value at the specified significance level that is derived from the sampling distribution of the unbiased modified LRT statistic
“Decision.lambda.simulation”, the decision (acceptance/rejection) regarding the null hypothesis of simple separability, made using the theoretical (biased unmodified) LRT
“Penalty”, the optimized penalty value used in the homothetic transformation between the biased unmodified and unbiased modified LRT statistics
“U1hat”, the estimated variance-covariance matrix for the rows
“Standardized_U1hat”, the standardized estimated variance-covariance matrix for the rows; the standardization is performed by dividing each entry of U1hat by entry(1, 1) of U1hat
“U2hat”, the estimated variance-covariance matrix for the columns
“Standardized_U2hat”, the standardized estimated variance-covariance matrix for the columns; the standardization is performed by multiplying each entry of U2hat by entry(1, 1) of U1hat
“Shat”, the sample variance-covariance matrix computed from the vectorized data matrices
Manceur AM, Dutilleul P. 2013. Unbiased modified likelihood ratio tests for simple and double separability of a variance-covariance structure. Statistics and Probability Letters 83: 631-636.
output <- lrt2d_svc(data2d$value2d, data2d$Id1, data2d$Id2, data2d$K, data_2d = data2d, n.simul = 100) output
output <- lrt2d_svc(data2d$value2d, data2d$Id1, data2d$Id2, data2d$K, data_2d = data2d, n.simul = 100) output
A likelihood ratio test (LRT) for double separability of a variance-covariance structure, modified to be unbiased in finite samples. The modification is a penalty-based homothetic transformation of the LRT statistic. The penalty value is optimized for a given mean model, which is left unstructured here. In the required function, the Id3, Id4 and Id5 variables correspond to the row, column and edge subscripts, respectively; “value3d” refers to the observed variable.
lrt3d_svc( value3d, Id3, Id4, Id5, subject, data_3d, eps, maxiter, startmatU2, startmatU3, sign.level, n.simul )
lrt3d_svc( value3d, Id3, Id4, Id5, subject, data_3d, eps, maxiter, startmatU2, startmatU3, sign.level, n.simul )
value3d |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id3 |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id4 |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id5 |
from the formula value3d ~ Id3 + Id4 + Id5 |
subject |
the replicate, also called individual |
data_3d |
the name of the tensor data |
eps |
the threshold in the stopping criterion for the iterative mle algorithm (estimation) |
maxiter |
the maximum number of iterations for the mle algorithm (estimation) |
startmatU2 |
the value of the second factor variance-covariance matrix used for initialization |
startmatU3 |
the value of the third factor variance-covariance matrix used for initialization, i.e., startmatU3 together with startmatU2 are used to start the mle algorithm (estimation) and obtain the initial estimate of the first factor variance-covariance matrix U1 |
sign.level |
the significance level, or rejection rate in the testing of the null hypothesis of simple separability for a variance-covariance structure, when the unbiased modified LRT is used, i.e., the critical value in the chi-square test is derived by simulations from the sampling distribution of the LRT statistic |
n.simul |
the number of simulations used to build the sampling distribution of the LRT statistic under the null hypothesis, using the same characteristics as the i.i.d. random sample from a tensor normal distribution |
“Convergence”, TRUE or FALSE
“chi.df”, the theoretical number of degrees of freedom of the asymptotic chi-square distribution that would apply to the unmodified LRT statistic for double separability of a variance-covariance structure “Lambda”, the observed value of the unmodified LRT statistic
“critical.value”, the critical value at the specified significance level for the chi-square distribution with “chi.df” degrees of freedom
“Decision.lambda”, the decision (acceptance/rejection) regarding the null hypothesis of double separability, made using the theoretical (biased unmodified) LRT
“Simulation.critical.value”, the critical value at the specified significance level that is derived from the sampling distribution of the unbiased modified LRT statistic
“Decision.lambda.simulation”, the decision (acceptance/rejection) regarding the null hypothesis of double separability, made using the unbiased modified LRT
“Penalty”, the optimized penalty value used in the homothetic transformation between the biased unmodified and unbiased modified LRT statistics
“U1hat”, the estimated variance-covariance matrix for the rows
“Standardized_U1hat”, the standardized estimated variance-covariance matrix for the rows; the standardization is performed by dividing each entry of U1hat by entry(1, 1) of U1hat
“U2hat”, the estimated variance-covariance matrix for the columns
“Standardized_U2hat”, the standardized estimated variance-covariance matrix for the columns; the standardization is performed by multiplying each entry of U2hat by entry(1, 1) of U1hat
“U3hat”, the estimated variance-covariance matrix for the edges
“Shat”, the sample variance-covariance matrix computed from the vectorized data tensors
Manceur AM, Dutilleul P. 2013. Unbiased modified likelihood ratio tests for simple and double separability of a variance-covariance structure. Statistics and Probability Letters 83: 631-636.
output <- lrt3d_svc(data3d$value3d, data3d$Id3, data3d$Id4, data3d$Id5, data3d$K, data_3d = data3d, n.simul = 100) output
output <- lrt3d_svc(data3d$value3d, data3d$Id3, data3d$Id4, data3d$Id5, data3d$K, data_3d = data3d, n.simul = 100) output
Maximum likelihood estimation for the parameters of a matrix normal distribution X, which is characterized by a simply separable variance-covariance structure. In the general case, which is the case considered here, two unstructured factor variance-covariance matrices determine the covariability of random matrix entries, depending on the row (one factor matrix) and the column (the other factor matrix) where two X-entries are. In the required function, the Id1 and Id2 variables correspond to the row and column subscripts, respectively; “value2d” indicates the observed variable.
mle2d_svc(value2d, Id1, Id2, subject, data_2d, eps, maxiter, startmat)
mle2d_svc(value2d, Id1, Id2, subject, data_2d, eps, maxiter, startmat)
value2d |
from the formula value2d ~ Id1 + Id2 |
Id1 |
from the formula value2d ~ Id1 + Id2 |
Id2 |
from the formula value2d ~ Id1 + Id2 |
subject |
the replicate, also called individual |
data_2d |
the name of the matrix data |
eps |
the threshold in the stopping criterion for the iterative mle algorithm |
maxiter |
the maximum number of iterations for the iterative mle algorithm |
startmat |
the value of the second factor variance-covariance matrix used for initialization, i.e., to start the algorithm and obtain the initial estimate of the first factor variance-covariance matrix |
“Convergence”, TRUE or FALSE
“Iter”, will indicate the number of iterations needed for the mle algorithm to converge
“Xmeanhat”, the estimated mean matrix (i.e., the sample mean)
“First”, the row subscript, or the second column in the data file
“U1hat”, the estimated variance-covariance matrix for the rows
“Standardized.U1hat”, the standardized estimated variance-covariance matrix for the rows; the standardization is performed by dividing each entry of U1hat by entry(1, 1) of U1hat
“Second”, the column subscript, or the third column in the data file
“U2hat”, the estimated variance-covariance matrix for the columns
“Standardized.U2hat”, the standardized estimated variance-covariance matrix for the columns; the standardization is performed by multiplying each entry of U2hat by entry(1, 1) of U1hat
“Shat”, is the sample variance-covariance matrix computed from of the vectorized data matrices
Dutilleul P. 1990. Apport en analyse spectrale d'un periodogramme modifie et modelisation des series chronologiques avec repetitions en vue de leur comparaison en frequence. D.Sc. Dissertation, Universite catholique de Louvain, Departement de mathematique.
Dutilleul P. 1999. The mle algorithm for the matrix normal distribution. Journal of Statistical Computation and Simulation 64: 105-123.
output <- mle2d_svc(data2d$value2d, data2d$Id1, data2d$Id2, data2d$K, data_2d = data2d) output
output <- mle2d_svc(data2d$value2d, data2d$Id1, data2d$Id2, data2d$K, data_2d = data2d) output
Maximum likelihood estimation for the parameters of a 3rd-order tensor normal distribution X, which is characterized by a doubly separable variance-covariance structure. In the general case, which is the case considered here, three unstructured factor variance-covariance matrices determine the covariability of random tensor entries, depending on the row (one factor matrix), the column (another factor matrix) and the edge (remaining factor matrix) where two X-entries are. In the required function, the Id3, Id4 and Id5 variables correspond to the row, column and edge subscripts, respectively; “value3d” indicates the observed variable.
mle3d_svc( value3d, Id3, Id4, Id5, subject, data_3d, eps, maxiter, startmatU2, startmatU3 )
mle3d_svc( value3d, Id3, Id4, Id5, subject, data_3d, eps, maxiter, startmatU2, startmatU3 )
value3d |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id3 |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id4 |
from the formula value3d ~ Id3 + Id4 + Id5 |
Id5 |
from the formula value3d ~ Id3 + Id4 + Id5 |
subject |
the replicate, also called individual |
data_3d |
the name of the tensor data |
eps |
the threshold in the stopping criterion for the iterative mle algorithm |
maxiter |
the maximum number of iterations for the iterative mle algorithm |
startmatU2 |
the value of the second factor variance covariance matrix used for initialization |
startmatU3 |
the value of the third factor variance covariance matrix used for initialization, i.e., startmatU3 together with startmatU2 are used to start the algorithm and obtain the initial estimate of the first factor variance covariance matrix U1 |
“Convergence”, TRUE or FALSE
“Iter”, the number of iterations needed for the mle algorithm to converge
“Xmeanhat”, the estimated mean tensor (i.e., the sample mean)
“First”, the row subscript, or the second column in the data file
“U1hat”, the estimated variance-covariance matrix for the rows
“Standardized.U1hat”, the standardized estimated variance-covariance matrix for the rows; the standardization is performed by dividing each entry of U1hat by entry(1, 1) of U1hat
“Second”, the column subscript, or the third column in the data file
“U2hat”, the estimated variance-covariance matrix for the columns
“Standardized.U2hat”, the standardized estimated variance-covariance matrix for the columns; the standardization is performed by multiplying each entry of U2hat by entry(1, 1) of U1hat
“Third”, the edge subscript, or the fourth column in the data file
“U3hat”, the estimated variance-covariance matrix for the edges
“Shat”, the sample variance-covariance matrix computed from the vectorized data tensors
Manceur AM, Dutilleul P. 2013. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. Journal of Computational and Applied Mathematics 239: 37-49.
output <- mle3d_svc(data3d$value3d, data3d$Id3, data3d$Id4, data3d$Id5, data3d$K, data_3d = data3d) output
output <- mle3d_svc(data3d$value3d, data3d$Id3, data3d$Id4, data3d$Id5, data3d$K, data_3d = data3d) output
A package for maximum likelihood estimation (MLE) of the parameters of matrix and 3rd-order tensor normal distributions with unstructured factor variance-covariance matrices (two procedures), and for unbiased modified likelihood ratio testing (LRT) of simple and double separability for variance-covariance structures (two procedures).
mle2d_svc, for maximum likelihood estimation of the parameters of a matrix normal distribution
mle3d_svc, for maximum likelihood estimation of the parameters of a 3rd-order tensor normal distribution
lrt2d_svc, for the unbiased modified likelihood ratio test of simple separability for a variance-covariance structure
lrt3d_svc, for the unbiased modified likelihood ratio test of double separability for a variance-covariance structure
data2d, a two-dimensional data set
data3d, a three-dimensional data set
Dutilleul P. 1999. The mle algorithm for the matrix normal distribution. Journal of Statistical Computation and Simulation 64: 105-123.
Manceur AM, Dutilleul P. 2013. Maximum likelihood estimation for the tensor normal distribution: Algorithm, minimum sample size, and empirical bias and dispersion. Journal of Computational and Applied Mathematics 239: 37-49.
Manceur AM, Dutilleul P. 2013. Unbiased modified likelihood ratio tests for simple and double separability of a variance covariance structure. Statistics and Probability Letters 83: 631-636.