Package 'oglmx' reference manual

Title:	Estimation of Ordered Generalized Linear Models
Description:	Ordered models such as ordered probit and ordered logit presume that the error variance is constant across observations. In the case that this assumption does not hold estimates of marginal effects are typically biased (Weiss (1997)). This package allows for generalization of ordered probit and ordered logit models by allowing the user to specify a model for the variance. Furthermore, the package includes functions to calculate the marginal effects. Wrapper functions to estimate the standard limited dependent variable models are also included.
Authors:	Nathan Carroll
Maintainer:	Nathan Carroll <[email protected]>
License:	GPL-2
Version:	3.0.0.0
Built:	2025-03-10 06:08:35 UTC
Source:	CRAN

Estimation of Ordered Generalized Linear Models Package for estimation of ordered generalized linear models.

Description

Ordered models such as ordered probit and ordered logit presume that the error variance is constant across observations. In the case that this assumption does not hold estimates of marginal effects are typically biased (Weiss (1997)). This package allows for generalization of ordered probit and ordered logit models by allowing the user to specify a model for the variance. Furthermore, the package includes functions to calculate the marginal effects. Wrapper functions to estimate the standard limited dependent variable models are also included.

Details

Package:	oglmx
Type:	Package
Title:	Estimation of Ordered Generalized Linear Models
Version:	3.0.0.0
Date:	2018-05-05
Author:	Nathan Carroll
Maintainer:	Nathan Carroll <[email protected]>
Description:	Ordered models such as ordered probit and ordered logit presume that the error variance is constant across observations. In the case that this assumption does not hold estimates of marginal effects are typically biased (Weiss (1997)). This package allows for generalization of ordered probit and ordered logit models by allowing the user to specify a model for the variance. Furthermore, the package includes functions to calculate the marginal effects. Wrapper functions to estimate the standard limited dependent variable models are also included.
License:	GPL-2
Depends:	maxLik
Imports:	stats
Suggests:	glmx, lmtest
NeedsCompilation:	no
Packaged:	2018-05-05 10:44:24 UTC; Nathan
Repository:	CRAN
Date/Publication:	2018-05-05 11:24:43 UTC

Index of help topics:

AIC.oglmx               Calculate Akaike Information Criterion
D_continuous.margin.mean_mean
                        Calculate derivatives of marginal effects for
                        continuous variables.
D_discrete.margin_meanonly.mean
                        Calculate derivatives of marginal effects for
                        binary variables.
McFaddensR2.oglmx       Calculate McFadden's R-Squared.
Probability             Various functions not intended for user.
continuous.margin.mean
                        Calculate marginal effects for continuous
                        variables.
discrete.margin_meanonly
                        Calculate marginal effects for binary
                        variables.
formula.oglmx           Obtain model formula for an 'oglmx' object.
getEtas                 Construct ingredients for probability
                        calculation.
logLik.oglmx            Extract log likelihood value
logit.reg               Fit Logit Model.
margins.oglmx           Calculate marginal effects for 'oglmx' objects.
oglmx                   Fit Ordered Generalized Linear Model.
oglmx-package           Estimation of Ordered Generalized Linear Models
                        Package for estimation of ordered generalized
                        linear models.
ologit.reg              Fit an ordered Logit model.
oprobit.reg             Fit Ordered Probit Model.
probit.reg              Fit Probit Model.
scoreMean               Calculate derivatives of loglikelihood
summary.oglmx           Summarizing Ordered Discrete Outcome Model Fits
vcov.oglmx              Calculate Variance-Covariance Matrix for an
                        oglmx Object

Further information is available in the following vignettes:

`oglmxVignette`	oglmx: A Package for Estimation of Ordered Generalized Linear Models. (source, pdf)

Author(s)

Nathan Carroll

Maintainer: Nathan Carroll <[email protected]>

Calculate Akaike Information Criterion

Description

Calculates the Akaike Information Criterion for objects of class oglmx. Calculate using the formula $-2*loglikelihood + k*npar$ where $npar$ represents the number of parameters in the model and $k$ is the cost of additional parameters, equal to 2 for the AIC, it is $k=\log(n)$ with $n$ the number of observations for the BIC.

Usage

  ## S3 method for class 'oglmx'
AIC(object, ..., k = 2)
## S3 method for class 'oglmx'
AIC(object, ..., k = 2)

Arguments

`object`	object of class `oglmx`
`...`	additional arguments. Currently ignored.
`k`	the penalty per parameter to be used.

Details

When comparing models by maximium likelihood estimation the smaller the value of the AIC the better.

Value

A numeric value with the AIC.

`paramvec`	Coefficients related to variables for which marginal effects are desired.
`etas`	Inputs to link functions.
`link`	specifies the link function for the estimated model.
`std.dev`	The calculated standard deviation of the error terms.
`gstd.dev`	The calculated derivative of the standard deviation of the error terms.

`whichMargins`	Numeric vector indicating indexes of parameters in the relevant matrix for which margins are desired.
`whichXest`	Logical vector indicating the variables in X for which the relevant parameters were estimated.
`X`	Data matrix containing variables in mean equation.
`paramvec`	Coefficients related to variables for which marginal effects are desired.
`etas`	Inputs to link functions.
`link`	specifies the link function for the estimated model.
`std.dev`	The calculated standard deviation of the error terms.
`Z`	Data matrix containing variables in variance equation.
`whichZest`	Logical vector indicating the variables in Z for which the relevant parameters were estimated.
`gstd.dev`	The calculated derivative of the standard deviation of the error terms.
`hstd.dev`	The calculated second derivative of the standard deviation of the error terms.
`estThresh`	Logical vector indicating which threshold parameters were estimated.
`outcomematrix`	A matrix that indicates the outcome variable.

`whichVars`	Numeric vector stating indexes of variables that are binary and marginal effects are desired.
`whichXest`	Logical vector indicating the variables in X for which the relevant parameters were estimated.
`X`	Data matrix containing variables in mean equation.
`fouretas`	Inputs to link functions.
`link`	specifies the link function for the estimated model.
`std.dev`	The calculated standard deviation of the error terms.
`Z`	Data matrix containing variables in variance equation.
`whichZest`	Logical vector indicating the variables in Z for which the relevant parameters were estimated.
`gstd.dev`	The calculated derivative of the standard deviation of the error terms.
`estThresh`	Logical vector indicating which threshold parameters were estimated.
`outcomematrix`	A matrix that indicates the outcome variable.
`ZDinputs`	Values of inputs to function that gives standard deviation when binary variable is equal to 0 and 1.
`StdDevs`	Values of standard deviation when binary variable is equal to 0 and 1.
`gsdmodel`	Expression used to calculate derivative of standard deviation.
`BothEqLocs`	Dataframe describing locations of binary variables that are in both the mean and variance equations.

`beta`	Coefficients for the mean equation.
`X`	Variable values for the mean equation.
`whichVars`	Numeric vector stating indexes of variables that are binary and marginal effects are desired.
`etas`	Inputs to link functions.
`link`	specifies the link function for the estimated model.
`std.dev`	The calculated standard deviation of the error terms.
`delta`	Coefficients for the variance equation.
`Z`	Variable values for the variance equation.
`sdmodel`	Expression used to calculate standard deviation.
`BothEqLocs`	Dataframe describing locations of binary variables that are in both the mean and variance equations.

`x`	object of class `oglmx`.
`...`	additional arguments, currently ignored.

`thresholds`	Numeric matrix of dimension (number of observations * 2). Columns refer to the right and left threshold corresponding to the desired outcome.
`xb`, `xb_matrix`	Numeric vector/matrix of expected values of the latent variable.
`std.dev`, `sd_matrix`	Numeric vector/matrix of standard deviations of the error term given variables.

`eta_1`	Numeric vector/matrix corresponding to the right threshold.
`eta_0`	Numeric vector/matrix corresponding to the left threshold.

`Env`, `inputenv`	environment, typically constructed by the `oglmx.fit` function, that contains all relevant information for the optimisation process.
`Parameters`, `start`	numeric vector of length equal to the number of estimated parameters.
`formula1`, `formula2`	items of class `formula`.
`whichparameter`	logical
`gfunc`	expression, function used to model the variance
`threshvec`, `thresholdvector`	numeric vectors of threshold values
`outcomematrix`	numeric matrix with binary variables indicating the outcome for each observation
`eta_1`, `eta_0`	input values for the link function
`link`	string value indicating which link function is to be used

`formula`	an object of class `formula`: a symbolic description of the model used to explain the mean of the latent variable. The response variable should be a numeric vector or factor variable with two values.
`data`	a data frame containing the variables in the model.
`start`	either `NULL` or a numeric vector specifying start values for each of the estimated parameters, passed to the maximisation routine.
`weights`	either `NULL` or a numeric vector of length equal to the number of rows in the data frame. Used to apply weighted maximum likelihood estimation.
`beta`	`NULL` or numeric vector. Used to prespecify elements of the parameter vector for the equation of the mean of the latent variable. Vector should be of length one or of length equal to the number of explanatory variables in the mean equation. If of length one the value is presumed to correspond to the constant. If of length greater than one then `NA` should be entered for elements of the vector to be estimated.
`analhessian`	logical. Indicates whether the analytic Hessian should be calculated and used, default is TRUE, if set to FALSE a finite-difference approximation of the Hessian is used.
`na.action`	a function which indicates what should happen when the data contain NAs. The default is set by the `na.action` setting of `options`, and is `na.fail` if that is unset. The factory-fresh default is `na.omit`. Another possible value is `NULL`, no action. Value `na.exclude` can be useful.
`savemodelframe`	logical. Indicates whether the model frame(s) should be saved for future use. Default is `FALSE`. Should be set to `TRUE` if intending to estimate Average Marginal Effects.
`robust`	logical. If set to `TRUE` the outer product or BHHH estimate of the meat in the sandwich of the variance-covariance matrix is calculated. If calculated standard errors will be calculated using the sandwich estimator by default when calling `summary`.

`object`	object of class `oglmx` or `summary.oglmx`.
`...`	additional arguments, currently ignored.

`object`	object of class "`oglmx`".
`Vars`	vector specifying variables for which marginal effects are desired.
`outcomes`	either character string "`All`", the default option, or a numeric vector indicating the outcomes for which the marginal effect is desired.
`atmeans`	logical. If `TRUE` then the marginal effects are calculated at the means of the variables in the equations for the mean and variance of the latent variable.
`AME`	logical. If `TRUE` the marginal effects are averaged across observations.
`ascontinuous`	logical. If `TRUE` binary variables are treated as if continuous to calculate marginal effects.
`location`	`NULL`, a numeric vector, or a list containing two numeric vectors. Allows the user to specify the values of the explanatory variables at which the marginal effect is to be calculated. For a homoskedastic model the input should be a numeric vector of length equal to the number of variables in the model matrix. For a heterskedastic model the input should be a list, the first element should be a vector of length equal to the number of variables in the mean equation and the second is a vector of length equal to the number of variables in the variance equation.
`...`	additional arguments to `print` method. Currently ignored.
`x`	object of class `margins.oglmx`.

`formulaMEAN`	an object of class `formula`: a symbolic description of the model used to explain the mean of the latent variable. The response variable should be a numeric vector or factor variable such that the numerical assignments for the levels of the factor have ordinal meaning.
`formulaSD`	either `NULL` or an object of class `formula`: a symbolic description of the model used to explain the variance of the latent variable.
`data`	a data frame containing the variables in the model.
`start`	either `NULL` or a numeric vector specifying start values for each of the estimated parameters, passed to the maximisation routine.
`weights`	either `NULL` or a numeric vector of length equal to the number of rows in the data frame. Used to apply weighted maximum likelihood estimation.
`link`	specifies a link function for the model to be estimated, accepted values are "`probit`", "`logit`", "`cauchit`", "`loglog`" and "`cloglog`"
`constantMEAN`	logical. Should an intercept be included in the model of the mean of the latent variable? Can be overwritten and set to `FALSE` using the formulaMEAN argument by writing `0 +` as the first element of the equation.
`constantSD`	logical. Should an intercept be included in the model of the variance of the latent variable? Can be overwritten and set to `FALSE` using the formulaSD argument by writing `0 +` as the first element of the equation.
`beta`	`NULL` or numeric vector. Used to prespecify elements of the parameter vector for the equation of the mean of the latent variable. Vector should be of length one or of length equal to the number of explanatory variables in the mean equation. If of length one the value is presumed to correspond to the constant if a constant is included or the first element of the parameter vector. If of length greater than one then `NA` should be entered for elements of the vector to be estimated.
`delta`	`NULL` or numeric vector. Used to prespecify elements of the parameter vector for the equation of the variance of the latent variable. Vector should be of length one or of length equal to the number of explanatory variables in the variance equation. If of length one the value is presumed to correspond to the constant if a constant is included or the first element of the parameter vector. If of length greater than one then `NA` should be entered for elements of the vector to be estimated.
`threshparam`	`NULL` or numeric vector. Used to prespecify the threshold parameters of the model. Vector should be of length equal to the number of outcomes minus one. `NA` should be entered for threshold parameters to be estimated by the model.
`analhessian`	logical. Indicates whether the analytic Hessian should be calculated and used, default is TRUE, if set to FALSE a finite-difference approximation of the Hessian is used.
`sdmodel`	object of mode “`expression`”. The expression defines function that transforms the linear model for the standard deviation into the standard deviation. The expression should be written as a function of variable `z`. The default value is `expression(exp(z))`.
`SameModelMEANSD`	logical. Indicates whether the matrix used to model the mean of the latent variable is identical to that used to model the variance. If `formulaSD=NULL` and `SameModelMEANSD=TRUE` a model with heteroskedasticity is estimated. If `SameModelMEANSD=FALSE` and `formulaSD==formulaMEAN` value is overridden. Used to reduce memory requirements when models are identical.
`na.action`	a function which indicates what should happen when the data contain NAs. The default is set by the `na.action` setting of `options`, and is `na.fail` if that is unset. The factory-fresh default is `na.omit`. Another possible value is `NULL`, no action. Value `na.exclude` can be useful.
`savemodelframe`	logical. Indicates whether the model frame(s) should be saved for future use. Default is `FALSE`. Should be set to `TRUE` if intending to estimate Average Marginal Effects.
`Force`	logical. If set to `FALSE` (the default) the function stops if the response variable has more than twenty categories. Should be changed to `TRUE` if a model with more than twenty categories is desired.
`robust`	logical. If set to `TRUE` the outer product or BHHH estimate of the meat in the sandwich of the variance-covariance matrix is calculated. If calculated standard errors will be calculated using the sandwich estimator by default when calling `summary`.
`outcomeMatrix`, `X`, `Z`	`X` is a data matrix for the right hand side of the mean equation, `outcomeMatrix` is a matrix that indicates the outcome variable and `Z` is a data matrix for the variance equation.
`w`	`w` specifies a vector of weights for the `oglmx.fit` function.
`optmeth`	`optmeth` specifies a method for the maximisation of the likelihood, currently "maxLik" is the only available option.

`link`	link function used in the estimated model.
`sdmodel`	Expression for the model for the standard deviation, default is exp(z).
`call`	the call used to generate the results.
`factorvars`	vector listing factor variables included in the model
`Outcomes`	numeric vector listing the values of the different outcomes.
`NoVarModData`	dataframe. Contains data required to estimate the no information model used in calculation of McFadden's R-squared measure.
`NOutcomes`	the number of distinct outcomes in the response variable.
`Hetero`	logical. If `TRUE` indicates that the estimated model includes a model for the variance of the error term, i.e. heteroskedasticity.
`formula`	two element list. Each element is an object of type `formula` related to the mean and standard deviation equation respectively.
`modelframes`	If `savemodelframe` set to `FALSE` then returns `NULL`, otherwise returns a list with two elements, the model frames for the mean and variance equations.
`BothEq`	Omitted in the case of a homoskedastic model. Dataframe listing variables that are contained in both the mean and variance equations.
`varMeans`	a list containing two numeric vectors. The vectors list the mean values of the variables in the mean and variance equation respectively. Stored for use in a call of `margins.oglmx` to obtain marginal effects at means.
`varBinary`	a list containing two numeric vectors. The vectors indicate whether the variables in the mean and variance equations are binary indicators. Stored for use in a call of `margins.oglmx` to obtain marginal effects at means.
`loglikelihood`	log-likelihood for the estimated model. Includes as attributes the log-likelihood for the constant only model and the number of observations.
`coefficients`	vector of estimated parameters.
`gradient`	numeric vector, the value of the gradient of the log-likelihood function at the obtained parameter vector. Should be approximately equal to zero.
`no.iterations`	number of iterations of maximisation algorithm.
`returnCode`	code returned by the `maxLik` optimisation routine. For details of meaning see `maxNR`.
`hessian`	hessian matrix of the log-likelihood function evaluated at the obtained parameter vector.
`allparams`	a list containing three numeric vectors, the vectors contain the parameters from the mean equation, the variance equation and the threshold parameters respectively. Includes the prespecified and estimated parameters together.
`Est.Parameters`	list containing three logical vectors. Indicates which parameters in the parameter vectors were estimated.
`BHHHhessian`	Omitted if `robust = FALSE` and weights were not included. The BHHH variance-covariance estimate.

`eta_1`	numeric vector or matrix. Refers to the input to the link function to calculate the probability at the right threshold of the outcome.
`eta_0`	numeric vector or matrix. Refers to the input to the link function to calculate the probability at the left threshold of the outcome.
`std.dev`	numeric vector or matrix. The standard deviation of the error term for the observations given the data and parameters.
`prob`	numeric vector or matrix. Probability of the outcome given the parameters and data.
`link`	character, indicates link function for the estimated model.
`estThresh`	numeric vector indicating which of the threshold values are estimated.
`outcomematrix`	numeric matrix indicating the outcome for each observation.
`gstd.dev`	numeric vector or matrix. The first derivative of standard deviation of the error term for the observations given the data and parameters.
`hstd.dev`	numeric vector or matrix. The second derivative of standard deviation of the error term for the observations given the data and parameters.

`object`	an object of class "oglmx"
`tol`	argument passed to qr.solve, defines the tolerance for detecting linear dependencies in the hessian matrix to be inverted.
`...`	additional arguments, currently ignored.
`x`	object of class `summary.oglmx`.

`regtype`	character string describing the type of model estimated.
`loglikelihood`	log-likelihood for the estimated model.
`estimate`	matrix with four columns and number of rows equal to the number of estimated parameters. Columns of the matrix correspond to estimated coefficients, standard errors, t-statistics and (two-sided) p-values.
`estimateDisplay`	the same data as in `estimate` but separated into a list with elements for each type of parameter estimate. The first element is for parameters in the mean equation, second element for parameters in the variance equation and the final element is for threshold parameters.
`no.iterations`	number of iterations used in function that maximises the log-likelihood.
`McFaddensR2`	McFadden's $R^2$ aka Pseudo- $R^2$ . Calculated as: $R^2=1-\log{L_{fit}}/\log{L_0}$ where $\log{L_{fit}}$ is the log-likelihood for the fitted model and $\log{L_0}$ is the log-likelihood from an intercept only model that estimates the probability of each alternative to be the sample average.
`AIC`	Akaike Information Criterion, calculated as: $AIC=2k-2\log{L_{fit}}$ where $k$ is the number of estimated parameters.
`coefficients`	named vector of estimated parameters.

Package 'oglmx'

Help Index

Estimation of Ordered Generalized Linear Models Package for estimation of ordered generalized linear models.

Description

Details

Author(s)

Calculate Akaike Information Criterion

Description

Usage

Arguments

Details

Value

Author(s)

See Also

Calculate marginal effects for continuous variables.

Description

Usage

Arguments

Value

Author(s)

See Also

Calculate derivatives of marginal effects for continuous variables.

Description

Usage

Arguments

Value

Author(s)

See Also

Calculate derivatives of marginal effects for binary variables.

Description

Usage

Arguments

Value

Author(s)

See Also

Calculate marginal effects for binary variables.

Description

Usage

Arguments

Value

Author(s)

See Also

Obtain model formula for an oglmx object.

Description

Usage

Arguments

Value

Author(s)

See Also

Construct ingredients for probability calculation.

Description

Usage

Arguments

Value

Author(s)

See Also

Various functions not intended for user.

Description

Usage

Arguments

Author(s)

See Also

Fit Logit Model.

Description

Usage

Arguments

Value

Author(s)

See Also

Extract log likelihood value

Description

Usage

Arguments

Value

Author(s)

See Also

Calculate marginal effects for oglmx objects.

Description

Usage

Arguments

Obtain model formula for an `oglmx` object.

Calculate marginal effects for `oglmx` objects.