Package 'cond' reference manual

Title:	Approximate Conditional Inference for Logistic and Loglinear Models
Description:	Higher order likelihood-based inference for logistic and loglinear models.
Authors:	S original by Alessandra R. Brazzale <[email protected]>. R port by Alessandra R. Brazzale <[email protected]>, following earlier work by Douglas Bates.
Maintainer:	Alessandra R. Brazzale <[email protected]>
License:	GPL (>= 2) \| file LICENCE
Version:	1.2-3.1
Built:	2025-02-11 06:37:19 UTC
Source:	CRAN

Approximate conditional inference for logistic and loglinear models

Description

Higher order likelihood-based inference for logistic and loglinear models

Details

Package:	cond
Version:	1.2-0
Date:	2009-10-03
Depends:	R (>= 2.6.0), statmod, survival
Suggests:	csampling, marg, nlreg
License:	GPL (>= 2)
URL:	http://www.r-project.org, http://statwww.epfl.ch/AA/
LazyLoad:	yes
LazyData:	yes

Index:

Functions:
=========
cond                    Approximate Conditional Inference - Generic
                        Function
cond.glm                Approximate Conditional Inference for Logistic
                        and Loglinear Models
cond.object             Approximate Conditional Inference Object
family.cond             Use family() on a "cond" object
family.summary.cond     Use family() on a "summary.cond" object
plot.cond               Generate Plots for an Approximate Conditional
                        Inference Object
print.summary.cond      Use print() on a "summary.cond" object
summary.cond            Summary Method for Objects of Class "cond"


Datasets:
========
aids                    AIDS Symptoms and AZT Use Data
airway                  Airway Data
babies                  Crying Babies Data
dormicum                Dormicum Data
fraudulent              Fraudulent Automobile Insurance Claims Data
fungal                  Fungal Infections Treatment Data
rabbits                 Rabbits Data
urine                   Urine Data

Further information is available in the following vignettes:

`Rnews-paper`	hoa: An R Package Bundle for Higher Order Likelihood Inference (source, pdf)

Author(s)

S original by Alessandra R. Brazzale <[email protected]>. R port by Alessandra R. Brazzale <[email protected]>, following earlier work by Douglas Bates.

Maintainer: Alessandra R. Brazzale <[email protected]>

AIDS Symptoms and AZT Use Data

Description

The aids data frame has 4 rows and 4 columns.

On February 15, 1991, the New York Times published the results of a study on the presence of AIDS symptoms and AZT use. The data were cross-classified according to the race of the patients.

Usage

data(aids)data(aids)

Format

This data frame contains the following columns:

yes: the number of patients with AIDS symptoms;
no: the number of patients without AIDS symptoms;
azt: an indicator variable for AZT use;
race: an indicator variable for the race (w=white, b=black).

Source

The data were obtained from the New York Times (2/15/91).

Examples

data(aids)
summary(aids)
data(aids)
summary(aids)

Airway Data

Description

The airway data frame has 35 rows and 6 columns.

Study to compare two devices (tracheal tube and laryngeal mask) used to secure airway in patients undergoing surgery. The response variable is the presence of a sore throat. Further information on age, sex, use of a lubricant, and duration of the surgery is available.

Usage

data(airway)data(airway)

Format

This data frame contains the following columns:

response: an indicator variable for sore throat (1=yes, 0=no);
type: the type of airway used (1=tracheal tube, 0=laryngeal mask);
age: the age of the patient (in years);
sex: an indicator variable for sex (1=male, 0=female);
lubricant: an indicator variable for lubricant use (1=yes, 0=no);
duration: the duration of the surgery (in minutes).

Source

The data were obtained from

“Binary Data” by D. Collet in Encyclopedia of Biostatistics (1998).

Examples

data(airway)
summary(airway)
par(mfrow=c(1,2))
plot(age ~ response, data = airway)
plot(duration ~ response, data = airway)
data(airway)
summary(airway)
par(mfrow=c(1,2))
plot(age ~ response, data = airway)
plot(duration ~ response, data = airway)

Crying Babies Data

Description

The babies data frame has 36 rows and 4 columns.

Matched pairs of binary observations concerning the crying of babies. The babies were observed on 18 days and on each day one child was lulled. Interest focuses on the treatment effect “lulling”.

Usage

data(babies)data(babies)

Format

This data frame contains the following columns:

r1: number of children not crying on one day;
r2: number of children crying on one day;
lull: indicator variable for the treatment;
day: factor variable for the days.

Source

The data were obtained from

Cox, D. R. (1970) Analysis of Binary Data (page 61). London: Chapman \& Hall.

References

Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.

Examples

data(babies)
coplot(r2/(r1+r2) ~ day | lull, data = babies)
##
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                        family = binomial, data = babies, offset = lullyes)
babies.cond
data(babies)
coplot(r2/(r1+r2) ~ day | lull, data = babies)
##
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                        family = binomial, data = babies, offset = lullyes)
babies.cond

Approximate Conditional Inference - Generic Function

Description

Performs approximate conditional inference.

Usage

cond(object, offset, ...)
cond(object, offset, ...)

Arguments

`object`	a fitted model object. Families supported are binomial and Poisson with canonical link function (class `glm`), and regression-scale models (class `rsm`).
`offset`	the covariate occurring in the model formula whose coefficient represents the parameter of interest. May be numerical or a two-level factor. In case of a two-level factor, it must be coded by contrasts and not appear as two dummy variables in the model. Can also be a call to a mathematical function (such as `exp`, `sin`, ...) or to a mathematical operator (`\^`, `/`, ...) applied to a numerical variable. The call must always agree with the label used to identify the corresponding parameter in the fitted model object passed through the `object` argument. Beware that the label includes the identity function `I()` if an arithmetic operator was used. Other function types (e.g. `factor`) and interactions are not admitted.
`...`	absorbs any additional arguments. See `cond.glm` and `cond.rsm` for details.

Details

This function is generic (see methods); method functions can be written to handle specific classes of data. Classes which already have methods for this function include: glm and rsm.

Value

The returned value is an approximate conditional inference object. Classes already supported are cond and marg depending on whether the fitted model object passed through the object argument has class glm or rsm. See cond.object or marg.object for more details.

References

Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference. Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne. Chapter 6.

Examples

## Urine Data 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
##
## function call as offset variable 
labels(coef(urine.glm))
cond(urine.glm, log(calc))
##
## large estimate of regression coefficient
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which = 4)

## House Price Data
## Not run: 
data(houses)
houses.rsm <- rsm(price ~ ., family = student(5), data = houses)
##
## parameter of interest: scale parameter
houses.marg <- cond(houses.rsm, scale)
plot(houses.marg, which = 2)

## End(Not run)
## Urine Data 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
##
## function call as offset variable 
labels(coef(urine.glm))
cond(urine.glm, log(calc))
##
## large estimate of regression coefficient
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which = 4)

## House Price Data
## Not run: 
data(houses)
houses.rsm <- rsm(price ~ ., family = student(5), data = houses)
##
## parameter of interest: scale parameter
houses.marg <- cond(houses.rsm, scale)
plot(houses.marg, which = 2)

## End(Not run)

Approximate Conditional Inference for Logistic and Loglinear Models

Description

Performs approximate conditional inference on a scalar parameter of interest in logistic and loglinear models. The output is stored in an object of class cond.

Usage

## S3 method for class 'glm'
cond(object, offset, formula = NULL, family = NULL, 
         data = sys.frame(sys.parent()), pts = 20, 
         n = max(100, 2*pts), tms = 0.6, from = NULL, to = NULL, 
         control = glm.control(...), trace = FALSE, ...)
## S3 method for class 'glm'
cond(object, offset, formula = NULL, family = NULL, 
         data = sys.frame(sys.parent()), pts = 20, 
         n = max(100, 2*pts), tms = 0.6, from = NULL, to = NULL, 
         control = glm.control(...), trace = FALSE, ...)

Arguments

`object`	a `glm` object. Families supported are binomial and Poisson with canonical link function.
`offset`	the covariate occurring in the model formula whose coefficient represents the parameter of interest. May be numerical or a two-level factor. In case of a two-level factor, it must be coded by contrasts and not appear as two dummy variables in the model. Can also be a call to a mathematical function (such as `exp`, `sin`, ...) or to a mathematical operator (`^`, `/`, ...) applied to a numerical variable. The call must always agree with the label used to identify the corresponding parameter in the `glm` object passed through the `object` argument or defined by `formula` and `family`. Beware that the label includes the identity function `I()` if an arithmetic operator was used. Other function types (e.g. `factor`) and interactions are not admitted.
`formula`	a formula expression (only if no `glm` object is defined).
`family`	a `family` object defining the variance function (only if no `glm` object is defined). Families supported are binomial and Poisson with canonical link function.
`data`	an optional data frame in which to interpret the variables occurring in the formula (only if no `glm` object is defined).
`pts`	number of output points (minimum 10) that are calculated exactly. The default is 20.
`n`	approximate number of output points (minimum 50) produced by the spline interpolation. The default is the maximum between 100 and twice `pts`.
`tms`	defines the range MLE +/- `tms` * s.e. where cubic spline interpolation is replaced by polynomial interpolation. The default is 0.6.
`from`	starting value of the sequence that contains the values of the parameter of interest for which output points are calculated exactly. The default is MLE - 3.5 * s.e.
`to`	ending value of the sequence that contains the values of the parameter of interest for which output points are calculated exactly. The default is MLE + 3.5 * s.e.
`control`	a list of iteration and algorithmic constants that controls the GLM fit. See \ `glm.control` for their names and default values.
`trace`	if `TRUE`, iteration numbers will be printed.
`...`	additional arguments, such as `subset` etc., used by the `glm` fitting routine if the `glm` object is defined through `formula` and `family`. See `glm` for their definition and use. The arguments `weights`, `offset` and `contrasts` are not admitted. The returned value is an object of class `cond`; see `cond.object` for details.

Details

This function is a method for the generic function cond for class glm. It can be invoked by calling cond for an object of the appropriate class, or directly by calling cond.glm regardless of the class of the object. cond.glm has also to be used if the glm object is not provided throught the object argument but specified by formula and family.

The function cond.glm implements several small sample asymptotic methods for approximate conditional inference in logistic and loglinear models. Approximations for both the conditional log likelihood function and conditional tail area probabilities are available (see cond.object for details). Attention is restricted to a scalar parameter of interest. The associated covariate can be either numerical or a two-level factor.

Approximate conditional inference is performed by either updating a fitted generalized linear model or defining the model formula and family. All approximations are calculated exactly for pts equally spaced points ranging from from to to. A cubic spline interpolation is used to extend them over the whole interval of interest, except for the range of values defined by MLE +/- tms * s.e. where the spline interpolation is replaced by a higher order polynomial interpolation. This is done in order to avoid numerical instabilities which are likely to occur for values of the parameter of interest close to the MLE. Results are stored in an object of class cond. Method functions like print, summary and plot can be used to examine the output or represent it graphically. Components can be extracted using coef, formula and family.

Main references for the methods considered are the papers by Pierce and Peters (1992) and Davison (1988). More details on the implementation are given in Brazzale (1999, 2000).

Value

The returned value is an object of class cond; see cond.object for details.

Note

In rare occasions, cond.glm dumps because of non-convergence of the function glm which is used to refit the model for a fixed value of the parameter of interest. This happens for instance if this value is too extreme. The arguments from and to may then be used to limit the default range of MLE +/- 3.5 * s.e. A further possibility is to fine-tuning the constants (number of iterations, convergence threshold) that control the GLM fit through the control argument.

cond.glm may also dump if the estimate of the parameter of interest is large (tipically > 400) in absolute value. This may be avoided by reparametrizing the model.

The output of cond.glm may be unreliable if part of the data have a degenerate distribution. For example take the fungal infections treatment data contained in the fungal data frame. Of the five $2\times 2$ contingency tables, two (the first and the third) are degenerate. As they make no contribution to the exact conditional likelihood, they should be omitted from the approximate conditional fit.

References

Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 1999, 653–661.

Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference. Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.

Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.

Pierce, D. A. and Peters, D. (1992) Practical use of higher order asymptotics for multiparameter exponential families (with Discussion). J. R. Statist. Soc. B, 54, 701–737.

Examples

## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                        family = binomial, data = babies, offset = lullyes)
babies.cond

## Urine Data 
## (function call as offset variable) 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))

## Fungal Infections Treatment Data (numerical instabilities around the
##                                   MLE)
## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, subset = -c(1,2,5,6), 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                        family = binomial, data = babies, offset = lullyes)
babies.cond

## Urine Data 
## (function call as offset variable) 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))

## Fungal Infections Treatment Data (numerical instabilities around the
##                                   MLE)
## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, subset = -c(1,2,5,6), 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).

Approximate Conditional Inference Object

Description

Class of objects returned when performing approximate conditional inference for logistic and loglinear models.

Arguments

Objects of class cond are implemented as a list. The following components are included:

`workspace`	a list whose elements are the spline interpolations of several first order and higher order statistics. They are used to implement the following likelihood quantities: - the profile and modified profile log likelihoods; - the Wald pivots from the unconditional and conditional MLEs; - the profile and modified likelihood roots (the latter one with a suitable continuity correction); - the Lugannani-Rice tail area approximation (with suitable continuity correction); - the correction term used in the higher order statistics; - the information and nuisance parameter aspects. Method functions work mainly on this part of the object. In order to avoid errors in the calculation of confidence intervals and tail probabilities, this part of the object should not be modified.
`coefficients`	a $2\times 2$ matrix containing the unconditional and approximate conditional MLEs and their standard errors.
`call`	function call that created the `cond` object.
`formula`	the model formula.
`family`	the variance function.
`offset`	the covariate occurring in the model formula whose coefficient represents the parameter of interest.
`diagnostics`	diagnostics related to the decomposition of the higher order adjustments into an information and a nuisance parameters term. A value larger than 0.2 in absolute value is an index that higher order methods are needed. See Pierce and Peters (1992) for details.
`n.approx`	number of output points that have been calculated exactly.
`omitted.val`	range of values omitted in the spline interpolation of some of the higher order statistics considered. The aim is to avoid numerical instabilities around the maximum likelihood estimate.
`is.scalar`	a logical value indicating whether there are any nuisance parameters. If `FALSE` there are none.

Main references for the methods considered are the papers by Pierce and Peters (1992) and Davison (1988). More details on the implementation and the methods considered are given in Brazzale (1999, 2000).

Generation

This class of objects is returned from calls to the function cond.glm.

Methods

The class cond has methods for summary, plot, print, coef and family, amongst others.

References

Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 653–661.

Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference, Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.

Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.

Pierce, D. A. and Peters, D. (1992) Practical use of higher order asymptotics for multiparameter exponential families (with Discussion). J. R. Statist. Soc. B, 54, 701–737.

Dormicum Data

Description

The dormicum data frame has 37 rows and 3 columns.

37 children in a pediatric intensive care unit were treated with varying doses and for varying duration with the drug Dormicum. The response variable is 1 if withdrawal symptoms were exhibited and 0 otherwise.

Usage

data(dormicum)data(dormicum)

Format

This data frame contains the following columns:

symp: indicator of the presence of withdrawal symptoms;
dose: the drug dose in mg/kg;
days: the number of days treated.

Source

The data were supplied by Spadille Biostatistik, Denmark.

References

Mehta, C. R., Patel, N. T. and Senchaudhuri, P. (2000) Efficient Monte Carlo methods for conditional logistic regression. J. Amer. Statist. Ass., 95, 99–108.

Examples

data(dormicum)
par(mfrow = c(1,2))
plot(dose ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
     ylab = "treatment dose (mg/kg)")
plot(days ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
     ylab = "treatment days")
data(dormicum)
par(mfrow = c(1,2))
plot(dose ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
     ylab = "treatment dose (mg/kg)")
plot(days ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
     ylab = "treatment days")

Use family() on a “cond” object

Description

This is a method for the function family() for objects inheriting from class cond. See family for the general behaviour of this function.

Usage

## S3 method for class 'cond'
family(object, ...)
## S3 method for class 'cond'
family(object, ...)

Arguments

`object`	any object from which a `family` object can be extracted.
`...`	absorbs any additional argument.

Use family() on a “summary.cond” object

Description

This is a method for the function family() for objects inheriting from class summary.cond. See family for the general behaviour of this function.

Usage

## S3 method for class 'summary.cond'
family(object, ...)
## S3 method for class 'summary.cond'
family(object, ...)

Arguments

`object`	any object from which a `family` object can be extracted.
`...`	absorbs any additional argument.

Fraudulent Automobile Insurance Claims Data

Description

The fraudulent data frame has 42 rows and 12 columns.

127 claims arising from automobile accidents in 1989 in Massachusetts (USA). Each claim was classified as either fraudulent or legitimate by consensus among four independent claims adjusters who examined each case file thoroughly. An exploratory analysis by Derrig and Weisberg (1993) identified 10 binary indicators, each of which denotes the presence or absence of a potential fraud characteristic in the claim situation. They fall into three broad groups relating to “Accident” (AC1, AC9 and AC16), “Claimant” (CL7 and CL11), and “Injury” (IJ2, IJ3, IJ4, IJ6 and IJ12).

Usage

data(fraudulent)data(fraudulent)

Format

This data frame contains the following columns:

r1: the number of frauds detected;
r2: the total number of automobile insurance claims;
AC1,AC9,AC16: potential fraud characteristics pertaining to “Accident”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.
CL7,CL11: potential fraud characteristics pertaining to “Claimer”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.
IJ2,IJ3,IJ4,IJ6,IJ12: potential fraud characteristics pertaining to “Injury”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.

Source

The data were supplied by Dr. Richard Derrig of the Automobile Insurers Bureau of Massachusetts.

References

Mehta, C. R., Patel, N. T. and Senchaudhuri, P. (2000) Efficient Monte Carlo methods for conditional logistic regression. J. Amer. Statist. Ass., 95, 99–108.

Derrig, R. A. and Weisberg, H. I. (1993). Quantitative methods for detecting fraudulent automobile bodily injury claims. Manuscript.

Examples

data(fraudulent)
summary(fraudulent)
data(fraudulent)
summary(fraudulent)

Fungal Infections Treatment Data

Description

The fungal data frame has 10 rows and 4 columns.

Clinical trial on the success of a particular treatment for fungal infections. The study was carried out in five different research units. Interest focuses on the treatment effect.

Usage

data(fungal)data(fungal)

Format

This data frame contains the following columns:

success: the number of patients that benefited from the treatment;
failure: the number of patients with no benefit from the treatment;
group: an indicator variable for treatment (T=treatment, P=placebo);
center: a factor variable indicating the research unit where the study was carried out.

Source

The data were supplied by Sandoz Pharmaceuticals.

Examples

## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis) 
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, subset = -c(1,2,5,6), 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).
## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis) 
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1, 
                  family = binomial, data = fungal, subset = -c(1,2,5,6), 
                  control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).

Generate Plots for an Approximate Conditional Inference Object

Description

Creates a set of plots for an object of class cond.

Usage

## S3 method for class 'cond'
plot(x = stop("nothing to plot"), from = x.axis[1], to = x.axis[n], 
     which = NULL, alpha = 0.05, add.leg = TRUE, loc.leg = FALSE,
     add.labs = TRUE, cex = 0.7, cex.lab = 1, cex.axis = 1, 
     cex.main = 1, lwd1 = 1, lwd2 = 2, lty1 = "solid", 
     lty2 = "dashed", col1 = "black", col2 = "blue", tck = 0.02, 
     las = 1, adj = 0.5, lab = c(15, 15, 5), ...)
## S3 method for class 'cond'
plot(x = stop("nothing to plot"), from = x.axis[1], to = x.axis[n], 
     which = NULL, alpha = 0.05, add.leg = TRUE, loc.leg = FALSE,
     add.labs = TRUE, cex = 0.7, cex.lab = 1, cex.axis = 1, 
     cex.main = 1, lwd1 = 1, lwd2 = 2, lty1 = "solid", 
     lty2 = "dashed", col1 = "black", col2 = "blue", tck = 0.02, 
     las = 1, adj = 0.5, lab = c(15, 15, 5), ...)

Arguments

`x`	a `cond` object. This is assumed to be the result returned by the `cond.glm` function.
`from`	starting value for the x-axis range. The default value has been set by `cond.glm`.
`to`	ending value for the x-axis range. The default value has been set by `cond.glm`.
`which`	which plot should be printed. Admissible values are `2` to `8` corresponding to the choices in the menu below.
`alpha`	the level used to read off confidence intervals; default is 5%.
`add.leg`	if `TRUE`, a legend is added to each plot; default is `TRUE`.
`loc.leg`	if `TRUE`, position of the legend can be located by hand; default is `FALSE`.
`add.labs`	if `TRUE`, labels are added; default is `TRUE`.
`cex`, `cex.lab`, `cex.axis`, `cex.main`	character expansions relative to the standard size of the device to be used for printing text, labels, axes and main title. See `par` for details.
`lwd1`, `lwd2`	line width used to compare different curves in the same plot; default is `lwd2 = 2` for higher order solutions and `lwd1 = 1` for first order solutions.
`lty1`, `lty2`	line type used to compare different curves in the same plot; default is `lty2 = "dashed"` for the Wald statistic and `lty1 = "solid"` for the remaining first- and higher order statistics.
`col1`, `col2`	colors used to compare different curves in the same plot; default is `col2 = "blue"` for higher order solutions, and `col1 = "black"` for the remaining first order statistics.
`tck`, `las`, `adj`, `lab`	further graphical parameters. See `par` for details.
`...`	optional graphical parameters; see `par` for details.

Details

Several plots are produced for an object of class cond. A menu lists all the plots that can be produced. They may be one or all of the following ones:

 Make a plot selection (or 0 to exit)

1:plot: All
2:plot: Profile and modified profile log likelihoods
3:plot: Profile and modified profile likelihood ratios
4:plot: Profile and modified likelihood roots
5:plot: Modified and continuity corrected likelihood roots
6:plot: Lugannani-Rice approximations
7:plot: Confidence intervals
8:plot: Diagnostics based on INF/NP decomposition

Selection:

If no nuisance parameters are presented, a subset of the above pictures is produced. More details on the implementation are given in Brazzale (1999, 2000).

This function is a method for the generic function plot() for class cond. It can be invoked by calling plot or directly plot.cond for an object of the appropriate class.

Value

A plot is created on the current graphics device.

Side Effects

The current device is cleared. When add.leg = TRUE, a legend is added to each plot, and if loc.leg = TRUE, it can be set by the user. All screens are closed, but not cleared, on termination of the function.

Note

The diagnostic plots only represent a preliminary version and need further development.

The two panels on the right trace the information and nuisance correction terms, INF and NP, against the likelihood root statistic. These are generally smooth functions and used to approximate the information and nuisance parameter aspects as a function of the parameter of interest, as shown in the two panels on the left. This procedure has the advantage of largely eliminating the numerical instabilities that affect the statistics around the MLE. The circles in the two leftmost panels represent the limit of INF and NP at the MLE calculated exactly using numerical derivatives. All four pictures are intended to give an idea of the order of magnitude of the two correction terms while trying to deal with the numerical problems that likely occur for these kinds of data.

More details can be found in Brazzale (2000, Appendix B.2).

References

Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 1999, 653–661.

Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference, Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.

Examples

## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
## Not run: 
plot(babies.cond)

## End(Not run)

## Urine Data 
data(urine)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which=4)
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
## Not run: 
plot(babies.cond)

## End(Not run)

## Urine Data 
data(urine)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which=4)

Use print() on a “summary.cond” object

Description

This is a method for the function print() for objects inheriting from class summary.cond. See print and print.default for the general behaviour of this function and for the interpretation of digits.

Usage

## S3 method for class 'summary.cond'
print(x, all = x$all, Coef = x$cf, int = x$int, test = x$hyp,
      digits = if(!is.null(x$digits)) x$digits else max(3, getOption("digits")-3), 
      ...)
## S3 method for class 'summary.cond'
print(x, all, Coef, int, test, digits, ...)
## S3 method for class 'summary.cond'
print(x, all = x$all, Coef = x$cf, int = x$int, test = x$hyp,
      digits = if(!is.null(x$digits)) x$digits else max(3, getOption("digits")-3), 
      ...)
## S3 method for class 'summary.cond'
print(x, all, Coef, int, test, digits, ...)

Arguments

`x`	a `summary.cond` object. This is assumed to be the result returned by the `summary.cond` function.
`all`	if `TRUE` all the information stored in the `summary.cond` object is printed, else only a subset of it. The default is `FALSE`.
`Coef`	if `TRUE`, the unconditional and conditional parameter estimates are printed. The default is `TRUE`.
`int`	if `TRUE`, confidence intervals are printed. The default is `TRUE`.
`test`	if `TRUE`, tests statistics and tail probabilities are printed. The default is `FALSE`.
`digits`	number of significant digits to be printed. The default depends on the value of `digits` set by `options`.
`...`	additional arguments.

Details

Changing the default values of all, Coef, int and test allows only a subset of the information in the summary.cond object to be printed. With all = FALSE, one-sided confidence intervals and the Lugannani-Rice tail approximations are omitted. See summary.cond for more details.

Note

The amount of information printed may vary depending on whether there are any nuisance parameters.

Examples

## Urine Data 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
urine.cond <- cond(urine.glm, urea)
print(summary(urine.cond, all = TRUE), digits = 4)
print(summary(urine.cond), Coef = FALSE)
## Urine Data 
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
urine.cond <- cond(urine.glm, urea)
print(summary(urine.cond, all = TRUE), digits = 4)
print(summary(urine.cond), Coef = FALSE)

Rabbits Data

Description

The rabbits data frame has 10 rows and 4 columns.

Five different doses of penicillin were administered to rabbits suffering from a streptococci infection and the number of recovering rabbits recorded. The rabbits are cross-classified according to whether the drug is administered immediately or delayed by an hour and a half. Interest focuses on whether the delay effects the treatment.

Usage

data(rabbits)
data(rabbits)

Format

This data frame contains the following columns:

cured: the number of rabbits that recovered;
died: the number of rabbits that died;
delay: an indicator variable indicating whether the administration of penicillin was delayed by 1 1/2 hours;
penicil: the penicillin dose.

Source

Unknown.

Examples

data(rabbits)
attach(rabbits)
fc <- cured/(cured + died)
coplot(fc ~ log(penicil) | delay, data = rabbits)
data(rabbits)
attach(rabbits)
fc <- cured/(cured + died)
coplot(fc ~ log(penicil) | delay, data = rabbits)

Summary Method for Objects of Class “cond”

Description

Returns a summary list for objects of class cond.

Usage

## S3 method for class 'cond'
summary(object, alpha = 0.05, test = NULL, all = FALSE, coef = TRUE, 
        int = ifelse( (is.null(test) || all), TRUE, FALSE), 
        digits = NULL, ...)
## S3 method for class 'cond'
summary(object, alpha = 0.05, test = NULL, all = FALSE, coef = TRUE, 
        int = ifelse( (is.null(test) || all), TRUE, FALSE), 
        digits = NULL, ...)

Arguments

`object`	a `cond` object. This is assumed to be the result returned by the `cond.glm` function.
`alpha`	vector of levels for confidence intervals. The default is 5%.
`test`	vector of values of the parameter of interest one wants to test for. If `NULL`, no test is performed. The default is `NULL`.
`all`	logical value; if `TRUE`, all the information stored in the `summary.cond` object is printed, else only a subset of it. The default is `FALSE`.
`coef`	logical value; if `TRUE`, the unconditional and conditional parameter estimates are printed. The default is `TRUE`.
`int`	logical value; if `TRUE` confidence intervals are printed. The default is `TRUE`.
`digits`	number of significant digits to be printed. The default depends on the value of `digits` set by `options`.
`...`	absorbs any additional argument.

Details

This function is a method for the generic function summary() for objects of class cond. It can be invoked by calling summary or directly summary.cond for an object of the appropriate class.

Value

A list is returned with the following components.

`coefficients`	a $2\times 2$ matrix containing the unconditional and approximate conditional MLEs and their standard errors.
`conf.int`	a matrix containing, for each level given in `alpha`, the upper and lower confidence bounds derived from several first- and higher order test statistics. One-sided and two-sided confidence intervals are considered. See `cond.object` for details on the test statistics.
`signif.tests`	a list with two elements. The first (`stats`) contains, for each value given in `test`, the values and tail probabilities of several first- and higher order test statistics. See `cond.object` for details on the test statistics.The second element of the list (`qTerm`) contains for each tested hypothesis the correction term used in the higher order solutions.
`call`	the function call that created the `cond` object.
`formula`	the model formula.
`family`	the variance function.
`offset`	the covariate occurring in the model formula whose coefficient represents the parameter of interest.
`alpha`	vector of levels used to compute the confidence intervals.
`hypotheses`	values for the parameter of interest that have been tested for.
`diagnostics`	information and nuisance parameters aspects; see `cond.object` for details.
`n.approx`	number of output points that have been calculated exactly.
`all`	logical value; if `TRUE`, all the information stored in the `summary.cond` object is printed.
`cf`	logical value; if `TRUE`, the unconditional and conditional parameter estimates are printed.
`int`	logical value; if `TRUE`, confidence intervals are printed.
`is.scalar`	a logical value indicating whether there are any nuisance parameters. If `FALSE` there are none.
`digits`	number of significant digits to be printed.

Note

The amount of information calculated may vary depending on whether there are any nuisance parameters.

Examples

## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
summary(babies.cond, test = 0, coef = FALSE)
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1, 
                  family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
summary(babies.cond, test = 0, coef = FALSE)

Urine Data

Description

The urine data frame has 77 rows and 7 columns.

79 urine specimens were analyzed in an effort to determine if certain physical characteristics of the urine might be related to the formation of calcium oxalate crystals.

Usage

data(urine)data(urine)

Format

This data frame contains the following columns:

r: indicator of the presence of calcium oxalate crystals;
gravity: the specific gravity of the urine, i.e. the density of urine relative to water;
ph: the pH reading of the urine;
osmo: the osmolarity of the urine. Osmolarity is proportional to the concentration of molecules in solution (mOsm).
conduct: The conductivity of the urine. Conductivity is proportional to the concentration of charged ions in solution (mMho milliMho).
urea: the urea concentration in millimoles per litre;
calc: the calcium concentration in millimoles per litre.

Source

The data were obtained from

Andrews, D. F. and Herzberg, A. M. (1985) Data: A Collection of Problems from Many Fields for the Student and Research Worker, Cambridge: Cambridge University Press.

References

Davison, A. C. and Hinkley, D. V. (1997). Bootstrap Methods and Their Application (Example 7.8). Cambridge: Cambridge University Press.

Examples

data(urine)
summary(urine)
pairs(urine)
##
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))
data(urine)
summary(urine)
pairs(urine)
##
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc), 
                 family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc, 
                 family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))

Package 'cond'

Help Index

Approximate conditional inference for logistic and loglinear models

Description

Details

Author(s)

AIDS Symptoms and AZT Use Data

Description

Usage

Format

Source

Examples

Airway Data

Description

Usage

Format

Source

Examples

Crying Babies Data

Description

Usage

Format

Source

References

Examples

Approximate Conditional Inference - Generic Function

Description

Usage

Arguments

Details

Value

References

See Also

Examples

Approximate Conditional Inference for Logistic and Loglinear Models

Description

Usage

Arguments

Details

Value

Note

References

See Also

Examples

Approximate Conditional Inference Object

Description

Arguments

Generation

Methods

References

See Also

Dormicum Data

Description

Usage

Format

Source

References

Examples

Use family() on a “cond” object

Description

Usage

Arguments

See Also

Use family() on a “summary.cond” object

Description

Usage

Arguments

See Also

Fraudulent Automobile Insurance Claims Data

Description

Usage

Format

Source

References

Examples

Fungal Infections Treatment Data

Description

Usage

Format

Source