Help for package cond (original) (raw)
Version: | 1.2-4 | ||
---|---|---|---|
Date: | 2025-05-22 | ||
Title: | Approximate Conditional Inference for Logistic and Loglinear Models | ||
Maintainer: | Alessandra R. Brazzale alessandra.brazzale@unipd.it | ||
Depends: | R (≥ 3.0.0), statmod, survival | ||
Suggests: | boot, csampling, marg, nlreg | ||
Description: | Implements higher order likelihood-based inference for logistic and loglinear models. | ||
License: | GPL-2 | GPL-3 | file LICENCE [expanded from: GPL (≥ 2) | file LICENCE] |
URL: | https://www.r-project.org | ||
LazyLoad: | yes | ||
LazyData: | yes | ||
NeedsCompilation: | no | ||
Packaged: | 2025-05-25 17:40:32 UTC; brazzale | ||
Author: | Alessandra R. Brazzale [aut, cre] (author of original code for S, author of R port following earlier work by Douglas Bates) | ||
Repository: | CRAN | ||
Date/Publication: | 2025-05-25 18:20:01 UTC |
Approximate conditional inference for logistic and loglinear models
Description
Higher order likelihood-based inference for logistic and loglinear models
Details
Package: | cond |
---|---|
Version: | 1.2-4 |
Date: | 2025-05-22 |
Depends: | R (>= 3.0.0), statmod, survival |
Suggests: | csampling, marg, nlreg |
License: | GPL (>= 2) |
URL: | http://www.r-project.org |
LazyLoad: | yes |
LazyData: | yes |
Index:
Functions:
cond Approximate Conditional Inference - Generic Function cond.glm Approximate Conditional Inference for Logistic and Loglinear Models cond.object Approximate Conditional Inference Object family.cond Use family() on a "cond" object family.summaryCond Use family() on a "summary.cond" object plot.cond Generate Plots for an Approximate Conditional Inference Object print.summaryCond Use print() on a "summary.cond" object summary.cond Summary Method for Objects of Class "cond"
Datasets:
aids AIDS Symptoms and AZT Use Data airway Airway Data babies Crying Babies Data dormicum Dormicum Data fraudulent Fraudulent Automobile Insurance Claims Data fungal Fungal Infections Treatment Data rabbits Rabbits Data urine Urine Data
Further information is available in the following vignettes:
Rnews-paper | hoa: An R Package Bundle for Higher Order Likelihood Inference (source, pdf) |
---|---|
Author(s)
S original by Alessandra R. Brazzale alessandra.brazzale@unipd.it. R port by Alessandra R. Brazzale alessandra.brazzale@unipd.it, following earlier work by Douglas Bates.
Maintainer: Alessandra R. Brazzale alessandra.brazzale@unipd.it
AIDS Symptoms and AZT Use Data
Description
The aids
data frame has 4 rows and 4 columns.
On February 15, 1991, the New York Times published the results of a study on the presence of AIDS symptoms and AZT use. The data were cross-classified according to the race of the patients.
Usage
data(aids)
Format
This data frame contains the following columns:
yes
the number of patients with AIDS symptoms;
no
the number of patients without AIDS symptoms;
azt
an indicator variable for AZT use;
race
an indicator variable for the race (w
=white, b
=black).
Source
The data were obtained from the New York Times (2/15/91).
Examples
data(aids)
summary(aids)
Airway Data
Description
The airway
data frame has 35 rows and 6 columns.
Study to compare two devices (tracheal tube and laryngeal mask) used to secure airway in patients undergoing surgery. The response variable is the presence of a sore throat. Further information on age, sex, use of a lubricant, and duration of the surgery is available.
Usage
data(airway)
Format
This data frame contains the following columns:
response
an indicator variable for sore throat (1=yes, 0=no);
type
the type of airway used (1=tracheal tube, 0=laryngeal mask);
age
the age of the patient (in years);
sex
an indicator variable for sex (1=male, 0=female);
lubricant
an indicator variable for lubricant use (1=yes, 0=no);
duration
the duration of the surgery (in minutes).
Source
The data were obtained from
“Binary Data” by D. Collet in Encyclopedia of Biostatistics (1998).
Examples
data(airway)
summary(airway)
par(mfrow=c(1,2))
plot(age ~ response, data = airway)
plot(duration ~ response, data = airway)
Crying Babies Data
Description
The babies
data frame has 36 rows and 4 columns.
Matched pairs of binary observations concerning the crying of babies. The babies were observed on 18 days and on each day one child was lulled. Interest focuses on the treatment effect “lulling”.
Usage
data(babies)
Format
This data frame contains the following columns:
r1
number of children not crying on one day;
r2
number of children crying on one day;
lull
indicator variable for the treatment;
day
factor variable for the days.
Source
The data were obtained from
Cox, D. R. (1970) Analysis of Binary Data (page 61). London: Chapman & Hall.
References
Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.
Examples
data(babies)
coplot(r2/(r1+r2) ~ day | lull, data = babies)
##
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies, offset = lullyes)
babies.cond
Approximate Conditional Inference - Generic Function
Description
Performs approximate conditional inference.
Usage
cond(object, offset, ...)
Arguments
object | a fitted model object. Families supported are binomial and Poisson with canonical link function (class glm), and regression-scale models (class rsm). |
---|---|
offset | the covariate occurring in the model formula whose coefficient represents the parameter of interest. May be numerical or a two-level factor. In case of a two-level factor, it must be coded by contrasts and not appear as two dummy variables in the model. Can also be a call to a mathematical function (such as exp, sin, ...) or to a mathematical operator (\^, /, ...) applied to a numerical variable. The call must always agree with the label used to identify the corresponding parameter in the fitted model object passed through the object argument. Beware that the label includes the identity function I() if an arithmetic operator was used. Other function types (e.g. factor) and interactions are not admitted. |
... | absorbs any additional arguments. See cond.glmand cond.rsm for details. |
Details
This function is generic (see [methods](../../../../doc/manuals/r-patched/packages/utils/refman/utils.html#topic+methods)
); method functions can be written to handle specific classes of data. Classes which already have methods for this function include: glm
and rsm
.
Value
The returned value is an _approximate conditional inference_object. Classes already supported are cond
and marg
depending on whether the fitted model object passed through the object
argument has class glm
or rsm
. See [cond.object](#topic+cond.object)
or [marg.object](../../marg/refman/marg.html#topic+marg.object)
for more details.
References
Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference. Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne. Chapter 6.
See Also
[cond.glm](#topic+cond.glm)
, [cond.rsm](../../marg/refman/marg.html#topic+cond.rsm)
, [cond.object](#topic+cond.object)
, [marg.object](../../marg/refman/marg.html#topic+marg.object)
Examples
## Urine Data
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc),
family = binomial, data = urine)
##
## function call as offset variable
labels(coef(urine.glm))
cond(urine.glm, log(calc))
##
## large estimate of regression coefficient
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which = 4)
## House Price Data
## Not run:
data(houses)
houses.rsm <- rsm(price ~ ., family = student(5), data = houses)
##
## parameter of interest: scale parameter
houses.marg <- cond(houses.rsm, scale)
plot(houses.marg, which = 2)
## End(Not run)
Approximate Conditional Inference for Logistic and Loglinear Models
Description
Performs approximate conditional inference on a scalar parameter of interest in logistic and loglinear models. The output is stored in an object of class cond
.
Usage
## S3 method for class 'glm'
cond(object, offset, formula = NULL, family = NULL,
data = sys.frame(sys.parent()), pts = 20,
n = max(100, 2*pts), tms = 0.6, from = NULL, to = NULL,
control = glm.control(...), trace = FALSE, ...)
Arguments
object | a glm object. Families supported are binomial and Poisson with canonical link function. |
---|---|
offset | the covariate occurring in the model formula whose coefficient represents the parameter of interest. May be numerical or a two-level factor. In case of a two-level factor, it must be coded by contrasts and not appear as two dummy variables in the model. Can also be a call to a mathematical function (such as exp, sin, ...) or to a mathematical operator (^, /, ...) applied to a numerical variable. The call must always agree with the label used to identify the corresponding parameter in the glm object passed through the object argument or defined by formula and family. Beware that the label includes the identity function I() if an arithmetic operator was used. Other function types (e.g. factor) and interactions are not admitted. |
formula | a formula expression (only if no glm object is defined). |
family | a family object defining the variance function (only if noglm object is defined). Families supported are binomial and Poisson with canonical link function. |
data | an optional data frame in which to interpret the variables occurring in the formula (only if no glm object is defined). |
pts | number of output points (minimum 10) that are calculated exactly. The default is 20. |
n | approximate number of output points (minimum 50) produced by the spline interpolation. The default is the maximum between 100 and twice pts. |
tms | defines the range MLE +/- tms * s.e.where cubic spline interpolation is replaced by polynomial interpolation. The default is 0.6. |
from | starting value of the sequence that contains the values of the parameter of interest for which output points are calculated exactly. The default is MLE - 3.5 * s.e. |
to | ending value of the sequence that contains the values of the parameter of interest for which output points are calculated exactly. The default is MLE + 3.5 * s.e. |
control | a list of iteration and algorithmic constants that controls the GLM fit. See \ glm.control for their names and default values. |
trace | if TRUE, iteration numbers will be printed. |
... | additional arguments, such as subset etc., used by the glm fitting routine if the glm object is defined through formula and family. See glm for their definition and use. The arguments weights, offset and contrasts are not admitted. The returned value is an object of class cond; see cond.object for details. |
Details
This function is a method for the generic function cond
for class glm
. It can be invoked by calling [cond](#topic+cond)
for an object of the appropriate class, or directly by calling cond.glm
regardless of the class of the object. cond.glm
has also to be used if the glm
object is not provided throught the object
argument but specified by formula
and family
.
The function cond.glm
implements several small sample asymptotic methods for approximate conditional inference in logistic and loglinear models. Approximations for both the conditional log likelihood function and conditional tail area probabilities are available (see [cond.object](#topic+cond.object)
for details). Attention is restricted to a scalar parameter of interest. The associated covariate can be either numerical or a two-level factor.
Approximate conditional inference is performed by either updating a fitted generalized linear model or defining the model formula and family. All approximations are calculated exactly for pts
equally spaced points ranging from from
to to
. A cubic spline interpolation is used to extend them over the whole interval of interest, except for the range of values defined by MLE +/- tms
* s.e. where the spline interpolation is replaced by a higher order polynomial interpolation. This is done in order to avoid numerical instabilities which are likely to occur for values of the parameter of interest close to the MLE. Results are stored in an object of class cond
. Method functions like print
, summary
and plot
can be used to examine the output or represent it graphically. Components can be extracted using coef
, formula
and family
.
Main references for the methods considered are the papers by Pierce and Peters (1992) and Davison (1988). More details on the implementation are given in Brazzale (1999, 2000).
Value
The returned value is an object of class cond
; see [cond.object](#topic+cond.object)
for details.
Note
In rare occasions, cond.glm
dumps because of non-convergence of the function glm
which is used to refit the model for a fixed value of the parameter of interest. This happens for instance if this value is too extreme. The arguments from
and to
may then be used to limit the default range of MLE +/- 3.5 * s.e. A further possibility is to fine-tuning the constants (number of iterations, convergence threshold) that control the GLM fit through the control
argument.
cond.glm
may also dump if the estimate of the parameter of interest is large (tipically > 400) in absolute value. This may be avoided by reparametrizing the model.
The output of cond.glm
may be unreliable if part of the data have a degenerate distribution. For example take the fungal infections treatment data contained in the [fungal](#topic+fungal)
data frame. Of the five 2\times 2
contingency tables, two (the first and the third) are degenerate. As they make no contribution to the exact conditional likelihood, they should be omitted from the approximate conditional fit.
References
Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 1999, 653–661.
Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference. Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.
Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.
Pierce, D. A. and Peters, D. (1992) Practical use of higher order asymptotics for multiparameter exponential families (with Discussion). J. R. Statist. Soc. B, 54, 701–737.
See Also
[cond.object](#topic+cond.object)
, [summary.cond](#topic+summary.cond)
, [plot.cond](#topic+plot.cond)
, [glm](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+glm)
Examples
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
babies.cond
##
## If one wishes to avoid the generalized linear model fit:
babies.cond <- cond.glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies, offset = lullyes)
babies.cond
## Urine Data
## (function call as offset variable)
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc),
family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))
## Fungal Infections Treatment Data (numerical instabilities around the
## MLE)
## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1,
family = binomial, data = fungal,
control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1,
family = binomial, data = fungal, subset = -c(1,2,5,6),
control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).
Approximate Conditional Inference Object
Description
Class of objects returned when performing approximate conditional inference for logistic and loglinear models.
Arguments
Objects of class cond
are implemented as a list. The following components are included:
workspace | a list whose elements are the spline interpolations of several first order and higher order statistics. They are used to implement the following likelihood quantities: - the profile and modified profile log likelihoods; - the Wald pivots from the unconditional and conditional MLEs; - the profile and modified likelihood roots (the latter one with a suitable continuity correction); - the Lugannani-Rice tail area approximation (with suitable continuity correction); - the correction term used in the higher order statistics; - the information and nuisance parameter aspects. Method functions work mainly on this part of the object. In order to avoid errors in the calculation of confidence intervals and tail probabilities, this part of the object should not be modified. |
---|---|
coefficients | a 2\times 2 matrix containing the unconditional and approximate conditional MLEs and their standard errors. |
call | function call that created the cond object. |
formula | the model formula. |
family | the variance function. |
offset | the covariate occurring in the model formula whose coefficient represents the parameter of interest. |
diagnostics | diagnostics related to the decomposition of the higher order adjustments into an information and a nuisance parameters term. A value larger than 0.2 in absolute value is an index that higher order methods are needed. See Pierce and Peters (1992) for details. |
n.approx | number of output points that have been calculated exactly. |
omitted.val | range of values omitted in the spline interpolation of some of the higher order statistics considered. The aim is to avoid numerical instabilities around the maximum likelihood estimate. |
is.scalar | a logical value indicating whether there are any nuisance parameters. If FALSE there are none. |
Main references for the methods considered are the papers by Pierce and Peters (1992) and Davison (1988). More details on the implementation and the methods considered are given in Brazzale (1999, 2000).
Generation
This class of objects is returned from calls to the function [cond.glm](#topic+cond.glm)
.
Methods
The class cond
has methods for [summary](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+summary)
, [plot](../../../../doc/manuals/r-patched/packages/graphics/refman/graphics.html#topic+plot)
, [print](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print)
, [coef](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+coef)
and [family](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+family)
, amongst others.
References
Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 653–661.
Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference, Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.
Davison, A. C. (1988) Approximate conditional inference in generalized linear models. J. R. Statist. Soc. B, 50, 445–461.
Pierce, D. A. and Peters, D. (1992) Practical use of higher order asymptotics for multiparameter exponential families (with Discussion). J. R. Statist. Soc. B, 54, 701–737.
See Also
[cond.glm](#topic+cond.glm)
, [summary.cond](#topic+summary.cond)
, [plot.cond](#topic+plot.cond)
Dormicum Data
Description
The dormicum
data frame has 37 rows and 3 columns.
37 children in a pediatric intensive care unit were treated with varying doses and for varying duration with the drug Dormicum. The response variable is 1 if withdrawal symptoms were exhibited and 0 otherwise.
Usage
data(dormicum)
Format
This data frame contains the following columns:
symp
indicator of the presence of withdrawal symptoms;
dose
the drug dose in mg/kg;
days
the number of days treated.
Source
The data were supplied by Spadille Biostatistik, Denmark.
References
Mehta, C. R., Patel, N. T. and Senchaudhuri, P. (2000) Efficient Monte Carlo methods for conditional logistic regression. J. Amer. Statist. Ass., 95, 99–108.
Examples
data(dormicum)
par(mfrow = c(1,2))
plot(dose ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
ylab = "treatment dose (mg/kg)")
plot(days ~ symp, data = dormicum, xlab = "presence of withdrawal symptoms",
ylab = "treatment days")
Use family() on a “cond” object
Description
This is a method for the function family()
for objects inheriting from class cond
. See [family](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+family)
for the general behaviour of this function.
Usage
## S3 method for class 'cond'
family(object, ...)
Arguments
object | any object from which a family object can be extracted. |
---|---|
... | absorbs any additional argument. |
See Also
[family](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+family)
Use family() on a “summaryCond” object
Description
This is a method for the function family()
for objects inheriting from class summaryCond
. See [family](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+family)
for the general behaviour of this function.
Usage
## S3 method for class 'summaryCond'
family(object, ...)
Arguments
object | any object from which a family object can be extracted. |
---|---|
... | absorbs any additional argument. |
See Also
[family](../../../../doc/manuals/r-patched/packages/stats/refman/stats.html#topic+family)
Fraudulent Automobile Insurance Claims Data
Description
The fraudulent
data frame has 42 rows and 12 columns.
127 claims arising from automobile accidents in 1989 in Massachusetts (USA). Each claim was classified as either fraudulent or legitimate by consensus among four independent claims adjusters who examined each case file thoroughly. An exploratory analysis by Derrig and Weisberg (1993) identified 10 binary indicators, each of which denotes the presence or absence of a potential fraud characteristic in the claim situation. They fall into three broad groups relating to “Accident” (AC1, AC9 and AC16), “Claimant” (CL7 and CL11), and “Injury” (IJ2, IJ3, IJ4, IJ6 and IJ12).
Usage
data(fraudulent)
Format
This data frame contains the following columns:
r1
the number of frauds detected;
r2
the total number of automobile insurance claims;
AC1
,AC9
,AC16
potential fraud characteristics pertaining to “Accident”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.
CL7
,CL11
potential fraud characteristics pertaining to “Claimer”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.
IJ2
,IJ3
,IJ4
,IJ6
,IJ12
potential fraud characteristics pertaining to “Injury”. The presence of the fraud characteristic is indicated by a 1, the absence is indicated by a 0.
Source
The data were supplied by Dr. Richard Derrig of the Automobile Insurers Bureau of Massachusetts.
References
Mehta, C. R., Patel, N. T. and Senchaudhuri, P. (2000) Efficient Monte Carlo methods for conditional logistic regression. J. Amer. Statist. Ass., 95, 99–108.
Derrig, R. A. and Weisberg, H. I. (1993). Quantitative methods for detecting fraudulent automobile bodily injury claims. Manuscript.
Examples
data(fraudulent)
summary(fraudulent)
Fungal Infections Treatment Data
Description
The fungal
data frame has 10 rows and 4 columns.
Clinical trial on the success of a particular treatment for fungal infections. The study was carried out in five different research units. Interest focuses on the treatment effect.
Usage
data(fungal)
Format
This data frame contains the following columns:
success
the number of patients that benefited from the treatment;
failure
the number of patients with no benefit from the treatment;
group
an indicator variable for treatment (T
=treatment, P
=placebo);
center
a factor variable indicating the research unit where the study was carried out.
Source
The data were supplied by Sandoz Pharmaceuticals.
Examples
## (full data analysis)
data(fungal)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1,
family = binomial, data = fungal,
control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (partial data analysis)
fungal.glm <- glm(cbind(success, failure) ~ center + group - 1,
family = binomial, data = fungal, subset = -c(1,2,5,6),
control = glm.control(maxit = 50, epsilon = 1e-005))
fungal.cond <- cond(fungal.glm, groupT)
plot(fungal.cond, which = 2)
## (Tables 1 and 3 are omitted).
Generate Plots for an Approximate Conditional Inference Object
Description
Creates a set of plots for an object of class cond
.
Usage
## S3 method for class 'cond'
plot(x = stop("nothing to plot"), from = x.axis[1], to = x.axis[n],
which = NULL, alpha = 0.05, add.leg = TRUE, loc.leg = FALSE,
add.labs = TRUE, cex = 0.7, cex.lab = 1, cex.axis = 1,
cex.main = 1, lwd1 = 1, lwd2 = 2, lty1 = "solid",
lty2 = "dashed", col1 = "black", col2 = "blue", tck = 0.02,
las = 1, adj = 0.5, lab = c(15, 15, 5), ...)
Arguments
x | a cond object. This is assumed to be the result returned by the cond.glm function. |
---|---|
from | starting value for the x-axis range. The default value has been set by cond.glm. |
to | ending value for the x-axis range. The default value has been set by cond.glm. |
which | which plot should be printed. Admissible values are 2 to 8 corresponding to the choices in the menu below. |
alpha | the level used to read off confidence intervals; default is 5%. |
add.leg | if TRUE, a legend is added to each plot; default is TRUE. |
loc.leg | if TRUE, position of the legend can be located by hand; default is FALSE. |
add.labs | if TRUE, labels are added; default is TRUE. |
cex, cex.lab, cex.axis, cex.main | character expansions relative to the standard size of the device to be used for printing text, labels, axes and main title. See par for details. |
lwd1, lwd2 | line width used to compare different curves in the same plot; default is lwd2 = 2 for higher order solutions and lwd1 = 1 for first order solutions. |
lty1, lty2 | line type used to compare different curves in the same plot; default is lty2 = "dashed" for the Wald statistic and lty1 = "solid" for the remaining first- and higher order statistics. |
col1, col2 | colors used to compare different curves in the same plot; default is col2 = "blue" for higher order solutions, and col1 = "black" for the remaining first order statistics. |
tck, las, adj, lab | further graphical parameters. See par for details. |
... | optional graphical parameters; see par for details. |
Details
Several plots are produced for an object of class cond
. A menu lists all the plots that can be produced. They may be one or all of the following ones:
Make a plot selection (or 0 to exit)
1:plot: All 2:plot: Profile and modified profile log likelihoods 3:plot: Profile and modified profile likelihood ratios 4:plot: Profile and modified likelihood roots 5:plot: Modified and continuity corrected likelihood roots 6:plot: Lugannani-Rice approximations 7:plot: Confidence intervals 8:plot: Diagnostics based on INF/NP decomposition
Selection:
If no nuisance parameters are presented, a subset of the above pictures is produced. More details on the implementation are given in Brazzale (1999, 2000).
This function is a method for the generic function plot()
for class cond
. It can be invoked by calling plot
or directly plot.cond
for an object of the appropriate class.
Value
A plot is created on the current graphics device.
Side Effects
The current device is cleared. When add.leg = TRUE
, a legend is added to each plot, and if loc.leg = TRUE
, it can be set by the user. All screens are closed, but not cleared, on termination of the function.
Note
The diagnostic plots only represent a preliminary version and need further development.
The two panels on the right trace the information and nuisance correction terms, INF and NP, against the likelihood root statistic. These are generally smooth functions and used to approximate the information and nuisance parameter aspects as a function of the parameter of interest, as shown in the two panels on the left. This procedure has the advantage of largely eliminating the numerical instabilities that affect the statistics around the MLE. The circles in the two leftmost panels represent the limit of INF and NP at theMLE calculated exactly using numerical derivatives. All four pictures are intended to give an idea of the order of magnitude of the two correction terms while trying to deal with the numerical problems that likely occur for these kinds of data.
More details can be found in Brazzale (2000, Appendix B.2).
References
Brazzale, A. R. (1999) Approximate conditional inference for logistic and loglinear models. J. Comput. Graph. Statist., 8, 1999, 653–661.
Brazzale, A. R. (2000) Practical Small-Sample Parametric Inference, Ph.D. Thesis N. 2230, Department of Mathematics, Swiss Federal Institute of Technology Lausanne.
See Also
[cond.glm](#topic+cond.glm)
, [cond.object](#topic+cond.object)
, [summary.cond](#topic+summary.cond)
Examples
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
## Not run:
plot(babies.cond)
## End(Not run)
## Urine Data
data(urine)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
urine.cond <- cond(urine.glm, I(gravity * 100))
plot(urine.cond, which=4)
Use print() on a “cond” object
Description
This is a method for the function print()
for objects inheriting from class cond
. See [print](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print)
and [print.default](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print.default)
for the general behaviour of this function and for the interpretation of digits
.
Usage
## S3 method for class 'cond'
print(x, digits=max(3, getOption("digits")-3), ...)
## S3 method for class 'cond'
print(x, digits, ...)
See Also
[cond.object](#topic+cond.object)
, [print](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print)
, [print.default](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print.default)
Use print() on a “summaryCond” object
Description
This is a method for the function print()
for objects inheriting from class summaryCond
. See [print](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print)
and [print.default](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print.default)
for the general behaviour of this function and for the interpretation of digits
.
Usage
## S3 method for class 'summaryCond'
print(x, all = x$all, Coef = x$cf, int = x$int, test = x$hyp,
digits = if(!is.null(x$digits)) x$digits else max(3, getOption("digits")-3),
...)
## S3 method for class 'summaryCond'
print(x, all, Coef, int, test, digits, ...)
Arguments
x | a summaryCond object. This is assumed to be the result returned by the summary.cond function. |
---|---|
all | if TRUE all the information stored in the summaryCond object is printed, else only a subset of it. The default is FALSE. |
Coef | if TRUE, the unconditional and conditional parameter estimates are printed. The default is TRUE. |
int | if TRUE, confidence intervals are printed. The default is TRUE. |
test | if TRUE, tests statistics and tail probabilities are printed. The default is FALSE. |
digits | number of significant digits to be printed. The default depends on the value of digits set by options. |
... | additional arguments. |
Details
Changing the default values of all
, Coef
, int
and test
allows only a subset of the information in the summaryCond
object to be printed. With all = FALSE
, one-sided confidence intervals and the Lugannani-Rice tail approximations are omitted. See [summary.cond](#topic+summary.cond)
for more details.
Note
The amount of information printed may vary depending on whether there are any nuisance parameters.
See Also
[summary.cond](#topic+summary.cond)
, [cond.object](#topic+cond.object)
, [print.default](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+print.default)
Examples
## Urine Data
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
urine.cond <- cond(urine.glm, urea)
print(summary(urine.cond, all = TRUE), digits = 4)
print(summary(urine.cond), Coef = FALSE)
Rabbits Data
Description
The rabbits
data frame has 10 rows and 4 columns.
Five different doses of penicillin were administered to rabbits suffering from a streptococci infection and the number of recovering rabbits recorded. The rabbits are cross-classified according to whether the drug is administered immediately or delayed by an hour and a half. Interest focuses on whether the delay effects the treatment.
Usage
data(rabbits)
Format
This data frame contains the following columns:
cured
the number of rabbits that recovered;
died
the number of rabbits that died;
delay
an indicator variable indicating whether the administration of penicillin was delayed by 1 1/2 hours;
penicil
the penicillin dose.
Source
Unknown.
Examples
data(rabbits)
attach(rabbits)
fc <- cured/(cured + died)
coplot(fc ~ log(penicil) | delay, data = rabbits)
Summary Method for Objects of Class “cond”
Description
Returns a summary list for objects of class cond
.
Usage
## S3 method for class 'cond'
summary(object, alpha = 0.05, test = NULL, all = FALSE, coef = TRUE,
int = ifelse( (is.null(test) || all), TRUE, FALSE),
digits = NULL, ...)
Arguments
object | a cond object. This is assumed to be the result returned by the cond.glm function. |
---|---|
alpha | vector of levels for confidence intervals. The default is 5%. |
test | vector of values of the parameter of interest one wants to test for. If NULL, no test is performed. The default is NULL. |
all | logical value; if TRUE, all the information stored in the summaryCond object is printed, else only a subset of it. The default is FALSE. |
coef | logical value; if TRUE, the unconditional and conditional parameter estimates are printed. The default is TRUE. |
int | logical value; if TRUE confidence intervals are printed. The default is TRUE. |
digits | number of significant digits to be printed. The default depends on the value of digits set by options. |
... | absorbs any additional argument. |
Details
This function is a method for the generic function summary()
for objects of class cond
. It can be invoked by calling summary
or directly summary.cond
for an object of the appropriate class.
Value
A list is returned with the following components.
coefficients | a 2\times 2 matrix containing the unconditional and approximate conditional MLEs and their standard errors. |
---|---|
conf.int | a matrix containing, for each level given in alpha, the upper and lower confidence bounds derived from several first- and higher order test statistics. One-sided and two-sided confidence intervals are considered. See cond.object for details on the test statistics. |
signif.tests | a list with two elements. The first (stats) contains, for each value given in test, the values and tail probabilities of several first- and higher order test statistics. See cond.object for details on the test statistics.The second element of the list (qTerm) contains for each tested hypothesis the correction term used in the higher order solutions. |
call | the function call that created the cond object. |
formula | the model formula. |
family | the variance function. |
offset | the covariate occurring in the model formula whose coefficient represents the parameter of interest. |
alpha | vector of levels used to compute the confidence intervals. |
hypotheses | values for the parameter of interest that have been tested for. |
diagnostics | information and nuisance parameters aspects; see cond.object for details. |
n.approx | number of output points that have been calculated exactly. |
all | logical value; if TRUE, all the information stored in the summaryCond object is printed. |
cf | logical value; if TRUE, the unconditional and conditional parameter estimates are printed. |
int | logical value; if TRUE, confidence intervals are printed. |
is.scalar | a logical value indicating whether there are any nuisance parameters. If FALSE there are none. |
digits | number of significant digits to be printed. |
Note
The amount of information calculated may vary depending on whether there are any nuisance parameters.
See Also
[summary](../../../../doc/manuals/r-patched/packages/base/refman/base.html#topic+summary)
, [cond.object](#topic+cond.object)
Examples
## Crying Babies Data
data(babies)
babies.glm <- glm(formula = cbind(r1, r2) ~ day + lull - 1,
family = binomial, data = babies)
babies.cond <- cond(object = babies.glm, offset = lullyes)
summary(babies.cond, test = 0, coef = FALSE)
Urine Data
Description
The urine
data frame has 77 rows and 7 columns.
79 urine specimens were analyzed in an effort to determine if certain physical characteristics of the urine might be related to the formation of calcium oxalate crystals.
Usage
data(urine)
Format
This data frame contains the following columns:
r
indicator of the presence of calcium oxalate crystals;
gravity
the specific gravity of the urine, i.e. the density of urine relative to water;
ph
the pH reading of the urine;
osmo
the osmolarity of the urine. Osmolarity is proportional to the concentration of molecules in solution (mOsm).
conduct
The conductivity of the urine. Conductivity is proportional to the concentration of charged ions in solution (mMho milliMho).
urea
the urea concentration in millimoles per litre;
calc
the calcium concentration in millimoles per litre.
Source
The data were obtained from
Andrews, D. F. and Herzberg, A. M. (1985) Data: A Collection of Problems from Many Fields for the Student and Research Worker, Cambridge: Cambridge University Press.
References
Davison, A. C. and Hinkley, D. V. (1997). Bootstrap Methods and Their Application (Example 7.8). Cambridge: Cambridge University Press.
Examples
data(urine)
summary(urine)
pairs(urine)
##
data(urine)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + log(calc),
family = binomial, data = urine)
labels(coef(urine.glm))
urine.cond <- cond(urine.glm, log(calc))
##
## (large estimate of regression coefficient)
urine.glm <- glm(r ~ gravity + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.glm <- glm(r ~ I(gravity * 100) + ph + osmo + conduct + urea + calc,
family = binomial, data = urine)
coef(urine.glm)
urine.cond <- cond(urine.glm, I(gravity * 100))