Title: | Log-Concave Density Estimation in Arbitrary Dimensions |
---|---|
Description: | Software for computing a log-concave (maximum likelihood) estimator for independent and identically distributed data in any number of dimensions. For a detailed description of the method see Cule, Samworth and Stewart (2010, Journal of Royal Statistical Society Series B, <doi:10.1111/j.1467-9868.2010.00753.x>). |
Authors: | Madeleine Cule [aut], Robert Gramacy [aut], Richard Samworth [aut], Yining Chen [aut, cre], Lutz Duembgen [ctb] (helped improve the numerical stability), Vikneswaran Gopal [ctb] (proposed the Metropolis-Hastings scheme for draw samples from a log-concave distribution), George Casella [ctb] (proposed the Metropolis-Hastings scheme for draw samples from a log-concave distribution), C. Bradford Barber [cph] (author of the Qhull library), The Geometry Center [cph], Kai Habel [cph] (author of the function that imports convex hull computation from Qhull to R), Raoul Grasman [cph] (author of the function that imports convex hull computation from Qhull to R), Alexei Kuntsevich [cph] (author of the C function for Shor's r algorithm), Franz Kappel [cph] (author of the C function for Shor's r algorithm) |
Maintainer: | Yining Chen <[email protected]> |
License: | GPL (>= 2) |
Version: | 1.6-10 |
Built: | 2024-12-11 07:20:17 UTC |
Source: | CRAN |
This package contains a function to compute the maximum
likelihood estimator of a log-concave density in any number of
dimensions using Shor's -algorithm.
Functions to plot (for 1- and 2-d data), evaluate and draw samples from the maximum likelihood estimator are provided.
This package contains a selection of functions for maximum likelihood estimation under the constraint of log-concavity.
mlelcd
computes the maximum likelihood estimator
(specified via its value at data points). Output is a list of class
"LogConcDEAD"
which is used as input to various auxiliary functions.
hatA
calculates the difference between the sample covariance and the fitted covariance.
dlcd
evaluates the estimated density at a particular point.
dslcd
evaluates the smoothed version of estimated density at a particular point.
rlcd
draws samples from the estimated density.
rslcd
draws samples from the smoothed version of estimated density.
interplcd
interpolates the estimated density on
a grid for plotting purposes.
dmarglcd
evaluates the estimated marginal density by
integrating the estimated density over an appropriate subspace.
interpmarglcd
evaluates a marginal density estimate at
equally spaced points along the axis for plotting purposes. This is
done by integrating the estimated density over an appropriate subspace.
plot.LogConcDEAD
produces plots of the maximum likelihood
estimator, optionally using the rgl package.
print
and
summary
methods are also available.
The authors gratefully acknowledge the assistance of Lutz Duembgen at the University of Bern for his insight into the objective function in mlelcd.
For one dimensional data, the active set algorithm in
logcondens
is much faster.
Yining Chen (maintainer) [email protected]
Madeleine Cule
Robert Gramacy
Richard Samworth
Barber, C.B., Dobkin, D.P., and Huhdanpaa, H.T. (1996) The Quickhull algorithm for convex hulls ACM Trans. on Mathematical Software, 22(4) p.469-483 http://www.qhull.org
Chen, Y. and Samworth, R. J. (2013) Smoothed log-concave maximum likelihood estimation with applications Statist. Sinica, 23, 1373-1398. https://arxiv.org/abs/1102.1191v4
Cule, M. L. and Duembgen, L. (2008) On an auxiliary function for log-density estimation, University of Bern technical report. https://arxiv.org/abs/0807.4719
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a multi-dimensional log-concave density J. Roy. Statist. Soc., Ser. B. (with discussion), 72, 545-600.
Gopal, V. and Casella, G. (2010) Discussion of Maximum likelihood estimation of a log-concave density by Cule, Samworth and Stewart J. Roy. Statist. Soc., Ser. B., 72, 580-582.
Grundmann, A. and Moeller, M. (1978) Invariant Integration Formulas for the N-Simplex by Combinatorial Methods SIAM Journal on Numerical Analysis, Volume 15, Number 2, 282-290.
Kappel, F. and Kuntsevich, A. V. (2000) An implementation of Shor's r-algorithm Computational Optimization and Applications, Volume 15, Issue 2, 193-205.
Shor, N. Z. (1985) Minimization methods for nondifferentiable functions Springer-Verlag
## Some simple normal data, and a few plots x <- matrix(rnorm(200),ncol=2) lcd <- mlelcd(x) g <- interplcd(lcd) oldpar <- par(mfrow = c(1,1)) par(mfrow=c(2,2), ask=TRUE) plot(lcd, g=g, type="c") plot(lcd, g=g, type="c", uselog=TRUE) plot(lcd, g=g, type="i") plot(lcd, g=g, type="i", uselog=TRUE) par(oldpar) ## Some plots of marginal estimates g.marg1 <- interpmarglcd(lcd, marg=1) g.marg2 <- interpmarglcd(lcd, marg=2) plot(lcd, marg=1, g.marg=g.marg1) plot(lcd, marg=2, g.marg=g.marg2) ## generate some points from the fitted density generated <- rlcd(100, lcd) genmean <- colMeans(generated) ## evaluate the fitted density mypoint <- c(0, 0) dlcd(mypoint, lcd, uselog=FALSE) mypoint <- c(10, 0) dlcd(mypoint, lcd, uselog=FALSE) ## evaluate the marginal density dmarglcd(0, lcd, marg=1) dmarglcd(1, lcd, marg=2)
## Some simple normal data, and a few plots x <- matrix(rnorm(200),ncol=2) lcd <- mlelcd(x) g <- interplcd(lcd) oldpar <- par(mfrow = c(1,1)) par(mfrow=c(2,2), ask=TRUE) plot(lcd, g=g, type="c") plot(lcd, g=g, type="c", uselog=TRUE) plot(lcd, g=g, type="i") plot(lcd, g=g, type="i", uselog=TRUE) par(oldpar) ## Some plots of marginal estimates g.marg1 <- interpmarglcd(lcd, marg=1) g.marg2 <- interpmarglcd(lcd, marg=2) plot(lcd, marg=1, g.marg=g.marg1) plot(lcd, marg=2, g.marg=g.marg2) ## generate some points from the fitted density generated <- rlcd(100, lcd) genmean <- colMeans(generated) ## evaluate the fitted density mypoint <- c(0, 0) dlcd(mypoint, lcd, uselog=FALSE) mypoint <- c(10, 0) dlcd(mypoint, lcd, uselog=FALSE) ## evaluate the marginal density dmarglcd(0, lcd, marg=1) dmarglcd(1, lcd, marg=2)
This function computes the covariance matrix of a log-concave maximum likelihood estimator.
cov.LogConcDEAD(lcd)
cov.LogConcDEAD(lcd)
lcd |
Object of class |
This function evaluates the covariance matrix of a given log-concave maximum likelihood estimator using the second order partial derivatives of the auxiliary function studied in Cule, M. L. and Duembgen, L. (2008).
For examples, see mlelcd
.
A matrix
equals the covariance matrix of the
log-concave maximum likelihood density estimator.
Yining Chen
Madeleine Cule
Robert Gramacy
Richard Samworth
Cule, M. L. and Duembgen, L. (2008) On an auxiliary function for log-density estimation, University of Bern technical report. https://arxiv.org/abs/0807.4719
This function evaluates the density function of a log-concave maximum likelihood estimator at a point or points.
dlcd(x,lcd, uselog=FALSE, eps=10^-10)
dlcd(x,lcd, uselog=FALSE, eps=10^-10)
x |
Point (or |
lcd |
Object of class |
uselog |
Scalar |
eps |
Tolerance for numerical stability |
A log-concave maximum likelihood estimate
is satisfies
for some
, where
Functions of this form may equivalently be specified by dividing
, the convex hull of the data into simplices
for
(triangles in 2d, tetrahedra in 3d etc), and setting
for , and
for
. The estimated density is zero outside the convex
hull of the data.
The estimate may therefore be evaluated by finding the appropriate
simplex , then evaluating
(if
, set
).
For examples, see mlelcd
.
A vector
of maximum likelihood estimate (or log
maximum likelihood estimate) values, as evaluated at the points x
.
Madeleine Cule
Robert Gramacy
Richard Samworth
Integrates the log-concave maximum likelihood estimator of multivariate data to evaluate the marginal density at a point.
dmarglcd(x=0, lcd, marg=1)
dmarglcd(x=0, lcd, marg=1)
x |
Point (or |
lcd |
Object of class |
marg |
Which margin is required? |
Given a multivariate log-concave maximum likelihood estimator in the
form of an object of class "LogConcDEAD"
, a margin marg
,
and a real-valued point x
, this function evaluates the estimated
marginal density ,
as obtained by integrating over all the other dimensions.
For examples, see mlelcd
.
A vector
containing the values of the marginal density at the points
x
.
Madeleine Cule
Robert Gramacy
Richard Samworth
This function evaluates the density function of a smoothed log-concave maximum likelihood estimator at a point or points.
dslcd(x, lcd, A=hatA(lcd))
dslcd(x, lcd, A=hatA(lcd))
x |
Point (or |
lcd |
Object of class |
A |
A positive definite |
The smoothed log-concave maximum likelihood estimator is a fully automatic
nonparametric density estimator, obtained as a canonical smoothing of the
log-concave maximum likelihood estimator. More precisely, it equals the convolution
, where
is the density function of
d-dimensional multivariate normal with covariance matrix
.
Typically,
is taken as the difference between the sample covariance and
the covariance of fitted log-concave maximum likelihood density. Therefore, this estimator
matches both the empirical mean and empirical covariance.
The estimate is evaluated numerically either by Gaussian quadrature in two dimensions, or in
higher dimensions, via a combinatorial method proposed by Grundmann and Moeller (1978).
Details of the computational aspects can be found in Chen and Samworth (2011). In one
dimension, explicit expression can be derived. See
logcondens
for more information.
For examples, see mlelcd
A vector
of smoothed log-concave maximum likelihood estimate
values, as evaluated at the points x
.
Yining Chen
Madeleine Cule
Robert Gramacy
Richard Samworth
Chen, Y. and Samworth, R. J. (2013) Smoothed log-concave maximum likelihood estimation with applications Statist. Sinica, 23, 1373-1398. https://arxiv.org/abs/1102.1191v4
Grundmann, A. and Moeller, M. (1978) Invariant Integration Formulas for the N-Simplex by Combinatorial Methods SIAM Journal on Numerical Analysis, Volume 15, Number 2, 282-290.
Uses EM algorithm to estimate the mixture proportions and the component
densities. The output is an object of class "lcdmix"
which
contains mixture proportions at each observation and all the information of the
estimated component densities.
EMmixlcd( x, k = 2, y, props, epsratio=10^-6, max.iter=50, epstheta=10^-8, verbose=-1 )
EMmixlcd( x, k = 2, y, props, epsratio=10^-6, max.iter=50, epstheta=10^-8, verbose=-1 )
x |
Data in |
k |
The number of components, equals 2 by default |
y |
An |
props |
Vector of length |
epsratio |
EM algorithm will terminate if the increase in the proportion of the likelihood is
less than this specified ratio. Default value is |
max.iter |
The maximum number of iterations for the EM algorithm |
epstheta |
|
verbose |
|
An introduction to the Em algorithm can be found in McLachlan and Krishnan (1997).
Briefly, given the current estimates of the mixture proportions and component densities, we first
update the estimates of the mixture prroportions. We then update the estimates of the component
densities by using mlelcd
. In fact, the incorporation of the weights in the
maximization process in mlelcd
presents no additional complication.
In our case, because of the computational intensity of the method, we first cluster the points according to ta hierarchical Gaussian clustering model and then iterate the EM algorithm until the increase in the proportion of the likelihood is less than a pre-specified quantity at each step.
More technical details can be found in Cule, Samworth and Stewart(2010)
An object of class "lcdmix"
, with the following components:
x |
Data copied from input (may be reordered) |
logf |
An |
props |
Vector containing the estimated proportions of components |
niter |
Number of iterations of the EM algorithm |
lcdloglik |
The log-likelihood after the final iteration |
Yining Chen
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a log-concave density, Journal of the Royal Statistical Society, Series B, 72(5) p.545-607.
McLachlan, G. J. and Krishnan, T. (1997) The EM Algorithm and Extensions, New York: Wiley.
mclust
,
logcondens
,
plot.LogConcDEAD
,mlelcd
, dlcd
##Simple bivariate normal data set.seed( 1 ) n = 15 d = 2 props=c( 0.6, 0.4 ) shift=2 x <- matrix( rnorm( n*d ), ncol = d ) shiftvec <- ifelse( runif( n ) > props[ 1 ], 0, shift ) x[,1] <- x[,1] + shiftvec EMmixlcd( x, k = 2, max.iter = 2)
##Simple bivariate normal data set.seed( 1 ) n = 15 d = 2 props=c( 0.6, 0.4 ) shift=2 x <- matrix( rnorm( n*d ), ncol = d ) shiftvec <- ifelse( runif( n ) > props[ 1 ], 0, shift ) x[,1] <- x[,1] + shiftvec EMmixlcd( x, k = 2, max.iter = 2)
A function to construct an object of class LogConcDEAD
from a
dataset (given as a matrix) and the value of the log maximum
likelihood estimator at datapoints.
getinfolcd(x, y, w = rep(1/length(y), length(y)), chtol = 10^-6, MinSigma = NA, NumberOfEvaluations = NA)
getinfolcd(x, y, w = rep(1/length(y), length(y)), chtol = 10^-6, MinSigma = NA, NumberOfEvaluations = NA)
x |
Data in |
y |
Value of log of maximum likelihood estimator at data points |
w |
Vector of weights
subject to the restriction that |
chtol |
Tolerance for computation of convex hull. Altering this is not recommended. |
MinSigma |
Real-valued scalar giving minimum value of the objective function |
NumberOfEvaluations |
Vector containing the number of steps, number of function
evaluations, and number of subgradient evaluations. If the SolvOpt
algorithm fails, the first component will be an error code |
This function is used in mlelcd
An object of class "LogConcDEAD"
, with the following
components:
x |
Data copied from input (may be reordered) |
w |
weights copied from input (may be reordered) |
logMLE |
|
NumberOfEvaluations |
Vector containing the number of steps, number of function
evaluations, and number of subgradient evaluations. If the SolvOpt
algorithm fails, the first component will be an error code |
MinSigma |
Real-valued scalar giving minimum value of the objective function |
b |
|
beta |
|
triang |
|
verts |
|
vertsoffset |
|
chull |
Vector containing vertices of faces of the convex hull of the data |
outnorm |
|
outoffset |
|
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Yining Chen
This function takes takes a matrix
of (possibly
binned) data and returns a matrix
containing the distinct
observations, and a vector
of weights as described below.
getweights(x)
getweights(x)
x |
a data |
Given an
matrix
of points in
, this function removes duplicated observations, and
counts the number of times each observation occurs. This is used to
compute a
vector
such that
This function is called by mlelcd
in order to compute
the maximum likelihood estimator when the observed data values are not
distinct. In this case, the log likelihood function is of the form
where the sum is over distinct observations.
xout |
A |
w |
A real-valued |
Madeleine Cule
Robert Gramacy
Richard Samworth
## simple normal example x <- matrix(rnorm(200),ncol=2) tmp <- getweights(x) lcd <- mlelcd(tmp$x,tmp$w) plot(lcd,type="ic")
## simple normal example x <- matrix(rnorm(200),ncol=2) tmp <- getweights(x) lcd <- mlelcd(tmp$x,tmp$w) plot(lcd,type="ic")
This function computes the matrix of the smoothed log-concave maximum
likelihood estimator
hatA(lcd)
hatA(lcd)
lcd |
Object of class |
This function evaluates the the matrix of the smoothed log-concave maximum
likelihood estimator, which is positive definite, and equals the difference between the
sample covariance matrix and the covariance matrix of the fitted log-concave maximum
likelihood density estimator.
For examples, see mlelcd
A matrix
equals of the smoothed log-concave maximum
likelihood estimator
Details of the computational aspects can be found in Chen and Samworth (2011).
Yining Chen
Madeleine Cule
Robert Gramacy
Richard Samworth
Chen, Y. and Samworth, R. J. (2013) Smoothed log-concave maximum likelihood estimation with applications Statist. Sinica, 23, 1373-1398. https://arxiv.org/abs/1102.1191v4
Uses tkrplot
to create a GUI for two-class classification
in two dimensions using the smoothed log-concave maximum likelihood estimates
interactive2D(data, cl)
interactive2D(data, cl)
data |
Data in |
cl |
factor of true classifications of the data set |
This function uses tkrplot
to create a GUI for two-class classification
in two dimensions using the smoothed log-concave maximum likelihood estimates. The construction of the
classifier is standard, and can be found in Chen and Samworth (2013). The slider controls the risk
ratio of two classes (equals one by default), which provides a way of demonstrating how the decision boundaries
change as the ratio varies. Observations from different classes are plotted in red and green respectively.
A GUI with a slider
Yining Chen
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Chen, Y. and Samworth, R. J. (2013) Smoothed log-concave maximum likelihood estimation with applications Statist. Sinica, 23, 1373-1398. https://arxiv.org/abs/1102.1191v4
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a log-concave density, Journal of the Royal Statistical Society, Series B, 72(5) p.545-607.
## Simple bivariate normal data ## only works interactively, not run as a test example here if(interactive()){ set.seed( 1 ) n = 15 d = 2 props=c( 0.6, 0.4 ) x <- matrix( rnorm( n*d ), ncol = d ) shiftvec <- ifelse( runif( n ) > props[ 1 ], 0, 1) x[,1] <- x[,1] + shiftvec interactive2D( x, shiftvec ) }
## Simple bivariate normal data ## only works interactively, not run as a test example here if(interactive()){ set.seed( 1 ) n = 15 d = 2 props=c( 0.6, 0.4 ) x <- matrix( rnorm( n*d ), ncol = d ) shiftvec <- ifelse( runif( n ) > props[ 1 ], 0, 1) x[,1] <- x[,1] + shiftvec interactive2D( x, shiftvec ) }
Evaluates the logarithm of the
log-concave maximum likelihood estimator on a grid for 2-d data, for use
in plot.LogConcDEAD
.
interplcd(lcd, gridlen=100 )
interplcd(lcd, gridlen=100 )
lcd |
Object of class |
gridlen |
A scalar indicating the size of the grid |
Interpolates the MLE over a grid.
The output is of a form readily usable by
plot.LogConcDEAD
, image
,
contour
, etc, as illustrated in the examples below.
For examples, please see mlelcd
.
x |
Vector of |
y |
Vector of |
z |
A |
Madeleine Cule
Robert Gramacy
Richard Samworth
Integrates the maximum likelihood estimator of multivariate data over an appropriate subspace to
produce axis-aligned marginals for use in plot.LogConcDEAD
.
interpmarglcd(lcd, marg=1, gridlen=100)
interpmarglcd(lcd, marg=1, gridlen=100)
lcd |
Output from |
marg |
An (integer) scalar indicating which margin is required |
gridlen |
An (integer) scalar indicating the size of the grid |
Given a multivariate log-concave maximum likelihood
estimator in the form of an object of class "LogConcDEAD"
and
a margin marg
, this function will compute the marginal
density estimate .
The estimate is evaluated at
gridlen
equally spaced points in
the range where the density estimate is nonzero. These points are
given in the vector
xo
.
is evaluated by
integrating the log-concave maximum likelihood estimator
over the other components. The marginal
density is zero outside the range of
xo
.
For examples, see mlelcd
.
xo |
Vector of values at which the marginal density is estimate is computed. |
marg |
Vector of values of the integrated maximum likelihood
estimator at the locations |
Madeleine Cule
Robert Gramacy
Richard Samworth
Uses Shor's -algorithm to compute the maximum
likelihood estimator of a log-concave density based on an
i.i.d. sample. The estimator is uniquely determined by its value at the data
points. The output is an object of class
"LogConcDEAD"
which
contains all the information needed to plot the estimator using the
plot
method, or to evaluate it using
the function dlcd
.
mlelcd(x, w=rep(1/nrow(x),nrow(x)), y=initialy(x), verbose=-1, alpha=5, c=1, sigmatol=10^-8, integraltol=10^-4, ytol=10^-4, Jtol=0.001, chtol=10^-6)
mlelcd(x, w=rep(1/nrow(x),nrow(x)), y=initialy(x), verbose=-1, alpha=5, c=1, sigmatol=10^-8, integraltol=10^-4, ytol=10^-4, Jtol=0.001, chtol=10^-6)
x |
Data in |
w |
Vector of weights
|
subject to the restriction that is
log-concave. The default is
for all
,
which corresponds to i.i.d. observations.
y |
Vector giving starting point for the |
verbose |
|
alpha |
Scalar parameter for SolvOpt |
c |
Scalar giving starting step size |
sigmatol |
Real-valued scalar giving one of the stopping
criteria: Relative change in |
ytol |
Real-valued scalar giving on of the stopping criteria: Relative change in |
integraltol |
Real-valued scalar giving one of the stopping
criteria: |
Jtol |
Parameter controlling when Taylor expansion is used in
computing the function |
chtol |
Parameter controlling convex hull computations |
The log-concave maximum likelihood density estimator based on data
is the function that maximizes
subject to the constraint that is
log-concave. For i.i.d.~data, the weights
should be
for each
.
This is a function of the form for some
, where
Functions of this form may equivalently be specified by dividing
, the convex hull of the data, into simplices
for
(triangles in 2d, tetrahedra in 3d etc), and setting
for , and
for
.
This function uses Shor's -algorithm (an iterative
subgradient-based procedure) to minimize over vectors
in
the function
This is equivalent to finding the log-concave maximum likelihood estimator, as demonstrated in Cule, Samworth and Stewart (2008).
An implementation of Shor's -algorithm based on SolvOpt
is used.
Computing makes use of the qhull library.
Code from this C-based library is copied here as it is not currently
possible to use compiled code from another library.
For points not in general position, this requires a Taylor expansion of
, discussed in Cule and Duembgen (2008).
An object of class "LogConcDEAD"
, with the following
components:
x |
Data copied from input (may be reordered) |
w |
weights copied from input (may be reordered) |
logMLE |
|
NumberOfEvaluations |
Vector containing the number of steps, number of function
evaluations, and number of subgradient evaluations. If the SolvOpt
algorithm fails, the first component will be an error code |
MinSigma |
Real-valued scalar giving minimum value of the objective function |
b |
|
beta |
|
triang |
|
verts |
|
vertsoffset |
|
chull |
Vector containing vertices of faces of the convex hull of the data |
outnorm |
|
outoffset |
|
For one-dimensional data, the active set algorithm of
logcondens
is faster, and may be
preferred.
The authors gratefully acknowledge the assistance of Lutz Duembgen
at the University of Bern for his insight into the objective function
.
Further references, including definitions and background material, may be found in Cule, Samworth and Stewart (2010).
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Yining Chen
Barber, C.B., Dobkin, D.P., and Huhdanpaa, H.T. (1996) The Quickhull algorithm for convex hulls ACM Trans. on Mathematical Software, 22(4) p.469-483 http://www.qhull.org
Cule, M. L. and Duembgen, L. (2008) On an auxiliary function for log-density estimation, University of Bern technical report. https://arxiv.org/abs/0807.4719
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a log-concave density, Journal of the Royal Statistical Society, Series B, 72(5) p.545-607.
Kappel, F. and Kuntsevich, A. V. (2000) An implementation of Shor's r-algorithm, Computational Optimization and Applications, Volume 15, Issue 2, 193-205.
Shor, N. Z. (1985) Minimization methods for nondifferentiable functions, Springer-Verlag
logcondens
,
interplcd
, plot.LogConcDEAD
,
interpmarglcd
, rlcd
, dlcd
,
## Some simple normal data, and a few plots x <- matrix(rnorm(200),ncol=2) lcd <- mlelcd(x) g <- interplcd(lcd) oldpar <- par(mfrow = c(1,1)) par(mfrow=c(2,2), ask=TRUE) plot(lcd, g=g, type="c") plot(lcd, g=g, type="c", uselog=TRUE) plot(lcd, g=g, type="i") plot(lcd, g=g, type="i", uselog=TRUE) par(oldpar) ## 2D interactive plot (need rgl package, not run here) if(interactive()) {plot(lcd, type="r")} ## Some plots of marginal estimates g.marg1 <- interpmarglcd(lcd, marg=1) g.marg2 <- interpmarglcd(lcd, marg=2) plot(lcd, marg=1, g.marg=g.marg1) plot(lcd, marg=2, g.marg=g.marg2) ## generate some points from the fitted density ## via independent rejection sampling generated1 <- rlcd(100, lcd) colMeans(generated1) ## via Metropolis-Hastings algorithm generated2 <- rlcd(100, lcd, "MH") colMeans(generated2) ## evaluate the fitted density mypoint <- c(0, 0) dlcd(mypoint, lcd, uselog=FALSE) mypoint <- c(1, 0) dlcd(mypoint, lcd, uselog=FALSE) ## evaluate the marginal density dmarglcd(0, lcd, marg=1) dmarglcd(1, lcd, marg=2) ## evaluate the covariance matrix of the fitted density covariance <- cov.LogConcDEAD(lcd) ## find the hat matrix for the smoothed log-concave that ## matches empirical mean and covariance A <- hatA(lcd) ## evaluate the fitted smoothed log-concave density mypoint <- c(0, 0) dslcd(mypoint, lcd, A) mypoint <- c(1, 0) dslcd(mypoint, lcd, A) ## generate some points from the fitted smoothed log-concave density generated <- rslcd(100, lcd, A)
## Some simple normal data, and a few plots x <- matrix(rnorm(200),ncol=2) lcd <- mlelcd(x) g <- interplcd(lcd) oldpar <- par(mfrow = c(1,1)) par(mfrow=c(2,2), ask=TRUE) plot(lcd, g=g, type="c") plot(lcd, g=g, type="c", uselog=TRUE) plot(lcd, g=g, type="i") plot(lcd, g=g, type="i", uselog=TRUE) par(oldpar) ## 2D interactive plot (need rgl package, not run here) if(interactive()) {plot(lcd, type="r")} ## Some plots of marginal estimates g.marg1 <- interpmarglcd(lcd, marg=1) g.marg2 <- interpmarglcd(lcd, marg=2) plot(lcd, marg=1, g.marg=g.marg1) plot(lcd, marg=2, g.marg=g.marg2) ## generate some points from the fitted density ## via independent rejection sampling generated1 <- rlcd(100, lcd) colMeans(generated1) ## via Metropolis-Hastings algorithm generated2 <- rlcd(100, lcd, "MH") colMeans(generated2) ## evaluate the fitted density mypoint <- c(0, 0) dlcd(mypoint, lcd, uselog=FALSE) mypoint <- c(1, 0) dlcd(mypoint, lcd, uselog=FALSE) ## evaluate the marginal density dmarglcd(0, lcd, marg=1) dmarglcd(1, lcd, marg=2) ## evaluate the covariance matrix of the fitted density covariance <- cov.LogConcDEAD(lcd) ## find the hat matrix for the smoothed log-concave that ## matches empirical mean and covariance A <- hatA(lcd) ## evaluate the fitted smoothed log-concave density mypoint <- c(0, 0) dslcd(mypoint, lcd, A) mypoint <- c(1, 0) dslcd(mypoint, lcd, A) ## generate some points from the fitted smoothed log-concave density generated <- rslcd(100, lcd, A)
plot
method for class "LogConcDEAD"
. Plots of various
types are available for 1- and 2-d data. For dimension greater than 1,
plots of axis-aligned marginal density estimates are available.
## S3 method for class 'LogConcDEAD' plot(x, uselog=FALSE, type="ic", addp=TRUE, drawlabels=TRUE, gridlen=400, g, marg, g.marg, main, xlab, ylab, ...)
## S3 method for class 'LogConcDEAD' plot(x, uselog=FALSE, type="ic", addp=TRUE, drawlabels=TRUE, gridlen=400, g, marg, g.marg, main, xlab, ylab, ...)
x |
Object of class |
uselog |
Scalar |
type |
Plot type: |
addp |
Scalar |
drawlabels |
Scalar |
gridlen |
Integer scalar indicating the number of points at which the maximum likelihood estimator is evaluated in each dimension |
g |
(optional) a |
marg |
If non- |
g.marg |
If |
main |
Title |
xlab |
x-axis label |
ylab |
y-axis label |
... |
Other arguments to be passed to the generic
|
The density estimate is evaluated on a grid of points using the
interplcd
function. If several plots are required, this
may be computed separately and passed to plot
using the
g
argument.
For two dimensional data, the default plot type is "ic"
,
corresponding to image
and contour
plots.
These may be obtained separately using plot type "i"
or "c"
respectively. Where available, the use of plot type "r"
is
recommended. This uses the rgl package
to produce a 3-d plot that may be rotated by the user. The option
"p"
produces perspective plots.
For data of dimension at least 2, axis-aligned marginals may be
plotted by setting the marg
argument. This integrates the
estimated density over the remaining dimensions. If several plots are
required, the estimate may be computed using the function
interpmarglcd
and passed using the argument
g.marg
.
Where relevant, the colors were obtained from the function
heat_hcl
in the colorspace package. Thanks to
Achim Zeileis for this suggestion.
For examples, see mlelcd
.
No return value, plot will display
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Yining Chen
mlelcd
, interplcd
, interpmarglcd
, heat_hcl
Generic print
and summary
method for objects
of class "LogConcDEAD"
## S3 method for class 'LogConcDEAD' print(x, ...) ## S3 method for class 'LogConcDEAD' summary(object, ...)
## S3 method for class 'LogConcDEAD' print(x, ...) ## S3 method for class 'LogConcDEAD' summary(object, ...)
x |
Object of class |
object |
Object of class |
... |
Other arguments passed to |
print
and summary
currently perform the same function.
If there has been an error computing the maximum likelihood estimator, an error message is printed.
Otherwise, the value of the log maximum likelihood estimator at observation points is printed. The number of interations required by the subgradient and the number of function evaluations are also printed.
No return value, log MLE at observation points will be printed out on the screen.
Madeleine Cule
Robert B. Gramacy
Richard Samworth
Draws samples from a log-concave maximum likelihood
estimate. The estimate should be specified in the form of an object of
class "LogConcDEAD"
, the result of a call to
mlelcd
.
rlcd(n=1, lcd, method=c("Independent","MH"))
rlcd(n=1, lcd, method=c("Independent","MH"))
n |
A scalar integer indicating the number of samples required |
lcd |
Object of class |
method |
Indicator of the method used to draw samples, either via independent rejection sampling (default choice) or via Metropolis-Hastings |
This function by default uses a simple rejection sampling scheme to draw independent random samples from a log-concave maximum likelihood estimator. One can also use the Metropolis-Hastings option to draw (dependent) samples with a higher acceptance rate.
For examples, see mlelcd
.
A numeric matrix
with nsample
rows, each row corresponding to a point
in drawn from the distribution with density defined by
lcd
.
Details of the rejection sampling can be found in Appendix B.3 of Cule, Samworth and Stewart (2010). Details of the Metropolis-Hastings scheme can be found in Gopal and Casella (2010)
Yining Chen
Madeleine Cule
Robert Gramacy
Richard Samworth
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a multi-dimensional log-concave density J. Roy. Statist. Soc., Ser. B. (with discussion), 72, 545-600.
Gopal, V. and Casella, G. (2010) Discussion of Maximum likelihood estimation of a log-concave density by Cule, Samworth and Stewart J. Roy. Statist. Soc., Ser. B., 72, 580-582.
Draws samples from a smoothed log-concave maximum likelihood
estimate. The estimate should be specified in the form of an object of
class "LogConcDEAD"
, the result of a call to mlelcd
,
and a positive definite matrix.
rslcd(n=1, lcd, A=hatA(lcd), method=c("Independent","MH"))
rslcd(n=1, lcd, A=hatA(lcd), method=c("Independent","MH"))
n |
A scalar integer indicating the number of samples required |
lcd |
Object of class |
A |
A positive definite |
method |
Indicator of the method used to draw samples, either via independent rejection sampling (default choice) or via Metropolis-Hastings |
This function by default uses a simple rejection sampling scheme to draw independent random samples from a smoothed log-concave maximum likelihood estimator. One can also use the Metropolis-Hastings option to draw (dependent) samples with a higher acceptance rate.
For examples, see mlelcd
.
A numeric matrix
with n
rows, each row corresponding to a point
in drawn from the distribution with density defined by
lcd
and A
.
Yining Chen
Madeleine Cule
Robert Gramacy
Richard Samworth
Chen, Y. and Samworth, R. J. (2013) Smoothed log-concave maximum likelihood estimation with applications Statist. Sinica, 23, 1373-1398. https://arxiv.org/abs/1102.1191v4
Cule, M. L., Samworth, R. J., and Stewart, M. I. (2010) Maximum likelihood estimation of a multi-dimensional log-concave density J. Roy. Statist. Soc., Ser. B. (with discussion), 72, 545-600.
Gopal, V. and Casella, G. (2010) Discussion of Maximum likelihood estimation of a log-concave density by Cule, Samworth and Stewart J. Roy. Statist. Soc., Ser. B., 72, 580-582.