Package 'MLCOPULA'

Title: Classification Models with Copula Functions
Description: Provides several classifiers based on probabilistic models. These classifiers allow to model the dependence structure of continuous features through bivariate copula functions and graphical models, see Salinas-Gutiérrez et al. (2014) <doi:10.1007/s00180-013-0457-y>.
Authors: Rogelio Salinas Gutiérrez [aut, cre, cph] , Angélica Hernández Quintero [aut, cph] , Pedro Abraham Montoya Calzada [aut, cph] , Carlos Alberto López Hernández [aut, cph] , Juan Manuel Marquez Romero [aut, cph]
Maintainer: Rogelio Salinas Gutiérrez <[email protected]>
License: GPL-3
Version: 1.0.1
Built: 2024-12-23 06:19:02 UTC
Source: CRAN

Help Index


Calculates classification performance metrics.

Description

Calculates the confusion matrix and several performance metrics.

Usage

classification_report(
y_true,
y_pred
)

Arguments

y_true

A vector with the true labels.

y_pred

A vector with the predicted labels.

Value

Returns a list with the following entries:

metrics

A table with the precision, recall and f1-score for each class.

confusion_matrix

The confusion matrix.

accuracy

The accuracy.

mutual_information

The mutual information between the true and the predicted classes.

Examples

#Example 1
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = "frank",
                      distribution = "kernel", graph_model = "tree")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)

#Example 2
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = c("frank","clayton"), 
                        distribution = "kernel", graph_model = "chain")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)

Trains a classification model using copula functions.

Description

It trains a classification model based on copulas. The dependence structure of the joint density is built by using a graphical model along with bivariate copulas, as shown in Salinas-Gutiérrez et al., 2014.

Usage

copulaClassifier(
  X,
  y,
  distribution = "kernel",
  copula = "frank",
  weights = "likelihood",
  graph_model = "tree",
  k = 7,
  m = 7,
  method_grid = "ml"
)

Arguments

X

Data frame with nn samples and d>1d>1 predictor variables.

y

a vector of size nn, with the classes to predict.

distribution

Marginal distribution to be used: "normal" or "kernel", by default kernel.

copula

Either a character or a string vector with the name of the copula to be used: "amh", "clayton", "frank", "gaussian", "grid", "gumbel", "independent" and "joe", by default "frank". For parametric copulas, "amh", "clayton", "frank", "gaussian", "gumbel", and "joe", one or more copulas can be selected. For nonparametric copula, only "grid" can be selected. See the examples for more details.

weights

A character with the weight construction method for the graphical model: "likelihood" or "mutual_information", by default "likelihood".

graph_model

A character with the graphical model structure: "tree" or "chain", by default "tree".

k

Only for the grid copula. Positive integer indicating the number of subintervals for the U2U_2 variable.

m

Only for the grid copula. Positive integer indicating the number of subintervals for the U1U_1 variable.

method_grid

Only for the grid copula. Fitting method, least squares "ls" or maximum likelihood "ml", by default "ml".

Value

Returns a trained model.

References

Salinas-Gutiérrez, R., Hernández-Aguirre, A., Villa-Diharce, E.R. (2014). Copula selection for graphical models in continuous Estimation of Distribution Algorithms. Computational Statistics, 29(3–4):685–713. doi:10.1007/s00180-013-0457-y

Examples

# Example 1
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = "frank",
                      distribution = "kernel", graph_model = "tree")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)

# Example 2
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = c("frank","clayton"), 
                        distribution = "kernel", graph_model = "chain")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)

Gets predictions from a classification model.

Description

Use the models trained with copula functions to generate new predictions.

Usage

copulaPredict(X, model)

Arguments

X

Data frame with predictor variables.

model

A classification model given by copulaClassifier.

Value

A list with the following entries:

class

a vector with the predicted class.

prob

a data frame with the probabilities of each class.

Examples

# Example 1
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = "frank",
                      distribution = "kernel", graph_model = "tree")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)

# Example 2
X <- iris[,1:4]
y <- iris$Species
model <- copulaClassifier(X = X, y = y, copula = c("frank","clayton"), 
                        distribution = "kernel", graph_model = "chain")
y_pred <- copulaPredict(X = X, model = model)
classification_report(y_true = y, y_pred = y_pred$class)