API Documentation

The `AbstractClassifier` Interface

Lighthouse.AbstractClassifier — Type

AbstractClassifier

An abstract type whose subtypes C<:AbstractClassifier must implement:

Lighthouse.classes
Lighthouse.train!
Lighthouse.loss_and_prediction

Subtypes may additionally overload default implementations for:

Lighthouse.onehot
Lighthouse.onecold
Lighthouse.is_early_stopping_exception

The AbstractClassifier interface is built upon the expectation that any multiclass label will be represented in one of two standardized forms:

"soft label": a probability distribution vector where the ith element is the probability assigned to the ith class in classes(classifier).
"hard label": the interger index of a corresponding class in classes(classifier).

Internally, Lighthouse converts hard labels to soft labels via onehot and soft labels to hard labels via onecold.

The `learn!` Interface

Lighthouse.learn! — Function

learn!(model::AbstractClassifier, logger,
       get_train_batches, get_test_batches, votes,
       elected=majority.(eachrow(votes), (1:length(classes(model)),));
       epoch_limit=100, post_epoch_callback=(_ -> nothing),
       optimal_threshold_class::Union{Nothing,Integer}=nothing,
       test_set_logger_prefix="test_set")

Return model after optimizing its parameters across multiple epochs of training and test, logging Lighthouse's standardized suite of classifier performance metrics to logger throughout the optimization process.

The following phases are executed at each epoch (note: in the below lists of logged values, $resource takes the values of the field names of Lighthouse.ResourceInfo):

Train model by calling train!(model, get_train_batches(), logger). The following quantities are logged to logger during this phase:
- train/loss_per_batch
- any additional quantities logged by the relevant model/framework-specific implementation of train!.
Compute model's predictions on test set provided by get_test_batches() (see below for details). The following quantities are logged to logger during this phase:
- <test_set_logger_prefix>_prediction/loss_per_batch
- <test_set_logger_prefix>_prediction/mean_loss_per_epoch
- <test_set_logger_prefix>_prediction/$resource_per_batch
Compute a battery of metrics to evaluate model's performance on the test set based on the test set prediction phase. The following quantities are logged to logger during this phase:
- <test_set_logger_prefix>_evaluation/metrics_per_epoch
- <test_set_logger_prefix>_evaluation/$resource_per_epoch
Call post_epoch_callback(current_epoch).

Where...

get_train_batches is a zero-argument function that returns an iterable of training set batches. Internally, learn! uses this function when it calls train!(model, get_train_batches(), logger).
get_test_batches is a zero-argument function that returns an iterable of test set batches used during the current epoch's test phase. Each element of the iterable takes the form (batch, votes_locations). Internally, batch is passed to loss_and_prediction as loss_and_prediction(model, batch...), and votes_locations[i] is expected to yield the row index of votes that corresponds to the ith sample in batch.
votes is a matrix of hard labels whose columns correspond to voters and whose rows correspond to the samples in the test set that have been voted on. If votes[sample, voter] is not a valid hard label for model, then voter will simply be considered to have not assigned a hard label to sample.
elected is a vector of hard labels where the ith element is the hard label elected as "ground truth" out of votes[i, :].
optimal_threshold_class is the class index (1 or 2) for which to calculate an optimal threshold for converting predicted_soft_labels to predicted_hard_labels. This is only a valid parameter when length(classes) == 2. If optimal_threshold_class is present, test set evaluation will be based on predicted hard labels calculated with this threshold; if optimal_threshold_class is nothing, predicted hard labels will be calculated via onecold(classifier, soft_label).

API Documentation

The AbstractClassifier Interface

The learn! Interface

The logging interface

LearnLoggers

Performance Metrics

Utilities

The `AbstractClassifier` Interface

The `learn!` Interface

`LearnLogger`s