GitHub - mrapp-ke/MLRL-Boomer at 28c81908afe1bb26dc4c7a61d6cc070dfc410bb0

5 Branches 14 Tags

Name	Name	Last commit message	Last commit date
Latest commit boomer-merge-bot[bot] Merge pull request #1158 from mrapp-ke/update-github-actions Dec 16, 2024 28c8190 · Dec 16, 2024 History 12,531 Commits
.github/workflows	.github/workflows	Update GitHub Actions.	Dec 16, 2024
assets	assets	Remove redundant SVG files.	Oct 23, 2024
build_system	build_system	Fix writing lines to files.	Dec 16, 2024
cpp	cpp	Dynamically register targets and modules for checking and enforcing t…	Dec 8, 2024
doc	doc	Update documentation.	Dec 15, 2024
python	python	Add emojis to READMEs.	Nov 17, 2024
.changelog-bugfix.md	.changelog-bugfix.md	Update changelog.	Dec 15, 2024
.gitignore	.gitignore	Replace scons with custom implementation.	Dec 13, 2024
.readthedocs.yaml	.readthedocs.yaml	Format YAML files.	Jun 22, 2024
.version	.version	[Bot] Update version to 0.11.2.	Sep 24, 2024
.version-dev	.version-dev	Rename file VERSION.dev to .version-dev.	Jul 7, 2024
.version-python	.version-python	[Bot] Increase minimum Python version to 3.10.	Jul 7, 2024
CHANGELOG.md	CHANGELOG.md	Use term "pull request" instead of "merge request".	Dec 15, 2024
CITATION.cff	CITATION.cff	Update links to the repository.	Dec 24, 2023
CODE_OF_CONDUCT.md	CODE_OF_CONDUCT.md	Add code of conduct to documentation.	Mar 27, 2024
CONTRIBUTORS.md	CONTRIBUTORS.md	Format Markdown files.	Jan 3, 2024
LICENSE.md	LICENSE.md	Update copyright information.	Dec 21, 2023
README.md	README.md	Add subsections to the README.	Nov 17, 2024
build	build	Replace scons with custom implementation.	Dec 13, 2024
build.bat	build.bat	Replace scons with custom implementation.	Dec 13, 2024

Repository files navigation

BOOMER - Gradient Boosted Multi-Label Classification Rules

This software package provides the official implementation of BOOMER - an algorithm for learning gradient boosted multi-output rules that uses gradient boosting for learning an ensemble of rules that is built with respect to a specific multivariate loss function. It integrates with the popular scikit-learn machine learning framework.

The problem domains addressed by this algorithm include the following:

Multi-label classification: The goal of multi-label classification is the automatic assignment of sets of labels to individual data points, for example, the annotation of text documents with topics.
Multi-output regression: Multivariate regression problems require to predict for more than a single numerical output variable.

To provide a versatile tool for different use cases, great emphasis is put on the efficiency of the implementation. Moreover, to ensure its flexibility, it is designed in a modular fashion and can therefore easily be adjusted to different requirements. This modular approach enables implementing different kind of rule learning algorithms. For example, this project does also provide a Separate-and-Conquer (SeCo) algorithm based on traditional rule learning techniques that are particularly well-suited for learning interpretable models.

📖 References

The algorithm was first published in the following paper. A preprint version is publicly available here.

Michael Rapp, Eneldo Loza Mencía, Johannes Fürnkranz Vu-Linh Nguyen and Eyke Hüllermeier. Learning Gradient Boosted Multi-label Classification Rules. In: Proceedings of the European Conference on Machine Learning and Knowledge Discovery in Databases (ECML-PKDD), 2020, Springer.

If you use the algorithm in a scientific publication, we would appreciate citations to the mentioned paper. An overview of publications that are concerned with the BOOMER algorithm, together with information on how to cite them, can be found in the section References of the documentation.

🔧 Functionalities

The algorithm that is provided by this project currently supports the following core functionalities for learning ensembles of boosted classification or regression rules.

Deliberate Loss Optimization

Decomposable or non-decomposable loss functions can be optimized in expectation.
$L_{1}$ and $L_{2}$ regularization can be used.
Shrinkage (a.k.a. the learning rate) can be adjusted for controlling the impact of individual rules on the overall ensemble.

Different Prediction Strategies

Various strategies for predicting scores, binary labels or probabilities are available, depending on whether a classification or regression model is used.
Isotonic regression models can be used to calibrate marginal and joint probabilities predicted by a classification model.

Flexible Handling of Input Data

Native support for numerical, ordinal, and nominal features eliminates the need for pre-processing techniques such as one-hot encoding.
Handling of missing feature values, i.e., occurrences of NaN in the feature matrix, is implemented by the algorithm.

Fine-grained Control over Model Characteristics

Rules can be constructed via a greedy search or a beam search. The latter may help to improve the quality of individual rules.
Single-output, partial, or complete heads can be used by rules, i.e., they can predict for a single output, a subset of the available outputs, or all of them. Predicting for multiple outputs simultaneously enables to model local dependencies between them.
Fine-grained control over the specificity/generality of rules is provided via hyperparameters.

Support for Post-Optimization and Pruning

Incremental reduced error pruning can be used for removing overly specific conditions from rules and preventing overfitting.
Post- and pre-pruning (a.k.a. early stopping) allows to determine the optimal number of rules to be included in an ensemble.
Sequential post-optimization may help improving the predictive performance of a model by reconstructing each rule in the context of the other rules.

⌚ Runtime and Memory Optimizations

In addition to the features mentioned above, several techniques that may speed up training or reduce the memory footprint are currently implemented.

Approximation Techniques

Unsupervised feature binning can be used to speed up the evaluation of a rule's potential conditions when dealing with numerical features.
Sampling techniques and stratification methods can be used for learning new rules on a subset of the available training examples, features, or output variables.
Gradient-based label binning (GBLB) can be used for assigning the labels included in a multi-label classification data set to a limited number of bins. This may speed up training significantly when minimizing a non-decomposable loss function using rules with partial or complete heads.

Sparse Data Structures

Sparse feature matrices can be used for training and prediction. This may speed up training significantly on some data sets.
Sparse ground truth matrices can be used for training. This may reduce the memory footprint in case of large data sets.
Sparse prediction matrices can be used for storing predicted labels. This may reduce the memory footprint in case of large data sets.
Sparse matrices for storing gradients and Hessians can be used if supported by the loss function. This may speed up training significantly on data sets with many output variables.

Parallelization

Multi-threading can be used for parallelizing the evaluation of a rule's potential refinements across several features, updating the gradients and Hessians of individual examples in parallel, or obtaining predictions for several examples in parallel.

📚 Documentation

An extensive user guide, as well as an API documentation for developers, is available at https://mlrl-boomer.readthedocs.io. If you are new to the project, you probably want to read about the following topics:

Instructions for installing the software package or building the project from source.
Examples of how to use the algorithm in your own Python code or how to use the command line API.
An overview of available parameters.

A collection of benchmark datasets that are compatible with the algorithm are provided in a separate repository.

For an overview of changes and new features that have been included in past releases, please refer to the changelog.

📜 License

This project is open source software licensed under the terms of the MIT license. We welcome contributions to the project to enhance its functionality and make it more accessible to a broader audience. A frequently updated list of contributors is available here.

All contributions to the project and discussions on the issue tracker are expected to follow the code of conduct.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📖 References

🔧 Functionalities

Deliberate Loss Optimization

Different Prediction Strategies

Flexible Handling of Input Data

Fine-grained Control over Model Characteristics

Support for Post-Optimization and Pruning

⌚ Runtime and Memory Optimizations

Approximation Techniques

Sparse Data Structures

Parallelization

📚 Documentation

📜 License

About

Releases 14

Contributors 4

Languages

License

mrapp-ke/MLRL-Boomer

Folders and files

Latest commit

History

Repository files navigation

📖 References

🔧 Functionalities

Deliberate Loss Optimization

Different Prediction Strategies

Flexible Handling of Input Data

Fine-grained Control over Model Characteristics

Support for Post-Optimization and Pruning

⌚ Runtime and Memory Optimizations

Approximation Techniques

Sparse Data Structures

Parallelization

📚 Documentation

📜 License

About

Topics

Resources

License

Code of conduct

Citation

Stars

Watchers

Forks

Releases 14

Contributors 4

Languages