Inclusion of more regressors and classifiers in the Model Selection #1186

ankitrajixr · 2021-03-07T22:14:36Z

Hi team,

Thank you for such a helpful library. While using tpot library, we found certain regressors and classifiers not included in the model selection of the machine learning pipeline.
It would be great with the addition of some regressors like Gaussian Process Regressor, Voting Regressor and classifiers like Voting Classifier, AdaBoost Classifier.

How to recreate it?

User creates TPOT instance
User calls TPOT fit() function with training data

Expected result

The above-mentioned regressor and classifier are not included in the Machine learning for Model selection.

The text was updated successfully, but these errors were encountered:

JDRomano2 · 2021-03-15T01:03:44Z

Hi @ankitrajixr, these may have previously been found to not play well with other parts of TPOT, and that may be why they are not included by default.

However, you can add any scikit-learn classifier or regressor to TPOT by simply including it in a custom configuration dictionary. Please see:
https://epistasislab.github.io/tpot/using/#customizing-tpots-operators-and-parameters

If you can use them and they perform well, we can look into adding them to the built-in configuration dictionaries. I'd recommend giving it a try and letting us know (on this thread) how they perform.

ankitrajixr · 2021-03-17T19:53:20Z

Thank you for your response @JDRomano2 . I have tried the custom TPOT config dictionary. Below is the code snippet for it.

tpot_config = {
    'kernel' : [1.0*RBF(length_scale=0.5, length_scale_bounds=(1e-05, 100000.0)),
           1.0*RationalQuadratic(length_scale=0.5, alpha=0.1),
           1.0*ExpSineSquared(length_scale=0.5, periodicity=3.0,
                                length_scale_bounds=(1e-05, 100000.0),
                                periodicity_bounds=(1.0, 10.0)),
           ConstantKernel(0.1, (0.01, 10.0))*(DotProduct(sigma_0=1.0, sigma_0_bounds=(0.1, 10.0)) ** 2),
           1.0**2*Matern(length_scale=0.5, length_scale_bounds=(1e-05, 100000.0),
                        nu=0.5)],
        'alpha': [5e-9,1e-3, 1e-2, 1e-1, 1., 10., 100.],
        'normalize_y' : [True, False],
        'optimizer' : ['fmin_l_bfgs_b']
}

The above code works fine for smaller datasets.

JDRomano2 added the enhancement label Mar 15, 2021

ankitrajixr added a commit to ankitrajixr/tpot that referenced this issue Mar 17, 2021

add custom regressor classifer EpistasisLab#1186

f841337

This was referenced Mar 17, 2021

add custom regressor classifer #1186 #1191

Open

added Regressor Classifier #1192

Closed

rachitk mentioned this issue Jul 27, 2021

Error when trying to use Gaussian process regressor #1218

Open

perib closed this as completed May 9, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inclusion of more regressors and classifiers in the Model Selection #1186

Inclusion of more regressors and classifiers in the Model Selection #1186

ankitrajixr commented Mar 7, 2021

JDRomano2 commented Mar 15, 2021

ankitrajixr commented Mar 17, 2021 •

edited

Loading

Inclusion of more regressors and classifiers in the Model Selection #1186

Inclusion of more regressors and classifiers in the Model Selection #1186

Comments

ankitrajixr commented Mar 7, 2021

JDRomano2 commented Mar 15, 2021

ankitrajixr commented Mar 17, 2021 • edited Loading

ankitrajixr commented Mar 17, 2021 •

edited

Loading