Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Segmentation issue keeps crashing the kernel #4444

Open
enfeizhan opened this issue Jun 5, 2024 · 2 comments
Open

Segmentation issue keeps crashing the kernel #4444

enfeizhan opened this issue Jun 5, 2024 · 2 comments
Labels
bug Issues tracking problems with existing features.

Comments

@enfeizhan
Copy link

issue.csv
The AutoMLSearch keeps crashing with the simple dataset. Running the code in terminal gives segmentation error. If run in Jupyter Notebook, the notebook crashes and gets restarted.

import pandas as pd
import evalml

fm = pd.read_csv('issue.csv')
fm.ww.init()

fm.ww.describe()

y = fm.ww.pop('label')

automl = evalml.AutoMLSearch(
    X_train=fm,
    y_train=y,
    problem_type='binary',
    random_seed=3,
    max_batches=5
)
automl.search()

The data doesn't have infinity or null values. In principle, it shouldn't crash the kernel even though it won't get an amazing model.

@enfeizhan enfeizhan added the bug Issues tracking problems with existing features. label Jun 5, 2024
@enfeizhan
Copy link
Author

enfeizhan commented Jun 5, 2024

The search went through once the search scope is limited to random forest and linear_model:
allowed_model_families=["random_forest", "linear_model"].
Further investigation finds the problem is with lightgbm. As long as lightgbm isn't here, the search would be fine.

@eccabay
Copy link
Contributor

eccabay commented Jun 13, 2024

Thanks for reporting and investigating @enfeizhan. Could you share what evalml and lightgbm versions you're running with, as well as a bit more information about your data (types, size, etc)?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Issues tracking problems with existing features.
Projects
None yet
Development

No branches or pull requests

2 participants