XGBoost incremental training, issue with ONNX Conversion #18841

kiransarv · 2023-12-15T18:43:47Z

Describe the issue

Trained an XGBoost with incremental learning.

    batch_size = 1024
    print(vectors.shape, labels.shape, len(np.unique(labels)))
    self.model: XGBClassifier = XGBClassifier(**self.init_param)
    for start in range(0, vectors.shape[0], batch_size):
        itr_vector = vectors[start : start + batch_size]
        itr_label = labels[start : start + batch_size]
        if start == 0:
           self.model.fit(itr_vector, itr_label, **fit_params)
        else:
           fit_params["xgb_model"] = self.model
           self.model.fit(itr_vector, itr_label, **fit_params)

facing an issue with ONNX model
RUNTIME_EXCEPTION : Non-zero status code returned while running TreeEnsembleClassifier node. Name:'TreeEnsembleClassifier' Status Message: /onnxruntime_src/onnxruntime/core/providers/cpu/ml/tree_ensemble_aggregator.h:201 void onnxruntime::ml::detail::TreeAggregatorSum<InputType, ThresholdType, OutputType>::ProcessTreeNodePrediction(onnxruntime::InlinedVector<onnxruntime::ml::detail::ScoreValue >&, const onnxruntime::ml::detail::TreeNodeElement&, gsl::span<const onnxruntime::ml::detail::SparseValue >) const [with InputType = float; ThresholdType = float; OutputType = float; onnxruntime::InlinedVector<onnxruntime::ml::detail::ScoreValue > = absl::lts_20220623::InlinedVector<onnxruntime::ml::detail::ScoreValue, 6, std::allocator<onnxruntime::ml::detail::ScoreValue > >] it->i < (int64_t)predictions.size() was false.

if not incremental model, only fitting one time self.model.fit(vectors, labels, **fit_params)
No issue with ONNX model, predictions are working fine.

To reproduce

Steps are detailed above.

Urgency

No response

Platform

Mac

OS Version

MacOS Ventura

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.16.1

ONNX Runtime API

Python

Architecture

X64

Execution Provider

Default CPU

Execution Provider Library Version

No response

The text was updated successfully, but these errors were encountered:

baijumeswani · 2024-01-03T18:46:08Z

@xadupre would you please help with this issue?

xadupre · 2024-01-04T10:07:02Z

This error means that a leaf returns a class index outside the expected number of classes. The attribute classlabels_int64s probably shorter than max(class_ids) but I wonder why it would happen. I'll need to know the version you used to train and convert the model (version of xgboost and onnxmltools).

kiransarv · 2024-01-04T10:16:30Z

XGBoost version 2.0.2
ONNX Version 1.16.1

xadupre · 2024-01-04T11:29:59Z

What about onnxmltools?

kiransarv · 2024-01-04T11:44:35Z

onnxmltools 1.11.2

xadupre · 2024-01-04T14:00:13Z

Is it possible to try with 1.12.0? We released it last month. It fixes some bugs with xgboost >= 2.0.

kiransarv · 2024-01-04T14:25:57Z

Sure Thanks...

kiransarv · 2024-01-08T10:42:20Z

Same error even after upgrading

xadupre · 2024-01-08T15:35:52Z

Thanks for trying. I'll try to replicate your issue unless you already have a full script to share.

github-actions · 2024-02-08T15:00:57Z

This issue has been automatically marked as stale due to inactivity and will be closed in 30 days if no further activity occurs. If further support is needed, please provide an update and/or more details.

addisonklinke · 2024-07-11T13:35:33Z

the attribute classlabels_int64s probably shorter than max(class_ids)

Thanks for the tip @xadupre. I've been trying to convert a PySpark XGBoost model, and because it doesn't have .classes_ from the sklearn implementation I had to fill that attribute myself. Initially I had the column names hardcoded and then realized I was fitting on one and setting the attribute with another which would indeed lead to len(classlabels_int64s) != max(class_ids)

yf711 added the training issues related to ONNX Runtime training; typically submitted using template label Dec 22, 2023

kiransarv changed the title ~~XGBoost issue with ONNX~~ XGBoost incremental training, issue with ONNX Conversion Jan 2, 2024

github-actions bot added the stale issues that have not been addressed in a while; categorized by a bot label Feb 8, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

XGBoost incremental training, issue with ONNX Conversion #18841

XGBoost incremental training, issue with ONNX Conversion #18841

kiransarv commented Dec 15, 2023

baijumeswani commented Jan 3, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

kiransarv commented Jan 8, 2024

xadupre commented Jan 8, 2024

github-actions bot commented Feb 8, 2024

addisonklinke commented Jul 11, 2024

XGBoost incremental training, issue with ONNX Conversion #18841

XGBoost incremental training, issue with ONNX Conversion #18841

Comments

kiransarv commented Dec 15, 2023

Describe the issue

To reproduce

Urgency

Platform

OS Version

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

ONNX Runtime API

Architecture

Execution Provider

Execution Provider Library Version

baijumeswani commented Jan 3, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

xadupre commented Jan 4, 2024

kiransarv commented Jan 4, 2024

kiransarv commented Jan 8, 2024

xadupre commented Jan 8, 2024

github-actions bot commented Feb 8, 2024

addisonklinke commented Jul 11, 2024