[Training] Strange training model created with crossEntropyLoss #17875

elephantpanda · 2023-10-10T22:26:42Z

Describe the issue

Trying out the training model creation on the MNIST-12 from here, when I set the loss to artifacts.LossType.CrossEntropyLoss,
it does something strange.

Even though the output label is of shape (1,10), it creates an input node called "labels" with shape of size (1)

Surely the labels shape should also be of size (1,10) ?

When using artifacts.LossType.MSELoss is creates an input node called "target" of shape (1,10) which is correct.

Is this a bug?

Edit:
I just realised that perhaps the label is not of the form (0,0,0,1,0,0,0,0,0,0) but just the index 4, in this case. Is this correct?

OK, I think I get it. you can delete this post.

To reproduce

AS above

Urgency

No response

ONNX Runtime Installation

Released Package

ONNX Runtime Version or Commit ID

1.15.1

PyTorch Version

1.13.1

Execution Provider

CUDA

Execution Provider Library Version

1.13.1

baijumeswani · 2023-10-10T22:43:27Z

I just realised that perhaps the label is not of the form (0,0,0,1,0,0,0,0,0,0) but just the index 4, in this case. Is this correct?

This is correct. Thanks for checking. Closing this now.

elephantpanda added the training issues related to ONNX Runtime training; typically submitted using template label Oct 10, 2023

github-actions bot added the ep:CUDA issues related to the CUDA execution provider label Oct 10, 2023

baijumeswani closed this as completed Oct 10, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Training] Strange training model created with crossEntropyLoss #17875

[Training] Strange training model created with crossEntropyLoss #17875

elephantpanda commented Oct 10, 2023 •

edited

Loading

baijumeswani commented Oct 10, 2023

[Training] Strange training model created with crossEntropyLoss #17875

[Training] Strange training model created with crossEntropyLoss #17875

Comments

elephantpanda commented Oct 10, 2023 • edited Loading

Describe the issue

To reproduce

Urgency

ONNX Runtime Installation

ONNX Runtime Version or Commit ID

PyTorch Version

Execution Provider

Execution Provider Library Version

baijumeswani commented Oct 10, 2023

elephantpanda commented Oct 10, 2023 •

edited

Loading