[WIP] export weights as a constants in graph, so can do constant folding to them #19278

zhijxu-MS · 2024-01-26T00:55:08Z

when supporting one 1p model inference, the weights of model are graph inputs and thus ORT's constant folding optimizer can't optimize the weight. after export weights as constants in graph, we see 10%+ gain in the model.

this is a draft

as we need to think of the scenario: model trained > model eval > model trained > model eval > ..., in such case, the weights are changed so we can't just export them as constant directly. one idea is we keep an model_version in trainingsession and also in inferencesession, if we found the model_version number mismatch, then we reexport it.

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py

… them

zhijxu-MS changed the title ~~WIP] export weights as a constants in graph, so can do constant folding to them~~ [WIP] export weights as a constants in graph, so can do constant folding to them Jan 26, 2024

github-advanced-security bot found potential problems Jan 26, 2024

View reviewed changes

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py Fixed Show fixed Hide fixed

zhijxu-MS force-pushed the zhijxu/improve-ortmodule-inference branch from b70cd75 to 604695a Compare January 31, 2024 11:31

github-advanced-security bot found potential problems Jan 31, 2024

View reviewed changes

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py Fixed Show fixed Hide fixed

orttraining/orttraining/python/training/ortmodule/_graph_execution_manager.py Fixed Show fixed Hide fixed

zhijxu-MS force-pushed the zhijxu/improve-ortmodule-inference branch from 604695a to 8bcd1d8 Compare February 19, 2024 08:54

export weights as a constants in graph, so can do constant folding to…

955cb5b

… them

zhijxu-MS force-pushed the zhijxu/improve-ortmodule-inference branch from 8bcd1d8 to 955cb5b Compare February 19, 2024 09:26

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] export weights as a constants in graph, so can do constant folding to them #19278

[WIP] export weights as a constants in graph, so can do constant folding to them #19278

zhijxu-MS commented Jan 26, 2024 •

edited

Loading

[WIP] export weights as a constants in graph, so can do constant folding to them #19278

Are you sure you want to change the base?

[WIP] export weights as a constants in graph, so can do constant folding to them #19278

Conversation

zhijxu-MS commented Jan 26, 2024 • edited Loading

zhijxu-MS commented Jan 26, 2024 •

edited

Loading