Model per instance model-mesh by default #108

fsatka · 2023-09-25T08:15:16Z

Now model load only on one instance, and lazy loading on another pods, when reauest has come.

Can we modify internal modelmesh parameters for default loading model on all ServingRuntime instances?

ckadner · 2024-01-19T22:54:04Z

@fsatka -- ModelMesh was designed to optimize resource utilization. Why would you want to load additional instances of the same model/predictor/ISVC on all serving runtime pods regardless of inference request traffic? Just for testing purposes?

ckadner added the question Further information is requested label Jan 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model per instance model-mesh by default #108

Model per instance model-mesh by default #108

fsatka commented Sep 25, 2023

ckadner commented Jan 19, 2024

Model per instance model-mesh by default #108

Model per instance model-mesh by default #108

Comments

fsatka commented Sep 25, 2023

ckadner commented Jan 19, 2024