Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

triton gpu deploy suddenly become very slow from 0.03s to 12s, how to solve it ? #7638

Open
yiluzhuimeng opened this issue Sep 20, 2024 · 1 comment
Labels
question Further information is requested

Comments

@yiluzhuimeng
Copy link

Description
A clear and concise description of what the bug is.

Triton Information
What version of Triton are you using?

Are you using the Triton container or did you build it yourself?

To Reproduce
Steps to reproduce the behavior.

Describe the models (framework, inputs, outputs), ideally include the model configuration file (if using an ensemble include the model configuration file for that as well).

Expected behavior
A clear and concise description of what you expected to happen.

@oandreeva-nv
Copy link
Contributor

Hi @yiluzhuimeng , Could you please feel out the question template, this will help us tremendously in assisting you with the issue

@oandreeva-nv oandreeva-nv added the question Further information is requested label Sep 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
question Further information is requested
Development

No branches or pull requests

2 participants