-
Notifications
You must be signed in to change notification settings - Fork 10
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Update GenAI-Perf README and tutorial doc #89
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Great job on these updates!
* update doc * remove duplicate commands * make commands more simpler for user * Remove redundant sdk container docker commands * add links * update template * fix type * add article and capitalize github * set env directly
…t as well (#88) (#89) * Replace binding index-based methods with name-based alternatives * Remove unused variables * Remove unused variables * Remove allInput*Specified() * Delete TRTV1Interface * Replace getProfileShapeValues() with getProfileTensorValues() * Remove buffer_bindings_ * Enhancements * Replace isExecutionBinding() * Add INT64 support * Remove hasImplicitBatchDimension() * Update Copyright * Remove unused variables * Undo copyright * Undo Copyright * Undo copyright * Fix the handling in INT64 shape tensors output * Fix data dependent output shapes * Fix pre commit errors * Update copyright * Resolve review comments * Include source for building on TRT 8 (#86) (#87) * Include source for building on TRT 8 * Apply suggestions from code review --------- * Fix envvar access in CMake --------- Co-authored-by: Sai Kiran Polisetty <[email protected]> Co-authored-by: Misha Chornyi <[email protected]>
export RELEASE="{{ release }}" | ||
|
||
docker run -it --net=host --gpus=all nvcr.io/nvidia/tritonserver:${RELEASE}-py3-sdk |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@nv-hwoo @debermudez @dyastremsky why was the SDK container run command removed from the tutorial?
I think generally a tutorial user is going to be unfamiliar, so we should give them as much copy-pasteable commands as possible so they don't get stuck.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I don't know. The GenAi-Perf README covers installation and these are supposed to be additional examples, so it's possible Hyunjae wanted to remove redundancy.
Hyunjae will be back in a couple of weeks. We could look at whether these need to be re-added then.
* update doc * remove duplicate commands * make commands more simpler for user * Remove redundant sdk container docker commands * add links * update template * fix type * add article and capitalize github * set env directly
This PR
Checkout the updated main README here
Checkout the updated Tutorial doc here