-
Notifications
You must be signed in to change notification settings - Fork 23
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Standardization] GPU naming convention needs further refinements #366
Comments
The order is and always has been fixed. |
Agreed. The wording suggests to use it for HBM memory, which I think is well-defined (and meaningful, as it does make a difference.) The other option is to narrow things down and say that |
True. We could easily add this. |
That can easily be addressed. I just added it to the PR #374 as it fit nicely. |
Signed-off-by: Kurt Garloff <[email protected]>
Added it also to PR #374. |
The number of SMs should give you an indication of how much performance to expect. Together with maybe the |
True, that is a real limitation. Another (and maybe more important) missing piece is that we don't specify the amount of VRAM that is available to the user, which may be a serious limitation. Does my 30b LLM model (in 4bit+ quantization, so it will require ~18GiB) fit or not? So this would need a real extension: |
As I feel myself not competent enough to judge this approach, I will forward the improvement of GPU definition in SCS flavor standard to our GPU expert. This may take some time. |
* Clarify SMs/CUs/EUs belonging to nVidia/AMD/Intel. Also add a line clarifying the order of extensions being fixed. * Explain CUs, EUs, SMs. This addresses #375. * Add GPU example as desired by #366. * Correct example. * Escape all underscores in examples. They get interpreted otherwise by markdown. * Use vendor-neutral terminology for GPU processing units * Typo and minor wording Signed-off-by: Kurt Garloff <[email protected]> Signed-off-by: Matthias Büchse <[email protected]> Co-authored-by: Matthias Büchse <[email protected]>
Any feedback? |
Just for the record. Microsoft Azure https://learn.microsoft.com/de-de/azure/virtual-machines/ncads-h100-v5
Google Cloud https://cloud.google.com/compute/docs/gpus?hl=de#a100-gpus
AWS https://aws.amazon.com/de/ec2/instance-types/
|
Sidenote: The multi-GPU feature is not yet included in #780. |
Discussion seems to have continued on #546 |
Can this be closed now thanks to #780? |
I'm closing this. Please open a new one with whatever's remaining. |
GPU name in SCS Flavor Naming Standard need further refinement. The following aspects are described insufficiently:
h
, which can be set multiple times indicating a high-performance GPU. However "high-performance" is neither explained in detail, nor there is a mapping fromh
,hh
,hhh
... to measure a GPU properly as it is done for CPU Frequency`. In favor of comparison and interoperability, standard for GPU naming SHOULD be very strict and clear here.The text was updated successfully, but these errors were encountered: