-
-
Notifications
You must be signed in to change notification settings - Fork 408
Issues: lyogavin/airllm
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Author
Label
Projects
Milestones
Assignee
Sort
Issues list
support for https://huggingface.co/nvidia/Nemotron-4-340B-Instruct ?
#196
opened Oct 28, 2024 by
mahald
taking about 40 minutes to generate one sentence,Is this speed normal?
#186
opened Sep 26, 2024 by
kingdoom1
Issue
model.safetensors.index.json should exist
with loading model in safetensors format
#185
opened Sep 26, 2024 by
LeMoussel
RuntimeError: shape '[1, 5, 8, 128]' is invalid for input of size 10240 LLama 405B 4-bit on Layer 1
#178
opened Aug 31, 2024 by
TitleOS
RuntimeError: shape '[1, 13, 8, 128]' is invalid for input of size 26624
#172
opened Aug 14, 2024 by
zhuojun1024
name 'dynamically_import_QuantLinear' is not defined
enhancement
New feature or request
future work
#163
opened Aug 4, 2024 by
gyyixr
Previous Next
ProTip!
Follow long discussions with comments:>50.