-
-
Notifications
You must be signed in to change notification settings - Fork 408
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
RuntimeError: shape '[1, 13, 8, 128]' is invalid for input of size 26624 #172
Comments
I'm getting something similar.
|
same error here, exact same stacktrace on
|
Same,any body could find the reason? |
me
me too ,Have you solved it |
Did any one solved this? |
EDIT: My test run finished without error and returned expected results. So, I have a potential quick fix. My test hasn't gone through yet, but it has already been running for quite a while. I didn't dig into why exactly this is, but so in
with
This seems to make it work for this specific airllm setup for me. However it might mess up some other use case as I suspect the error originates elsewhere and the wrong value is just passed to this file. |
I encountered an error while trying to run Llama3.1 405B according to the documentation. Can you help me identify the problem?
error message
code
env
windows 11
python 3.10 env in minicanda
cuda 12.1
pip list
The text was updated successfully, but these errors were encountered: