Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Llama2-Chinese-13b-Chat-4bit量化模型无法运行 #356

Open
Kevin0412 opened this issue Aug 3, 2024 · 0 comments
Open

Llama2-Chinese-13b-Chat-4bit量化模型无法运行 #356

Kevin0412 opened this issue Aug 3, 2024 · 0 comments

Comments

@Kevin0412
Copy link
Contributor

FileNotFoundError: Could not find a model in Llama2-Chinese-13b-Chat-4bit with a name in gptq_model-4bit-128g.safetensors.index.json, gptq_model-4bit-128g.safetensors, model.safetensors.index.json, model.safetensors. Please specify the argument model_basename to use a custom file name.
文件夹里有的是gptq_model-4bit-128g.bin,是早期格式,估计现在的transformer已经不支持这一格式了,只能支持.safetensors格式,然后就出问题了,希望可以及时更新文件格式。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant