Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Badcase]: qwen2.5-72b 在昇腾910推理结果不符合预期 #992

Open
4 tasks done
tianshiyisi opened this issue Sep 28, 2024 · 3 comments
Open
4 tasks done

[Badcase]: qwen2.5-72b 在昇腾910推理结果不符合预期 #992

tianshiyisi opened this issue Sep 28, 2024 · 3 comments
Labels
help wanted Extra attention is needed

Comments

@tianshiyisi
Copy link

Model Series

Qwen2.5

What are the models used?

qwen2.5-72b-instruct

What is the scenario where the problem happened?

qwen2.5-72b-instruct 在昇腾910b上推理异常

Is this badcase known and can it be solved using avaiable techniques?

  • I have followed the GitHub README.
  • I have checked the Qwen documentation and cannot find a solution there.
  • I have checked the documentation of the related framework and cannot find useful information.
  • I have searched the issues and there is not a similar one.

Information about environment

OS: Ubuntu 22.04
Python: Python 3.10.9
GPUs: 4 x 910B3
NPU driver: 24.1.rc1 (from npu-smi)
mindie version: 1.0.T59
mindietorch : 1.0-t59-torch2.1.0.abi0
torch-npu : 2.1.0.post3-20240523

Description

qwen2.5-72b-instruct
image

qwen2-72b-instruct
image

@paul-yangmy
Copy link

mark

@qiangruoyu
Copy link

请问下这个配置文件是咋修改的,模型初始化的时候就报错了。

@940910941
Copy link

需要多大的显存?需要什么样的显卡才能运行?

@jklj077 jklj077 added the help wanted Extra attention is needed label Oct 9, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

5 participants