Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

910B机器 #314

Open
Jayc-Z opened this issue Nov 1, 2024 · 0 comments
Open

910B机器 #314

Jayc-Z opened this issue Nov 1, 2024 · 0 comments

Comments

@Jayc-Z
Copy link

Jayc-Z commented Nov 1, 2024

Environment

Hardware Environment(Ascend/GPU/CPU):

Uncomment only one /device <> line, hit enter to put that in a new line, and remove leading whitespaces from that line:
Ascend 910B

Software Environment:

  • MindSpore 2.4.0:
  • mindpet 1.0.4:
  • mindformers 1.3.0:
  • Python 3.10:
  • Ubuntu aarch64:

Describe the current behavior

利用mindformers/research/qwen1_5/run_qwen1_5_chat.py进行推理,配置如下:
image
单次推理时间74s,token生成速度6tokens/s,这个速度是正常的吗
image

Describe the expected behavior

Steps to reproduce the issue

Related log / screenshot

Special notes for this issue

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant