Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature Request] Leverage Arm64 SME/SME2 instructions in onnxruntime #23031

Open
mcfi opened this issue Dec 5, 2024 · 0 comments
Open

[Feature Request] Leverage Arm64 SME/SME2 instructions in onnxruntime #23031

mcfi opened this issue Dec 5, 2024 · 0 comments
Labels
feature request request for unsupported feature or enhancement

Comments

@mcfi
Copy link
Contributor

mcfi commented Dec 5, 2024

Describe the feature request

Apple M4 supports SME2 instructions that can greatly improve matrix multiplication performance, and onnxruntime may want to leverage those instructions.

This https://github.com/tzakharko/m4-sme-exploration/tree/main shows M4 SME support details and performance.

Describe scenario use case

This should make onnxruntime run faster inference on Arm64 CPUs with SME support.

@mcfi mcfi added the feature request request for unsupported feature or enhancement label Dec 5, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
feature request request for unsupported feature or enhancement
Projects
None yet
Development

No branches or pull requests

1 participant