-
Notifications
You must be signed in to change notification settings - Fork 42
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Feature like SAXPY but with divide #36
Comments
Single-precision scalar-vector multiply: vfmul.s.vs vv1, vv0, vs0 Most Hwacha vector compute instructions support using shared (scalar) registers for the Page 15 of the Hwacha ISA manual explains the instruction encoding:
There are currently no instructions for fused floating-point divide and add. |
Hello,
I am currently optimizing a code to run using Hwacha and I have this scenario.
for(int i = 0; i < n; i++){
out[i] = vec1[i]* const_float / vec2[i] + const_float2;
}
So I know I can do the vector division on Hwacha, but it would be nice to have an operation to perform the multiplication by scalar and and operation like SAXPY but with divide. For example, SADPY so that I could do the following:
mul_scalar_vec_hwacha(out, vec1, const_float)
sadpy(out, vec2, const_float2)
Is there any instruction on the ISA to perform this? I have been looking for it but I can't find any.
Thanks in advance!
The text was updated successfully, but these errors were encountered: