Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
This change contains the fixes for following issues:-
1) When AMD optimized FFTW (using --enable-amd-opt) is configured with --enable-amd-trans, the hybrid OpenMP+MPI tests were failing. With this fix, AMD_OPT_TRANS is made undefined in case of hybrid OpenMP+MPI configuration. So, even using --enable-amd-trans, AMD optimized transpose would not be used when running for hybrid OpenMP+MPI configuration. For, single-threaded FFTW, AMD_OPT_TRANS will remain enabled. 2) long double and quad precision tests with --enable-amd-opt were failing. The cpy2d routine will use normal C version now in case of long double and quad precision. For single and double precision, AMD optimized cpy2d routine will remained enabled. Change-Id: I30cdb461bd6d24f5563faba9f4c85b17f1c08006
- Loading branch information