AMD Optimized FFTW version 2.2
AMD Optimized FFTW version 2.2
Highlights of improvements on AMD EPYCTM processor family CPUs
- Improved performance of in-place MPI FFT by employing a faster in-place MPI transpose routine.
- Improved performance of copy function cpy2d_pair used for rank-0 transform and buffering plans.
- Added DFT kernels of higher radix sizes for q1fv, t1fv and q1fv FFT codelets.