Skip to content

AMD Optimized FFTW version 2.2

Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 30 Jun 09:07
· 3201 commits to amd-fftw since this release

AMD Optimized FFTW version 2.2

Highlights of improvements on AMD EPYCTM processor family CPUs

  • Improved performance of in-place MPI FFT by employing a faster in-place MPI transpose routine.
  • Improved performance of copy function cpy2d_pair used for rank-0 transform and buffering plans.
  • Added DFT kernels of higher radix sizes for q1fv, t1fv and q1fv FFT codelets.