Skip to content

AMD Optimized FFTW version 3.0.1

Compare
Choose a tag to compare
@pradeeptrgit pradeeptrgit released this 06 Jul 08:42
· 3189 commits to amd-fftw since this release

AMD Optimized FFTW version 3.0.1

Highlights of improvements on AMD EPYCTM processor family CPUs

  • A new planner feature called Top N planner is introduced that minimizes single-threaded run-to-run variations.
  • New parallel MPI transpose algorithm enabled via configure option "--enable-amd-mpi-vader-limit"
    • When using this configure option, the user needs to set --mca btl_vader_eager_limit appropriately (current preference is 65536) in the MPIRUN command.