-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Merge branch 'master' of https://github.com/reger-men/HPL_GPU
- Loading branch information
Showing
21 changed files
with
413 additions
and
33 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
HPLinpack benchmark input file | ||
Innovative Computing Laboratory, University of Tennessee | ||
HPL.out output file name (if any) | ||
0 device out (6=stdout,7=stderr,file) | ||
1 # of problems sizes (N) | ||
4104192 N | ||
1 # of NBs | ||
384 NBs | ||
0 PMAP process mapping (0=Row-,1=Column-major) | ||
1 # of process grids (P x Q) | ||
64 Ps | ||
32 Qs | ||
16.0 threshold | ||
1 # of panel fact | ||
2 PFACTs (0=left, 1=Crout, 2=Right) | ||
1 # of recursive stopping criterium | ||
2 NBMINs (>= 1) | ||
1 # of panels in recursion | ||
2 NDIVs | ||
1 # of recursive panel fact. | ||
2 RFACTs (0=left, 1=Crout, 2=Right) | ||
1 # of broadcast | ||
6 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM,6=ibcast) | ||
1 # of lookahead depth | ||
1 DEPTHs (>=0) | ||
1 SWAP (0=bin-exch,1=long,2=mix) | ||
64 swapping threshold | ||
1 L1 in (0=transposed,1=no-transposed) form | ||
0 U in (0=transposed,1=no-transposed) form | ||
0 Equilibration (0=no,1=yes) | ||
8 memory alignment in double (> 0) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,31 @@ | ||
HPLinpack benchmark input file | ||
Innovative Computing Laboratory, University of Tennessee | ||
HPL.out output file name (if any) | ||
0 device out (6=stdout,7=stderr,file) | ||
1 # of problems sizes (N) | ||
5786112 N | ||
1 # of NBs | ||
384 NBs | ||
0 PMAP process mapping (0=Row-,1=Column-major) | ||
1 # of process grids (P x Q) | ||
64 Ps | ||
64 Qs | ||
16.0 threshold | ||
1 # of panel fact | ||
2 PFACTs (0=left, 1=Crout, 2=Right) | ||
1 # of recursive stopping criterium | ||
2 NBMINs (>= 1) | ||
1 # of panels in recursion | ||
2 NDIVs | ||
1 # of recursive panel fact. | ||
2 RFACTs (0=left, 1=Crout, 2=Right) | ||
1 # of broadcast | ||
6 BCASTs (0=1rg,1=1rM,2=2rg,3=2rM,4=Lng,5=LnM,6=ibcast) | ||
1 # of lookahead depth | ||
1 DEPTHs (>=0) | ||
1 SWAP (0=bin-exch,1=long,2=mix) | ||
64 swapping threshold | ||
1 L1 in (0=transposed,1=no-transposed) form | ||
0 U in (0=transposed,1=no-transposed) form | ||
0 Equilibration (0=no,1=yes) | ||
8 memory alignment in double (> 0) |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,20 @@ | ||
# modules | ||
module reset | ||
|
||
module load craype-accel-amd-gfx90a | ||
module load PrgEnv-amd | ||
module load amd/5.2.0 | ||
module load rocm/5.2.0 | ||
module load cray-mpich/8.1.17 | ||
module load openblas/0.3.17-omp | ||
|
||
# | ||
# env | ||
# | ||
export LD_LIBRARY_PATH="${CRAY_LD_LIBRARY_PATH}:${LD_LIBRARY_PATH}" | ||
# enable GPU aware MPI | ||
export MPICH_GPU_SUPPORT_ENABLED=1 | ||
# to work around the OFI registration cache issue for > 8 nodes | ||
export FI_MR_CACHE_MAX_COUNT=0 | ||
#export MPICH_SMP_SINGLE_COPY_MODE=NONE # does not work | ||
export MPICH_RANK_REORDER_DISPLAY=1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,22 @@ | ||
# Any other commands must follow the #SBATCH directives | ||
module load LUMI/22.06 partition/G | ||
#module load rocm/5.1.4 | ||
|
||
module use /project/project_462000075/paklui/modulefiles | ||
#module load rocm/5.3.0-10584 | ||
#module load rocm/5.3.0-10619 | ||
module load rocm/5.3.0-10670 | ||
#module load openblas/0.3.17-omp | ||
#module load cce/14.0.2 | ||
#module load cray-libsci/22.08.1.1 | ||
module load cray-mpich/8.1.18 | ||
#module load craype/2.7.17 | ||
|
||
# | ||
# env | ||
# | ||
# enable GPU aware MPI | ||
export MPICH_GPU_SUPPORT_ENABLED=1 | ||
# to work around the OFI registration cache issue for > 8 nodes | ||
#export FI_MR_CACHE_MAX_COUNT=0 | ||
export MPICH_RANK_REORDER_DISPLAY=1 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.