-
Notifications
You must be signed in to change notification settings - Fork 11
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
improve 05_saxpy, 09_matAdd and 10_matMul
- Loading branch information
Showing
12 changed files
with
94 additions
and
52 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
hallo from gpu029 | ||
The system supports 1 ns time resolution | ||
total size of x and y is 512.0 MB | ||
tests are averaged over 32 loops | ||
saxpy on host (0) : 65092.0 MB/s 65093.6 MB/s maxabserr = 0.0 | ||
saxpy on host (1) : 70769.4 MB/s 70772.0 MB/s maxabserr = 0.0 | ||
saxpy on accl (1) : 1400.7 MB/s 4648.9 MB/s maxabserr = 0.0 | ||
saxpy on accl (2) : 1371.6 MB/s 4653.9 MB/s maxabserr = 0.0 | ||
saxpy on accl (3) : 2046.7 MB/s 227586.6 MB/s maxabserr = 0.0 | ||
saxpy on accl (4) : 2062.4 MB/s 224540.3 MB/s maxabserr = 0.0 | ||
saxpy on accl (5) : 2073.9 MB/s 276659.5 MB/s maxabserr = 0.0 | ||
saxpy on accl (6) : 2045.0 MB/s 271431.4 MB/s maxabserr = 0.0 | ||
saxpy on accl (7) : 2025.2 MB/s 280631.7 MB/s maxabserr = 0.0 | ||
saxpy on accl (8) : 2025.7 MB/s 279577.4 MB/s maxabserr = 0.0 |
This file was deleted.
Oops, something went wrong.
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,21 @@ | ||
hallo from gpu028 | ||
matrix dim: 4096 x 4096 | ||
time averaged over 64 loops | ||
matAddAB (0) : 1.9 GB/s 86.6 GB/s maxabserr = 0.0 | ||
matAddAB (1) : 1.6 GB/s 35.7 GB/s maxabserr = 0.0 | ||
matAddAB (2) : 1.6 GB/s 48.1 GB/s maxabserr = 0.0 | ||
matAddAB (3) : 1.7 GB/s 166.6 GB/s maxabserr = 0.0 | ||
matAddAB (4) : 2.0 GB/s 183.3 GB/s maxabserr = 0.0 | ||
matAddAB (5) : 1.9 GB/s 183.7 GB/s maxabserr = 0.0 | ||
matAddAB (6) : 1.9 GB/s 185.3 GB/s maxabserr = 0.0 | ||
matAddAB (7) : 1.8 GB/s 185.4 GB/s maxabserr = 0.0 | ||
matrix dim: 8192 x 8192 | ||
time averaged over 64 loops | ||
matAddAB (0) : 1.9 GB/s 172.2 GB/s maxabserr = 0.0 | ||
matAddAB (1) : 1.9 GB/s 34.0 GB/s maxabserr = 0.0 | ||
matAddAB (2) : 1.6 GB/s 8.4 GB/s maxabserr = 0.0 | ||
matAddAB (3) : 1.9 GB/s 265.8 GB/s maxabserr = 0.0 | ||
matAddAB (4) : 1.9 GB/s 265.4 GB/s maxabserr = 0.0 | ||
matAddAB (5) : 1.9 GB/s 265.8 GB/s maxabserr = 0.0 | ||
matAddAB (6) : 1.9 GB/s 264.9 GB/s maxabserr = 0.0 | ||
matAddAB (7) : 1.9 GB/s 269.0 GB/s maxabserr = 0.0 |
This file was deleted.
Oops, something went wrong.
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,14 @@ | ||
hallo from gpu028 | ||
matrix dim: 4096 x 4096 | ||
time averaged over 16 loops | ||
matMulAB (0) : 24.9 GFLOPS 25.5 GFLOPS maxabserr = 0.0 | ||
matMulAB (1) : 9.8 GFLOPS 9.9 GFLOPS maxabserr = 0.0 | ||
matMulAB (2) : 184.5 GFLOPS 228.9 GFLOPS maxabserr = 0.0 | ||
matMulAB (3) : 5.0 GFLOPS 5.1 GFLOPS maxabserr = 1018.4 | ||
matMulAB (4) : 176.1 GFLOPS 216.2 GFLOPS maxabserr = 0.0 | ||
matMulAB (5) : 340.9 GFLOPS 531.9 GFLOPS maxabserr = 0.0 | ||
matMulAB (6) : 610.3 GFLOPS 1708.9 GFLOPS maxabserr = 0.0 | ||
matMulAB (7) : 218.9 GFLOPS 284.6 GFLOPS maxabserr = 0.0 | ||
matMulAB (8) : 233.8 GFLOPS 310.4 GFLOPS maxabserr = 0.0 | ||
matMulAB (9) : 254.5 GFLOPS 348.1 GFLOPS maxabserr = 0.0 | ||
matMulAB (10) : 931.6 GFLOPS 10126.1 GFLOPS maxabserr = 0.0 |
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters