Skip to content

Commit

Permalink
Update README.md
Browse files Browse the repository at this point in the history
Improve MPI example to avoid confusion of number of processes / total number of GPUs.

#54 (comment)
  • Loading branch information
sjeaugey authored Jan 3, 2023
1 parent 0b4c4cb commit 2cbb968
Showing 1 changed file with 2 additions and 2 deletions.
4 changes: 2 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -29,9 +29,9 @@ Run on 8 GPUs (`-g 8`), scanning from 8 Bytes to 128MBytes :
$ ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 8
```

Run with MPI on 40 processes (potentially on multiple nodes) with 4 GPUs each :
Run with MPI on 10 processes (potentially on multiple nodes) with 4 GPUs each, for a total of 40 GPUs:
```shell
$ mpirun -np 40 ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 4
$ mpirun -np 10 ./build/all_reduce_perf -b 8 -e 128M -f 2 -g 4
```

### Performance
Expand Down

0 comments on commit 2cbb968

Please sign in to comment.