Skip to content

Commit

Permalink
Merge pull request #602 from stdweird/shinx_pilot_v3
Browse files Browse the repository at this point in the history
shinx pilot phase stage 2
  • Loading branch information
boegel authored Apr 19, 2024
2 parents 1950db5 + e1c9588 commit 668fd6c
Showing 1 changed file with 4 additions and 4 deletions.
8 changes: 4 additions & 4 deletions mkdocs/docs/HPC/only/gent/2023/shinx.md
Original file line number Diff line number Diff line change
Expand Up @@ -68,7 +68,7 @@ optimizations (like GCC's `-march=native`, or Intel compiler's `-xHost`) might s

### Other remarks

* Possible issues with `OpenMP` thread pinning: we have seen, especially on `Tier-1 dodrio` cluster, that in certain cases
* Possible issues with thread pinning: we have seen, especially on `Tier-1 dodrio` cluster, that in certain cases
thread pinning is invoked where it is not expected. Typical symptom is that all the processes that are started are pinned
to a single core. Always report this issue when it occurs.
You can try yourself to mitigate this by setting `export OMP_PROC_BIND=false`, but always report it so we can keep track of this problem.
Expand Down Expand Up @@ -111,11 +111,11 @@ As such, we will have an extended pilot phase in 3 stages:
* Racking of last 16 nodes
* Installation of NDR/NDR-200 infiniband network

### Stage 2 (01/03/2024-20/05/2024)
### Stage 2 (19/04/2024-20/05/2024)

* Full size cluster
* 48 nodes
* NDR-200 Infiniband
* 48 nodes (no job size limit)
* NDR-200 Infiniband (single switch Infiniband topology)
* EL9 OS

* We expect to plan a full Tier-2 downtime in May 2024 to cleanup, refactor and renew the core networks
Expand Down

0 comments on commit 668fd6c

Please sign in to comment.