-
Notifications
You must be signed in to change notification settings - Fork 36
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add Scotch library with intel and gnu compiler on Hera and Orion #501
Comments
EPIC has handled the installations on Hera and Orion. Jong is the contact person. |
Hi @jkbk2004 do you have any status updates or a timeline? |
@aliabdolali Thanks for reminding! On EPIC side, I think we can follow up by early next week (say Monday). There are a few lib update check list. Doe it work on your side? |
@jkbk2004 Yes, it works for us. Thanks in advance. |
@natalie-perlin @ulmononian We need to follow up the scotch instrallation. |
I updated the instructions on Hera and Orion. |
@aliabdolali thank you for providing your installation instructions. i am currently working to add this to the hpc-stack on hera (@natalie-perlin will be handling orion), and will update you once it is added. to aid in determining which stack to add this to, do you require netcdf-4.9.0 or higher to run scotch w/ your applications? thank you! |
@aliabdolali is the expert, but from everything I have read and tested, we do not require netcdf-4.9.0 or higher. @MatthewMasarik-NOAA and I are testing the install instructions above to see if we need the lines:
after the make install as this does not appear needed from the install instructions on SCOTCH seen here: https://gitlab.inria.fr/scotch/scotch/-/blob/master/INSTALL.txt#L60-151 |
I revised the install instructions with these updates plus edit for Hera SCOTCH Install
|
@JessicaMeixner-NOAA @aliabdolali @MatthewMasarik-NOAA has anyone on the team had success installing scotch on cheyenne? also: is version 7.0.1 still your preference? i noticed 7.0.3 is out w/ full |
@DeniseWorthen can you jump in? |
@ulmononian regarding version, we would like Neither @JessicaMeixner-NOAA or @aliabdolali or myself have access to Cheyenne, so for us, no. |
@jkbk2004 As long as it compiles and runs, I have no preferences. |
I will note that we've been investigating what build flags to use which might be contributing to scaling problems we are seeing with SCOTCH. See a github issue here: NOAA-EMC/WW3#879 Also I have a ticket open with the orion helpdesk to update the intel/gnu issue so that you do not have to modify the gnu module file to download. Also I don't know if spack has the 7.0.3 version already in spack-stack. |
@JessicaMeixner-NOAA @aliabdolali @DeniseWorthen thanks for the updates and information. i've tried installing 7.0.1 on cheyenne (again have to use a gnu module file hack) but it is having issues in the make step with parsing using bison. i'm not sure what the min. required version is, but cheyenne only has 2.7. i tried installing the newest bison but that was not successful. i will try 7.0.3 using the cmake system... |
@JessicaMeixner-NOAA i will follow up with the scotch spack issue and see where it stands. we may be able to get 7.0.3 in the preferred versions list at least in the emc fork for now. will report back. |
I just looked at the SCOTCH library and found this for bison/flex: I'm not sure what "most recent" really is. I have an issue open on the scotch repository asking about the required version of gnu (for the header) and will add a request to find about about flex and bison minimum versions as well. |
I know for orion we needed to turn of pthreads to run successfully (-DTHREADS="OFF") and turning off MPI multiple threads (-DMPI_THREAD_MULTIPLE="OFF") was also helpful, but for now seems unrelated to your issues, just thought I'd mention it all the same. |
Here's my scotch issue asking for minimum versions: https://gitlab.inria.fr/scotch/scotch/-/issues/21 |
thanks for those tips and also for inquiring about the minimum versions w/ the scotch team, @JessicaMeixner-NOAA. some updates from my side on cheyenne: i was able to successfully install
@DeniseWorthen are you going to be testing scotch on cheyenne? if so, i will add it to the given that each machine so far has required some ad-hoc approaches for installing scotch, i am hopeful that it can be supported more smoothly using spack and the spack-stack. i will be posting updates on that effort here NOAA-EMC/spack-stack issue #465. |
FWIW,
|
@ulmononian Yes, I hope to be able to test on cheyenne, where the Q waits and job turn-around is much better for debugging purposes. |
@ulmononian Thanks for the scotch install. It will be a few days before I can give it a road test. I'll let you know if I have issues. |
@ulmononian I have heard from the SCOTCH developers and the minimum version of bison needed is 3.4 --- see response here: https://gitlab.inria.fr/scotch/scotch/-/issues/21 There is also a response about the minimum version of gnu --- which can be found here: https://gitlab.inria.fr/scotch/scotch/-/issues/19#note_808510 and relevant parts are repeated here:
|
On orion, a new module has been created, so that we can use later gcc with intel, to load:
|
this is very helpful. i wonder if the same thing could happen on cheyenne. @DeniseWorthen did you have time to test ww3 against the scotch install there? |
@ulmononian I'm sorry, I have not had a chance to test this yet. |
@ulmononian I did build and run on cheyenne using the scotch lib using my test setup. All I did was add
and then my usual compile.sh command. I'm using intel. |
Hi, I wanted to report a seg fault when testing the I'm testing using the WW3 standalone regression tests. I've found that I can build WW3 successfully, but when I try to run it I get a seg fault during model initialization. After doing the module loads, here the output of
I have set the needed environment parameter
And here is the part of the log file when the model is initializing, then crashes
I can confirm that running the WW3
|
Is this Intel or GNU? I also found a problem that only showed up when I tried GNU; it isn't related to SCOTCH. All my development work was done w/ Intel debug. This may or may not be the same issue, but this is the fix:
|
This is for That is very interesting. I don't know how it fits into the puzzle right now because I can run the same code, but with a different SCOTCH install.. Very interesting though. |
thanks for this information. can i ask how you built 7.0.3 on hera? perhaps i need to modify the build script in some way. |
@ulmononian, yes certainly. I'll get back early Fri with instructions. |
@ulmononian, there isn't much new here, but for completeness this is how I built SCOTCH on hera.
|
@MatthewMasarik-NOAA thanks for those build instructions. based on your use of on a related note, @natalie-perlin will be taking over the scotch work for the hpc-stack from this point forward. i'm happy to help where i can and will stay tuned to the discussion, but i will be focusing on the spack-stack scotch installations. the pr to add scotch to spack-stack is here: JCSDA/spack-stack#550. it would be great if anyone from ww3 could test the spack-stack UE w/ scotch on hera or orion. feel free to contact me or comment over at that PR if interested. |
Hi @ulmononian, @natalie-perlin, please use the following
Also, I have been testing the spack-stack SCOTCH install on hera. I post my current status on that at JCSDA/spack-stack/pull/550 |
@MatthewMasarik-NOAA -
|
Hi @natalie-perlin, sure thing.
|
@MatthewMasarik-NOAA @JessicaMeixner-NOAA - Updated/installed scotch 7.0.3 on Hera and Orion, with intel/2022.1.2 compilers. Please test these installations. This is the configuration from the actual log files:
Few additional notes: (FYI, @jkbk2004 , @ulmononian )
|
@natalie-perlin Sure thing, I'll start testing these. |
@natalie-perlin, for orion, there is a typo (ahelp vs. help) in the scotch lua file that prevents it from loading:
Thank you for the finding! Fixed both in the actual modulefile for scotch and in the template. No such issue on Hera, checked. |
Great, thank you for fixing those. I'll test it out |
@natalie-perlin these new installs on |
Hi @natalie-perlin. I have some good news, SCOTCH v7.0.4 was just released containing bug fixes for the scaling issue we saw, as well as an issue exposed when using gnu + openmpi, found by @AlexanderRichert-NOAA. Please find this new version here: https://gitlab.inria.fr/scotch/scotch/-/releases/v7.0.4 We are looking forward to having this updated version installed. Is there anything else I can provide to start the installations? |
@natalie-perlin I wanted to amend my comment from yesterday, and see if we could pause this process for a few days? With the new version released just yesterday I need some time to produce new build instructions. We have been using the safest SCOTCH compile options while debugging, though now we may be able to get some performance gains using different SCOTCH options. I'm testing different builds now, then I can pass you the complete instructions we decide on. My intention is to have those instructions ready by Monday. |
Fyi @natalie-perlin For the SCOTCH v7.0.4 installs I created a new Install issue. Since this issue was for the initial Package Addition, and I've posted the install request for |
Hi @MatthewMasarik-NOAA - yes, it has been added and tested successfully on Orion, Hera (and other current hpc-stack locations). I hope the issue could be closed now |
just fyi @MatthewMasarik-NOAA @natalie-perlin: scotch 7.4.0 will be included with spack-stack 1.5.0. |
@natalie-perlin Yes, from my perspective this can be closed now. |
@ulmononian Awesome! That's great news. I will be following up with the corresponding spack-stack install issue, it will be Monday. |
Please describe the package or library you would like to add to hpc-stack.
The Scotch distribution is a set of programs and libraries which implement the static mapping and sparse matrix reordering algorithms developed within the Scotch project. We would like to utilize the graph partitioning capability of Scotch in the WW3 model. Here is the link to the Scotch repository:
https://gitlab.inria.fr/scotch/scotch
What applications at NOAA will be using this package or library?
UFS-WEATHER-MODEL:
UFS-COASTAL
Is there already a package or library in hpc-stack that provides this, or related, functionality?
NA
Additional context
I have tested the compilation of SCOTCH with GNU and Intel compiler on both Hera and Orion. The compilation with intel is a bit tricky, so here I added the step-by-step instruction:
Hera
I compiled scotch following this instruction on Hera in
/scratch2/COASTAL/coastal/save/Ali.Abdolali/hpc-stack/scotch/install
Orion
I compiled scotch following this instruction on Orion in
/work/noaa/marine/ali.abdolali/Source/hpc-stack/scotch/install
The Intel compiler depends on GNU for some of its functionality, and needs a more recent version of GNU compiler to function correctly. The default version of GNU that comes with most distribution is pretty old.
So if WCOSS2 has a more recent version as the default then it should simply work. Based on the GNU module loaded on hera, it seems to work fine with:
gcc (Spack GCC) 9.2.0
If WCOSS2 had an older version of GNU then you see if there is a module for GNU with a more recent compiler.
Will This Package be Needed in an Operational Application?
Yes
WCOSS System Software Request Checklist
General questions:
Supervisor or sponsor of the requester: @AvichalMehra-NOAA
The license of the package is approved by NCO.
Software name and version, specific URL to the software SCOTCH https://gitlab.inria.fr/scotch/scotch
Software type - New or Upgrade existing? New
Justification ( List NPS models using the software)
Completion Time requested
Software License including all Dependency Software Licenses
Support contact(s) who must have a WCOSS account
Dependency Software list
Installation instructions
Test and verification instructions
Technical Review Checklist for open source software - Review the source code to answer the following questions
Licenses for the requested software and its dependencies
Licenses - confirm the software Licenses are acceptable
Maturity
Acceptable - Stable, production, or equivalent
Self-contained
No external http, https, ftp, or other URI exists except that in comments
No binary files in the package unless they are in the approved list
No publicly disclosed cybersecurity vulnerabilities and exposures
Searching https://cve.mitre.org/cve/
Security High Level Checklists
Is it prohibited by DHS/DOC/NOAA/NWS
Is it provided by a trusted source? Trusted sources include other NWS, NOAA, or DOC, agencies, or other Federal agencies that operate at a FISMA high or equivalent level. Additionally, trusted sources could be third-party agencies through which there is an existing SLA on file (such as RedHat).
Is software support offered (is it being updated and patched). Yes, the main developers agreed to support the software.
If maintained by a private entity, does the entity operate in a foreign country, especially a prohibited foreign country (China, Russia, Iran, North Korea, etc.).
Is there sufficient documentation to support maintenance Yes
Are there known vulnerabilities or weaknesses No
Is there a need for privileged processes
Are there software dependencies, are those dependencies approved or do they have any security concerns
Are there any other concerns related to SA, SI, and SC control families
The text was updated successfully, but these errors were encountered: