Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add OpenFOAM v9 to EESSI pilot 2021.12 #195

Closed
wants to merge 6 commits into from

Conversation

boegel
Copy link
Contributor

@boegel boegel commented Nov 7, 2022

checklist:

@boegel boegel added enhancement New feature or request pilot-2021.12 labels Nov 7, 2022
@boegel
Copy link
Contributor Author

boegel commented Nov 10, 2022

I'm hitting a weird error during make install of Qt5 while building this on AWS, see easybuilders/easybuild-easyconfigs#16614 for more details...

edit: fixed with new build container, cfr. EESSI/filesystem-layer#133

@boegel boegel marked this pull request as ready for review January 22, 2023 08:06
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@boegel
Copy link
Contributor Author

boegel commented Jan 31, 2023

Last iteration of builds seemed to fail due to a variety of issues (mostly building Qt5, one failed to build GObject-Introspection, another fail during OpenFOAM sanity check).

I've reconfigured the bot to use larger instances (16-core with 32GB of RAM, was 8-core with 16GB of RAM), let's see if that helps...

@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@EESSI EESSI deleted a comment from eessi-bot bot Jan 31, 2023
@eessi-bot
Copy link

eessi-bot bot commented Jan 31, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen2 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.01/pr_195/3582

date job status comment
Jan 31 22:09:41 UTC 2023 submitted job id 3582 awaits release by job manager
Jan 31 22:10:12 UTC 2023 released job awaits launch by Slurm scheduler
Jan 31 22:11:31 UTC 2023 running job 3582 is running
Jan 31 22:17:16 UTC 2023 finished 😁 SUCCESS tarball eessi-2021.12-software-linux-x86_64-amd-zen2-1675203381.tar.gz (0.000 GiB) in job dir

@eessi-bot
Copy link

eessi-bot bot commented Jan 31, 2023

New job on instance eessi-bot-citc-aws for architecture x86_64-amd-zen3 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.01/pr_195/3583

date job status comment
Jan 31 22:09:43 UTC 2023 submitted job id 3583 awaits release by job manager
Jan 31 22:10:09 UTC 2023 released job awaits launch by Slurm scheduler
Jan 31 22:11:28 UTC 2023 running job 3583 is running
Jan 31 22:16:10 UTC 2023 finished 😁 SUCCESS tarball eessi-2021.12-software-linux-x86_64-amd-zen3-1675203324.tar.gz (0.000 GiB) in job dir
Feb 01 07:18:10 UTC 2023 uploaded transfer of eessi-2021.12-software-linux-x86_64-amd-zen3-1675203324.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jan 31, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-generic in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.01/pr_195/3584

date job status comment
Jan 31 22:09:44 UTC 2023 submitted job id 3584 awaits release by job manager
Jan 31 22:10:06 UTC 2023 released job awaits launch by Slurm scheduler
Jan 31 22:11:26 UTC 2023 running job 3584 is running
Feb 01 01:49:12 UTC 2023 finished 😁 SUCCESS tarball eessi-2021.12-software-linux-aarch64-generic-1675215914.tar.gz (1.140 GiB) in job dir
Feb 01 07:18:04 UTC 2023 uploaded transfer of eessi-2021.12-software-linux-aarch64-generic-1675215914.tar.gz to S3 bucket succeeded

@eessi-bot
Copy link

eessi-bot bot commented Jan 31, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-graviton2 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.01/pr_195/3585

date job status comment
Jan 31 22:09:46 UTC 2023 submitted job id 3585 awaits release by job manager
Jan 31 22:10:04 UTC 2023 released job awaits launch by Slurm scheduler
Jan 31 22:11:25 UTC 2023 running job 3585 is running
Jan 31 22:16:08 UTC 2023 finished 😢 FAILURE
  • Found slurm output slurm-3585.out in job dir
  • Slurm output lacks message "No missing modules!".

edit (@boegel): build of GObject-Introspection-1.68.0-GCCcore-10.3.0.eb failed, error unclear, will re-trigger and see if it's consistent

@eessi-bot
Copy link

eessi-bot bot commented Jan 31, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-graviton3 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.01/pr_195/3586

date job status comment
Jan 31 22:09:48 UTC 2023 submitted job id 3586 awaits release by job manager
Jan 31 22:10:00 UTC 2023 released job awaits launch by Slurm scheduler
Jan 31 22:11:23 UTC 2023 running job 3586 is running
Feb 01 00:56:40 UTC 2023 finished 😁 SUCCESS tarball eessi-2021.12-software-linux-aarch64-graviton3-1675212769.tar.gz (1.138 GiB) in job dir
Feb 01 07:18:48 UTC 2023 uploaded transfer of eessi-2021.12-software-linux-aarch64-graviton3-1675212769.tar.gz to S3 bucket succeeded

@boegel boegel added the bot:deploy Ask bot to deploy missing software installations to EESSI label Feb 1, 2023
@boegel boegel added bot:build and removed bot:build bot:deploy Ask bot to deploy missing software installations to EESSI labels Feb 2, 2023
@eessi-bot
Copy link

eessi-bot bot commented Feb 2, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-graviton2 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.02/pr_195/3587

date job status comment
Feb 02 08:29:38 UTC 2023 submitted job id 3587 awaits release by job manager
Feb 02 08:30:10 UTC 2023 released job awaits launch by Slurm scheduler
Feb 02 08:31:12 UTC 2023 running job 3587 is running
Feb 02 08:37:21 UTC 2023 finished 😢 FAILURE
  • Found slurm output slurm-3587.out in job dir
  • Slurm output lacks message "No missing modules!".
  • Slurm output lacks message about created tarball.
  • No tarball matching eessi-*software-*.tar.gz found in job dir.

@boegel boegel added bot:build and removed bot:build labels Feb 2, 2023
@eessi-bot
Copy link

eessi-bot bot commented Feb 2, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-graviton2 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.02/pr_195/3588

date job status comment
Feb 02 08:55:45 UTC 2023 submitted job id 3588 awaits release by job manager
Feb 02 08:56:26 UTC 2023 released job awaits launch by Slurm scheduler
Feb 02 08:57:28 UTC 2023 running job 3588 is running
Feb 02 09:02:36 UTC 2023 finished 😢 FAILURE
  • Found slurm output slurm-3588.out in job dir
  • Slurm output lacks message "No missing modules!".

@boegel boegel added bot:build and removed bot:build labels Feb 9, 2023
@eessi-bot
Copy link

eessi-bot bot commented Feb 9, 2023

New job on instance eessi-bot-citc-aws for architecture aarch64-graviton2 in job dir /mnt/shared/home/bot/eessi-bot-software-layer/jobs/2023.02/pr_195/3607

date job status comment
Feb 09 14:20:02 UTC 2023 submitted job id 3607 awaits release by job manager
Feb 09 14:20:58 UTC 2023 released job awaits launch by Slurm scheduler
Feb 09 14:22:00 UTC 2023 running job 3607 is running
Feb 09 14:28:09 UTC 2023 finished 😢 FAILURE
  • Found slurm output slurm-3607.out in job dir
  • Slurm output lacks message "No missing modules!".

@boegel
Copy link
Contributor Author

boegel commented Feb 16, 2023

I've reproduced the build problem on graviton2 manually, and this is the culprit for the failing build of GObject-Introspection-1.68.0-GCCcore-10.3.0.eb:

Illegal instruction (core dumped)

🤔

@ocaisa
Copy link
Member

ocaisa commented Feb 16, 2023

I've reproduced the build problem on graviton2 manually, and this is the culprit for the failing build of GObject-Introspection-1.68.0-GCCcore-10.3.0.eb:

Illegal instruction (core dumped)

🤔

Hmm, that sounds familiar #196

@eessi-bot
Copy link

eessi-bot bot commented Mar 17, 2023

New job on instance bot-aws-PR155 for architecture aarch64-graviton2 for repository EESSI-pilot in job dir /mnt/shared/home/bot/PR155/jobs/2023.03/pr_195/4169

date job status comment
Mar 17 01:27:49 PM UTC 2023 submitted job id 4169 awaits release by job manager
Mar 17 01:28:35 PM UTC 2023 released job awaits launch by Slurm scheduler
Mar 17 01:31:40 PM UTC 2023 running job 4169 is running
Mar 17 01:34:48 PM UTC 2023 finished 😢 FAILURE
  • Found slurm output slurm-4169.out in job dir
  • Slurm output lacks message "No missing modules!".

@eessi-bot
Copy link

eessi-bot bot commented Mar 17, 2023

New job on instance bot-aws-PR155 for architecture x86_64-generic for repository eessi-2021.12 in job dir /mnt/shared/home/bot/PR155/jobs/2023.03/pr_195/4170

date job status comment
Mar 17 01:27:51 PM UTC 2023 submitted job id 4170 awaits release by job manager
Mar 17 01:28:33 PM UTC 2023 released job awaits launch by Slurm scheduler
Mar 17 01:31:38 PM UTC 2023 running job 4170 is running
Mar 17 01:38:54 PM UTC 2023 finished 😁 SUCCESS tarball eessi-2021.12-software-linux-x86_64-generic-1679060321.tar.gz (0.000 GiB) in job dir

@boegel boegel changed the base branch from main to 2021.12 June 5, 2023 08:46
@boegel
Copy link
Contributor Author

boegel commented Jun 6, 2023

I may found the culprit for the Illegal instruction error...

OpenBLAS/0.3.15-GCC-10.3.0 was not correctly built, it wasn't using the tweaked OpenBLAS easyblock from easybuilders/easybuild-easyblocks#1946, which caused DYNAMIC_ARCH=1 to not be used in the options to make.

The build of OpenBLAS/0.3.15-GCC-10.3.0 for aarch64/generic was done on a c7g-8xlarge instance (Graviton 3), so that effectively leads to an OpenBLAS/0.3.15-GCC-10.3.0 installation that is incompatible with Graviton 2...

That was meant to fixed by #211, but it seems like the rebuild for aarch64/generic was done incorrectly, not using the required --include-easyblocks-from-pr 1946.

It's a perfect example of why the builds should be fully automated with the bot, rather than being done by a human who's prone to make mistakes...

@boegel boegel removed the bot:build label Aug 29, 2023
TopRichard added a commit to TopRichard/bot-software-layer1 that referenced this pull request Oct 8, 2023
…/2.27.0-gompi/2022a

{2023.06}[gompi/2022a] ecCodes V2.27.0
@bedroge
Copy link
Collaborator

bedroge commented Apr 16, 2024

Since this involved manual builds and since the pilot repo is no longer supported, I'm going to close this PR.

@bedroge bedroge closed this Apr 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request pilot-2021.12
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants