Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[v1.21.x] contrib/intel/jenkins Cherry Pick Intel CI update from main #10547

Merged
merged 7 commits into from
Nov 16, 2024

Conversation

zachdworkin
Copy link
Contributor

No description provided.

fi_rdm_tagged_peek is failing on cleanup path.
ft_free_res() -> ft_close_fids() -> fi_close() -> ucx_ep_close()
-> ucp_worker_destroy() -> ucp_worker_discard_uct_ep_progress()
-> ucp_ep_destroy_base() -> __funlockfile()

The reported error is:
"Segmentation fault: address not mapped to object at address 0x8"

This is a race condition and does not occur every time.
To reproduce run:
server: fi_rdm_tagged_peek -p ucx -E
client: fi_rdm_tagged_peek -p ucx -E server_address

Issue 10126 is tracking this bug. Re-enable this test when it is resolved.

Signed-off-by: Zach Dworkin <[email protected]>
Add health check stage to reboot nodes as needed if they
are stuck in a bad state from previous jobs.

Signed-off-by: Zach Dworkin <[email protected]>
New CI will be in charge of building everything that
needs to be built that it is in charge of running.
Libfabric, Fabtests, SHMEM.

New CI prefers a tarball to build from so we will checkout
scm from the plugin, tar it, and then feed it to new CI to use.

All builds will eventually move from middlewares to pre-build.

Signed-off-by: Zach Dworkin <[email protected]>
Make sure code is rebased on target branch before running.
Re-order git commands to not need adding remote several times

Signed-off-by: Zach Dworkin <[email protected]>
Separate tcp and tcp;ofi_rxm testing so that both providers
are covered. Previously tcp was testing everything and
picking up rxm as needed. Instead it is better practice to
separate them and just test both usecases entirely. Update
both tcp ubertest lists to correctly run only supported tests.

Signed-off-by: Zach Dworkin <[email protected]>
@zachdworkin
Copy link
Contributor Author

@j-xiong this is good to merge whenever its convenient

@j-xiong j-xiong merged commit deead6e into ofiwg:v1.21.x Nov 16, 2024
10 of 11 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants