Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

UCX blocked after .sendStreamNonBlocking(sendBuffer, new SendCallback(sendBuffer, this)); #9964

Open
pereverges opened this issue Jun 17, 2024 · 3 comments
Assignees
Labels

Comments

@pereverges
Copy link

pereverges commented Jun 17, 2024

Describe the bug

After running for the second or third time the same program it gets blocked at the .sendStreamNonBlocking(sendBuffer, new SendCallback(sendBuffer, this)); call, it is trying to access a port that is not listening. The application then gets stuck in the send call on a progress while.

Steps to Reproduce

Using multiple threads, performing the .sendStreamNonBlocking(sendBuffer, new SendCallback(sendBuffer, this)); call
Using 1.17 UCX version
UCX_TLS=tcp UCX_NET_DEVICES=lo UCX_LOG_LEVEL=DIAG

Setup and versions

I am using linux ubuntu 20.4 with JUXC

Additional information (depending on the issue)

[1718657516.879778] [verges:58764:a] wireup_cm.c:128 UCX DIAG client ep 0x7fd095044000 failed to connect to 127.0.0.1:43008 using tcp cms
[1718657516.879912] [verges:58764:a] uct_cm.c:97 UCX DIAG resolve callback failed with error: Endpoint is not connected
[1718657516.880768] [verges:58764:0] ucp_ep.c:1768 UCX WARN disconnect failed: Endpoint is not connected
UCX: ERROR org.openucx.jucx.UcxException: Endpoint is not connected

@pereverges pereverges added the Bug label Jun 17, 2024
@gleon99
Copy link
Contributor

gleon99 commented Jun 18, 2024

Hi @pereverges
please add more information, according to our bug report template.

@pereverges
Copy link
Author

I have added more information.

@evgeny-leksikov
Copy link
Contributor

hi @pereverges , thanks for reporting! can you share a test application which reproduces the issue?

@evgeny-leksikov evgeny-leksikov self-assigned this Jul 17, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants