-
Dear community, TL;DRIs there a way to flush an endpoint in a blocking manner without progressing the ucp worker? In-depthI'm currently struggling with an implementation of a middleware utilizing UCX and its RMA capabilities. In my current setup, I'm utilizing one-sided RC write/reads for While ucx_ep_flush_nbx seems like a potential solution, it returns a request handle which necessitates further progression of the worker. But I want to avoid a complete progression of the worker, because: As far as I understand Why am I sure that
|
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 1 reply
-
The progress should not affect scalability as we poll on CQs (one per UCT iface) and not on QPs. The performance change above can we caused by switching from RC to UD. Which HCA/NIC is being used? Can you try forcing RC transport by running with UCX_TLS=self,sm,rc also on large scale? |
Beta Was this translation helpful? Give feedback.
-
After conducting several investigations to identify the bad scaling, we've determined that the issue does not lie in ucx, but rather with our hardware. Sorry for the inconvenience and thanks for the reply. |
Beta Was this translation helpful? Give feedback.
After conducting several investigations to identify the bad scaling, we've determined that the issue does not lie in ucx, but rather with our hardware.
Sorry for the inconvenience and thanks for the reply.