-
Notifications
You must be signed in to change notification settings - Fork 48
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Optimize RPC on UCX #1
Comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Currently a pingpong RPC on UCX/IB costs 20-30us. We expect to optimize it to ~5us.
The main overhead is coroutine switch: Due to the limitation of UCX, all UCX operations must run on a single thread. So a send request have to be sent from user coroutine to the network coroutine, and a received message have to be sent from the network coroutine to user coroutine. A RPC handler can be directly called from the network coroutine, but now it has 2 extra switches.
The text was updated successfully, but these errors were encountered: