Vancouver, BC, Canada
August 27 & 28 - Co-Located Events, Tutorials, Labs & Lightning Talks
August 29-31 - Conference
Click Here For Information & Registration
Tuesday, August 28 • 5:25pm - 5:30pm
Lightning Talk: RDMA Programming Design and Case Studies - For Better Performance Distributed Applications - Yoshiro Yamabe, NTT

Sign up or log in to save this to your schedule, view media, leave feedback and see who's attending!

Feedback form is now closed.
Remote Direct Memory Access (RDMA) is a network protocol that offers low latency, high bandwidth, and low CPU utilization. So recently, RDMA has been expected to improve performance of applications which need frequent network communications, and there are several attempts applying RDMA to OSS. For instance, Tensorflow (deep learning framework) and Apache Spark (data analytics platform) have RDMA implementations.

However, RDMA programming offers many decisions for programmers which may affect performance (e.g. RDMA operation, application's data layout on memory region). Depending on them, extra overheads such as memcpy() will occur and reduce RDMA's benefits. To make matters more complex, they rely on target application's features.

This presentation shows overviews of RDMA, choices of design, and example of applying RDMA to MXNet, an OSS deep learning framework, by using Infiniband and its native API, ibverbs.


Yoshiro Yamabe

Software engineer, NTT
Yoshiro Yamabe is a software engineer at NTT Laboratories. He has been investigating RDMA and tries to apply RDMA to OSS (MXNet).

Tuesday August 28, 2018 5:25pm - 5:30pm PDT
West Ballroom A/B