Loading…
Vancouver, BC, Canada
August 27 & 28 - Co-Located Events, Tutorials, Labs & Lightning Talks
August 29-31 - Conference
Click Here For Information & Registration
Friday, August 31 • 11:50am - 12:30pm
Deploy and Use a Multi Framework Distributed Deep Learning Platform on Kubernetes - Animesh Singh & Tommy Li, IBM

Sign up or log in to save this to your schedule and see who's attending!

Feedback form is now closed.
Training deep neural network models requires a highly tuned system with the right combination of software, drivers, compute, memory, network, and storage resources. Deep learning frameworks like TensorFlow, PyTorch, Caffe, Torch, Theano, and MXNet have contributed to the popularity of deep learning by reducing the effort and skill needed to design, train, and use deep learning models. Fabric for Deep Learning (FfDL, pronounced “fiddle”) provides a consistent way to run these deep learning frameworks as a service on Kubernetes. FfDL uses a microservices architecture to reduce coupling between components, keep each component simple and as stateless as possible, isolate component failures, and allow each component to be developed, tested, deployed, scaled, and upgraded independently.

Animesh Singh, and Tommy Li will share lessons learned while building and using FfDL and demonstrate how to leverage it to execute distributed deep learning training for models written using multiple frameworks, using GPUs and object storage constructs. They then explain how to take models from IBM’s Model Asset Exchange, train them using FfDL, and deploy them on Kubernetes for serving and inferencing.



Speakers
avatar for Tommy Li

Tommy Li

Software Developer, IBM
Tommy Li is a software developer in IBM focusing on Cloud, Kubernetes, and Machine Learning. He is one of the Fabric for Deep Learning’s main contributors and worked on various developer code patterns on Kubernetes, Microservice, and deep learning application to provide use cases... Read More →
avatar for Animesh Singh

Animesh Singh

STSM and Program Director, IBM
Animesh Singh is an STSM and works with IBM Watson and Cloud Platform, where he leads machine learning and deep learning initiatives and works with communities and customers to design and implement deep learning, machine learning, and cloud computing frameworks. He has a proven track... Read More →


Friday August 31, 2018 11:50am - 12:30pm
Room 116/117