AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kubernetes
October 25, 2023
By 
Lu Qiu

This hands-on session discusses best practices for using PyTorch and Alluxio during model training on AWS. Shawn and Lu provide a step-by-step demonstration of how to use Alluxio on EKS as a distributed cache to accelerate computer vision model training jobs that read datasets from S3. This architecture significantly improves the utilization of GPUs from 30% to 90%+, archives ~5x faster training, and lower cloud storage costs.

This hands-on session discusses best practices for using PyTorch and Alluxio during model training on AWS. Shawn and Lu provide a step-by-step demonstration of how to use Alluxio on EKS as a distributed cache to accelerate computer vision model training jobs that read datasets from S3. This architecture significantly improves the utilization of GPUs from 30% to 90%+, archives ~5x faster training, and lower cloud storage costs.

Video:

Presentation slides:

AI Infra Day | Hands-on Lab: CV Model Training with PyTorch & Alluxio on Kubernetes from Alluxio, Inc

Complete the form below to access the full overview:

Videos

Sign-up for a Live Demo or Book a Meeting with a Solutions Engineer