Databricks - Scalable Deep Learning with TensorFlow and Apache Spark

Duration: 2 days

Industry: Information Technology

About this course

What is the Scalable Deep Learning with TensorFlow and Apache Spark course about?

This course starts with the basics of the tf.keras API including defining model architectures, optimizers, and saving/loading models. You then implement more advanced concepts such as callbacks, regularization, TensorBoard, and activation functions.

After training your models, you will integrate the MLflow tracking API to reproduce and version your experiments. You will also apply model interpretability libraries such as LIME and SHAP to understand how the network generates predictions. You will also learn about various Convolutional Neural Networks (CNNs) architectures and use them as a basis for transfer learning to reduce model training time.

Substantial class time is spent on scaling your deep learning applications, from distributed inference with pandas UDFs to distributed hyperparameter search with Hyperopt to distributed model training with Horovod. This course is taught fully in Python.

What is Google TensorFlow

According to Guru99, Google TensorFlow is an open-source end-to-end platform for creating Machine Learning applications. It is a symbolic math library that uses dataflow and differentiable programming to perform various tasks focused on the training and inference of deep neural networks. It allows developers to create machine learning applications using various tools, libraries, and community resources.

For more information about this course, please check this blog from P2L.

Who can benefit?

Data scientist
Machine learning engineer

This is what you'll learn

Build deep learning models using Keras/TensorFlow
Scale the following:
- Model inference with pandas UDFs & pandas function API
- Hyperparameter tuning with HyperOpt
- Training of distributed TensorFlow models with Horovod
Track, version, and reproduce experiments using MLflow
Apply model interpretability libraries to understand & visualize model predictions
Use CNNs (convolutional neural networks) and perform transfer learning & data augmentation to improve model performance
Deploy deep learning models

Prerequisite Skills

Intermediate experience with Python/pandas
Familiarity with machine learning concepts
Experience with PySpark

Schedule (iMVP)

Sep 29-30, 2022

Oct 27-28, 2022

Nov 17-18, 2022

Enroll