
Databricks - Scalable Deep Learning with TensorFlow and Apache Spark
Duration: 2 days
Industry: Information Technology
About this course
What is the Scalable Deep Learning with TensorFlow and Apache Spark course about?
This course starts with the basics of the tf.keras API including defining model architectures, optimizers, and saving/loading models. You then implement more advanced concepts such as callbacks, regularization, TensorBoard, and activation functions.
After training your models, you will integrate the MLflow tracking API to reproduce and version your experiments. You will also apply model interpretability libraries such as LIME and SHAP to understand how the network generates predictions. You will also learn about various Convolutional Neural Networks (CNNs) architectures and use them as a basis for transfer learning to reduce model training time.
Substantial class time is spent on scaling your deep learning applications, from distributed inference with pandas UDFs to distributed hyperparameter search with Hyperopt to distributed model training with Horovod. This course is taught fully in Python.
What is Google TensorFlow
According to Guru99, Google TensorFlow is an open-source end-to-end platform for creating Machine Learning applications. It is a symbolic math library that uses dataflow and differentiable programming to perform various tasks focused on the training and inference of deep neural networks. It allows developers to create machine learning applications using various tools, libraries, and community resources.
For more information about this course, please check this blog from P2L.
Who can benefit?
- Data scientist
- Machine learning engineer
This is what you'll learn
- Build deep learning models using Keras/TensorFlow
- Scale the following:
- Model inference with pandas UDFs & pandas function API
- Hyperparameter tuning with HyperOpt
- Training of distributed TensorFlow models with Horovod
- Track, version, and reproduce experiments using MLflow
- Apply model interpretability libraries to understand & visualize model predictions
- Use CNNs (convolutional neural networks) and perform transfer learning & data augmentation to improve model performance
- Deploy deep learning models
Prerequisite Skills
- Intermediate experience with Python/pandas
- Familiarity with machine learning concepts
- Experience with PySpark
Schedule (iMVP)
Sep 29-30, 2022
Oct 27-28, 2022
Nov 17-18, 2022