Deep Learning Reference Stack Release Notes

The Deep Learning Reference Stack is an integrated, highly-performant open source stack optimized for Intel® Xeon® Scalable platforms. This open source community release is part of our effort to ensure AI developers have easy access to all of the features and functionality of the Intel platforms. The Deep Learning Reference Stack is highly-tuned and built for cloud native environments. With this stack, we are enabling developers to quickly prototype by reducing the complexity associated with integrating multiple software components, while still giving users the flexibility to customize their solutions. This version includes additional components to provide greater flexibility and a more comprehensive take on the deep learning environment.

To offer more flexibility, we are releasing multiple versions of the Deep Learning Reference Stack. We offer versions built on either Clear Linux OS or Ubuntu* to meet differing needs.

The Deep Learning Reference Stack Release built on Clear Linux OS

Note: The minimum validated version of Clear Linux for this stack is 32690.

The Deep Learning Reference Stack with Tensorflow 2.2, oneDNN and Intel® AVX512-Deep Learning Boost built on Clear Linux OS

The dlrs-tensorflow2-clearlinux version v0.6.0 includes:

Clear Linux* OS version 32690
Runtimes (Python 3)
TensorFlow 2.2.0-rc0 optimized using oneAPI Deep Neural Network Library (oneDNN) primitives and Intel® AVX-512 Deep Learning Boost (Formerly Intel® VNNI)
Intel OpenVINO™ model server v2020.1
Transformers - State-of-the-art Natural Language Processing for TensorFlow 2.2.0-rc0
Jupyterhub 1.1.0
Seldon-core 1.0.1

The Deep Learning Reference Stack with Tensorflow 1.15, oneDNN and Intel® AVX512-Deep Learning Boost built on Clear Linux OS

The dlrs-tensorflow-clearlinux version v0.6.0 includes:

Clear Linux* OS version 32690
Runtimes (Python 3)
TensorFlow 1.15 optimized using oneAPI Deep Neural Network Library (oneDNN) primitives and Intel® AVX-512 Deep Learning Boost (Formerly Intel® VNNI)
Intel OpenVINO™ model server v2020.1
Jupyterhub 1.1.0
Seldon-core 1.0.1

The Deep Learning Reference Stack with Eigen built on Clear Linux OS

The dlrs-tensorflow-clearlinux:v0.6.0-oss version v0.6.0 includes:

Clear Linux* OS version 32690
TensorFlow 2.0 compiled with AVX2 and AVX512 optimizations
Runtimes (Python 3)
Jupyter-notebook 6.0.3
Seldon-core 1.0.1

The Deep Learning Reference Stack with PyTorch and Intel® MKL built on Clear Linux OS

The dlrs-pytorch-clearlinux version v0.6.0 includes:

Clear Linux* OS version 32690
Runtimes (Python 3)
PyTorch version 1.4.0 optimized using Intel® Math Kernel Library, oneAPI Deep Neural Network Library (oneDNN) primitives and Intel® AVX-512 Deep Learning Boost (Formerly Intel® VNNI)
PyTorch lightning version v0.6.0
Transformers - State-of-the-art Natural Language Processing for PyTorch 1.14
Flair for Natural Language Processing version v0.4.5
Jupyterhub 1.1.0
Seldon-core 1.0.1

The Deep Learning Reference Stack with PyTorch built on Clear Linux OS

The dlrs-pytorch-clearlinux:v0.6.0-oss version v0.6.0 includes:

Clear Linux* OS version 32690
Runtimes (Python 3)
PyTorch version 1.4.0 with OpenBLAS version 0.3.9
Jupyter-notebook 6.0.3
Seldon-core 1.0.1

Deep Learning Compilers built on Clear Linux OS

The dlrs-ml-compiler-clearlinux version v0.6.0 includes:

Clear Linux* OS version 32690
TVM version 0.6 optimized using the Intel® Math Kernel Library
Jupyter-notebook 6.0.3

The Deep Learning Reference Stack Release built on Ubuntu

The Deep Learning Reference Stack TensorFlow 2.2.0 with oneDNN primitives, Intel® MKL, Intel® DLBoost and OpenVINO™ - Deep Learning Deployment Toolkit v2020.1 (TBB) built on Ubuntu

The dlrs-tensorflow2-ubuntu version v0.6.1 includes:

Ubuntu 20.04
Runtimes (Python 3)
TensorFlow 2.2.0 optimized using oneAPI Deep Neural Network Library (oneDNN) primitives and Intel® AVX-512 Deep Learning Boost (Formerly Intel® VNNI)
Intel OpenVINO™ model server v2020.1
Transformers - State-of-the-art Natural Language Processing for TensorFlow 2.2.0
Jupyterhub 1.1.0
Seldon-core 1.0.1

The Deep Learning Reference Stack with Tensorflow 1.15.2, Intel® MKL-DNN and Intel® AVX512-Deep Learning Boost (v0.6.1) built on Ubuntu

The dlrs-tensorflow-ubuntu version v0.6.1 includes:

Ubuntu 20.04
Runtimes (Python 3)
TensorFlow 1.15.2 optimized using oneAPI Deep Neural Network Library (oneDNN) primitives and Intel® AVX-512 Deep Learning Boost (Formerly Intel® VNNI)
Intel OpenVINO™ model server v2020.1
Jupyterhub 1.1.0
Seldon-core 1.0.1

The Deep Learning Reference Stack with PyTorch built on Clear Linux OS

The dlrs-pytorch-ubuntu version v0.6.1 includes:

Ubuntu 20.04
Runtimes (Python 3)
PyTorch version 1.4.0
Jupyterhub 1.1.0
Seldon-core 1.0.1

How to get the Deep Learning Reference Stack

The official Deep Learning Reference Stack Docker images are hosted at: https://hub.docker.com/r/sysstacks/.

Note: The System Stacks team is transitioning into a new organization in Github and Dockerhub, please note all images are now under the sysstacks namespace.

Clear Linux OS versions

Pull from the Tensorflow 1.15 and oneDNN version
Pull from the Tensorflow 2.2.0-rc0 and oneDNN version
Pull from the Tensorflow with Eigen version
Pull from the PyTorch with oneDNN and Intel® MKL version
Pull from the PyTorch with OpenBLAS version
Pull from the ML Compiler with Intel® MKL version

Ubuntu versions

Pull from the TensorFlow 2.2.0 with oneDNN primitives version
Pull from the Tensorflow 1.15, Intel® MKL-DNN and Intel® AVX512-Deep Learning Boost version
Pull from the PyTorch and Intel® MKL version

Note: To take advantage of the AVX-512, and AVX-512 Deep Learning Boost functionality with the Deep Learning Reference Stack, please use the following hardware: * AVX 512 images require an Intel® Xeon® Scalable Platform * AVX-512 Deep Learning Boost requires a Second-Generation Intel® Xeon® Scalable Platform

Licensing

The Deep Learning Reference Stack is guided by these Terms of Use. The Docker images are hosted on https://hub.docker.com and as with all Docker images, these likely also contain other software which may be under other licenses (such as Bash, etc. from the base distribution, along with any direct or indirect dependencies of the primary software being contained).

Working with the Deep Learning Reference Stack

The Deep Learning Reference Stack includes TensorFlow and Kubeflow support. These software components were selected because they are most popular/widely used by developers and CSPs. Clear Linux provides optimizations across the entire OS stack for the ultimate end user performance and is customizable to meet your unique needs. TensorFlow was selected as it is the leading deep learning and machine learning framework. oneAPI Deep Neural Network Library (oneDNN) is an open source performance library for Deep Learning (DL) applications intended for acceleration of DL frameworks on Intel® architecture. Intel® oneDNN includes highly vectorized and threaded building blocks to implement convolutional neural networks (CNN) with C and C++ interfaces. Kubeflow is a project that provides a straightforward way to deploy simple, scalable and portable Machine Learning workflows on Kubernetes. This combination of an operating system and the deep learning framework and libraries results in a performant deep learning software stack.

Please refer to the Deep Learning Reference Stack guide for detailed instructions for running the TensorFlow and Kubeflow Benchmarks on the docker images.

Note: Although the DLRS images and dockerfiles may be modified for your needs, there are some modifications that may cause unexpected or undesirable results. For example, using the Clear Linux swupd bundle-add command to add packages to a Clear Linux based container may overwrite the DLRS core components. Please use care when modifying the contents of the containers. A listing of the core components can be found in the release notes for each release.

Performance tuning configurations

Single Node Configuration

Key	Value
num_inter_threads	Socket number
num_intra_threads	Physical cores number
data_format	NHWC for Eigen; NCHW for MKL as MKL is optimized for this format

Example: For Intel® Xeon® Gold 6140 CPU @ 2.30GHz with 2 Sockets and 18 Cores/Socket MKL training with batch size 32:

  python tf_cnn_benchmarks.py --device=cpu --mkl=True --nodistortions --model=resnet50 --data_format=NCHW --batch_size=32 --num_inter_threads=2 --num_intra_threads=36 --data_dir=/imagenet-TFrecord --data_name=imagenet

Multi-node Configuration

The Deep Learning Reference Stack can be used in a multi node configuration using Kubernetes and different machine learning frameworks and libraries. Please refer to the System Stacks usecases repo to find examples on how to setup the Deep Learning Reference Stack with Kubeflow for multi-node.

For multi-node training please refer to:

Contributing to the Deep Learning Reference Stack

We encourage your contributions to this project, through the established Clear Linux community tools. Our team uses typical open source collaboration tools that are described on the Clear Linux community page.

Reporting Security Issues

If you have discovered potential security vulnerability in an Intel product, please contact the iPSIRT at secure@intel.com.

It is important to include the following details:

The products and versions affected
Detailed description of the vulnerability
Information on known exploits

Vulnerability information is extremely sensitive. The iPSIRT strongly recommends that all security vulnerability reports sent to Intel be encrypted using the iPSIRT PGP key. The PGP key is available here: https://www.intel.com/content/www/us/en/security-center/pgp-public-key.html

Software to encrypt messages may be obtained from:

PGP Corporation
GnuPG