Intel® Neural Compressor Bench

Intel® Neural Compressor Bench is a web application for easier use of Intel® Neural Compressor.

Table of Contents

  1. Introduction

  2. Home Screen

  3. Create New Project

    3.1. Optimization Tab

    3.2. Benchmark Tab

    3.3. Profiling Tab

    3.4. Diagnosis Tab

    3.5. Dataset Tab

    3.6. Project Information

  4. System Information

  5. Security

Introduction

Install Intel® Neural Compressor with Bench

To install Install Intel® Neural Compressor with GUI install full version of Intel® Neural Compressor in one of following ways:

Option 1: Install from Binary

# install stable full version from pip (including GUI)
pip install neural-compressor-full

# install nightly full version from pip (including GUI)
pip install -i https://test.pypi.org/simple/ neural-compressor-full

# install stable full version from from conda (including GUI)
conda install neural-compressor-full -c conda-forge -c intel  

Option 2: Install from Source

git clone https://github.com/intel/neural-compressor.git
cd neural-compressor
pip install -r requirements.txt
# build with full functionality (including GUI)
python setup.py --full install

Start the Intel® Neural Compressor Bench

To start the Intel® Neural Compressor Bench server execute inc_bench command:

inc_bench

The server generates a self-signed TLS certificate and prints instruction how to access the Web UI.

Intel(r) Neural Compressor Bench Server started.

Open address https://10.11.12.13:5000/?token=338174d13706855fc6924cec7b3a8ae8

Server generated certificate is not trusted by your web browser, you will need to accept usage of such certificate.

You might also use additional parameters and settings:

  • Intel® Neural Compressor Bench listens on port 5000. Make sure that port 5000 is accessible to your browser (you might need to open it in your firewall), or specify different port that is already opened, for example 8080:

    inc_bench -p 8080
    
  • When using official TF>=2.6.0, set environment variable TF_ENABLE_ONEDNN_OPTS=1 for INT8 tuning:

    TF_ENABLE_ONEDNN_OPTS=1 inc_bench
    
  • To start the Intel® Neural Compressor Bench server with your own TLS certificate add --cert and --key parameters:

    inc_bench --cert path_to_cert.crt --key path_to_private_key.key
    
  • To start the Intel® Neural Compressor Bench server without TLS encryption use --allow-insecure-connections parameter:

    inc_bench --allow-insecure-connections
    

    This enables access to the server from any machine in your local network (or the whole Internet if your server is exposed to it).

    You are forfeiting security, confidentiality and integrity of all client-server communication. Your server is exposed to external threats.

Home Screen

This view shows introduction to Intel® Neural Compressor Bench and a button for creating new project. After clicking this button, pop-up with project wizard will be shown.

Home

Create New Project

To create a new project, in first step you need to choose its name.

Project1

In second step there are 2 possible options to choose from:

  • predefined model - you choose model from predefined examples list, you don’t need to set any additional parameters,

  • custom model - in this scenario you can set more parameters and customize your model.

Project2

Predefined Model

First you need to choose domain for the model (image recognition or object detection). For each domain there are few available models to choose from. When you click Finish the chosen model will be downloaded.

Project3-predefined

Custom Model

First you have to choose the model path. When it is chosen, in most cases all other fields will be completed automatically. You can edit its input and output nodes, see the model graph (if it is available for this model) and set shape for synthetic dataset. If model domain was not detected, you need to choose it from the list. Model domain is used to set some default parameters for the model.

Project3-custom

Display Model Graph

For several model types there will be a button available Show graph in the project wizard. It is also possible to see the graph in Diagnosis tab. The graph by default is collapsed, but when you click on plus icon, sections will be unfolded.

Bert model graph.

Project List

On the left hand side there is a panel with list of created projects. When you click on the project name, you can see its details. “Create new project” button navigates to new project wizard pop-up described in previous section.

Menu

Remove Project

If you want to remove project, you have to click the trash icon next to project name (it is visible when the cursor is on the project name).

Remove1

Then you will be prompted to confirm your choice by typing the project name. Project removal is not reversible.

Remove2

Develop Project

Optimization Tab

Optimization Table

In Optimizations tab you can see list of optimizations in the project. Currently UI supports three optimization precisions and two types of optimization. Optimizations-table

Optimization Wizard

To add new optimization, click “Add new optimization” button at the bottom of the table and follow the steps. Optimizations-wizard

Editing Optimization Entries

There is a possibility to modify some optimization parameters even after exit from Wizard. If optimization has not been run yet, the pencil icon on the right hand side should be in light blue color. That indicates that it can be modified. After click on that pencil icon you can select different precision or dataset.

For Quantization you can also modify Tuning details before optimizing model. Optimization-tuning-details

Optimization Details

To perform optimization click “Run” button. Once process is finished you can click on row with specific optimization to display details about optimization parameters and optimized model. When you click on blue arrow icon in model path line, you can download optimized model. Optimization-details

Pruning

To add pruning, click “Add new optimization” and in the Precision step choose fp32 and then check the pruning option. Choose the dataset and click Finish. Pruning-wizard Pruning parameters are visible after in Optimization details section described before. Pruning-details Click Run button in the optimization table to execute the pruning. When the pruning is finished, path to the optimized model can be found in the details section.

Benchmark Tab

Benchmark Table

For each optimization and input model you can add benchmark. Benchmark have 2 modes: accuracy and performance. In benchmark tab you can see all your benchmarks. When you check checkboxes in the last column you can choose benchmark you want to compare in the chart (visible after clicking “Compare selected”).

Benchmarks-table

Benchmark Wizard

To add new benchmark, click “Add new benchmark” button at the bottom of the table and follow the steps.

Benchmarks-wizard

Editing Benchmark Entries

As for optimizations you can also modify benchmark parameters. You can modify benchmark mode, dataset and benchmark parameters like batch size, number of instances and number of cores per instance. Benchmark-edit-wizard

Benchmark Details

When the benchmark is added, you can click “Run” button to execute it. Results will be filled in the table and in details view visible after clicking row in the table. You can also see config and output logs when clicking links highlighted in blue.

Benchmark-details

Profiling Tab

Profiling Table

It is also possible to do profiling of all Tensorflow frozen models in project. Profiling-table

Profiling Wizard

To profile model, click “Add new profiling” button at the bottom of the table and follow the steps. Profiling-wizard

Editing Profiling Entries

In Profiling tab you can edit dataset and number or threads. Profiling-edit-wizard

Profiling Details

Once profiling entry is added, you can click “Run” button to execute it. After completing the process, the results will appear in the form of a bar chart and a table with full profiling data. The table is also used to control which operations are included in the chart. Check the box next to the selected row and click “Update chart” button to include it in the bar chart. Click “Download .csv file” button to get profiling data in .csv file. Profiling-details

Diagnosis Tab

Diagnosis tab offers convenient debug information for optimizations with easy way for generating new one with requested variations.

To get OP list you need to execute quantization optimization and select optimized model on left hand side. In OP table you can see list of OPs with MSE and min/max activation values. Selecting one of OP in table highlights its position in graph. Configuration for currently selected OP can be set in section under OP table.

Diagnosis-tab

You can set model wise parameters that apply to whole model by clicking button with “Model wise”. When you set specific configuration you can view summary and generate new optimization config.

Diagnosis-actions

Model wise configuration provides separate settings for weights and activations. Diagnosis-model-wise-wizard

Dataset Tab

Dataset List

Dataset tab presents list of datasets assigned to a project. In most cases the “dummy” dataset consisting of synthetic data should be automatically added while creating a project. Datasets-table

Dataset Wizard

New dataset can be defined by clicking “Add new profiling” button at the bottom of the table and follow the steps. Datasets-wizard

Dataset Details

Dataset details can be inspected by clicking specific row. Dataset-details

Custom Dataset

When adding the dataset, you can choose custom in dataloader and metric field. In that case a template file will be created. The path to the template file will be available in dataset details. You should edit this file to add your custom configuration before using this dataset in optimizations or benchmarks. Small yellow warning will remind about it.

Custom dataset

Project Information

Last tab is called “Project info”. You can find here details about the project, when it was created and modified, what is the framework and some details about input model. It is also possible to add some notes about the project.

Project info

System Information

One can see system information by clicking System info button. The result is details dialog:

System info table

Security

Intel® Neural Compressor Bench uses encrypted connections to ensure security, confidentiality and integrity of all client-server communication.

You can use automatically generated self-signed certificate or provide your own trusted certificate.

You can also choose to start the server without encryption exposing it to threats from network.

Intel® Neural Compressor Bench uses external packages to run the web-server and provide encryption. Please report any security issues to correct organizations: