cnncodegen

Generate code for a deep learning network to target the ARM Mali GPU

Syntax

cnncodegen(net,'targetlib','arm-compute-mali')

cnncodegen(net,'targetlib','arm-compute-mali',targetparams)

Description

cnncodegen(net,'targetlib','arm-compute-mali') generates C++ code for the specified network object by using the ARM^® Compute Library for Mali GPUs.

Requires the GPU Coder™ product and the GPU Coder Interface for Deep Learning.

cnncodegen(net,'targetlib','arm-compute-mali',targetparams) generates C++ code for the specified network object by using the ARM Compute Library for Mali GPUs with additional code generation options.

example

Examples

collapse all

Generate C++ Code for a Pretrained Network to Run on an ARM Processor

Use cnncodegen to generate C++ code for a pretrained network for deployment to an ARM Mali graphics processor.

Get the pretrained GoogLeNet model by using the googlenet (Deep Learning Toolbox) function. This function requires the Deep Learning Toolbox™ Model for GoogLeNet Network. If you have not installed this support package, the function provides a download link. Alternatively, see https://www.mathworks.com/matlabcentral/fileexchange/64456-deep-learning-toolbox-model-for-googlenet-network.

net = googlenet;

Generate code by using cnncodegen with 'targetlib' set to 'arm-compute-mali'. By default, the code generator targets version '19.05' of the ARM. To target a different version of the Compute Library, use the 'ArmComputeVersion' parameter.

cnncodegen(net,'targetlib','arm-compute-mali'...
,'targetparams',struct('ArmComputeVersion','19.02'));

------------------------------------------------------------------------
Compilation suppressed: generating code only.
------------------------------------------------------------------------
### Codegen Successfully Generated for arm device

The code generator generates the .cpp and header files in the '/pwd/codegen' folder. The DAG network is generated as a C++ class called CnnMain, containing an array of 87 layer classes. The code generator reduces the number of layers is by layer fusion optimization of convolutional and batch normalization layers. The setup() method of this class sets up handles and allocates resources for each layer object. The predict() method invokes prediction for each of the 87 layers in the network. The cleanup() method releases all the memory and system resources allocated for each layer object. All the binary weights (cnn_**_w) and the bias files (cnn_**_b) for the convolution layers of the network are stored in the codegen folder.

To build the library, move the generated code to the ARM target platform and use the generated makefile cnnbuild_rtw.mk.

Input Arguments

collapse all

`net` — Pretrained deep learning network object
character vector | string scalar

Pretrained SeriesNetwork or DAGNetwork object.

Note

cnncodegen does not support dlnetwork objects.

`targetparams` — Library-specific parameters
structure

ARM Compute Library-specific parameters specified as a 1-by-1 structure containing the fields described in these tables.

Field	Description
`ArmComputeVersion`	Version of ARM Compute Library on the target hardware, specified as `'19.02'` or `'19.05'`. The default value is `'19.05'`. If you set `ArmComputeVersion` to a version later than `'19.05'`, `ArmComputeVersion` is set to `'19.05'`.

Version History

Introduced in R2017b

expand all

R2021a: Changes to Target Library Support

From R2021b, the cnncodegen function generates C++ code and makefiles to build a static library for only the ARM Mali GPU processor by using the ARM Compute Library for computer vision and machine learning.

For all other targets, use the codegen command. Write an entry-point function in MATLAB^® that uses the coder.loadDeepLearningNetwork function to load a deep learning model and calls predict (Deep Learning Toolbox) to predict the responses. For example,

function out = googlenet_predict(in) %#codegen

persistent mynet;

if isempty(mynet)
    mynet = coder.loadDeepLearningNetwork('googlenet');
end

% pass in input   
out = predict(mynet,in);

This table shows some typical usages of cnncodegen and how to update your code to use codegen instead.

Target workflow Not recommended Recommended

Target workflow	Not recommended	Recommended
ARM CPU processor supporting `NEON` instructions	Set the `'targetlib'` parameter to `'arm-compute'`. Specify the ARM Compute Library version to generate code for and the ARM architecture on the target hardware by using the `'targetparams'` parameter. cnncodegen(net,'targetlib'... ,'arm-compute','targetparams' ... ,struct('ArmComputeVersion'... ,'19.02','ArmArchitecture'... ,'armv8')) Other supported versions of ARM Compute Library are `'18.11'`, `'19.02'`, `'19.05'`, or `'20.02.1'`. The default value is `'20.02.1'`. If you set `ArmComputeVersion` to a version later than `'20.02.1'`, `ArmComputeVersion` is set to `'20.02.1'` You can specify the ARM architecture as `'armv7` or `'armv8'`. The specified architecture must be the same as the architecture for the ARM Compute Library on the target hardware.	Create a `coder.config` configuration object for generation of a static library. cfg = coder.config('lib'); cfg.TargetLang = 'C++'; Create a `coder.ARMNEONConfig` deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the `DeepLearningConfig` property of the `cfg` configuration object. dlcfg = coder.DeepLearningConfig ... ('arm-compute'); dlcfg.ArmArchitecture = 'armv8'; dlcfg.ArmComputeVersion = '19.02'; cfg.DeepLearningConfig = dlcfg; Use the `-config` option of the `codegen` function to specify the `cfg` configuration object. The `codegen` function must determine the size, class, and complexity of MATLAB function inputs. Use the `-args` option to specify the size of the input to the entry-point function. arg = {ones(224,224,3,'single')}; codegen -args arg ... -config cfg googlenet_predict For more information, see Code Generation for Deep Learning Networks with ARM Compute Library.
NVIDIA^® GPUs by using the CUDA^® Deep Neural Network library (cuDNN)	Set the `'targetlib'` parameter to `'cudnn'`. Specify cuDNN library-specific properties by using the `'targetparams'` parameter. cnncodegen(net,'targetlib'... ,'cudnn','ComputeCapability'... ,'7.0','targetparams' ... ,struct('AutoTuning',true ... ,'DataType','INT8'... ,'CalibrationResultFile' ... 'myInt8Cal.mat')) The auto tuning feature allows the cuDNN library to find the fastest convolution algorithms. The `'DataType'` parameter specifies the precision of the inference computations in supported layers. When performing inference in 32-bit floats, use `'FP32'`.	Create a `coder.gpuConfig` configuration object for generation of a static library. cfg = coder.gpuConfig('lib'); cfg.TargetLang = 'C++'; To set the minimum compute capability for code generation, use the `ComputeCapability` property of the GPU code configuration object. cfg.GpuConfig.ComputeCapability = '7.0'; Create a `coder.CuDNNConfig` deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the `DeepLearningConfig` property of the `cfg` configuration object. dlcfg = coder.DeepLearningConfig('cudnn'); dlcfg.AutoTuning = true; dlcfg.DataType = 'int8'; dlcfg.CalibrationResultFile = 'myInt8Cal.mat'; cfg.DeepLearningConfig = dlcfg; Use the `-config` option of the `codegen` function to specify the `cfg` configuration object. The `codegen` function must determine the size, class, and complexity of MATLAB function inputs. Use the `-args` option to specify the size of the input to the entry-point function. arg = {ones(224,224,3,'single')}; codegen -args arg ... -config cfg googlenet_predict For more information, see Code Generation for Deep Learning Networks by Using cuDNN.
Intel^® CPU processor	To use the Intel Math Kernel Library for Deep Neural Networks (MKL-DNN) for Intel CPUs, set the `'targetlib'` parameter to `'mkldnn'`. cnncodegen(net,'targetlib'... ,'mkldnn');	Create a `coder.config` configuration object for generation of a static library. cfg = coder.config('lib'); cfg.TargetLang = 'C++'; Create a `coder.MklDNNConfig` deep learning configuration object. Assign it to the `DeepLearningConfig` property of the `cfg` configuration object. dlcfg = coder.DeepLearningConfig... ('mkldnn'); cfg.DeepLearningConfig = dlcfg; Use the `-config` option of the `codegen` function to specify the `cfg` configuration object. The `codegen` function must determine the size, class, and complexity of MATLAB function inputs. Use the `-args` option to specify the size of the input to the entry-point function. arg = {ones(224,224,3,'single')}; codegen -args arg ... -config cfg googlenet_predict For more information, see Code Generation for Deep Learning Networks with MKL-DNN.
NVIDIA GPUs by using NVIDIA TensorRT™, a high performance deep learning inference optimizer and run-time library	Set the `'targetlib'` parameter to `'tensorrt'`. Specify TensorRT library-specific properties by using the `'targetparams'` parameter. cnncodegen(net,'targetlib'... ,'tensorrt','ComputeCapability'... ,'7.0','targetparams' ... ,struct('DataType','INT8' ... 'DataPath','image_dataset'... ,'NumCalibrationBatches',50))	Create a `coder.gpuConfig` configuration object for generation of a static library. cfg = coder.gpuConfig('lib'); cfg.TargetLang = 'C++'; To set the minimum compute capability for code generation, use the `ComputeCapability` property of the GPU code configuration object. cfg.GpuConfig.ComputeCapability = '7.0'; Create a `coder.TensorRTConfig` deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the `DeepLearningConfig` property of the `cfg` configuration object. dlcfg = coder.DeepLearningConfig... ('cudnn'); dlcfg.DataType = 'int8'; dlcfg.DataPath = 'image_dataset'; dlcfg.NumCalibrationBatches = 50; cfg.DeepLearningConfig = dlcfg; Use the `-config` option of the `codegen` function to specify the `cfg` configuration object. The `codegen` function must determine the size, class, and complexity of MATLAB function inputs. Use the `-args` option to specify the size of the input to the entry-point function. arg = {ones(224,224,3,'single')}; codegen -args arg ... -config cfg googlenet_predict For more information, see Deep Learning Prediction with NVIDIA TensorRT Library.
General options	Generate code without generating and building a makefile. For example, cnncodegen(net,'targetlib' ... ,'mkldnn','codegenonly',1);	To produces the source code without invoking the make command or build object code, use the `GenCodeOnly` property of the `coder.CodeConfig` or `coder.GPUCodeConfig` object. For example, cfg = coder.codeConfig('lib'); cfg.GenCodeOnly = true;
Specifying the NVIDIA GPU compute capability to compile for. Argument takes the format of `major#.minor#`. cnncodegen(net,'targetlib'... ,'cudnn','ComputeCapability','7.0');	To set the minimum compute capability for code generation, use the `ComputeCapability` property of the GPU code configuration object. cfg = coder.gpuConfig('lib'); cfg.GpuConfig.ComputeCapability = '7.0';

ARM CPU processor supporting NEON instructions

Set the 'targetlib' parameter to 'arm-compute'. Specify the ARM Compute Library version to generate code for and the ARM architecture on the target hardware by using the 'targetparams' parameter.

cnncodegen(net,'targetlib'...
,'arm-compute','targetparams' ...
,struct('ArmComputeVersion'...
,'19.02','ArmArchitecture'...
,'armv8'))

Other supported versions of ARM Compute Library are '18.11', '19.02', '19.05', or '20.02.1'. The default value is '20.02.1'. If you set ArmComputeVersion to a version later than '20.02.1', ArmComputeVersion is set to '20.02.1'

You can specify the ARM architecture as 'armv7 or 'armv8'. The specified architecture must be the same as the architecture for the ARM Compute Library on the target hardware.

Create a coder.config configuration object for generation of a static library.

cfg = coder.config('lib');
cfg.TargetLang = 'C++';

Create a coder.ARMNEONConfig deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the DeepLearningConfig property of the cfg configuration object.

dlcfg = coder.DeepLearningConfig ...
('arm-compute');
dlcfg.ArmArchitecture = 'armv8';
dlcfg.ArmComputeVersion = '19.02';
cfg.DeepLearningConfig = dlcfg;

Use the -config option of the codegen function to specify the cfg configuration object. The codegen function must determine the size, class, and complexity of MATLAB function inputs. Use the -args option to specify the size of the input to the entry-point function.

arg = {ones(224,224,3,'single')};
codegen -args arg ...
-config cfg googlenet_predict

For more information, see Code Generation for Deep Learning Networks with ARM Compute Library.

NVIDIA^® GPUs by using the CUDA^® Deep Neural Network library (cuDNN)

Set the 'targetlib' parameter to 'cudnn'. Specify cuDNN library-specific properties by using the 'targetparams' parameter.

cnncodegen(net,'targetlib'...
,'cudnn','ComputeCapability'...
,'7.0','targetparams' ...
,struct('AutoTuning',true ...
,'DataType','INT8'...
,'CalibrationResultFile' ...
'myInt8Cal.mat'))

The auto tuning feature allows the cuDNN library to find the fastest convolution algorithms.

The 'DataType' parameter specifies the precision of the inference computations in supported layers. When performing inference in 32-bit floats, use 'FP32'.

Create a coder.gpuConfig configuration object for generation of a static library.

cfg = coder.gpuConfig('lib');
cfg.TargetLang = 'C++';

To set the minimum compute capability for code generation, use the ComputeCapability property of the GPU code configuration object.

cfg.GpuConfig.ComputeCapability = '7.0';

Create a coder.CuDNNConfig deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the DeepLearningConfig property of the cfg configuration object.

dlcfg = coder.DeepLearningConfig('cudnn');
dlcfg.AutoTuning = true;
dlcfg.DataType = 'int8';
dlcfg.CalibrationResultFile = 'myInt8Cal.mat';
cfg.DeepLearningConfig = dlcfg;

arg = {ones(224,224,3,'single')};
codegen -args arg ...
-config cfg googlenet_predict

For more information, see Code Generation for Deep Learning Networks by Using cuDNN.

Intel^® CPU processor

To use the Intel Math Kernel Library for Deep Neural Networks (MKL-DNN) for Intel CPUs, set the 'targetlib' parameter to 'mkldnn'.

cnncodegen(net,'targetlib'...
,'mkldnn');

Create a coder.config configuration object for generation of a static library.

cfg = coder.config('lib');
cfg.TargetLang = 'C++';

Create a coder.MklDNNConfig deep learning configuration object. Assign it to the DeepLearningConfig property of the cfg configuration object.

dlcfg = coder.DeepLearningConfig...
('mkldnn');
cfg.DeepLearningConfig = dlcfg;

arg = {ones(224,224,3,'single')};
codegen -args arg ...
-config cfg googlenet_predict

For more information, see Code Generation for Deep Learning Networks with MKL-DNN.

NVIDIA GPUs by using NVIDIA TensorRT™, a high performance deep learning inference optimizer and run-time library

Set the 'targetlib' parameter to 'tensorrt'. Specify TensorRT library-specific properties by using the 'targetparams' parameter.

cnncodegen(net,'targetlib'...
,'tensorrt','ComputeCapability'...
,'7.0','targetparams' ...
,struct('DataType','INT8' ...
'DataPath','image_dataset'...
,'NumCalibrationBatches',50))

Create a coder.gpuConfig configuration object for generation of a static library.

cfg = coder.gpuConfig('lib');
cfg.TargetLang = 'C++';

To set the minimum compute capability for code generation, use the ComputeCapability property of the GPU code configuration object.

cfg.GpuConfig.ComputeCapability = '7.0';

Create a coder.TensorRTConfig deep learning configuration object. Specify target library-specific properties of the deep learning configuration object. Assign it to the DeepLearningConfig property of the cfg configuration object.

dlcfg = coder.DeepLearningConfig...
('cudnn');
dlcfg.DataType = 'int8';
dlcfg.DataPath = 'image_dataset';
dlcfg.NumCalibrationBatches = 50;
cfg.DeepLearningConfig = dlcfg;

arg = {ones(224,224,3,'single')};
codegen -args arg ...
-config cfg googlenet_predict

For more information, see Deep Learning Prediction with NVIDIA TensorRT Library.

General options

Generate code without generating and building a makefile. For example,

cnncodegen(net,'targetlib' ...
,'mkldnn','codegenonly',1);

To produces the source code without invoking the make command or build object code, use the GenCodeOnly property of the coder.CodeConfig or coder.GPUCodeConfig object. For example,

cfg = coder.codeConfig('lib');
cfg.GenCodeOnly = true;

Specifying the NVIDIA GPU compute capability to compile for. Argument takes the format of major#.minor#.

cnncodegen(net,'targetlib'...
,'cudnn','ComputeCapability','7.0');

To set the minimum compute capability for code generation, use the ComputeCapability property of the GPU code configuration object.

cfg = coder.gpuConfig('lib');
cfg.GpuConfig.ComputeCapability = '7.0';

cnncodegen

Syntax

Description

Examples

Generate C++ Code for a Pretrained Network to Run on an ARM Processor

Input Arguments

`net` — Pretrained deep learning network object
character vector | string scalar

`targetparams` — Library-specific parameters
structure

Version History

R2021a: Changes to Target Library Support

See Also

Functions

Topics

cnncodegen

Syntax

Description

Examples

Generate C++ Code for a Pretrained Network to Run on an ARM Processor

Input Arguments

net — Pretrained deep learning network object character vector | string scalar

targetparams — Library-specific parameters structure

Version History

R2021a: Changes to Target Library Support

See Also

Functions

Topics

`net` — Pretrained deep learning network object
character vector | string scalar

`targetparams` — Library-specific parameters
structure