QNN HTP Predict
Predict responses of a QNN model or QNN context binary for the HTP (NPU) backend
Since R2025b
Libraries:
Embedded Coder Support Package for Qualcomm Hexagon
Processors /
Hexagon /
QNN
Description
The QNN HTP Predict block predicts responses of a deep learning network represented as a QNN model or QNN context binary for the HTP (NPU) backend of Qualcomm® AI Direct Engine, based on the given input data.
To add the block to your Simulink model, open the model (for example,
myQNNModel), and enter this command at the MATLAB
prompt:
add_block("mwqnnlib/QNN HTP Predict","myQNNModel/QNN HTP Predict")
The QNN HTP Predict block allows you to select a QNN model as a compiled shared object (.so) for running on x86-based host. For the target, you can select either a compiled shared object (.so) or a QNN context binary file (.bin) that are optimized to run on HTP (NPU) backend.
The code generated using this block can be deployed to one of these boards that are available under the Hardware board parameter in Configuration Parameters:
Qualcomm Android Board
Qualcomm Linux Board
Qualcomm Hexagon Android Board, with Processor Version
cDSPQualcomm Hexagon Linux Board, with Processor Version
cDSP
The block also provides the option to dequantize outputs to single-precision, if required.
Ports
Input
Output
Parameters
Extended Capabilities
Version History
Introduced in R2025b