onnx2versal
Loading...
Searching...
No Matches
QLinearConv

The convolution operator consumes a quantized input tensor, its scale and zero point, a quantized filter, its scale and zero point, and output's scale and zero point, and computes the quantized output. 2D convolution on H, W dimensions of BCHW using kernels MCKK. Each c-th KxK kernel is applied on C dimension. This is done over M iterations to yield MxHxW per instance. This is done over B iterations to yield B batches. More...

Collaboration diagram for QLinearConv:

Modules

 QLinearConvKernels
 

Classes

class  QLinearConvGraph< PAD, QLINEARCONV, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Single instance graph that stores weights and biases Max size = 16384 and 4096 bytes respectively. More...
 
class  QLinearConvStreamGraph< PAD, QLINEARCONV, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Single instance graph that streams weights and biases, significantly slower. More...
 
class  QLinearConvChunkHGraph< QLINEARCONV, CONCAT, HCHUNK, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Multiinstance graph that stores weights and biases, chunks BCHW by H dimension, maximum 8 chunks. More...
 
class  QLinearConvChunkHStreamGraph< QLINEARCONV, CONCAT, HCHUNK, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Multiinstance graph that stores weights and biases, chunks BCHW by H dimension, maximum 8 chunks. More...
 
class  QLinearConvChunkHPktStreamGraph< QLINEARCONV, CONCAT, HCHUNK, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Multiinstance graph that stores weights and biases, chunks BCHW by H dimension, maximum 8 chunks. More...
 
class  QLinearConvChunkCGraph< QLINEARCONV0, QLINEARCONV1, QLINEARCONV2, CONCAT, CCHUNK, TT, TTPARAM, INP_H, INP_W, INP_W_PAD, OUT_W, OUT_W_PAD, STEP_H, STEP_W, B, C, M, KH, KW, GROUP, H0, H1, W0, W1 >
 Multiinstance graph that stores weights and biases, chunks BCHW by C dimension, maximum 8 chunks. More...
 

Detailed Description

Template Parameters
QLINEARCONVConv2D Kernel
TTinput/output dtype, int8_t or uint8_t only
TTPARAMweight dtype, int8_t or uint8_t only
INP_Hinput height
INP_Winput width to use
INP_W_PADinput width, padded to vector boundary
OUT_Woutput width to use, unable to infer due to width padding
OUT_W_PADoutput width, padded to vector boundary
STEP_Hstride in height dimension
STEP_Wstride in width dimension
Bbatch size
Cinput channels
Moutput channels
KHkernel height
KWkernel width
GROUPsplit input into groups, CGROUP==0
H0Pixels added before height (default 0)
H1Pixels added after height (default 0)
W0Pixels added before width (default 0)
W1Pixels added after width (default 0)