|
onnx2versal
|
Vector implementation using fastexp2 method for single axis, QLinearSoftmaxSingleaxis<10,10,16> takes 5185 cycles requires INP_W_PAD%16=0. Slightly less accurate due to srs after each mult.
#include <qlinearsoftmax.h>
Public Member Functions | |
| QLinearSoftmaxSingleaxis (float x_scale, float y_scale, TT x_zero, TT y_zero) | |
| void | filter (input_window< TT > *in, output_stream< TT > *out) |
Static Public Member Functions | |
| static void | registerKernelClass () |