|
onnx2versal
|
Scalar implementation for concatenating 2 chunked streams, ConcatTwo32bitStreams<f,4,32,32,64> takes ~1000 cycles.
#include <concat.h>
Public Member Functions | |
| void | filter (input_stream< TT > *in0, input_stream< TT > *in1, output_stream< TT > *out) |
Static Public Member Functions | |
| static void | registerKernelClass () |