A High-Performance and Power-Efficient SIMD Convolution Engine for FPGAs