Abstract
r_MRAM, addr_WeightCache, . . . )
4: convolution_loop:
5: #Load activation from activation memory to local
memory. Then do convolution.
6: MOV(row, col, addr_ActMem, addr_LocalMem, . . . )
7: CONV(row, col, channel, kernel_size, stride, . . . )
8: #Register control in NCU for for-loop
9: NCU_ADDI, row, block_size
10: NCU_BLT row, row_size, convolution_loop
11: NCU_MOVI, row, initial_row
12: NCU_ADDI, col, block_size
13: NCU_BLT col, col_size, convolution_loop
14: NCU_MOVI, col, initial_col
Fig.