Microarchitecture¶

Processing Element¶

Should support dot product

Accumulator: Adder that keeps result in storage

Inference in INT8 precision => Multipliers are INT8, because adders and accumulators need wide range to perform accurate accumulation of many numbers

Step
1
2

Step
1
2

Initiation interval: How often we can start computation of a new element in a loop

Break down computation into multiple steps with intermediate registers

Block Floating Point

Bit-width of address = no of data entries

Connecting RAM to MAC


Simple
Use separate memories for 2 operands
Increase no of read ports	Problems with adding many read ports to SRAM 1. Large size 2. Inc power consumption 3. Slow 4. In FPGA, you need to duplicate your memorie
Banking	Use multiple small memories

Processing		Why?
In-Sensor		Data movement from sensor to processor is costly For eg, if you only need class label as output, why unnecessarily transfer 8MP image to processor
Near-Memory
In-Memory (Analog Processing)		- Weights stored as charges - Activations delivered as analog voltages - By activating pre-charge circuity on the word & bit lines, we can perform multiplication between input activation voltage & stored weights

Last Updated: 2024-05-12 ; Contributors: AhmedThahir, web-flow