Statistical Array Allocation and Partitioning for Compute In-Memory Fabrics
In: IFIP Advances in Information and Communication Technology ; 28th IFIP/IEEE International Conference on Very Large Scale Integration - System on a Chip (VLSI-SoC) ; https://hal.inria.fr/hal-03759733 ; 28th IFIP/IEEE International Conference on Very Large Scale Integration - System on a Chip (VLSI-SoC), Oct 2020, Salt Lake City, UT, United States. pp.323-341, ⟨10.1007/978-3-030-81641-4_15⟩, 2020
Online
Konferenz
Zugriff:
International audience ; Compute in-memory (CIM) is a promising technique that minimizes data transport, the primary performance bottleneck and energy cost of most data intensive applications. This has found wide-spread adoption in accelerating neural networks for machine learning applications. Utilizing a crossbar architecture with emerging non-volatile memories (eNVM) such as dense resistive random access memory (RRAM) or phase change random access memory (PCRAM), various forms of neural networks can be implemented to greatly reduce power and increase on chip memory capacity. However, compute in-memory faces its own limitations at both the circuit and the device levels. Although compute in-memory using the crossbar architecture can greatly reduce data transport, the rigid nature of these large fixed weight matrices forfeits the flexibility of traditional CMOS and SRAM based designs. In this work, we explore the different synchronization barriers that occur from the CIM constraints. Furthermore, we propose a new allocation algorithm and data flow based on input data distributions to maximize utilization and performance for compute-in memory based designs. We demonstrate a 7.47$$\times $$× performance improvement over a naive allocation method for CIM accelerators on ResNet18.
Titel: |
Statistical Array Allocation and Partitioning for Compute In-Memory Fabrics
|
---|---|
Autor/in / Beteiligte Person: | Crafton, Brian ; Spetalnick, Samuel ; Murali, Gauthaman ; Krishna, Tushar ; Lim, Sung-Kyu ; Raychowdhury, Arijit ; Georgia Institute of Technology Atlanta ; Calimera, Andrea ; Gaillardon, Pierre-Emmanuel ; Korgaonkar, Kunal ; Kvatinsky, Shahar ; Reis, Ricardo ; 10, TC ; 10.5, WG |
Link: | |
Zeitschrift: | IFIP Advances in Information and Communication Technology ; 28th IFIP/IEEE International Conference on Very Large Scale Integration - System on a Chip (VLSI-SoC) ; https://hal.inria.fr/hal-03759733 ; 28th IFIP/IEEE International Conference on Very Large Scale Integration - System on a Chip (VLSI-SoC), Oct 2020, Salt Lake City, UT, United States. pp.323-341, ⟨10.1007/978-3-030-81641-4_15⟩, 2020 |
Veröffentlichung: | HAL CCSD ; Springer International Publishing, 2020 |
Medientyp: | Konferenz |
DOI: | 10.1007/978-3-030-81641-4_15 |
Schlagwort: |
|
Sonstiges: |
|