Skip to Main content Skip to Navigation
Journal articles

MOL-based In-Memory Computing of Binary Neural Networks

Khaled Alhaj Ali 1, 2 Amer Baghdadi 1, 2 Elsa Dupraz 3, 2 Mathieu Léonardon 1, 2 Mostafa Rizk 1, 2, 4, 5 Jean-Philippe Diguet 6 
1 Lab-STICC_2AI - Equipe Algorithm Architecture Interactions
Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance : UMR6285
3 Lab-STICC_CODES - Equipe CODES
Lab-STICC - Laboratoire des sciences et techniques de l'information, de la communication et de la connaissance : UMR6285
Abstract : Convolutional neural networks (CNN) have proven very effective in a variety of practical applications involving Artificial Intelligence (AI). However, the layer depth of CNN deepens as user applications become more sophisticated, resulting in a huge number of operations and increased memory size. The massive amount of the produced intermediate data leads to intensive data movement between memory and computing cores causing a real bottleneck. In-Memory Computing (IMC) aims to address this bottleneck by directly computing inside memory, eliminating energy-intensive and time-consuming data movement. On the other hand, the emerging Binary Neural Networks (BNN), which is a special case of CNN, shows a number of hardware-friendly properties including memory saving. In BNN, the costly floating-point multiply-and-accumulate is replaced with lightweight bit-wise XNOR and popcount operations. In this paper, we propose an IMC programmable architecture targeting efficient implementation of BNN. Computational memories based on the recently introduced Memristor Overwrite Logic (MOL) design style are employed. The architecture, which is presented in semi-parallel and parallel models, efficiently executes the advanced quantization algorithm of XNOR-Net BNN. Performance evaluation based on CIFAR-10 dataset demonstrates between 1.24× to 3× speedup, and 49% to 99% energy saving compared to state-of-the-art implementations, and up to 273 image/sec/Watt throughput efficiency.
Complete list of metadata

https://hal-imt-atlantique.archives-ouvertes.fr/hal-03659297
Contributor : Amer BAGHDADI Connect in order to contact the contributor
Submitted on : Monday, May 9, 2022 - 3:11:19 PM
Last modification on : Friday, August 5, 2022 - 2:54:52 PM
Long-term archiving on: : Wednesday, August 10, 2022 - 6:02:44 PM

File

TVLSI_arXiv_version.pdf
Files produced by the author(s)

Identifiers

Citation

Khaled Alhaj Ali, Amer Baghdadi, Elsa Dupraz, Mathieu Léonardon, Mostafa Rizk, et al.. MOL-based In-Memory Computing of Binary Neural Networks. IEEE Transactions on Very Large Scale Integration (VLSI) Systems, IEEE, 2022, 30 (7), ⟨10.1109/TVLSI.2022.3163233⟩. ⟨hal-03659297⟩

Share

Metrics

Record views

78

Files downloads

33