Zum Hauptinhalt springen

Showing 1–28 of 28 results for author: Gallo, M L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.02353  [pdf, other

    eess.SP cs.AR eess.SY

    Roadmap to Neuromorphic Computing with Emerging Technologies

    Authors: Adnan Mehonic, Daniele Ielmini, Kaushik Roy, Onur Mutlu, Shahar Kvatinsky, Teresa Serrano-Gotarredona, Bernabe Linares-Barranco, Sabina Spiga, Sergey Savelev, Alexander G Balanov, Nitin Chawla, Giuseppe Desoli, Gerardo Malavena, Christian Monzio Compagnoni, Zhongrui Wang, J Joshua Yang, Ghazi Sarwat Syed, Abu Sebastian, Thomas Mikolajick, Beatriz Noheda, Stefan Slesazeck, Bernard Dieny, Tuo-Hung, Hou, Akhil Varri , et al. (28 additional authors not shown)

    Abstract: The roadmap is organized into several thematic sections, outlining current computing challenges, discussing the neuromorphic computing approach, analyzing mature and currently utilized technologies, providing an overview of emerging technologies, addressing material challenges, exploring novel computing concepts, and finally examining the maturity level of emerging technologies while determining t… ▽ More

    Submitted 5 July, 2024; v1 submitted 2 July, 2024; originally announced July 2024.

    Comments: 90 pages, 22 figures, roadmap, neuromorphic

  2. arXiv:2406.03372  [pdf, other

    physics.app-ph cs.LG

    Training of Physical Neural Networks

    Authors: Ali Momeni, Babak Rahmani, Benjamin Scellier, Logan G. Wright, Peter L. McMahon, Clara C. Wanjura, Yuhang Li, Anas Skalli, Natalia G. Berloff, Tatsuhiro Onodera, Ilker Oguz, Francesco Morichetti, Philipp del Hougne, Manuel Le Gallo, Abu Sebastian, Azalia Mirhoseini, Cheng Zhang, Danijela Marković, Daniel Brunner, Christophe Moser, Sylvain Gigan, Florian Marquardt, Aydogan Ozcan, Julie Grollier, Andrea J. Liu , et al. (3 additional authors not shown)

    Abstract: Physical neural networks (PNNs) are a class of neural-like networks that leverage the properties of physical systems to perform computation. While PNNs are so far a niche research area with small-scale laboratory demonstrations, they are arguably one of the most underappreciated important opportunities in modern AI. Could we train AI models 1000x larger than current ones? Could we do this and also… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: 29 pages, 4 figures

  3. A Precision-Optimized Fixed-Point Near-Memory Digital Processing Unit for Analog In-Memory Computing

    Authors: Elena Ferro, Athanasios Vasilopoulos, Corey Lammie, Manuel Le Gallo, Luca Benini, Irem Boybat, Abu Sebastian

    Abstract: Analog In-Memory Computing (AIMC) is an emerging technology for fast and energy-efficient Deep Learning (DL) inference. However, a certain amount of digital post-processing is required to deal with circuit mismatches and non-idealities associated with the memory devices. Efficient near-memory digital logic is critical to retain the high area/energy efficiency and low latency of AIMC. Existing syst… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: Accepted at ISCAS2024

  4. Improving the Accuracy of Analog-Based In-Memory Computing Accelerators Post-Training

    Authors: Corey Lammie, Athanasios Vasilopoulos, Julian Büchel, Giacomo Camposampiero, Manuel Le Gallo, Malte Rasch, Abu Sebastian

    Abstract: Analog-Based In-Memory Computing (AIMC) inference accelerators can be used to efficiently execute Deep Neural Network (DNN) inference workloads. However, to mitigate accuracy losses, due to circuit and device non-idealities, Hardware-Aware (HWA) training methodologies must be employed. These typically require significant information about the underlying hardware. In this paper, we propose two Post… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

    Comments: Accepted at 2024 IEEE International Symposium on Circuits and Systems (ISCAS)

  5. Using the IBM Analog In-Memory Hardware Acceleration Kit for Neural Network Training and Inference

    Authors: Manuel Le Gallo, Corey Lammie, Julian Buechel, Fabio Carta, Omobayode Fagbohungbe, Charles Mackin, Hsinyu Tsai, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui, Malte J. Rasch

    Abstract: Analog In-Memory Computing (AIMC) is a promising approach to reduce the latency and energy consumption of Deep Neural Network (DNN) inference and training. However, the noisy and non-linear device characteristics, and the non-ideal peripheral circuitry in AIMC chips, require adapting DNNs to be deployed on such hardware to achieve equivalent accuracy to digital computing. In this tutorial, we prov… ▽ More

    Submitted 26 January, 2024; v1 submitted 18 July, 2023; originally announced July 2023.

    Journal ref: APL Machine Learning (2023) 1 (4): 041102

  6. Gradient descent-based programming of analog in-memory computing cores

    Authors: Julian Büchel, Athanasios Vasilopoulos, Benedikt Kersting, Frederic Odermatt, Kevin Brew, Injo Ok, Sam Choi, Iqbal Saraf, Victor Chan, Timothy Philip, Nicole Saulnier, Vijay Narayanan, Manuel Le Gallo, Abu Sebastian

    Abstract: The precise programming of crossbar arrays of unit-cells is crucial for obtaining high matrix-vector-multiplication (MVM) accuracy in analog in-memory computing (AIMC) cores. We propose a radically different approach based on directly minimizing the MVM error using gradient descent with synthetic random input data. Our method significantly reduces the MVM error compared with conventional unit-cell… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Journal ref: 2022 International Electron Devices Meeting (IEDM), San Francisco, CA, USA, 2022, pp. 33.1.1-33.1.4

  7. arXiv:2305.10459  [pdf, other

    cs.AR cs.CV cs.LG

    AnalogNAS: A Neural Network Design Framework for Accurate Inference with Analog In-Memory Computing

    Authors: Hadjer Benmeziane, Corey Lammie, Irem Boybat, Malte Rasch, Manuel Le Gallo, Hsinyu Tsai, Ramachandran Muralidhar, Smail Niar, Ouarnoughi Hamza, Vijay Narayanan, Abu Sebastian, Kaoutar El Maghraoui

    Abstract: The advancement of Deep Learning (DL) is driven by efficient Deep Neural Network (DNN) design and new hardware accelerators. Current DNN design is primarily tailored for general-purpose use and deployment on commercially viable platforms. Inference at the edge requires low latency, compact and power-efficient models, and must be cost-effective. Digital processors based on typical von Neumann archi… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

    Comments: Accepted to IEEE Edge

  8. arXiv:2302.08469  [pdf, ps, other

    cs.LG cs.ET

    Hardware-aware training for large-scale and diverse deep learning inference workloads using in-memory computing-based accelerators

    Authors: Malte J. Rasch, Charles Mackin, Manuel Le Gallo, An Chen, Andrea Fasoli, Frederic Odermatt, Ning Li, S. R. Nandakumar, Pritish Narayanan, Hsinyu Tsai, Geoffrey W. Burr, Abu Sebastian, Vijay Narayanan

    Abstract: Analog in-memory computing (AIMC) -- a promising approach for energy-efficient acceleration of deep learning workloads -- computes matrix-vector multiplications (MVMs) but only approximately, due to nonidealities that often are non-deterministic or nonlinear. This can adversely impact the achievable deep neural network (DNN) inference accuracy as compared to a conventional floating point (FP) impl… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

    Comments: 35 pages, 7 figures, 5 tables

  9. A 64-core mixed-signal in-memory compute chip based on phase-change memory for deep neural network inference

    Authors: Manuel Le Gallo, Riduan Khaddam-Aljameh, Milos Stanisavljevic, Athanasios Vasilopoulos, Benedikt Kersting, Martino Dazzi, Geethan Karunaratne, Matthias Braendli, Abhairaj Singh, Silvia M. Mueller, Julian Buechel, Xavier Timoneda, Vinay Joshi, Urs Egger, Angelo Garofalo, Anastasios Petropoulos, Theodore Antonakopoulos, Kevin Brew, Samuel Choi, Injo Ok, Timothy Philip, Victor Chan, Claire Silvestre, Ishtiaq Ahsan, Nicole Saulnier , et al. (4 additional authors not shown)

    Abstract: The need to repeatedly shuttle around synaptic weight values from memory to processing units has been a key source of energy inefficiency associated with hardware implementation of artificial neural networks. Analog in-memory computing (AIMC) with spatially instantiated synaptic weights holds high promise to overcome this challenge, by performing matrix-vector multiplications (MVMs) directly withi… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Journal ref: Nature Electronics 6, 680-693 (2023)

  10. arXiv:2111.06503  [pdf, other

    cs.AR cs.ET cs.LG

    AnalogNets: ML-HW Co-Design of Noise-robust TinyML Models and Always-On Analog Compute-in-Memory Accelerator

    Authors: Chuteng Zhou, Fernando Garcia Redondo, Julian Büchel, Irem Boybat, Xavier Timoneda Comas, S. R. Nandakumar, Shidhartha Das, Abu Sebastian, Manuel Le Gallo, Paul N. Whatmough

    Abstract: Always-on TinyML perception tasks in IoT applications require very high energy efficiency. Analog compute-in-memory (CiM) using non-volatile memory (NVM) promises high efficiency and also provides self-contained on-chip model storage. However, analog CiM introduces new practical considerations, including conductance drift, read/write noise, fixed analog-to-digital (ADC) converter gain, etc. These… ▽ More

    Submitted 10 November, 2021; originally announced November 2021.

  11. Energy Efficient In-memory Hyperdimensional Encoding for Spatio-temporal Signal Processing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Michael Hersche, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: The emerging brain-inspired computing paradigm known as hyperdimensional computing (HDC) has been proven to provide a lightweight learning framework for various cognitive tasks compared to the widely used deep learning-based approaches. Spatio-temporal (ST) signal processing, which encompasses biosignals such as electromyography (EMG) and electroencephalography (EEG), is one family of applications… ▽ More

    Submitted 22 June, 2021; originally announced June 2021.

    Journal ref: IEEE Transactions on Circuits and Systems II: Express Briefs, vol. 68, no. 5, pp. 1725-1729, May 2021

  12. arXiv:2106.06270  [pdf, ps, other

    cond-mat.mtrl-sci cs.ET

    Measurement of onset of structural relaxation in melt-quenched phase change materials

    Authors: Benedikt Kersting, Syed Ghazi Sarwat, Manuel Le Gallo, Kevin Brew, Sebastian Walfort, Nicole Saulnier, Martin Salinga, Abu Sebastian

    Abstract: Chalcogenide phase change materials enable non-volatile, low-latency storage-class memory. They are also being explored for new forms of computing such as neuromorphic and in-memory computing. A key challenge, however, is the temporal drift in the electrical resistance of the amorphous states that encode data. Drift, caused by the spontaneous structural relaxation of the newly recreated melt-quenc… ▽ More

    Submitted 11 June, 2021; originally announced June 2021.

  13. arXiv:2105.05956  [pdf

    cs.ET cond-mat.dis-nn cond-mat.mtrl-sci

    2022 Roadmap on Neuromorphic Computing and Engineering

    Authors: Dennis V. Christensen, Regina Dittmann, Bernabé Linares-Barranco, Abu Sebastian, Manuel Le Gallo, Andrea Redaelli, Stefan Slesazeck, Thomas Mikolajick, Sabina Spiga, Stephan Menzel, Ilia Valov, Gianluca Milano, Carlo Ricciardi, Shi-Jun Liang, Feng Miao, Mario Lanza, Tyler J. Quill, Scott T. Keene, Alberto Salleo, Julie Grollier, Danijela Marković, Alice Mizrahi, Peng Yao, J. Joshua Yang, Giacomo Indiveri , et al. (34 additional authors not shown)

    Abstract: Modern computation based on the von Neumann architecture is today a mature cutting-edge science. In the Von Neumann architecture, processing and memory units are implemented as separate blocks interchanging data intensively and continuously. This data transfer is responsible for a large part of the power consumption. The next generation computer technology is expected to solve problems at the exas… ▽ More

    Submitted 13 January, 2022; v1 submitted 12 May, 2021; originally announced May 2021.

    Journal ref: Neuromorph. Comput. Eng. 2 022501 (2022)

  14. A flexible and fast PyTorch toolkit for simulating training and inference on analog crossbar arrays

    Authors: Malte J. Rasch, Diego Moreda, Tayfun Gokmen, Manuel Le Gallo, Fabio Carta, Cindy Goldberg, Kaoutar El Maghraoui, Abu Sebastian, Vijay Narayanan

    Abstract: We introduce the IBM Analog Hardware Acceleration Kit, a new and first of a kind open source toolkit to simulate analog crossbar arrays in a convenient fashion from within PyTorch (freely available at https://github.com/IBM/aihwkit). The toolkit is under active development and is centered around the concept of an "analog tile" which captures the computations performed on a crossbar array. Analog t… ▽ More

    Submitted 5 April, 2021; originally announced April 2021.

    Comments: Submitted to AICAS2021

  15. Robust High-dimensional Memory-augmented Neural Networks

    Authors: Geethan Karunaratne, Manuel Schmuck, Manuel Le Gallo, Giovanni Cherubini, Luca Benini, Abu Sebastian, Abbas Rahimi

    Abstract: Traditional neural networks require enormous amounts of data to build their complex mappings during a slow training procedure that hinders their abilities for relearning and adapting to new data. Memory-augmented neural networks enhance neural networks with an explicit memory to overcome these issues. Access to this explicit memory, however, occurs via soft read and write operations involving ever… ▽ More

    Submitted 19 March, 2021; v1 submitted 5 October, 2020; originally announced October 2020.

    Comments: This is a pre-print of an article accepted for publication in Nature Communications

    Journal ref: Nature Communications volume 12, Article number: 2468 (2021)

  16. arXiv:2004.03073  [pdf, other

    cs.ET

    Accurate Emulation of Memristive Crossbar Arrays for In-Memory Computing

    Authors: Anastasios Petropoulos, Irem Boybat, Manuel Le Gallo, Evangelos Eleftheriou, Abu Sebastian, Theodore Antonakopoulos

    Abstract: In-memory computing is an emerging non-von Neumann computing paradigm where certain computational tasks are performed in memory by exploiting the physical attributes of the memory devices. Memristive devices such as phase-change memory (PCM), where information is stored in terms of their conductance levels, are especially well suited for in-memory computing. In particular, memristive devices, when… ▽ More

    Submitted 6 April, 2020; originally announced April 2020.

    Comments: 5 pages, 4 figures, accepted for publication at ISCAS 2020

  17. arXiv:2003.11256  [pdf, other

    cs.LG cs.NE

    ESSOP: Efficient and Scalable Stochastic Outer Product Architecture for Deep Learning

    Authors: Vinay Joshi, Geethan Karunaratne, Manuel Le Gallo, Irem Boybat, Christophe Piveteau, Abu Sebastian, Bipin Rajendran, Evangelos Eleftheriou

    Abstract: Deep neural networks (DNNs) have surpassed human-level accuracy in a variety of cognitive tasks but at the cost of significant memory/time requirements in DNN training. This limits their deployment in energy and memory limited applications that require real-time learning. Matrix-vector multiplications (MVM) and vector-vector outer product (VVOP) are the two most expensive operations associated wit… ▽ More

    Submitted 25 March, 2020; originally announced March 2020.

    Comments: 5 pages. 5 figures. Accepted at ISCAS 2020 for publication

  18. arXiv:2002.00281  [pdf

    physics.optics cond-mat.dis-nn cs.ET

    Parallel convolution processing using an integrated photonic tensor core

    Authors: Johannes Feldmann, Nathan Youngblood, Maxim Karpov, Helge Gehring, Xuan Li, Maik Stappers, Manuel Le Gallo, Xin Fu, Anton Lukashchuk, Arslan Raja, Junqiu Liu, David Wright, Abu Sebastian, Tobias Kippenberg, Wolfram Pernice, Harish Bhaskaran

    Abstract: With the proliferation of ultra-high-speed mobile networks and internet-connected devices, along with the rise of artificial intelligence, the world is generating exponentially increasing amounts of data - data that needs to be processed in a fast, efficient and smart way. These developments are pushing the limits of existing computing paradigms, and highly parallelized, fast and scalable hardware… ▽ More

    Submitted 12 October, 2020; v1 submitted 1 February, 2020; originally announced February 2020.

  19. Mixed-precision deep learning based on computational memory

    Authors: S. R. Nandakumar, Manuel Le Gallo, Christophe Piveteau, Vinay Joshi, Giovanni Mariani, Irem Boybat, Geethan Karunaratne, Riduan Khaddam-Aljameh, Urs Egger, Anastasios Petropoulos, Theodore Antonakopoulos, Bipin Rajendran, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Deep neural networks (DNNs) have revolutionized the field of artificial intelligence and have achieved unprecedented success in cognitive tasks such as image and speech recognition. Training of large DNNs, however, is computationally intensive and this has motivated the search for novel computing architectures targeting this application. A computational memory unit with nanoscale resistive memory… ▽ More

    Submitted 31 January, 2020; originally announced January 2020.

    Journal ref: Frontiers in Neuroscience 14:406 (2020)

  20. Accurate deep neural network inference using computational phase-change memory

    Authors: Vinay Joshi, Manuel Le Gallo, Simon Haefeli, Irem Boybat, S. R. Nandakumar, Christophe Piveteau, Martino Dazzi, Bipin Rajendran, Abu Sebastian, Evangelos Eleftheriou

    Abstract: In-memory computing is a promising non-von Neumann approach for making energy-efficient deep learning inference hardware. Crossbar arrays of resistive memory devices can be used to encode the network weights and perform efficient analog matrix-vector multiplications without intermediate movements of data. However, due to device variability and noise, the network needs to be trained in a specific w… ▽ More

    Submitted 11 April, 2020; v1 submitted 7 June, 2019; originally announced June 2019.

    Comments: This is a pre-print of an article accepted for publication in Nature Communications

    Journal ref: Nature Communications 11, Article number: 2473 (2020)

  21. arXiv:1906.01548  [pdf, other

    cs.ET cs.AI physics.app-ph

    In-memory hyperdimensional computing

    Authors: Geethan Karunaratne, Manuel Le Gallo, Giovanni Cherubini, Luca Benini, Abbas Rahimi, Abu Sebastian

    Abstract: Hyperdimensional computing (HDC) is an emerging computational framework that takes inspiration from attributes of neuronal circuits such as hyperdimensionality, fully distributed holographic representation, and (pseudo)randomness. When employed for machine learning tasks such as learning and classification, HDC involves manipulation and comparison of large patterns within memory. Moreover, a key a… ▽ More

    Submitted 9 April, 2020; v1 submitted 4 June, 2019; originally announced June 2019.

  22. arXiv:1905.11929  [pdf, other

    cs.ET cs.NE

    Supervised Learning in Spiking Neural Networks with Phase-Change Memory Synapses

    Authors: S. R. Nandakumar, Irem Boybat, Manuel Le Gallo, Evangelos Eleftheriou, Abu Sebastian, Bipin Rajendran

    Abstract: Spiking neural networks (SNN) are artificial computational models that have been inspired by the brain's ability to naturally encode and process information in the time domain. The added temporal dimension is believed to render them more computationally efficient than the conventional artificial neural networks, though their full computational capabilities are yet to be explored. Recently, computa… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  23. arXiv:1801.06228  [pdf

    cs.ET physics.app-ph physics.optics

    In-memory computing on a photonic platform

    Authors: Carlos Ríos, Nathan Youngblood, Zengguang Cheng, Manuel Le Gallo, Wolfram H. P. Pernice, C David Wright, Abu Sebastian, Harish Bhaskaran

    Abstract: Collocated data processing and storage are the norm in biological systems. Indeed, the von Neumann computing architecture, that physically and temporally separates processing and memory, was born more of pragmatism based on available technology. As our ability to create better hardware improves, new computational paradigms are being explored. Integrated photonic circuits are regarded as an attract… ▽ More

    Submitted 18 January, 2018; originally announced January 2018.

  24. arXiv:1712.01192  [pdf, other

    cs.ET

    Mixed-precision training of deep neural networks using computational memory

    Authors: Nandakumar S. R., Manuel Le Gallo, Irem Boybat, Bipin Rajendran, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Deep neural networks have revolutionized the field of machine learning by providing unprecedented human-like performance in solving many real-world problems such as image and speech recognition. Training of large DNNs, however, is a computationally intensive task, and this necessitates the development of novel computing architectures targeting this application. A computational memory unit where re… ▽ More

    Submitted 4 December, 2017; originally announced December 2017.

  25. Neuromorphic computing with multi-memristive synapses

    Authors: Irem Boybat, Manuel Le Gallo, S. R. Nandakumar, Timoleon Moraitis, Thomas Parnell, Tomas Tuma, Bipin Rajendran, Yusuf Leblebici, Abu Sebastian, Evangelos Eleftheriou

    Abstract: Neuromorphic computing has emerged as a promising avenue towards building the next generation of intelligent computing systems. It has been proposed that memristive devices, which exhibit history-dependent conductivity modulation, could efficiently represent the synaptic weights in artificial neural networks. However, precise modulation of the device conductance over a wide dynamic range, necessar… ▽ More

    Submitted 24 February, 2019; v1 submitted 17 November, 2017; originally announced November 2017.

    Journal ref: Nature Communications, volume 9, page 2514 (2018)

  26. arXiv:1706.05563  [pdf, other

    cs.NE cs.LG stat.ML

    Fatiguing STDP: Learning from Spike-Timing Codes in the Presence of Rate Codes

    Authors: Timoleon Moraitis, Abu Sebastian, Irem Boybat, Manuel Le Gallo, Tomas Tuma, Evangelos Eleftheriou

    Abstract: Spiking neural networks (SNNs) could play a key role in unsupervised machine learning applications, by virtue of strengths related to learning from the fine temporal structure of event-based signals. However, some spike-timing-related strengths of SNNs are hindered by the sensitivity of spike-timing-dependent plasticity (STDP) rules to input spike rates, as fine temporal correlations may be obstru… ▽ More

    Submitted 17 June, 2017; originally announced June 2017.

    Comments: 8 pages, 8 figures, presented at IJCNN in May 2017

    Journal ref: 2017 International Joint Conference on Neural Networks (IJCNN), Anchorage, AK, 2017, pp. 1823-1830

  27. Temporal correlation detection using computational phase-change memory

    Authors: Abu Sebastian, Tomas Tuma, Nikolaos Papandreou, Manuel Le Gallo, Lukas Kull, Thomas Parnell, Evangelos Eleftheriou

    Abstract: For decades, conventional computers based on the von Neumann architecture have performed computation by repeatedly transferring data between their processing and their memory units, which are physically separated. As computation becomes increasingly data-centric and as the scalability limits in terms of performance and power are being reached, alternative computing paradigms are searched for in wh… ▽ More

    Submitted 1 June, 2017; originally announced June 2017.

  28. Mixed-Precision In-Memory Computing

    Authors: Manuel Le Gallo, Abu Sebastian, Roland Mathis, Matteo Manica, Heiner Giefers, Tomas Tuma, Costas Bekas, Alessandro Curioni, Evangelos Eleftheriou

    Abstract: As CMOS scaling reaches its technological limits, a radical departure from traditional von Neumann systems, which involve separate processing and memory units, is needed in order to significantly extend the performance of today's computers. In-memory computing is a promising approach in which nanoscale resistive memory devices, organized in a computational memory unit, are used for both processing… ▽ More

    Submitted 4 October, 2018; v1 submitted 16 January, 2017; originally announced January 2017.

    Journal ref: Nature Electronics volume 1, pages 246-253 (2018)