Zum Hauptinhalt springen

Showing 1–25 of 25 results for author: Gerstner, W

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.01644  [pdf, other

    cs.LG cs.NE stat.ML

    Should Under-parameterized Student Networks Copy or Average Teacher Weights?

    Authors: Berfin Şimşek, Amire Bendjeddou, Wulfram Gerstner, Johanni Brea

    Abstract: Any continuous function $f^*$ can be approximated arbitrarily well by a neural network with sufficiently many neurons $k$. We consider the case when $f^*$ itself is a neural network with one hidden layer and $k$ neurons. Approximating $f^*$ with a neural network with $n< k$ neurons can thus be seen as fitting an under-parameterized "student" network with $n$ neurons to a "teacher" network with… ▽ More

    Submitted 15 January, 2024; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 41 pages, presented at NeurIPS 2023

  2. arXiv:2306.08744  [pdf, other

    cs.NE cs.LG

    High-performance deep spiking neural networks with 0.3 spikes per neuron

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Communication by rare, binary spikes is a key factor for the energy efficiency of biological brains. However, it is harder to train biologically-inspired spiking neural networks (SNNs) than artificial neural networks (ANNs). This is puzzling given that theoretical results provide exact mapping algorithms from ANNs to SNNs with time-to-first-spike (TTFS) coding. In this paper we analyze in theory a… ▽ More

    Submitted 20 November, 2023; v1 submitted 14 June, 2023; originally announced June 2023.

  3. arXiv:2306.01690  [pdf, other

    cs.LG cs.AI

    Context selectivity with dynamic availability enables lifelong continual learning

    Authors: Martin Barry, Wulfram Gerstner, Guillaume Bellec

    Abstract: "You never forget how to ride a bike", -- but how is that possible? The brain is able to learn complex skills, stop the practice for years, learn other skills in between, and still retrieve the original knowledge when necessary. The mechanisms of this capability, referred to as lifelong learning (or continual learning, CL), are unknown. We suggest a bio-plausible meta-plasticity rule building on c… ▽ More

    Submitted 25 January, 2024; v1 submitted 2 June, 2023; originally announced June 2023.

  4. arXiv:2304.12794  [pdf, other

    cs.NE

    Expand-and-Cluster: Parameter Recovery of Neural Networks

    Authors: Flavio Martinelli, Berfin Simsek, Wulfram Gerstner, Johanni Brea

    Abstract: Can we identify the weights of a neural network by probing its input-output mapping? At first glance, this problem seems to have many solutions because of permutation, overparameterisation and activation function symmetries. Yet, we show that the incoming weight vector of each neuron is identifiable up to sign or scaling, depending on the activation function. Our novel method 'Expand-and-Cluster'… ▽ More

    Submitted 27 June, 2024; v1 submitted 25 April, 2023; originally announced April 2023.

    Comments: Accepted paper at ICML '24

  5. arXiv:2301.10638  [pdf, ps, other

    cs.LG

    MLPGradientFlow: going with the flow of multilayer perceptrons (and finding minima fast and accurately)

    Authors: Johanni Brea, Flavio Martinelli, Berfin Şimşek, Wulfram Gerstner

    Abstract: MLPGradientFlow is a software package to solve numerically the gradient flow differential equation $\dot θ= -\nabla \mathcal L(θ; \mathcal D)$, where $θ$ are the parameters of a multi-layer perceptron, $\mathcal D$ is some data set, and $\nabla \mathcal L$ is the gradient of a loss function. We show numerically that adaptive first- or higher-order integration methods based on Runge-Kutta schemes h… ▽ More

    Submitted 25 January, 2023; originally announced January 2023.

  6. arXiv:2212.12522  [pdf, other

    cs.NE cs.LG

    An Exact Mapping From ReLU Networks to Spiking Neural Networks

    Authors: Ana Stanojevic, Stanisław Woźniak, Guillaume Bellec, Giovanni Cherubini, Angeliki Pantazi, Wulfram Gerstner

    Abstract: Deep spiking neural networks (SNNs) offer the promise of low-power artificial intelligence. However, training deep SNNs from scratch or converting deep artificial neural networks to SNNs without loss of performance has been a challenge. Here we propose an exact mapping from a network with Rectified Linear Units (ReLUs) to an SNN that fires exactly one spike per neuron. For our constructive proof,… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

  7. arXiv:2208.09416  [pdf, other

    cs.NE

    Kernel Memory Networks: A Unifying Framework for Memory Modeling

    Authors: Georgios Iatropoulos, Johanni Brea, Wulfram Gerstner

    Abstract: We consider the problem of training a neural network to store a set of patterns with maximal noise robustness. A solution, in terms of optimal weights and state update rules, is derived by training each individual neuron to perform either kernel classification or interpolation with a minimum weight norm. By applying this method to feed-forward and recurrent networks, we derive optimal models, term… ▽ More

    Submitted 23 July, 2024; v1 submitted 19 August, 2022; originally announced August 2022.

    Comments: 24 pages, 5 figures. This is the version published in the NeurIPS 2022 proceedings

  8. arXiv:2205.13493  [pdf, other

    q-bio.NC cs.LG stat.ML

    Mesoscopic modeling of hidden spiking neurons

    Authors: Shuqi Wang, Valentin Schmutz, Guillaume Bellec, Wulfram Gerstner

    Abstract: Can we use spiking neural networks (SNN) as generative models of multi-neuronal recordings, while taking into account that most neurons are unobserved? Modeling the unobserved neurons with large pools of hidden spiking neurons leads to severely underconstrained problems that are hard to tackle with maximum likelihood estimation. In this work, we use coarse-graining and mean-field approximations to… ▽ More

    Submitted 7 January, 2023; v1 submitted 26 May, 2022; originally announced May 2022.

    Comments: 23 pages, 7 figures

  9. arXiv:2106.10064  [pdf, other

    stat.ML cs.LG q-bio.NC

    Fitting summary statistics of neural data with a differentiable spiking network simulator

    Authors: Guillaume Bellec, Shuqi Wang, Alireza Modirshanechi, Johanni Brea, Wulfram Gerstner

    Abstract: Fitting network models to neural activity is an important tool in neuroscience. A popular approach is to model a brain area with a probabilistic recurrent spiking network whose parameters maximize the likelihood of the recorded activity. Although this is widely used, we show that the resulting model does not produce realistic neural activity. To correct for this, we suggest to augment the log-like… ▽ More

    Submitted 14 November, 2021; v1 submitted 18 June, 2021; originally announced June 2021.

  10. arXiv:2105.12221  [pdf, other

    cs.LG

    Geometry of the Loss Landscape in Overparameterized Neural Networks: Symmetries and Invariances

    Authors: Berfin Şimşek, François Ged, Arthur Jacot, Francesco Spadaro, Clément Hongler, Wulfram Gerstner, Johanni Brea

    Abstract: We study how permutation symmetries in overparameterized multi-layer neural networks generate `symmetry-induced' critical points. Assuming a network with $ L $ layers of minimal widths $ r_1^*, \ldots, r_{L-1}^* $ reaches a zero-loss minimum at $ r_1^*! \cdots r_{L-1}^*! $ isolated points that are permutations of one another, we show that adding one extra neuron to each layer is sufficient to conn… ▽ More

    Submitted 12 September, 2021; v1 submitted 25 May, 2021; originally announced May 2021.

    Comments: 29 pages, 12 figures, ICML 2021

  11. arXiv:2010.08262  [pdf, other

    cs.NE cs.AI cs.AR cs.CV cs.LG

    Local plasticity rules can learn deep representations using self-supervised contrastive predictions

    Authors: Bernd Illing, Jean Ventura, Guillaume Bellec, Wulfram Gerstner

    Abstract: Learning in the brain is poorly understood and learning rules that respect biological constraints, yet yield deep hierarchical representations, are still unknown. Here, we propose a learning rule that takes inspiration from neuroscience and recent advances in self-supervised deep learning. Learning minimizes a simple layer-specific loss function and does not need to back-propagate error signals wi… ▽ More

    Submitted 25 October, 2021; v1 submitted 16 October, 2020; originally announced October 2020.

  12. arXiv:1910.10559  [pdf, other

    q-bio.NC cs.NE

    Working memory facilitates reward-modulated Hebbian learning in recurrent neural networks

    Authors: Roman Pogodin, Dane Corneil, Alexander Seeholzer, Joseph Heng, Wulfram Gerstner

    Abstract: Reservoir computing is a powerful tool to explain how the brain learns temporal sequences, such as movements, but existing learning schemes are either biologically implausible or too inefficient to explain animal performance. We show that a network can learn complicated sequences with a reward-modulated Hebbian learning rule if the network of reservoir neurons is combined with a second network tha… ▽ More

    Submitted 23 October, 2019; originally announced October 2019.

    Comments: NeurIPS 2019 workshop "Real Neurons & Hidden Units: Future directions at the intersection of neuroscience and artificial intelligence", Vancouver, Canada

  13. arXiv:1907.02936  [pdf, other

    stat.ML cs.LG q-bio.NC stat.AP

    Learning in Volatile Environments with the Bayes Factor Surprise

    Authors: Vasiliki Liakoni, Alireza Modirshanechi, Wulfram Gerstner, Johanni Brea

    Abstract: Surprise-based learning allows agents to rapidly adapt to non-stationary stochastic environments characterized by sudden changes. We show that exact Bayesian inference in a hierarchical model gives rise to a surprise-modulated trade-off between forgetting old observations and integrating them with the new ones. The modulation depends on a probability ratio, which we call "Bayes Factor Surprise", t… ▽ More

    Submitted 23 September, 2020; v1 submitted 5 July, 2019; originally announced July 2019.

  14. arXiv:1907.02911  [pdf, other

    cs.LG stat.ML

    Weight-space symmetry in deep networks gives rise to permutation saddles, connected by equal-loss valleys across the loss landscape

    Authors: Johanni Brea, Berfin Simsek, Bernd Illing, Wulfram Gerstner

    Abstract: The permutation symmetry of neurons in each layer of a deep neural network gives rise not only to multiple equivalent global minima of the loss function, but also to first-order saddle points located on the path between the global minima. In a network of $d-1$ hidden layers with $n_k$ neurons in layers $k = 1, \ldots, d$, we construct smooth paths between equivalent global minima that lead through… ▽ More

    Submitted 5 July, 2019; originally announced July 2019.

  15. Biologically plausible deep learning -- but how far can we go with shallow networks?

    Authors: Bernd Illing, Wulfram Gerstner, Johanni Brea

    Abstract: Training deep neural networks with the error backpropagation algorithm is considered implausible from a biological perspective. Numerous recent publications suggest elaborate models for biologically plausible variants of deep learning, typically defining success as reaching around 98% test accuracy on the MNIST data set. Here, we investigate how far we can go on digit (MNIST) and object (CIFAR10)… ▽ More

    Submitted 17 June, 2019; v1 submitted 27 February, 2019; originally announced May 2019.

    Comments: 14 pages, 4 figures

    Journal ref: Neural Networks, Volume 118, October 2019, Pages 90-101

  16. arXiv:1812.06669  [pdf, other

    cs.SD cs.LG eess.AS stat.ML

    Learning to Generate Music with BachProp

    Authors: Florian Colombo, Johanni Brea, Wulfram Gerstner

    Abstract: As deep learning advances, algorithms of music composition increase in performance. However, most of the successful models are designed for specific musical structures. Here, we present BachProp, an algorithmic composer that can generate music scores in many styles given sufficient training data. To adapt BachProp to a broad range of musical styles, we propose a novel representation of music and t… ▽ More

    Submitted 12 June, 2019; v1 submitted 17 December, 2018; originally announced December 2018.

    Journal ref: in Proceedings of the 16th Sound and Music Computing Conference. 2019. p. 380-386

  17. arXiv:1802.05162  [pdf, other

    cs.SD eess.AS

    BachProp: Learning to Compose Music in Multiple Styles

    Authors: Florian Colombo, Wulfram Gerstner

    Abstract: Hand in hand with deep learning advancements, algorithms of music composition increase in performance. However, most of the successful models are designed for specific musical structures. Here, we present BachProp, an algorithmic composer that can generate music scores in any style given sufficient training data. To adapt BachProp to a broad range of musical styles, we propose a novel normalized r… ▽ More

    Submitted 20 February, 2018; v1 submitted 14 February, 2018; originally announced February 2018.

    Comments: Preliminary work. Under review by the 2018 International Conference on Machine Learning (ICML)

  18. arXiv:1802.04325  [pdf, other

    cs.LG cs.AI stat.ML

    Efficient Model-Based Deep Reinforcement Learning with Variational State Tabulation

    Authors: Dane Corneil, Wulfram Gerstner, Johanni Brea

    Abstract: Modern reinforcement learning algorithms reach super-human performance on many board and video games, but they are sample inefficient, i.e. they typically require significantly more playing experience than humans to reach an equal performance level. To improve sample efficiency, an agent may build a model of the environment and use planning methods to update its policy. In this article we introduc… ▽ More

    Submitted 11 June, 2018; v1 submitted 12 February, 2018; originally announced February 2018.

    Comments: Accepted at ICML 2018; camera-ready version

  19. arXiv:1712.10158  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY stat.ML

    Non-linear motor control by local learning in spiking neural networks

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Learning weights in a spiking neural network with hidden neurons, using local, stable and online rules, to control non-linear body dynamics is an open problem. Here, we employ a supervised scheme, Feedback-based Online Local Learning Of Weights (FOLLOW), to train a network of heterogeneous spiking neurons with hidden layers, to control a two-link arm so as to reproduce a desired state trajectory.… ▽ More

    Submitted 29 December, 2017; originally announced December 2017.

    Journal ref: Proceedings of the 35th International Conference on Machine Learning, PMLR 80:1773-1782, 2018

  20. arXiv:1712.10062  [pdf, other

    q-bio.NC cs.LG cs.NE stat.ML

    Multi-timescale memory dynamics in a reinforcement learning network with attention-gated memory

    Authors: Marco Martinolli, Wulfram Gerstner, Aditya Gilra

    Abstract: Learning and memory are intertwined in our brain and their relationship is at the core of several recent neural network models. In particular, the Attention-Gated MEmory Tagging model (AuGMEnT) is a reinforcement learning network with an emphasis on biological plausibility of memory dynamics and learning. We find that the AuGMEnT network does not solve some hierarchical tasks, where higher-level s… ▽ More

    Submitted 28 December, 2017; originally announced December 2017.

    Journal ref: Frontiers in Computational Neuroscience, 12 July 2018 | https://doi.org/10.3389/fncom.2018.00050

  21. arXiv:1702.06463  [pdf, other

    q-bio.NC cs.LG cs.NE eess.SY

    Predicting non-linear dynamics by stable local learning in a recurrent spiking neural network

    Authors: Aditya Gilra, Wulfram Gerstner

    Abstract: Brains need to predict how the body reacts to motor commands. It is an open question how networks of spiking neurons can learn to reproduce the non-linear body dynamics caused by motor commands, using local, online and stable learning rules. Here, we present a supervised learning scheme for the feedforward and recurrent connections in a network of heterogeneous spiking neurons. The error in the ou… ▽ More

    Submitted 26 April, 2017; v1 submitted 21 February, 2017; originally announced February 2017.

    Journal ref: eLife 2017;6:e28295

  22. arXiv:1612.03214  [pdf, other

    cs.LG cs.NE q-bio.NC

    Towards deep learning with spiking neurons in energy based models with contrastive Hebbian plasticity

    Authors: Thomas Mesnard, Wulfram Gerstner, Johanni Brea

    Abstract: In machine learning, error back-propagation in multi-layer neural networks (deep learning) has been impressively successful in supervised and reinforcement learning tasks. As a model for learning in the brain, however, deep learning has long been regarded as implausible, since it relies in its basic form on a non-local plasticity rule. To overcome this problem, energy-based models with local contr… ▽ More

    Submitted 9 December, 2016; originally announced December 2016.

  23. Algorithmic Composition of Melodies with Deep Recurrent Neural Networks

    Authors: Florian Colombo, Samuel P. Muscinelli, Alexander Seeholzer, Johanni Brea, Wulfram Gerstner

    Abstract: A big challenge in algorithmic composition is to devise a model that is both easily trainable and able to reproduce the long-range temporal dependencies typical of music. Here we investigate how artificial neural networks can be trained on a large corpus of melodies and turned into automated music composers able to generate new melodies coherent with the style they have been trained on. We employ… ▽ More

    Submitted 23 June, 2016; originally announced June 2016.

    Comments: Proceeding of the 1st Conference on Computer Simulation of Musical Creativity, Huddersfield University

  24. arXiv:1606.05642  [pdf, other

    stat.ML cs.LG q-bio.NC

    Balancing New Against Old Information: The Role of Surprise in Learning

    Authors: Mohammadjavad Faraji, Kerstin Preuschoff, Wulfram Gerstner

    Abstract: Surprise describes a range of phenomena from unexpected events to behavioral responses. We propose a measure of surprise and use it for surprise-driven learning. Our surprise measure takes into account data likelihood as well as the degree of commitment to a belief via the entropy of the belief distribution. We find that surprise-minimizing learning dynamically adjusts the balance between new and… ▽ More

    Submitted 1 March, 2017; v1 submitted 17 June, 2016; originally announced June 2016.

  25. Nonlinear Hebbian learning as a unifying principle in receptive field formation

    Authors: Carlos S. N. Brito, Wulfram Gerstner

    Abstract: The development of sensory receptive fields has been modeled in the past by a variety of models including normative models such as sparse coding or independent component analysis and bottom-up models such as spike-timing dependent plasticity or the Bienenstock-Cooper-Munro model of synaptic plasticity. Here we show that the above variety of approaches can all be unified into a single common princi… ▽ More

    Submitted 4 January, 2016; originally announced January 2016.