Learning Pitch with STDP: A Computational Model of Place and Temporal Pitch Perception Using Spiking Neural Networks

PLoS Comput Biol. 2016 Apr 6;12(4):e1004860. doi: 10.1371/journal.pcbi.1004860. eCollection 2016 Apr.

Abstract

Pitch perception is important for understanding speech prosody, music perception, recognizing tones in tonal languages, and perceiving speech in noisy environments. The two principal pitch perception theories consider the place of maximum neural excitation along the auditory nerve and the temporal pattern of the auditory neurons' action potentials (spikes) as pitch cues. This paper describes a biophysical mechanism by which fine-structure temporal information can be extracted from the spikes generated at the auditory periphery. Deriving meaningful pitch-related information from spike times requires neural structures specialized in capturing synchronous or correlated activity from amongst neural events. The emergence of such pitch-processing neural mechanisms is described through a computational model of auditory processing. Simulation results show that a correlation-based, unsupervised, spike-based form of Hebbian learning can explain the development of neural structures required for recognizing the pitch of simple and complex tones, with or without the fundamental frequency. The temporal code is robust to variations in the spectral shape of the signal and thus can explain the phenomenon of pitch constancy.

Publication types

  • Research Support, Non-U.S. Gov't

MeSH terms

  • Acoustic Stimulation
  • Action Potentials
  • Auditory Pathways / physiology
  • Biophysical Phenomena
  • Cochlear Implants
  • Computational Biology
  • Computer Simulation
  • Humans
  • Learning / physiology
  • Models, Neurological
  • Nerve Net / physiology
  • Neural Networks, Computer
  • Pitch Perception / physiology*
  • Time Perception

Grants and funding

This work was supported by the Australian Research Council Discovery Project Grant DP1094830 (DBG) and a Victorian Life Sciences Computation Initiative grant number VR0003 (DBG) on its Peak Computing Facility at the University of Melbourne, an initiative of the Victorian Government, Australia. The Bionics Institute acknowledges the support it receives from the Victorian Government through its Operational Infrastructure Support Program. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.