Skip to main content

Showing 1–40 of 40 results for author: Balduzzi, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2110.04041  [pdf, other

    cs.AI

    Pick Your Battles: Interaction Graphs as Population-Level Objectives for Strategic Diversity

    Authors: Marta Garnelo, Wojciech Marian Czarnecki, Siqi Liu, Dhruva Tirumala, Junhyuk Oh, Gauthier Gidel, Hado van Hasselt, David Balduzzi

    Abstract: Strategic diversity is often essential in games: in multi-player games, for example, evaluating a player against a diverse set of strategies will yield a more accurate estimate of its performance. Furthermore, in games with non-transitivities diversity allows a player to cover several winning strategies. However, despite the significance of strategic diversity, training agents that exhibit diverse… ▽ More

    Submitted 8 October, 2021; originally announced October 2021.

  2. arXiv:2010.00575  [pdf, other

    cs.MA cs.GT

    D3C: Reducing the Price of Anarchy in Multi-Agent Learning

    Authors: Ian Gemp, Kevin R. McKee, Richard Everett, Edgar A. Duéñez-Guzmán, Yoram Bachrach, David Balduzzi, Andrea Tacchetti

    Abstract: In multiagent systems, the complex interaction of fixed incentives can lead agents to outcomes that are poor (inefficient) not only for the group, but also for each individual. Price of anarchy is a technical, game-theoretic definition that quantifies the inefficiency arising in these scenarios -- it compares the welfare that can be achieved through perfect coordination against that achieved by se… ▽ More

    Submitted 20 February, 2022; v1 submitted 1 October, 2020; originally announced October 2020.

    Comments: Published in AAMAS 2022

  3. arXiv:2004.09468  [pdf, other

    cs.LG stat.ML

    Real World Games Look Like Spinning Tops

    Authors: Wojciech Marian Czarnecki, Gauthier Gidel, Brendan Tracey, Karl Tuyls, Shayegan Omidshafiei, David Balduzzi, Max Jaderberg

    Abstract: This paper investigates the geometrical properties of real world games (e.g. Tic-Tac-Toe, Go, StarCraft II). We hypothesise that their geometrical structure resemble a spinning top, with the upright axis representing transitive strength, and the radial axis, which corresponds to the number of cycles that exist at a particular transitive strength, representing the non-transitive dimension. We prove… ▽ More

    Submitted 17 June, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  4. arXiv:2003.00799  [pdf, other

    cs.GT cs.LG cs.MA stat.ML

    Learning to Resolve Alliance Dilemmas in Many-Player Zero-Sum Games

    Authors: Edward Hughes, Thomas W. Anthony, Tom Eccles, Joel Z. Leibo, David Balduzzi, Yoram Bachrach

    Abstract: Zero-sum games have long guided artificial intelligence research, since they possess both a rich strategy space of best-responses and a clear evaluation metric. What's more, competition is a vital mechanism in many real-world multi-agent systems capable of generating intelligent innovations: Darwinian evolution, the market economy and the AlphaZero algorithm, to name a few. In two-player zero-sum… ▽ More

    Submitted 27 February, 2020; originally announced March 2020.

    Comments: Accepted for publication at AAMAS 2020

  5. arXiv:2002.08456  [pdf, other

    cs.GT cs.LG stat.ML

    From Poincaré Recurrence to Convergence in Imperfect Information Games: Finding Equilibrium via Regularization

    Authors: Julien Perolat, Remi Munos, Jean-Baptiste Lespiau, Shayegan Omidshafiei, Mark Rowland, Pedro Ortega, Neil Burch, Thomas Anthony, David Balduzzi, Bart De Vylder, Georgios Piliouras, Marc Lanctot, Karl Tuyls

    Abstract: In this paper we investigate the Follow the Regularized Leader dynamics in sequential imperfect information games (IIG). We generalize existing results of Poincaré recurrence from normal-form games to zero-sum two-player imperfect information games and other sequential game settings. We then investigate how adapting the reward (by adding a regularization term) of the game can give strong convergen… ▽ More

    Submitted 19 February, 2020; originally announced February 2020.

    Comments: 43 pages

  6. arXiv:2002.05820  [pdf, other

    stat.ML cs.GT cs.LG

    A Limited-Capacity Minimax Theorem for Non-Convex Games or: How I Learned to Stop Worrying about Mixed-Nash and Love Neural Nets

    Authors: Gauthier Gidel, David Balduzzi, Wojciech Marian Czarnecki, Marta Garnelo, Yoram Bachrach

    Abstract: Adversarial training, a special case of multi-objective optimization, is an increasingly prevalent machine learning technique: some of its most notable applications include GAN-based generative modeling and self-play techniques in reinforcement learning which have been applied to complex games such as Go or Poker. In practice, a \emph{single} pair of networks is typically trained in order to find… ▽ More

    Submitted 15 March, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: Appears in: Proceedings of the 24th International Conference on Artificial Intelligence and Statistics (AISTATS 2021). 19 pages

  7. arXiv:2001.04678  [pdf, other

    cs.LG cs.AI cs.GT cs.MA stat.ML

    Smooth markets: A basic mechanism for organizing gradient-based learners

    Authors: David Balduzzi, Wojciech M Czarnecki, Thomas W Anthony, Ian M Gemp, Edward Hughes, Joel Z Leibo, Georgios Piliouras, Thore Graepel

    Abstract: With the success of modern machine learning, it is becoming increasingly important to understand and control how learning algorithms interact. Unfortunately, negative results from game theory show there is little hope of understanding or controlling general n-player games. We therefore introduce smooth markets (SM-games), a class of n-player games with pairwise zero sum interactions. SM-games codi… ▽ More

    Submitted 18 January, 2020; v1 submitted 14 January, 2020; originally announced January 2020.

    Comments: 18 pages, 3 figures

    Journal ref: ICLR 2020

  8. arXiv:1912.00953  [pdf, other

    cs.LG stat.ML

    LOGAN: Latent Optimisation for Generative Adversarial Networks

    Authors: Yan Wu, Jeff Donahue, David Balduzzi, Karen Simonyan, Timothy Lillicrap

    Abstract: Training generative adversarial networks requires balancing of delicate adversarial dynamics. Even with careful tuning, training may diverge or end up in a bad equilibrium with dropped modes. In this work, we improve CS-GAN with natural gradient-based latent optimisation and show that it improves adversarial dynamics by enhancing interactions between the discriminator and the generator. Our experi… ▽ More

    Submitted 1 July, 2020; v1 submitted 2 December, 2019; originally announced December 2019.

    Comments: Improved writing, added new analysis and evaluation

  9. arXiv:1905.04926  [pdf, other

    cs.LG cs.GT cs.MA cs.NE stat.ML

    Differentiable Game Mechanics

    Authors: Alistair Letcher, David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, Thore Graepel

    Abstract: Deep learning is built on the foundational guarantee that gradient descent on an objective function converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, that exhibit multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-objecti… ▽ More

    Submitted 13 May, 2019; originally announced May 2019.

    Comments: JMLR 2019, journal version of arXiv:1802.05642

    Journal ref: Journal of Machine Learning Research (JMLR), v20 (84) 1-40, 2019

  10. arXiv:1901.08106  [pdf, other

    cs.LG cs.GT cs.MA stat.ML

    Open-ended Learning in Symmetric Zero-sum Games

    Authors: David Balduzzi, Marta Garnelo, Yoram Bachrach, Wojciech M. Czarnecki, Julien Perolat, Max Jaderberg, Thore Graepel

    Abstract: Zero-sum games such as chess and poker are, abstractly, functions that evaluate pairs of agents, for example labeling them `winner' and `loser'. If the game is approximately transitive, then self-play generates sequences of agents of increasing strength. However, nontransitive games, such as rock-paper-scissors, can exhibit strategic cycles, and there is no longer a clear objective -- we want agen… ▽ More

    Submitted 13 May, 2019; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: ICML 2019, final version

  11. arXiv:1811.08469  [pdf, other

    cs.MA cs.AI cs.LG

    Stable Opponent Shaping in Differentiable Games

    Authors: Alistair Letcher, Jakob Foerster, David Balduzzi, Tim Rocktäschel, Shimon Whiteson

    Abstract: A growing number of learning methods are actually differentiable games whose players optimise multiple, interdependent objectives in parallel -- from GANs and intrinsic curiosity to multi-agent RL. Opponent shaping is a powerful approach to improve learning dynamics in these games, accounting for player influence on others' updates. Learning with Opponent-Learning Awareness (LOLA) is a recent algo… ▽ More

    Submitted 17 January, 2021; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: 20 pages, 7 figures

  12. arXiv:1806.02643  [pdf, other

    cs.LG cs.GT stat.ML

    Re-evaluating Evaluation

    Authors: David Balduzzi, Karl Tuyls, Julien Perolat, Thore Graepel

    Abstract: Progress in machine learning is measured by careful evaluation on problems of outstanding common interest. However, the proliferation of benchmark suites and environments, adversarial attacks, and other complications has diluted the basic evaluation model by overwhelming researchers with choices. Deliberate or accidental cherry picking is increasingly likely, and designing well-balanced evaluation… ▽ More

    Submitted 30 October, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: NIPS 2018, final version

  13. arXiv:1802.05642  [pdf, other

    cs.LG cs.GT cs.MA cs.NE

    The Mechanics of n-Player Differentiable Games

    Authors: David Balduzzi, Sebastien Racaniere, James Martens, Jakob Foerster, Karl Tuyls, Thore Graepel

    Abstract: The cornerstone underpinning deep learning is the guarantee that gradient descent on an objective converges to local minima. Unfortunately, this guarantee fails in settings, such as generative adversarial nets, where there are multiple interacting losses. The behavior of gradient-based methods in games is not well understood -- and is becoming increasingly important as adversarial and multi-object… ▽ More

    Submitted 6 June, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: ICML 2018, final version

    Journal ref: PMLR volume 80, 2018

  14. arXiv:1702.08591  [pdf, other

    cs.NE cs.LG stat.ML

    The Shattered Gradients Problem: If resnets are the answer, then what is the question?

    Authors: David Balduzzi, Marcus Frean, Lennox Leary, JP Lewis, Kurt Wan-Duo Ma, Brian McWilliams

    Abstract: A long-standing obstacle to progress in deep learning is the problem of vanishing and exploding gradients. Although, the problem has largely been overcome via carefully constructed initializations and batch normalization, architectures incorporating skip-connections such as highway and resnets perform much better than standard feedforward architectures despite well-chosen initialization and batch… ▽ More

    Submitted 6 June, 2018; v1 submitted 27 February, 2017; originally announced February 2017.

    Comments: ICML 2017, final version

    Journal ref: PMLR volume 70 (2017)

  15. arXiv:1702.07450  [pdf, other

    cs.LG cs.AI cs.GT

    Strongly-Typed Agents are Guaranteed to Interact Safely

    Authors: David Balduzzi

    Abstract: As artificial agents proliferate, it is becoming increasingly important to ensure that their interactions with one another are well-behaved. In this paper, we formalize a common-sense notion of when algorithms are well-behaved: an algorithm is safe if it does no harm. Motivated by recent progress in deep learning, we focus on the specific case where agents update their actions according to gradien… ▽ More

    Submitted 6 June, 2018; v1 submitted 23 February, 2017; originally announced February 2017.

    Comments: ICML 2017, final version

    Journal ref: PMLR volume 70, 2017

  16. arXiv:1611.02345  [pdf, other

    cs.LG cs.NE stat.ML

    Neural Taylor Approximations: Convergence and Exploration in Rectifier Networks

    Authors: David Balduzzi, Brian McWilliams, Tony Butler-Yeoman

    Abstract: Modern convolutional networks, incorporating rectifiers and max-pooling, are neither smooth nor convex; standard guarantees therefore do not apply. Nevertheless, methods from convex optimization such as gradient descent and Adam are widely used as building blocks for deep learning algorithms. This paper provides the first convergence guarantee applicable to modern convnets, which furthermore match… ▽ More

    Submitted 6 June, 2018; v1 submitted 7 November, 2016; originally announced November 2016.

    Comments: ICML 2017, final version

    Journal ref: PMLR volume 70, 2017

  17. arXiv:1607.03516  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Deep Reconstruction-Classification Networks for Unsupervised Domain Adaptation

    Authors: Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi, Wen Li

    Abstract: In this paper, we propose a novel unsupervised domain adaptation algorithm based on deep learning for visual object recognition. Specifically, we design a new model called Deep Reconstruction-Classification Network (DRCN), which jointly learns a shared encoding representation for two tasks: i) supervised classification of labeled source data, and ii) unsupervised reconstruction of unlabeled target… ▽ More

    Submitted 1 August, 2016; v1 submitted 12 July, 2016; originally announced July 2016.

    Comments: to appear in European Conference on Computer Vision (ECCV) 2016

  18. arXiv:1604.01952  [pdf, ps, other

    cs.LG cs.GT cs.NE stat.ML

    Deep Online Convex Optimization with Gated Games

    Authors: David Balduzzi

    Abstract: Methods from convex optimization are widely used as building blocks for deep learning algorithms. However, the reasons for their empirical success are unclear, since modern convolutional networks (convnets), incorporating rectifier units and max-pooling, are neither smooth nor convex. Standard guarantees therefore do not apply. This paper provides the first convergence rates for gradient descent o… ▽ More

    Submitted 7 April, 2016; originally announced April 2016.

    Comments: 13 pages. This paper renders arXiv:1509.01851 obsolete. It contains the same basic results, with major changes to exposition and minor changes to terminology

  19. arXiv:1602.02852  [pdf, other

    stat.ML cs.LG

    Compliance-Aware Bandits

    Authors: Nicolás Della Penna, Mark D. Reid, David Balduzzi

    Abstract: Motivated by clinical trials, we study bandits with observable non-compliance. At each step, the learner chooses an arm, after, instead of observing only the reward, it also observes the action that took place. We show that such noncompliance can be helpful or hurtful to the learner in general. Unfortunately, naively incorporating compliance information into bandit algorithms loses guarantees on s… ▽ More

    Submitted 8 February, 2016; originally announced February 2016.

  20. arXiv:1602.02218  [pdf, ps, other

    cs.LG cs.NE

    Strongly-Typed Recurrent Neural Networks

    Authors: David Balduzzi, Muhammad Ghifary

    Abstract: Recurrent neural networks are increasing popular models for sequential learning. Unfortunately, although the most effective RNN architectures are perhaps excessively complicated, extensive searches have not found simpler alternatives. This paper imports ideas from physics and functional programming into RNN design to provide guiding principles. From physics, we introduce type constraints, analogou… ▽ More

    Submitted 24 May, 2016; v1 submitted 6 February, 2016; originally announced February 2016.

    Comments: 10 pages, final version, ICML 2016

  21. arXiv:1510.04373  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Scatter Component Analysis: A Unified Framework for Domain Adaptation and Domain Generalization

    Authors: Muhammad Ghifary, David Balduzzi, W. Bastiaan Kleijn, Mengjie Zhang

    Abstract: This paper addresses classification tasks on a particular target domain in which labeled training data are only available from source domains different from (but related to) the target. Two closely related frameworks, domain adaptation and domain generalization, are concerned with such tasks, where the only difference between those frameworks is the availability of the unlabeled target data: domai… ▽ More

    Submitted 26 July, 2016; v1 submitted 14 October, 2015; originally announced October 2015.

    Comments: to appear in IEEE Transactions on Pattern Analysis and Machine Intelligence

    ACM Class: I.2.6; I.4

  22. arXiv:1509.08627  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Semantics, Representations and Grammars for Deep Learning

    Authors: David Balduzzi

    Abstract: Deep learning is currently the subject of intensive study. However, fundamental concepts such as representations are not formally defined -- researchers "know them when they see them" -- and there is no common language for describing and analyzing algorithms. This essay proposes an abstract framework that identifies the essential features of current practice and may provide a foundation for future… ▽ More

    Submitted 29 September, 2015; originally announced September 2015.

    Comments: 20 pages, many diagrams

  23. arXiv:1509.03005  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Compatible Value Gradients for Reinforcement Learning of Continuous Deep Policies

    Authors: David Balduzzi, Muhammad Ghifary

    Abstract: This paper proposes GProp, a deep reinforcement learning algorithm for continuous policies with compatible function approximation. The algorithm is based on two innovations. Firstly, we present a temporal-difference based method for learning the gradient of the value-function. Secondly, we present the deviator-actor-critic (DAC) model, which comprises three neural networks that estimate the value… ▽ More

    Submitted 10 September, 2015; originally announced September 2015.

    Comments: 27 pages

  24. arXiv:1509.01851   

    cs.LG cs.GT cs.NE

    Deep Online Convex Optimization by Putting Forecaster to Sleep

    Authors: David Balduzzi

    Abstract: Methods from convex optimization such as accelerated gradient descent are widely used as building blocks for deep learning algorithms. However, the reasons for their empirical success are unclear, since neural networks are not convex and standard guarantees do not apply. This paper develops the first rigorous link between online convex optimization and error backpropagation on convolutional networ… ▽ More

    Submitted 7 April, 2016; v1 submitted 6 September, 2015; originally announced September 2015.

    Comments: Rendered obsolete by arXiv:1604.01952. The new version contains the same basic results, with major changes to exposition and minor changes to terminology

  25. arXiv:1508.07680  [pdf, other

    cs.CV cs.AI cs.LG stat.ML

    Domain Generalization for Object Recognition with Multi-task Autoencoders

    Authors: Muhammad Ghifary, W. Bastiaan Kleijn, Mengjie Zhang, David Balduzzi

    Abstract: The problem of domain generalization is to take knowledge acquired from a number of related domains where training data is available, and to then successfully apply it to previously unseen domains. We propose a new feature learning algorithm, Multi-Task Autoencoder (MTAE), that provides good generalization performance for cross-domain object recognition. Our algorithm extends the standard denois… ▽ More

    Submitted 31 August, 2015; originally announced August 2015.

    Comments: accepted in ICCV 2015

  26. arXiv:1411.6191  [pdf, other

    cs.LG cs.NE q-bio.NC

    Kickback cuts Backprop's red-tape: Biologically plausible credit assignment in neural networks

    Authors: David Balduzzi, Hastagiri Vanchinathan, Joachim Buhmann

    Abstract: Error backpropagation is an extremely effective algorithm for assigning credit in artificial neural networks. However, weight updates under Backprop depend on lengthy recursive computations and require separate output and error messages -- features not shared by biological neurons, that are perhaps unnecessary. In this paper, we revisit Backprop and the credit assignment problem. We first decompos… ▽ More

    Submitted 22 November, 2014; originally announced November 2014.

    Comments: 7 pages. To appear, AAAI-15

  27. arXiv:1408.6618  [pdf, ps, other

    cs.LG math.ST stat.ML

    Falsifiable implies Learnable

    Authors: David Balduzzi

    Abstract: The paper demonstrates that falsifiability is fundamental to learning. We prove the following theorem for statistical learning and sequential prediction: If a theory is falsifiable then it is learnable -- i.e. admits a strategy that predicts optimally. An analogous result is shown for universal induction.

    Submitted 27 August, 2014; originally announced August 2014.

  28. arXiv:1401.1465  [pdf, other

    cs.AI cs.GT cs.LG cs.MA q-bio.NC

    Cortical prediction markets

    Authors: David Balduzzi

    Abstract: We investigate cortical learning from the perspective of mechanism design. First, we show that discretizing standard models of neurons and synaptic plasticity leads to rational agents maximizing simple scoring rules. Second, our main result is that the scoring rules are proper, implying that neurons faithfully encode expected utilities in their synaptic weights and encode high-scoring outcomes in… ▽ More

    Submitted 7 January, 2014; originally announced January 2014.

    Comments: To appear, AAMAS 2014

  29. arXiv:1310.6536  [pdf, ps, other

    cs.LG q-bio.NC stat.ML

    Randomized co-training: from cortical neurons to machine learning and back again

    Authors: David Balduzzi

    Abstract: Despite its size and complexity, the human cortex exhibits striking anatomical regularities, suggesting there may simple meta-algorithms underlying cortical learning and computation. We expect such meta-algorithms to be of interest since they need to operate quickly, scalably and effectively with little-to-no specialized assumptions. This note focuses on a specific question: How can neurons use… ▽ More

    Submitted 24 October, 2013; originally announced October 2013.

    Comments: NIPS workshop: Randomized methods for machine learning

  30. arXiv:1306.5554  [pdf, ps, other

    stat.ML cs.LG

    Correlated random features for fast semi-supervised learning

    Authors: Brian McWilliams, David Balduzzi, Joachim M. Buhmann

    Abstract: This paper presents Correlated Nystrom Views (XNV), a fast semi-supervised algorithm for regression and classification. The algorithm draws on two main ideas. First, it generates two views consisting of computationally inexpensive random features. Second, XNV applies multiview regression using Canonical Correlation Analysis (CCA) on unlabeled data to bias the regression towards useful features. It… ▽ More

    Submitted 5 November, 2013; v1 submitted 24 June, 2013; originally announced June 2013.

    Comments: 15 pages, 3 figures, 6 tables

  31. arXiv:1301.2115  [pdf, ps, other

    stat.ML cs.LG

    Domain Generalization via Invariant Feature Representation

    Authors: Krikamol Muandet, David Balduzzi, Bernhard Schölkopf

    Abstract: This paper investigates domain generalization: How to take knowledge acquired from an arbitrary number of related domains and apply it to previously unseen domains? We propose Domain-Invariant Component Analysis (DICA), a kernel-based optimization algorithm that learns an invariant transformation by minimizing the dissimilarity across domains, whilst preserving the functional relationship between… ▽ More

    Submitted 10 January, 2013; originally announced January 2013.

    Comments: The 30th International Conference on Machine Learning (ICML 2013)

  32. arXiv:1210.4695  [pdf, ps, other

    q-bio.NC cs.IT cs.LG

    Regulating the information in spikes: a useful bias

    Authors: David Balduzzi

    Abstract: The bias/variance tradeoff is fundamental to learning: increasing a model's complexity can improve its fit on training data, but potentially worsens performance on future samples. Remarkably, however, the human brain effortlessly handles a wide-range of complex pattern recognition tasks. On the basis of these conflicting observations, it has been argued that useful biases in the form of "generic m… ▽ More

    Submitted 17 October, 2012; originally announced October 2012.

    Comments: NIPS 2012 workshop on Information in Perception and Action

  33. arXiv:1209.5549  [pdf, other

    q-bio.NC cs.LG stat.ML

    Towards a learning-theoretic analysis of spike-timing dependent plasticity

    Authors: David Balduzzi, Michel Besserve

    Abstract: This paper suggests a learning-theoretic perspective on how synaptic plasticity benefits global brain functioning. We introduce a model, the selectron, that (i) arises as the fast time constant limit of leaky integrate-and-fire neurons equipped with spiking timing dependent plasticity (STDP) and (ii) is amenable to theoretical analysis. We show that the selectron encodes reward estimates into spik… ▽ More

    Submitted 25 September, 2012; originally announced September 2012.

    Comments: To appear in Adv. Neural Inf. Proc. Systems

  34. arXiv:1206.1898  [pdf, ps, other

    stat.ML cs.AI math.ST

    A Nonparametric Conjugate Prior Distribution for the Maximizing Argument of a Noisy Function

    Authors: Pedro A. Ortega, Jordi Grau-Moya, Tim Genewein, David Balduzzi, Daniel A. Braun

    Abstract: We propose a novel Bayesian approach to solve stochastic optimization problems that involve finding extrema of noisy, nonlinear functions. Previous work has focused on representing possible functions explicitly, which leads to a two-step procedure of first, doing inference over the function space and second, finding the extrema of these functions. Here we skip the representation step and directly… ▽ More

    Submitted 10 November, 2012; v1 submitted 8 June, 2012; originally announced June 2012.

    Comments: 9 pages, 5 figures

    Journal ref: Neural Information Processing Systems (NIPS) 2012

  35. arXiv:1202.4482  [pdf, other

    q-bio.NC cs.LG nlin.AO

    Metabolic cost as an organizing principle for cooperative learning

    Authors: David Balduzzi, Pedro A Ortega, Michel Besserve

    Abstract: This paper investigates how neurons can use metabolic cost to facilitate learning at a population level. Although decision-making by individual neurons has been extensively studied, questions regarding how neurons should behave to cooperate effectively remain largely unaddressed. Under assumptions that capture a few basic features of cortical neurons, we show that constraining reward maximization… ▽ More

    Submitted 9 February, 2013; v1 submitted 20 February, 2012; originally announced February 2012.

    Comments: 14 pages, 2 figures, to appear in Advances in Complex Systems

  36. arXiv:1111.5648  [pdf, other

    stat.ML cs.IT cs.LG

    Falsification and future performance

    Authors: David Balduzzi

    Abstract: We information-theoretically reformulate two measures of capacity from statistical learning theory: empirical VC-entropy and empirical Rademacher complexity. We show these capacity measures count the number of hypotheses about a dataset that a learning algorithm falsifies when it finds the classifier in its repertoire minimizing empirical risk. It then follows from that the future performance of p… ▽ More

    Submitted 23 November, 2011; originally announced November 2011.

    Comments: 10 pages, 2 figures

  37. arXiv:1110.3592  [pdf, ps, other

    cs.IT cs.LG stat.ML

    Information, learning and falsification

    Authors: David Balduzzi

    Abstract: There are (at least) three approaches to quantifying information. The first, algorithmic information or Kolmogorov complexity, takes events as strings and, given a universal Turing machine, quantifies the information content of a string as the length of the shortest program producing it. The second, Shannon information, takes events as belonging to ensembles and quantifies the information resultin… ▽ More

    Submitted 28 November, 2011; v1 submitted 17 October, 2011; originally announced October 2011.

  38. arXiv:1107.1222  [pdf, other

    cs.IT cs.DC cs.NE math.CT nlin.CG

    On the information-theoretic structure of distributed measurements

    Authors: David Balduzzi

    Abstract: The internal structure of a measuring device, which depends on what its components are and how they are organized, determines how it categorizes its inputs. This paper presents a geometric approach to studying the internal structure of measurements performed by distributed systems such as probabilistic cellular automata. It constructs the quale, a family of sections of a suitably defined presheaf… ▽ More

    Submitted 30 July, 2012; v1 submitted 6 July, 2011; originally announced July 2011.

    Comments: In Proceedings DCM 2011, arXiv:1207.6821

    Journal ref: EPTCS 88, 2012, pp. 28-42

  39. arXiv:1105.0697  [pdf, ps, other

    cs.SI cs.DS cs.IR physics.soc-ph

    Uncovering the Temporal Dynamics of Diffusion Networks

    Authors: Manuel Gomez Rodriguez, David Balduzzi, Bernhard Schölkopf

    Abstract: Time plays an essential role in the diffusion of information, influence and disease over networks. In many cases we only observe when a node copies information, makes a decision or becomes infected -- but the connectivity, transmission rates between nodes and transmission sources are unknown. Inferring the underlying dynamics is of outstanding interest since it enables forecasting, influencing and… ▽ More

    Submitted 3 May, 2011; originally announced May 2011.

    Comments: To appear in the 28th International Conference on Machine Learning (ICML), 2011. Website: http://www.stanford.edu/~manuelgr/netrate/

    ACM Class: H.2.8

  40. arXiv:1105.0158  [pdf, other

    cs.IT nlin.CG q-bio.NC

    Detecting emergent processes in cellular automata with excess information

    Authors: David Balduzzi

    Abstract: Many natural processes occur over characteristic spatial and temporal scales. This paper presents tools for (i) flexibly and scalably coarse-graining cellular automata and (ii) identifying which coarse-grainings express an automaton's dynamics well, and which express its dynamics badly. We apply the tools to investigate a range of examples in Conway's Game of Life and Hopfield networks and demonst… ▽ More

    Submitted 22 September, 2011; v1 submitted 1 May, 2011; originally announced May 2011.

    Comments: 8 pages, 6 figures

    Report number: Advance in Artificial Life, ECAL 2011