Zum Hauptinhalt springen

Showing 51–58 of 58 results for author: Rowland, M

.
  1. arXiv:1807.00400  [pdf, other

    stat.ML cs.LG

    Antithetic and Monte Carlo kernel estimators for partial rankings

    Authors: Maria Lomeli, Mark Rowland, Arthur Gretton, Zoubin Ghahramani

    Abstract: In the modern age, rankings data is ubiquitous and it is useful for a variety of applications such as recommender systems, multi-object tracking and preference learning. However, most rankings data encountered in the real world is incomplete, which prevents the direct application of existing modelling tools for complete rankings. Our contribution is a novel way to extend kernel methods for complet… ▽ More

    Submitted 25 July, 2018; v1 submitted 1 July, 2018; originally announced July 2018.

  2. arXiv:1804.11271  [pdf, other

    stat.ML cs.LG

    Gaussian Process Behaviour in Wide Deep Neural Networks

    Authors: Alexander G. de G. Matthews, Mark Rowland, Jiri Hron, Richard E. Turner, Zoubin Ghahramani

    Abstract: Whilst deep neural networks have shown great empirical success, there is still much work to be done to understand their theoretical properties. In this paper, we study the relationship between random, wide, fully connected, feedforward networks with more than one hidden layer and Gaussian processes with a recursive kernel definition. We show that, under broad conditions, as we make the architectur… ▽ More

    Submitted 16 August, 2018; v1 submitted 30 April, 2018; originally announced April 2018.

    Comments: This work substantially extends the work of Matthews et al. (2018) published at the International Conference on Learning Representations (ICLR) 2018

  3. arXiv:1804.02395  [pdf, other

    cs.LG cs.RO stat.ML

    Structured Evolution with Compact Architectures for Scalable Policy Optimization

    Authors: Krzysztof Choromanski, Mark Rowland, Vikas Sindhwani, Richard E. Turner, Adrian Weller

    Abstract: We present a new method of blackbox optimization via gradient approximation with the use of structured random orthogonal matrices, providing more accurate estimators than baselines and with provable theoretical guarantees. We show that this algorithm can be successfully applied to learn better quality compact policies than those using standard gradient estimation techniques. The compact policies w… ▽ More

    Submitted 12 June, 2018; v1 submitted 6 April, 2018; originally announced April 2018.

  4. arXiv:1802.08163  [pdf, other

    stat.ML

    An Analysis of Categorical Distributional Reinforcement Learning

    Authors: Mark Rowland, Marc G. Bellemare, Will Dabney, Rémi Munos, Yee Whye Teh

    Abstract: Distributional approaches to value-based reinforcement learning model the entire distribution of returns, rather than just their expected values, and have recently been shown to yield state-of-the-art empirical performance. This was demonstrated by the recently proposed C51 algorithm, based on categorical distributional reinforcement learning (CDRL) [Bellemare et al., 2017]. However, the theoretic… ▽ More

    Submitted 22 February, 2018; originally announced February 2018.

  5. arXiv:1710.10044  [pdf, other

    cs.AI cs.LG stat.ML

    Distributional Reinforcement Learning with Quantile Regression

    Authors: Will Dabney, Mark Rowland, Marc G. Bellemare, Rémi Munos

    Abstract: In reinforcement learning an agent interacts with the environment by taking actions and observing the next state and reward. When sampled probabilistically, these state transitions, rewards, and actions can all induce randomness in the observed long-term return. Traditionally, reinforcement learning algorithms average over this randomness to estimate the value function. In this paper, we build on… ▽ More

    Submitted 27 October, 2017; originally announced October 2017.

  6. arXiv:1703.00864  [pdf, other

    stat.ML stat.CO

    The Unreasonable Effectiveness of Structured Random Orthogonal Embeddings

    Authors: Krzysztof Choromanski, Mark Rowland, Adrian Weller

    Abstract: We examine a class of embeddings based on structured random matrices with orthogonal rows which can be applied in many machine learning applications including dimensionality reduction and kernel approximation. For both the Johnson-Lindenstrauss transform and the angular kernel, we show that we can select matrices yielding guaranteed improved performance in accuracy and/or speed compared to earlier… ▽ More

    Submitted 3 September, 2018; v1 submitted 2 March, 2017; originally announced March 2017.

  7. arXiv:1607.02738  [pdf, other

    stat.ML

    Magnetic Hamiltonian Monte Carlo

    Authors: Nilesh Tripuraneni, Mark Rowland, Zoubin Ghahramani, Richard Turner

    Abstract: Hamiltonian Monte Carlo (HMC) exploits Hamiltonian dynamics to construct efficient proposals for Markov chain Monte Carlo (MCMC). In this paper, we present a generalization of HMC which exploits \textit{non-canonical} Hamiltonian dynamics. We refer to this algorithm as magnetic HMC, since in 3 dimensions a subset of the dynamics map onto the mechanics of a charged particle coupled to a magnetic fi… ▽ More

    Submitted 19 August, 2017; v1 submitted 10 July, 2016; originally announced July 2016.

    Comments: 34th International Conference on Machine Learning (ICML 2017)

  8. arXiv:1511.03243  [pdf, other

    stat.ML

    Black-box $α$-divergence Minimization

    Authors: José Miguel Hernández-Lobato, Yingzhen Li, Mark Rowland, Daniel Hernández-Lobato, Thang Bui, Richard E. Turner

    Abstract: Black-box alpha (BB-$α$) is a new approximate inference method based on the minimization of $α$-divergences. BB-$α$ scales to large datasets because it can be implemented using stochastic gradient descent. BB-$α$ can be applied to complex probabilistic models with little effort since it only requires as input the likelihood function and its gradients. These gradients can be easily obtained using a… ▽ More

    Submitted 1 June, 2016; v1 submitted 10 November, 2015; originally announced November 2015.

    Comments: Accepted at ICML 2016. The first version (v1) was presented at NIPS workshops on Advances in Approximate Bayesian Inference and Black Box Learning and Inference