Zum Hauptinhalt springen

Showing 1–23 of 23 results for author: Harley, T

.
  1. arXiv:2404.10179  [pdf, other

    cs.RO cs.AI cs.HC cs.LG

    Scaling Instructable Agents Across Many Simulated Worlds

    Authors: SIMA Team, Maria Abi Raad, Arun Ahuja, Catarina Barros, Frederic Besse, Andrew Bolt, Adrian Bolton, Bethanie Brownfield, Gavin Buttimore, Max Cant, Sarah Chakera, Stephanie C. Y. Chan, Jeff Clune, Adrian Collister, Vikki Copeman, Alex Cullum, Ishita Dasgupta, Dario de Cesare, Julia Di Trapani, Yani Donchev, Emma Dunleavy, Martin Engelcke, Ryan Faulkner, Frankie Garcia, Charles Gbadamosi , et al. (68 additional authors not shown)

    Abstract: Building embodied AI systems that can follow arbitrary language instructions in any 3D environment is a key challenge for creating general AI. Accomplishing this goal requires learning to ground language in perception and embodied actions, in order to accomplish complex tasks. The Scalable, Instructable, Multiworld Agent (SIMA) project tackles this by training agents to follow free-form instructio… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 March, 2024; originally announced April 2024.

  2. arXiv:2112.03763  [pdf, other

    cs.LG

    Creating Multimodal Interactive Agents with Imitation and Self-Supervised Learning

    Authors: DeepMind Interactive Agents Team, Josh Abramson, Arun Ahuja, Arthur Brussee, Federico Carnevale, Mary Cassin, Felix Fischer, Petko Georgiev, Alex Goldin, Mansi Gupta, Tim Harley, Felix Hill, Peter C Humphreys, Alden Hung, Jessica Landon, Timothy Lillicrap, Hamza Merzic, Alistair Muldal, Adam Santoro, Guy Scully, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. We show that imitation learning of human-human interactions in a… ▽ More

    Submitted 2 February, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

  3. arXiv:2012.05672  [pdf, other

    cs.LG cs.AI cs.MA

    Imitating Interactive Intelligence

    Authors: Josh Abramson, Arun Ahuja, Iain Barr, Arthur Brussee, Federico Carnevale, Mary Cassin, Rachita Chhaparia, Stephen Clark, Bogdan Damoc, Andrew Dudzik, Petko Georgiev, Aurelia Guy, Tim Harley, Felix Hill, Alden Hung, Zachary Kenton, Jessica Landon, Timothy Lillicrap, Kory Mathewson, Soňa Mokrá, Alistair Muldal, Adam Santoro, Nikolay Savinov, Vikrant Varma, Greg Wayne , et al. (4 additional authors not shown)

    Abstract: A common vision from science fiction is that robots will one day inhabit our physical spaces, sense the world as we do, assist our physical labours, and communicate with us through natural language. Here we study how to design artificial agents that can interact naturally with humans using the simplification of a virtual environment. This setting nevertheless integrates a number of the central cha… ▽ More

    Submitted 20 January, 2021; v1 submitted 10 December, 2020; originally announced December 2020.

  4. arXiv:2005.09382  [pdf, other

    cs.CL

    Human Instruction-Following with Deep Reinforcement Learning via Transfer-Learning from Text

    Authors: Felix Hill, Sona Mokra, Nathaniel Wong, Tim Harley

    Abstract: Recent work has described neural-network-based agents that are trained with reinforcement learning (RL) to execute language-like commands in simulated worlds, as a step towards an intelligent agent or robot that can be instructed by human users. However, the optimisation of multi-goal motor policies via deep RL from scratch requires many episodes of experience. Consequently, instruction-following… ▽ More

    Submitted 19 May, 2020; originally announced May 2020.

  5. arXiv:1902.01894  [pdf, other

    cs.AI cs.DC cs.LG cs.NE

    A Generalized Framework for Population Based Training

    Authors: Ang Li, Aleksandra Spyra, Sagi Perel, Valentin Dalibard, Max Jaderberg, Chenjie Gu, David Budden, Tim Harley, Pramod Gupta

    Abstract: Population Based Training (PBT) is a recent approach that jointly optimizes neural network weights and hyperparameters which periodically copies weights of the best performers and mutates hyperparameters during training. Previous PBT implementations have been synchronized glass-box systems. We propose a general, black-box PBT framework that distributes many asynchronous "trials" (a small number of… ▽ More

    Submitted 5 February, 2019; originally announced February 2019.

    Comments: 9 pages

  6. Dynamic Control Flow in Large-Scale Machine Learning

    Authors: Yuan Yu, Martín Abadi, Paul Barham, Eugene Brevdo, Mike Burrows, Andy Davis, Jeff Dean, Sanjay Ghemawat, Tim Harley, Peter Hawkins, Michael Isard, Manjunath Kudlur, Rajat Monga, Derek Murray, Xiaoqiang Zheng

    Abstract: Many recent machine learning models rely on fine-grained dynamic control flow for training and inference. In particular, models based on recurrent neural networks and on reinforcement learning depend on recurrence relations, data-dependent conditional execution, and other features that call for dynamic control flow. These applications benefit from the ability to make rapid control-flow decisions a… ▽ More

    Submitted 4 May, 2018; originally announced May 2018.

    Comments: Appeared in EuroSys 2018. 14 pages, 16 figures

    Journal ref: EuroSys 2018: Thirteenth EuroSys Conference, April 23-26, 2018, Porto, Portugal. ACM, New York, NY, USA

  7. arXiv:1803.10760  [pdf, other

    cs.LG stat.ML

    Unsupervised Predictive Memory in a Goal-Directed Agent

    Authors: Greg Wayne, Chia-Chun Hung, David Amos, Mehdi Mirza, Arun Ahuja, Agnieszka Grabska-Barwinska, Jack Rae, Piotr Mirowski, Joel Z. Leibo, Adam Santoro, Mevlana Gemici, Malcolm Reynolds, Tim Harley, Josh Abramson, Shakir Mohamed, Danilo Rezende, David Saxton, Adam Cain, Chloe Hillier, David Silver, Koray Kavukcuoglu, Matt Botvinick, Demis Hassabis, Timothy Lillicrap

    Abstract: Animals execute goal-directed behaviours despite the limited range and scope of their sensors. To cope, they explore environments and store memories maintaining estimates of important information that is not presently available. Recently, progress has been made with artificial intelligence (AI) agents that learn to perform tasks from sensory input, even at a human level, by merging reinforcement l… ▽ More

    Submitted 28 March, 2018; originally announced March 2018.

  8. arXiv:1802.01561  [pdf, other

    cs.LG cs.AI

    IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

    Authors: Lasse Espeholt, Hubert Soyer, Remi Munos, Karen Simonyan, Volodymir Mnih, Tom Ward, Yotam Doron, Vlad Firoiu, Tim Harley, Iain Dunning, Shane Legg, Koray Kavukcuoglu

    Abstract: In this work we aim to solve a large collection of tasks using a single reinforcement learning agent with a single set of parameters. A key challenge is to handle the increased amount of data and extended training time. We have developed a new distributed agent IMPALA (Importance Weighted Actor-Learner Architecture) that not only uses resources more efficiently in single-machine training but also… ▽ More

    Submitted 28 June, 2018; v1 submitted 5 February, 2018; originally announced February 2018.

  9. arXiv:1612.08810  [pdf, other

    cs.LG cs.AI cs.NE

    The Predictron: End-To-End Learning and Planning

    Authors: David Silver, Hado van Hasselt, Matteo Hessel, Tom Schaul, Arthur Guez, Tim Harley, Gabriel Dulac-Arnold, David Reichert, Neil Rabinowitz, Andre Barreto, Thomas Degris

    Abstract: One of the key challenges of artificial intelligence is to learn models that are effective in the context of planning. In this document we introduce the predictron architecture. The predictron consists of a fully abstract model, represented by a Markov reward process, that can be rolled forward multiple "imagined" planning steps. Each forward pass of the predictron accumulates internal rewards and… ▽ More

    Submitted 20 July, 2017; v1 submitted 28 December, 2016; originally announced December 2016.

    Comments: Camera-ready version, ICML 2017, with supplement

  10. arXiv:1610.09027  [pdf, other

    cs.LG

    Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

    Authors: Jack W Rae, Jonathan J Hunt, Tim Harley, Ivo Danihelka, Andrew Senior, Greg Wayne, Alex Graves, Timothy P Lillicrap

    Abstract: Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: in 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain

  11. arXiv:1602.01783  [pdf, other

    cs.LG

    Asynchronous Methods for Deep Reinforcement Learning

    Authors: Volodymyr Mnih, Adrià Puigdomènech Badia, Mehdi Mirza, Alex Graves, Timothy P. Lillicrap, Tim Harley, David Silver, Koray Kavukcuoglu

    Abstract: We propose a conceptually simple and lightweight framework for deep reinforcement learning that uses asynchronous gradient descent for optimization of deep neural network controllers. We present asynchronous variants of four standard reinforcement learning algorithms and show that parallel actor-learners have a stabilizing effect on training allowing all four methods to successfully train neural n… ▽ More

    Submitted 16 June, 2016; v1 submitted 4 February, 2016; originally announced February 2016.

    Journal ref: ICML 2016

  12. arXiv:1112.5507  [pdf, other

    math.OC cs.DS q-bio.NC

    Fast Approximate Quadratic Programming for Large (Brain) Graph Matching

    Authors: Joshua T. Vogelstein, John M. Conroy, Vince Lyzinski, Louis J. Podrazik, Steven G. Kratzer, Eric T. Harley, Donniell E. Fishkind, R. Jacob Vogelstein, Carey E. Priebe

    Abstract: Quadratic assignment problems (QAPs) arise in a wide variety of domains, ranging from operations research to graph theory to computer vision to neuroscience. In the age of big data, graph valued data is becoming more prominent, and with it, a desire to run algorithms on ever larger graphs. Because QAP is NP-hard, exact algorithms are intractable. Approximate algorithms necessarily employ an accura… ▽ More

    Submitted 13 September, 2014; v1 submitted 22 December, 2011; originally announced December 2011.

    Comments: 17 pages, 5 figures, 2 tables

  13. arXiv:1010.2142  [pdf, ps, other

    cond-mat.mes-hall cond-mat.other

    Spin-orbit fields in asymmetric (001) quantum wells

    Authors: P. S. Eldridge, J. Hübner, S. Oertel, R. T. Harley, M. Henini, M. Oestreich

    Abstract: We measure simultaneously the in-plane electron g-factor and spin relaxation rate in a series of undoped inversion-asymmetric (001)-oriented GaAs/AlGaAs quantum wells by spin-quantum beat spectroscopy. In combination the two quantities reveal the absolute values of both the Rashba and the Dresselhaus coefficients and prove that the Rashba coefficient can be negligibly small despite huge conduction… ▽ More

    Submitted 11 October, 2010; originally announced October 2010.

    Journal ref: Phys. Rev. B 83, 041301(R) (2011)

  14. arXiv:0907.5152  [pdf, other

    cond-mat.other

    Increased color conversion efficiency in hybrid light emitting diodes utilizing non-radiative energy transfer

    Authors: S. Chanyawadee, P. G. Lagoudakis, R. T. Harley, M. D. B. Charlton, D. V. Talapin, S. Lin

    Abstract: We fabricate a highly efficient colour conversion light emitting diode consisting of surface-patterned blue emitters and semiconductor colloidal nanocrystal quantum dots (NQDs). Electrically injected carriers in the blue emitter (donor) are efficiently transferred to the NQDs (acceptor) via nonradiative energy transfer in addition to conventional radiative energy transfer. The existence of nonra… ▽ More

    Submitted 29 July, 2009; originally announced July 2009.

  15. Absence of the Rashba effect in undoped asymmetric quantum wells

    Authors: P. S. Eldridge, W. J. H Leyland, J. D. Mar, P. G. Lagoudakis, R. Winkler, O. Z. Karimov, M. Henini, D Taylor, R. T. Phillips, R T Harley

    Abstract: To an electron moving in free space an electric field appears as a magnetic field which interacts with and can reorient the electron spin. In semiconductor quantum wells this spin-orbit interaction seems to offer the possibility of gate-voltage control in spintronic devices but, as the electrons are subject to both ion-core and macroscopic structural potentials, this over-simple picture has lead… ▽ More

    Submitted 30 July, 2008; originally announced July 2008.

    Comments: 7 pages, 5 figs

    Journal ref: Physical Review B82 045317 (2010)

  16. arXiv:0803.1230  [pdf

    cond-mat.other cond-mat.mes-hall

    Energy-Dependent Electron-Electron Scattering and Spin Dynamics in a Two Dimensional Electron Gas

    Authors: W. J. H. Leylanda, R. T. Harley, M. Henini, A. J. Shields, I. Farrer, D. A. Ritchie

    Abstract: Measurements of spin dynamics of electrons in a degenerate two dimensional electron gas, where the Dyakonov-Perel mechanism is dominant, have been used to investigate the electron scattering time (tp*) as a function of energy near the Fermi energy. Close to the Fermi energy the spin evolution is oscillatory, indicating a quasi-collision-free regime of spin dynamics. As the energy is increased a… ▽ More

    Submitted 8 March, 2008; originally announced March 2008.

    Comments: 9 pages 3 figures

  17. Non-radiative exciton energy transfer in hybrid organic-inorganic heterostructures

    Authors: S. Chanyawadee, P. G. Lagoudakis, R. T. Harley, D. G. Lidzey, M. Henini

    Abstract: Non-radiative optical energy transfer from a GaAs quantum well to a thin overlayer of an infrared organic semiconductor dye is unambiguously demonstrated. The dynamics of exciton transfer are studied in the time-domain using pump-probe spectroscopy at the donor site and fluorescence spectroscopy at the acceptor site. The effect is observed as simultaneous increase of the population decay rate at… ▽ More

    Submitted 16 November, 2007; originally announced November 2007.

  18. arXiv:0707.4493  [pdf

    cond-mat.other

    Temperature Dependence of Rashba Spin-orbit Coupling in Quantum Wells

    Authors: P. S. Eldridge, W. J. H. Leyland, P. G. Lagoudakis, O. Z. Karimov, M. Henini, D. Taylor, R. T. Phillips, R. T. Harley

    Abstract: We perform an all-optical spin-dynamic measurement of the Rashba spin-orbit interaction in (110)-oriented GaAs/AlGaAs quantum wells. The crystallographic direction of quantum confinement allows us to disentangle the contributions to spin-orbit coupling from the structural inversion asymmetry (Rashba term) and the bulk inversion asymmetry. We observe an unexpected temperature dependence of the Ra… ▽ More

    Submitted 30 July, 2007; originally announced July 2007.

  19. Oscillatory D'yakonov-Perel' spin dynamics in two dimensional electron gases

    Authors: W. J. H. Leyland, R. T. Harley, M. Henini, D. Taylor, A. J. Shields, I. Farrer, D. A. Ritchie

    Abstract: Optical pump-probe measurements of spin-dynamics at temperatures down to 1.5K are described for a series of (001)-oriented GaAs/AlGaAs quantum well samples containing high mobility two-dimensional electron gases (2DEGs). For well widths ranging from 5 nm to 20 nm and 2DEG sheet densities from 1.75x1011cm-2 to 3.5x1011cm-2 the evolution of a small injected spin population is found to be a damped… ▽ More

    Submitted 27 July, 2007; originally announced July 2007.

  20. arXiv:cond-mat/0610587  [pdf, ps, other

    cond-mat.mes-hall cond-mat.mtrl-sci cond-mat.other

    Enhanced spin relaxation time due to electron-electron scattering in semiconductors

    Authors: W. J. H. Leyland, G. H. John, R. T. Harley, M. M. Glazov, E. L. Ivchenko, D. A. Ritchie, A. J. Shields, M. Henini

    Abstract: We present a detailed experimental and theoretical analysis of the spin dynamics of two-dimensional electron gases (2DEGs) in a series of n-doped GaAs/AlGaAs quantum wells. Picosecond-resolution polarized pump-probe reflection techniques were applied in order to study in detail the temperature-, concentration- and quantum-well-width- dependencies of the spin relaxation rate of a small photoexcit… ▽ More

    Submitted 21 October, 2006; originally announced October 2006.

    Comments: 8 pages, 5 figures

  21. High temperature gate control of quantum well spin memory

    Authors: O. Z. Karimov, G. H. John, R. T. Harley, W. H. Lau, M. E. Flatte, M. Henini, R. Airey

    Abstract: Time-resolved optical measurements in (110)-oriented GaAs/AlGaAs quantum wells show a ten-fold increase of the spin-relaxation rate as a function of applied electric field from 20 to 80 kV cm-1 at 170 K and indicate a similar variation at 300 K, in agreement with calculations based on the Rashba effect. Spin relaxation is almost field-independent below 20 kV cm-1 reflecting quantum well interfac… ▽ More

    Submitted 16 May, 2003; originally announced May 2003.

    Comments: 4 pages, 4 EPS figures, REVTeX

  22. arXiv:cond-mat/0305260  [pdf, ps, other

    cond-mat.mes-hall

    D'yakonov-Perel' spin relaxation under electron-electron collisions in n-type QWs

    Authors: M. M. Glazov, E. L. Ivchenko, M. A. Brand, O. Z. Karimov, R. T. Harley

    Abstract: The D'yakonov-Perel' spin relaxation mechanism in n-doped GaAs/AlGaAs quantum wells (QWs) has been studied both theoretically and experimentally. The temperature dependence of the spin relaxation time has been calculated for arbitrary degeneracy of the 2D electron gas. The comparison between theory and experiment shows that, in high-mobility n-doped QWs, the studied spin decoherence is controlle… ▽ More

    Submitted 12 May, 2003; originally announced May 2003.

    Comments: 3 pages, 1 figure, accepted to Proc. Int. Symp. "Nanostructures: Physics and Technology" (St-Petersburg, Russia, 2003)

  23. Dynamic Nuclear Overhauser Shifts in Larmor beats from a quantum well

    Authors: A. Malinowski, R. T. Harley

    Abstract: The significance of nuclear spin polarisation in time-resolved optical studies of III-V semiconductors is addressed. Electron Larmor beats in pump-probe reflectivity from a GaAs/AlGaAs quantum well show Overhauser shift of 0.7 T due to accumulated nuclear polarisation <I>/I=0.065. This leads to precision values of electron g-factor, elucidates nuclear spin pumping and diffusion mechanisms in qua… ▽ More

    Submitted 28 October, 1999; originally announced October 1999.

    Comments: 9 pages, 2 figures