Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Hunt, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2210.01542  [pdf, other

    cs.LG cs.AI

    Hyperbolic Deep Reinforcement Learning

    Authors: Edoardo Cetin, Benjamin Chamberlain, Michael Bronstein, Jonathan J Hunt

    Abstract: We propose a new class of deep reinforcement learning (RL) algorithms that model latent representations in hyperbolic space. Sequential decision-making requires reasoning about the possible future consequences of current behavior. Consequently, capturing the relationship between key evolving features for a given task is conducive to recovering effective policies. To this end, hyperbolic geometry p… ▽ More

    Submitted 4 October, 2022; originally announced October 2022.

    Comments: Preprint

  2. arXiv:2208.06193  [pdf, other

    cs.LG stat.ML

    Diffusion Policies as an Expressive Policy Class for Offline Reinforcement Learning

    Authors: Zhendong Wang, Jonathan J Hunt, Mingyuan Zhou

    Abstract: Offline reinforcement learning (RL), which aims to learn an optimal policy using a previously collected static dataset, is an important paradigm of RL. Standard RL methods often perform poorly in this regime due to the function approximation errors on out-of-distribution actions. While a variety of regularization methods have been proposed to mitigate this issue, they are often constrained by poli… ▽ More

    Submitted 25 August, 2023; v1 submitted 12 August, 2022; originally announced August 2022.

    Comments: ICLR 2023

  3. arXiv:2202.08812  [pdf, other

    cs.IR cs.LG

    Should I send this notification? Optimizing push notifications decision making by modeling the future

    Authors: Conor O'Brien, Huasen Wu, Shaodan Zhai, Dalin Guo, Wenzhe Shi, Jonathan J Hunt

    Abstract: Most recommender systems are myopic, that is they optimize based on the immediate response of the user. This may be misaligned with the true objective, such as creating long term user satisfaction. In this work we focus on mobile push notifications, where the long term effects of recommender system decisions can be particularly strong. For example, sending too many or irrelevant notifications may… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  4. arXiv:2201.12666  [pdf, other

    cs.LG cs.CR cs.IR

    Challenges and approaches to privacy preserving post-click conversion prediction

    Authors: Conor O'Brien, Arvind Thiagarajan, Sourav Das, Rafael Barreto, Chetan Verma, Tim Hsu, James Neufield, Jonathan J Hunt

    Abstract: Online advertising has typically been more personalized than offline advertising, through the use of machine learning models and real-time auctions for ad targeting. One specific task, predicting the likelihood of conversion (i.e.\ the probability a user will purchase the advertised product), is crucial to the advertising ecosystem for both targeting and pricing ads. Currently, these models are of… ▽ More

    Submitted 29 January, 2022; originally announced January 2022.

  5. arXiv:2201.07681  [pdf, ps, other

    cs.IR cs.LG

    Learning to Rank For Push Notifications Using Pairwise Expected Regret

    Authors: Yuguang Yue, Yuanpu Xie, Huasen Wu, Haofeng Jia, Shaodan Zhai, Wenzhe Shi, Jonathan J Hunt

    Abstract: Listwise ranking losses have been widely studied in recommender systems. However, new paradigms of content consumption present new challenges for ranking methods. In this work we contribute an analysis of learning to rank for personalized mobile push notifications and discuss the unique challenges this presents compared to traditional ranking problems. To address these challenges, we introduce a n… ▽ More

    Submitted 19 January, 2022; originally announced January 2022.

  6. arXiv:2111.00917  [pdf, other

    cs.LG physics.comp-ph physics.flu-dyn

    Adaptive Modeling Powers Fast Multi-parameter Fitting of CARS Spectra

    Authors: Gregory J. Hunt, Cody R. Ground, Andrew D. Cutler

    Abstract: Coherent anti-Stokes Raman Spectroscopy (CARS) is a laser-based measurement technique widely applied across many science and engineering disciplines to perform non-intrusive gas diagnostics. CARS is often used to study combustion, where the measured spectra can be used to simultaneously recover multiple flow parameters from the reacting gas such as temperature and relative species mole fractions.… ▽ More

    Submitted 26 October, 2021; originally announced November 2021.

    Comments: 14 pages, 6 figures

  7. arXiv:2110.01398  [pdf

    cs.DC cs.CR

    Enabling Blockchain Scalability and Interoperability with Mobile Computing through LayerOne.X

    Authors: Kevin Coutinho, Ponnie Clark, Ferdinand Azis, Norman Lip, Josh Hunt

    Abstract: Interoperability and scalability are currently the bottlenecks preventing mass adoption of blockchain technology. Development of an interoperable and scalable network that promotes a truly decentralised, permissionless and secure blockchain as well as one that enables micro validation is the main goal of this project. Layer-One.X, a truly decentralised ledger which utilises para-sharding, Directed… ▽ More

    Submitted 30 September, 2021; originally announced October 2021.

    Comments: 40 pages

  8. arXiv:2109.08245  [pdf, other

    cs.SI

    The 2021 RecSys Challenge Dataset: Fairness is not optional

    Authors: Luca Belli, Alykhan Tejani, Frank Portman, Alexandre Lung-Yut-Fong, Ben Chamberlain, Yuanpu Xie, Kristian Lum, Jonathan Hunt, Michael Bronstein, Vito Walter Anelli, Saikishore Kalloori, Bruce Ferwerda, Wenzhe Shi

    Abstract: After the success the RecSys 2020 Challenge, we are describing a novel and bigger dataset that was released in conjunction with the ACM RecSys Challenge 2021. This year's dataset is not only bigger (~ 1B data points, a 5 fold increase), but for the first time it take into consideration fairness aspects of the challenge. Unlike many static datsets, a lot of effort went into making sure that the dat… ▽ More

    Submitted 21 September, 2021; v1 submitted 16 September, 2021; originally announced September 2021.

  9. An Analysis Of Entire Space Multi-Task Models For Post-Click Conversion Prediction

    Authors: Conor O'Brien, Kin Sum Liu, James Neufeld, Rafael Barreto, Jonathan J Hunt

    Abstract: Industrial recommender systems are frequently tasked with approximating probabilities for multiple, often closely related, user actions. For example, predicting if a user will click on an advertisement and if they will then purchase the advertised product. The conceptual similarity between these tasks has promoted the use of multi-task learning: a class of algorithms that aim to bring positive ind… ▽ More

    Submitted 18 August, 2021; originally announced August 2021.

    Comments: RecSys 21 Late Breaking Results

  10. arXiv:2106.13105  [pdf, other

    cs.AI cs.LG

    The Option Keyboard: Combining Skills in Reinforcement Learning

    Authors: André Barreto, Diana Borsa, Shaobo Hou, Gheorghe Comanici, Eser Aygün, Philippe Hamel, Daniel Toyama, Jonathan Hunt, Shibl Mourad, David Silver, Doina Precup

    Abstract: The ability to combine known skills to create new ones may be crucial in the solution of complex reinforcement learning problems that unfold over extended periods. We argue that a robust way of combining skills is to define and manipulate them in the space of pseudo-rewards (or "cumulants"). Based on this premise, we propose a framework for combining skills using the formalism of options. We show… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: Published at NeurIPS 2019

  11. arXiv:2010.13778  [pdf

    physics.ed-ph cs.ET cs.GL quant-ph

    Achieving a quantum smart workforce

    Authors: Clarice D. Aiello, D. D. Awschalom, Hannes Bernien, Tina Brower-Thomas, Kenneth R. Brown, Todd A. Brun, Justin R. Caram, Eric Chitambar, Rosa Di Felice, Michael F. J. Fox, Stephan Haas, Alexander W. Holleitner, Eric R. Hudson, Jeffrey H. Hunt, Robert Joynt, Scott Koziol, H. J. Lewandowski, Douglas T. McClure, Jens Palsberg, Gina Passante, Kristen L. Pudenz, Christopher J. K. Richardson, Jessica L. Rosenberg, R. S. Ross, Mark Saffman , et al. (7 additional authors not shown)

    Abstract: Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. Thi… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 18 pages, 2 figures, 1 table

    Journal ref: Quantum Sci. Technol. 6 030501 (2021)

  12. arXiv:2009.05524  [pdf, other

    cs.AI cs.LG

    Physically Embedded Planning Problems: New Challenges for Reinforcement Learning

    Authors: Mehdi Mirza, Andrew Jaegle, Jonathan J. Hunt, Arthur Guez, Saran Tunyasuvunakool, Alistair Muldal, Théophane Weber, Peter Karkus, Sébastien Racanière, Lars Buesing, Timothy Lillicrap, Nicolas Heess

    Abstract: Recent work in deep reinforcement learning (RL) has produced algorithms capable of mastering challenging games such as Go, chess, or shogi. In these works the RL agent directly observes the natural state of the game and controls that state directly with its actions. However, when humans play such games, they do not just reason about the moves but also interact with their physical environment. They… ▽ More

    Submitted 29 October, 2020; v1 submitted 11 September, 2020; originally announced September 2020.

    Comments: 17 pages + appendix. Updated text and references

  13. arXiv:2006.10495  [pdf, other

    physics.soc-ph cs.SI

    A Modified Epidemiological Model to Understand the Uneven Impact of COVID-19 on Vulnerable Individuals and the Approaches Required to Help them Emerge from Lockdown

    Authors: Dario Ortega Anderez, Eiman Kanjo, Ganna Pogrebna, Shane Johnson, John Alan Hunt

    Abstract: COVID-19 has shown a relatively low mortality rate in young healthy individuals, with the majority of this group being asymptomatic or having mild symptoms, while the severity of the disease among individuals with underlying health conditions has caused signiffcant mortality rates worldwide. Understanding these differences in mortality amongst different sectors of society and modelling this will e… ▽ More

    Submitted 19 June, 2020; v1 submitted 25 May, 2020; originally announced June 2020.

  14. arXiv:2005.09146  [pdf

    physics.med-ph cs.CV cs.HC

    3D Augmented Reality-Assisted CT-Guided Interventions: System Design and Preclinical Trial on an Abdominal Phantom using HoloLens 2

    Authors: Brian J. Park, Stephen J. Hunt, Gregory J. Nadolski, Terence P. Gade

    Abstract: Background: Out-of-plane lesions pose challenges for CT-guided interventions. Augmented reality (AR) headset devices have evolved and are readily capable to provide virtual 3D guidance to improve CT-guided targeting. Purpose: To describe the design of a three-dimensional (3D) AR-assisted navigation system using HoloLens 2 and evaluate its performance through CT-guided simulations. Materials an… ▽ More

    Submitted 18 May, 2020; originally announced May 2020.

    Comments: 16 pages, 6 figures, 2 tables

  15. arXiv:2004.10931  [pdf

    stat.ML cs.LG stat.AP

    Active Learning for Gaussian Process Considering Uncertainties with Application to Shape Control of Composite Fuselage

    Authors: Xiaowei Yue, Yuchen Wen, Jeffrey H. Hunt, Jianjun Shi

    Abstract: In the machine learning domain, active learning is an iterative data selection algorithm for maximizing information acquisition and improving model performance with limited training samples. It is very useful, especially for the industrial applications where training samples are expensive, time-consuming, or difficult to obtain. Existing methods mainly focus on active learning for classification,… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  16. arXiv:1907.07713  [pdf, other

    eess.IV cs.LG q-bio.QM stat.ML

    An AI-Augmented Lesion Detection Framework For Liver Metastases With Model Interpretability

    Authors: Xin J. Hunt, Ralph Abbey, Ricky Tharrington, Joost Huiskens, Nina Wesdorp

    Abstract: Colorectal cancer (CRC) is the third most common cancer and the second leading cause of cancer-related deaths worldwide. Most CRC deaths are the result of progression of metastases. The assessment of metastases is done using the RECIST criterion, which is time consuming and subjective, as clinicians need to manually measure anatomical tumor sizes. AI has many successes in image object detection, b… ▽ More

    Submitted 17 July, 2019; originally announced July 2019.

    Comments: 4 pages, 2 figures, 2019 KDD Workshop on Applied Data Science for Healthcare

  17. arXiv:1812.02216  [pdf, other

    cs.LG stat.ML

    Composing Entropic Policies using Divergence Correction

    Authors: Jonathan J Hunt, Andre Barreto, Timothy P Lillicrap, Nicolas Heess

    Abstract: Composing previously mastered skills to solve novel tasks promises dramatic improvements in the data efficiency of reinforcement learning. Here, we analyze two recent works composing behaviors represented in the form of action-value functions and show that they perform poorly in some situations. As part of this analysis, we extend an important generalization of policy improvement to the maximum en… ▽ More

    Submitted 5 July, 2019; v1 submitted 5 December, 2018; originally announced December 2018.

  18. arXiv:1807.02442  [pdf, other

    stat.ML cs.LG

    Multi-Task Learning with Incomplete Data for Healthcare

    Authors: Xin J. Hunt, Saba Emrani, Ilknur Kaynar Kabul, Jorge Silva

    Abstract: Multi-task learning is a type of transfer learning that trains multiple tasks simultaneously and leverages the shared information between related tasks to improve the generalization performance. However, missing features in the input matrix is a much more difficult problem which needs to be carefully addressed. Removing records with missing values can significantly reduce the sample size, which is… ▽ More

    Submitted 6 July, 2018; originally announced July 2018.

    Comments: 4 pages, 3 figures, 1 table, 2018 KDD Workshop on Machine Learning for Medicine and Healthcare

  19. arXiv:1610.09027  [pdf, other

    cs.LG

    Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

    Authors: Jack W Rae, Jonathan J Hunt, Tim Harley, Ivo Danihelka, Andrew Senior, Greg Wayne, Alex Graves, Timothy P Lillicrap

    Abstract: Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in both space and time as the amount of memory grows --- limiting their applicability to real-world domains. Here, we present an end-to-end differentiable memory… ▽ More

    Submitted 27 October, 2016; originally announced October 2016.

    Comments: in 30th Conference on Neural Information Processing Systems (NIPS 2016), Barcelona, Spain

  20. arXiv:1606.05312  [pdf, other

    cs.AI

    Successor Features for Transfer in Reinforcement Learning

    Authors: André Barreto, Will Dabney, Rémi Munos, Jonathan J. Hunt, Tom Schaul, Hado van Hasselt, David Silver

    Abstract: Transfer in reinforcement learning refers to the notion that generalization should occur not only within a task but also across tasks. We propose a transfer framework for the scenario where the reward function changes between tasks but the environment's dynamics remain the same. Our approach rests on two key ideas: "successor features", a value function representation that decouples the dynamics o… ▽ More

    Submitted 12 April, 2018; v1 submitted 16 June, 2016; originally announced June 2016.

    Comments: Published at NIPS 2017

  21. arXiv:1512.07679  [pdf, other

    cs.AI cs.LG cs.NE stat.ML

    Deep Reinforcement Learning in Large Discrete Action Spaces

    Authors: Gabriel Dulac-Arnold, Richard Evans, Hado van Hasselt, Peter Sunehag, Timothy Lillicrap, Jonathan Hunt, Timothy Mann, Theophane Weber, Thomas Degris, Ben Coppin

    Abstract: Being able to reason in an environment with a large number of discrete actions is essential to bringing reinforcement learning to a larger class of problems. Recommender systems, industrial plants and language models are only some of the many real-world tasks involving large numbers of discrete actions for which current methods are difficult or even often impossible to apply. An ability to general… ▽ More

    Submitted 4 April, 2016; v1 submitted 23 December, 2015; originally announced December 2015.

  22. arXiv:1512.04455  [pdf, other

    cs.LG

    Memory-based control with recurrent neural networks

    Authors: Nicolas Heess, Jonathan J Hunt, Timothy P Lillicrap, David Silver

    Abstract: Partially observed control problems are a challenging aspect of reinforcement learning. We extend two related, model-free algorithms for continuous control -- deterministic policy gradient and stochastic value gradient -- to solve partially observed domains using recurrent neural networks trained with backpropagation through time. We demonstrate that this approach, coupled with long-short term m… ▽ More

    Submitted 14 December, 2015; originally announced December 2015.

    Comments: NIPS Deep Reinforcement Learning Workshop 2015

  23. arXiv:1509.02971  [pdf, other

    cs.LG stat.ML

    Continuous control with deep reinforcement learning

    Authors: Timothy P. Lillicrap, Jonathan J. Hunt, Alexander Pritzel, Nicolas Heess, Tom Erez, Yuval Tassa, David Silver, Daan Wierstra

    Abstract: We adapt the ideas underlying the success of Deep Q-Learning to the continuous action domain. We present an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. Using the same learning algorithm, network architecture and hyper-parameters, our algorithm robustly solves more than 20 simulated physics tasks, including classic pr… ▽ More

    Submitted 5 July, 2019; v1 submitted 9 September, 2015; originally announced September 2015.

    Comments: 10 pages + supplementary

  24. arXiv:0908.3022  [pdf

    cs.HC cs.SE

    An approach for the automated risk assessment of structural differences between spreadsheets (DiffXL)

    Authors: John Hunt

    Abstract: This paper outlines an approach to manage and quantify the risks associated with changes made to spreadsheets. The methodology focuses on structural differences between spreadsheets and suggests a technique by which a risk analysis can be achieved in an automated environment. The paper offers an example that demonstrates how contiguous ranges of data can be mapped into a generic list of formulae… ▽ More

    Submitted 20 August, 2009; originally announced August 2009.

    Comments: 10 Pages, Numerous Colour Diagrams & Screenshots

    Journal ref: Proc. European Spreadsheet Risks Int. Grp. (EuSpRIG) 2009 ISBN 978-1-905617-89-0