Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Markowitz, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2311.18684  [pdf, other

    cs.LG

    Handling Cost and Constraints with Off-Policy Deep Reinforcement Learning

    Authors: Jared Markowitz, Jesse Silverberg, Gary Collins

    Abstract: By reusing data throughout training, off-policy deep reinforcement learning algorithms offer improved sample efficiency relative to on-policy approaches. For continuous action spaces, the most popular methods for off-policy learning include policy improvement steps where a learned state-action ($Q$) value function is maximized over selected batches of data. These updates are often paired with regu… ▽ More

    Submitted 30 November, 2023; originally announced November 2023.

    Comments: 22 pages, 16 figures

  2. arXiv:2311.05846  [pdf, other

    cs.LG

    Clipped-Objective Policy Gradients for Pessimistic Policy Optimization

    Authors: Jared Markowitz, Edward W. Staley

    Abstract: To facilitate efficient learning, policy gradient approaches to deep reinforcement learning (RL) are typically paired with variance reduction measures and strategies for making large but safe policy changes based on a batch of experiences. Natural policy gradient methods, including Trust Region Policy Optimization (TRPO), seek to produce monotonic improvement through bounded changes in policy outp… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: 12 pages, 8 figures

  3. A Risk-Sensitive Approach to Policy Optimization

    Authors: Jared Markowitz, Ryan W. Gardner, Ashley Llorens, Raman Arora, I-Jeng Wang

    Abstract: Standard deep reinforcement learning (DRL) aims to maximize expected reward, considering collected experiences equally in formulating a policy. This differs from human decision-making, where gains and losses are valued differently and outlying outcomes are given increased consideration. It also fails to capitalize on opportunities to improve safety and/or performance through the incorporation of d… ▽ More

    Submitted 15 November, 2023; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: 16 pages, 13 figures. AAAI 2023 (Special Track on Safe and Robust AI)

  4. arXiv:2205.01235  [pdf, other

    cs.LG cs.NE

    Triangular Dropout: Variable Network Width without Retraining

    Authors: Edward W. Staley, Jared Markowitz

    Abstract: One of the most fundamental design choices in neural networks is layer width: it affects the capacity of what a network can learn and determines the complexity of the solution. This latter property is often exploited when introducing information bottlenecks, forcing a network to learn compressed representations. However, such an architecture decision is typically immutable once training begins; sw… ▽ More

    Submitted 2 May, 2022; originally announced May 2022.

  5. arXiv:2112.00583  [pdf, other

    cs.LG

    Meta Arcade: A Configurable Environment Suite for Meta-Learning

    Authors: Edward W. Staley, Chace Ashcraft, Benjamin Stoler, Jared Markowitz, Gautam Vallabha, Christopher Ratto, Kapil D. Katyal

    Abstract: Most approaches to deep reinforcement learning (DRL) attempt to solve a single task at a time. As a result, most existing research benchmarks consist of individual games or suites of games that have common interfaces but little overlap in their perceptual features, objectives, or reward structures. To facilitate research into knowledge transfer among trained agents (e.g. via multi-task and meta-le… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

    Comments: 17 pages, 6 figures, 6 tables, extended version of an accepted paper to NeurIPS DRL Workshop 2021

  6. Mixture Model Framework for Traumatic Brain Injury Prognosis Using Heterogeneous Clinical and Outcome Data

    Authors: Alan D. Kaplan, Qi Cheng, K. Aditya Mohan, Lindsay D. Nelson, Sonia Jain, Harvey Levin, Abel Torres-Espin, Austin Chou, J. Russell Huie, Adam R. Ferguson, Michael McCrea, Joseph Giacino, Shivshankar Sundaram, Amy J. Markowitz, Geoffrey T. Manley

    Abstract: Prognoses of Traumatic Brain Injury (TBI) outcomes are neither easily nor accurately determined from clinical indicators. This is due in part to the heterogeneity of damage inflicted to the brain, ultimately resulting in diverse and complex outcomes. Using a data-driven approach on many distinct data elements may be necessary to describe this large set of outcomes and thereby robustly depict the n… ▽ More

    Submitted 20 July, 2021; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 12 pages, 5 figures

  7. arXiv:2012.12291  [pdf, other

    cs.RO cs.HC cs.LG

    Learning a Group-Aware Policy for Robot Navigation

    Authors: Kapil Katyal, Yuxiang Gao, Jared Markowitz, Sara Pohland, Corban Rivera, I-Jeng Wang, Chien-Ming Huang

    Abstract: Human-aware robot navigation promises a range of applications in which mobile robots bring versatile assistance to people in common human environments. While prior research has mostly focused on modeling pedestrians as independent, intentional individuals, people move in groups; consequently, it is imperative for mobile robots to respect human groups when navigating around people. This paper explo… ▽ More

    Submitted 29 July, 2022; v1 submitted 22 December, 2020; originally announced December 2020.

    Comments: 8 pages, 4 figures

  8. arXiv:2012.06509  [pdf, other

    cs.CV cs.AI

    Addressing Visual Search in Open and Closed Set Settings

    Authors: Nathan Drenkow, Philippe Burlina, Neil Fendley, Onyekachi Odoemene, Jared Markowitz

    Abstract: Searching for small objects in large images is a task that is both challenging for current deep learning systems and important in numerous real-world applications, such as remote sensing and medical imaging. Thorough scanning of very large images is computationally expensive, particularly at resolutions sufficient to capture small objects. The smaller an object of interest, the more likely it is t… ▽ More

    Submitted 14 April, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

  9. arXiv:1811.03119  [pdf, other

    cs.AI

    On the Complexity of Reconnaissance Blind Chess

    Authors: Jared Markowitz, Ryan W. Gardner, Ashley J. Llorens

    Abstract: This paper provides a complexity analysis for the game of reconnaissance blind chess (RBC), a recently-introduced variant of chess where each player does not know the positions of the opponent's pieces a priori but may reveal a subset of them through chosen, private sensing actions. In contrast to many commonly studied imperfect information games like poker, an RBC player does not know what the op… ▽ More

    Submitted 1 March, 2019; v1 submitted 7 November, 2018; originally announced November 2018.

  10. arXiv:1712.03151  [pdf, other

    cs.CV

    Combining Deep Universal Features, Semantic Attributes, and Hierarchical Classification for Zero-Shot Learning

    Authors: Jared Markowitz, Aurora C. Schmidt, Philippe M. Burlina, I-Jeng Wang

    Abstract: We address zero-shot (ZS) learning, building upon prior work in hierarchical classification by combining it with approaches based on semantic attribute estimation. For both non-novel and novel image classes we compare multiple formulations of the problem, starting with deep universal features in each case. We investigate the effect of using different posterior probabilities as inputs to the hierar… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.

    Comments: 17 pages, 4 figures, extension to work published in conference proceedings of 2017 IAPR MVA Conference