Zum Hauptinhalt springen

Showing 1–3 of 3 results for author: Howe, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18934  [pdf

    cs.CV cs.HC

    The Visual Experience Dataset: Over 200 Recorded Hours of Integrated Eye Movement, Odometry, and Egocentric Video

    Authors: Michelle R. Greene, Benjamin J. Balas, Mark D. Lescroart, Paul R. MacNeilage, Jennifer A. Hart, Kamran Binaee, Peter A. Hausamann, Ronald Mezile, Bharath Shankar, Christian B. Sinnott, Kaylie Capurro, Savannah Halow, Hunter Howe, Mariam Josyula, Annie Li, Abraham Mieses, Amina Mohamed, Ilya Nudnou, Ezra Parkhill, Peter Riley, Brett Schmidt, Matthew W. Shinkle, Wentao Si, Brian Szekely, Joaquin M. Torres , et al. (1 additional authors not shown)

    Abstract: We introduce the Visual Experience Dataset (VEDB), a compilation of over 240 hours of egocentric video combined with gaze- and head-tracking data that offers an unprecedented view of the visual world as experienced by human observers. The dataset consists of 717 sessions, recorded by 58 observers ranging from 6-49 years old. This paper outlines the data collection, processing, and labeling protoco… ▽ More

    Submitted 13 August, 2024; v1 submitted 15 February, 2024; originally announced April 2024.

    Comments: 40 pages, 1 table, 9 figures

  2. arXiv:2209.13085  [pdf, other

    cs.LG stat.ML

    Defining and Characterizing Reward Hacking

    Authors: Joar Skalse, Nikolaus H. R. Howe, Dmitrii Krasheninnikov, David Krueger

    Abstract: We provide the first formal definition of reward hacking, a phenomenon where optimizing an imperfect proxy reward function, $\mathcal{\tilde{R}}$, leads to poor performance according to the true reward function, $\mathcal{R}$. We say that a proxy is unhackable if increasing the expected proxy return can never decrease the expected true return. Intuitively, it might be possible to create an unhacka… ▽ More

    Submitted 26 September, 2022; originally announced September 2022.

  3. arXiv:2202.10600  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Myriad: a real-world testbed to bridge trajectory optimization and deep learning

    Authors: Nikolaus H. R. Howe, Simon Dufort-Labbé, Nitarshan Rajkumar, Pierre-Luc Bacon

    Abstract: We present Myriad, a testbed written in JAX for learning and planning in real-world continuous environments. The primary contributions of Myriad are threefold. First, Myriad provides machine learning practitioners access to trajectory optimization techniques for application within a typical automatic differentiation workflow. Second, Myriad presents many real-world optimal control problems, rangin… ▽ More

    Submitted 26 January, 2023; v1 submitted 21 February, 2022; originally announced February 2022.

    Comments: Updated to match version accepted at NeurIPS 2022