Zum Hauptinhalt springen

Showing 1–38 of 38 results for author: Ross, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2304.10302  [pdf, ps, other

    stat.ML cs.LG

    Optimal Activation of Halting Multi-Armed Bandit Models

    Authors: Wesley Cowan, Michael N. Katehakis, Sheldon M. Ross

    Abstract: We study new types of dynamic allocation problems the {\sl Halting Bandit} models. As an application, we obtain new proofs for the classic Gittins index decomposition result and recent results of the authors in `Multi-armed bandits under general depreciation and commitment.'

    Submitted 20 April, 2023; originally announced April 2023.

    MSC Class: 68T05; 68Q32; 62L10 ACM Class: G.3

  2. arXiv:2304.09299  [pdf, other

    cs.HC

    Virtual Fidgets: Opportunities and Design Principles for Bringing Fidgeting to Online Learning

    Authors: Sam Ross, Nicole Sullivan, Jina Yoon

    Abstract: We present design guidelines for incorporating fidgeting into the virtual world as a tool for students in online lectures. Fidgeting is associated with increased attention and self-regulation, and has the potential to help students focus. Currently there are no fidgets, physical or virtual, designed for preserving attention specifically in online learning environments, and no heuristics for design… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: 6 pages, 3 figures, CHI LBW 2023

  3. The Programmer's Assistant: Conversational Interaction with a Large Language Model for Software Development

    Authors: Steven I. Ross, Fernando Martinez, Stephanie Houde, Michael Muller, Justin D. Weisz

    Abstract: Large language models (LLMs) have recently been applied in software engineering to perform tasks such as translating code between programming languages, generating code from natural language, and autocompleting code as it is being written. When used within development tools, these systems typically treat each model invocation independently from all previous invocations, and only a specific limited… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

    Comments: 43 pages, 3 figures. To be published in IUI 2023

  4. arXiv:2301.10016  [pdf, other

    cs.CY cs.AI cs.HC

    A Case Study in Engineering a Conversational Programming Assistant's Persona

    Authors: Steven I. Ross, Michael Muller, Fernando Martinez, Stephanie Houde, Justin D. Weisz

    Abstract: The Programmer's Assistant is an experimental prototype software development environment that integrates a chatbot with a code editor. Conversational capability was achieved by using an existing code-fluent Large Language Model and providing it with a prompt that establishes a conversational interaction pattern, a set of conventions, and a style of interaction appropriate for the application. A di… ▽ More

    Submitted 13 January, 2023; originally announced January 2023.

    Comments: 11 pages. Submitted to the 4th Workshop on Human-AI Co-Creation with Generative Models (HAI-GEN) at IUI 2023

  5. arXiv:2208.14007  [pdf, other

    cs.LG eess.SP q-bio.NC

    Finding neural signatures for obesity through feature selection on source-localized EEG

    Authors: Yuan Yue, Dirk De Ridder, Patrick Manning, Samantha Ross, Jeremiah D. Deng

    Abstract: Obesity is a serious issue in the modern society and is often associated to significantly reduced quality of life. Current research conducted to explore obesity-related neurological evidences using electroencephalography (EEG) data are limited to traditional approaches. In this study, we developed a novel machine learning model to identify brain networks of obese females using alpha band functiona… ▽ More

    Submitted 21 June, 2023; v1 submitted 30 August, 2022; originally announced August 2022.

    Comments: 4 pages, 3 figures, conference submission

  6. Better Together? An Evaluation of AI-Supported Code Translation

    Authors: Justin D. Weisz, Michael Muller, Steven I. Ross, Fernando Martinez, Stephanie Houde, Mayank Agarwal, Kartik Talamadupula, John T. Richards

    Abstract: Generative machine learning models have recently been applied to source code, for use cases including translating code between programming languages, creating documentation from code, and auto-completing methods. Yet, state-of-the-art models often produce code that is erroneous or incomplete. In a controlled study with 32 software engineers, we examined whether such imperfect outputs are helpful i… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

    Comments: 35 pages, 3 figures. To be published in IUI 2022

  7. arXiv:2110.05423  [pdf, other

    cs.CL

    Using Document Similarity Methods to create Parallel Datasets for Code Translation

    Authors: Mayank Agarwal, Kartik Talamadupula, Fernando Martinez, Stephanie Houde, Michael Muller, John Richards, Steven I Ross, Justin D. Weisz

    Abstract: Translating source code from one programming language to another is a critical, time-consuming task in modernizing legacy applications and codebases. Recent work in this space has drawn inspiration from the software naturalness hypothesis by applying natural language processing techniques towards automating the code translation task. However, due to the paucity of parallel data in this domain, sup… ▽ More

    Submitted 11 October, 2021; originally announced October 2021.

  8. arXiv:2109.11043  [pdf, other

    cs.LG

    Learning Predictive and Interpretable Timeseries Summaries from ICU Data

    Authors: Nari Johnson, Sonali Parbhoo, Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: Machine learning models that utilize patient data across time (rather than just the most recent measurements) have increased performance for many risk stratification tasks in the intensive care unit. However, many of these models and their learned representations are complex and therefore difficult for clinicians to interpret, creating challenges for validation. Our work proposes a new procedure t… ▽ More

    Submitted 22 September, 2021; originally announced September 2021.

    Comments: 10 pages, 3 figures, AMIA 2021 Annual Symposium

  9. arXiv:2108.05295  [pdf, other

    math.CO cs.DM

    Linear Bounds for Cycle-free Saturation Games

    Authors: Sean English, Tomáš Masařík, Grace McCourt, Erin Meger, Michael S. Ross, Sam Spiro

    Abstract: Given a family of graphs $\mathcal{F}$, we define the $\mathcal{F}$-saturation game as follows. Two players alternate adding edges to an initially empty graph on $n$ vertices, with the only constraint being that neither player can add an edge that creates a subgraph in $\mathcal{F}$. The game ends when no more edges can be added to the graph. One of the players wishes to end the game as quickly as… ▽ More

    Submitted 11 August, 2021; originally announced August 2021.

    Comments: 18 pages, 2 figures

    MSC Class: 05C57

    Journal ref: The Electronic Journal of Combinatorics 29(3), 5:1-5:21, 2022

  10. arXiv:2106.06848  [pdf, other

    cs.LG

    Guaranteed Fixed-Confidence Best Arm Identification in Multi-Armed Bandits: Simple Sequential Elimination Algorithms

    Authors: MohammadJavad Azizi, Sheldon M Ross, Zhengyu Zhang

    Abstract: We consider the problem of finding, through adaptive sampling, which of $n$ options (arms) has the largest mean. Our objective is to determine a rule which identifies the best arm with a fixed minimum confidence using as few observations as possible, i.e. this is a fixed-confidence (FC) best arm identification (BAI) in multi-armed bandits. We study such problems under the Bayesian setting with bot… ▽ More

    Submitted 15 March, 2022; v1 submitted 12 June, 2021; originally announced June 2021.

  11. arXiv:2105.08486  [pdf, other

    cs.LG

    Univariate Long-Term Municipal Water Demand Forecasting

    Authors: Blake VanBerlo, Matthew A. S. Ross, Daniel Hsia

    Abstract: This study describes an investigation into the modelling of citywide water consumption in London, Canada. Multiple modelling techniques were evaluated for the task of univariate time series forecasting with water consumption, including linear regression, Facebook's Prophet method, recurrent neural networks, and convolutional neural networks. Prophet was identified as the model of choice, having ac… ▽ More

    Submitted 18 May, 2021; originally announced May 2021.

    Comments: 11 pages, 6 figures

  12. Perfection Not Required? Human-AI Partnerships in Code Translation

    Authors: Justin D. Weisz, Michael Muller, Stephanie Houde, John Richards, Steven I. Ross, Fernando Martinez, Mayank Agarwal, Kartik Talamadupula

    Abstract: Generative models have become adept at producing artifacts such as images, videos, and prose at human-like levels of proficiency. New generative techniques, such as unsupervised neural machine translation (NMT), have recently been applied to the task of generating source code, translating it from one programming language to another. The artifacts produced in this way may contain imperfections, suc… ▽ More

    Submitted 8 April, 2021; originally announced April 2021.

    Comments: 18 pages, 1 figure. To be published in IUI 2021

  13. arXiv:2102.05185  [pdf, other

    cs.LG cs.AI

    Benchmarks, Algorithms, and Metrics for Hierarchical Disentanglement

    Authors: Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: In representation learning, there has been recent interest in developing algorithms to disentangle the ground-truth generative factors behind a dataset, and metrics to quantify how fully this occurs. However, these algorithms and metrics often assume that both representations and ground-truth factors are flat, continuous, and factorized, whereas many real-world generative processes involve rich hi… ▽ More

    Submitted 8 April, 2022; v1 submitted 9 February, 2021; originally announced February 2021.

    Comments: ICML 2021 paper, fixed incorrect version upload

  14. arXiv:2102.01264  [pdf, other

    cs.LG cs.AI cs.HC

    Evaluating the Interpretability of Generative Models by Interactive Reconstruction

    Authors: Andrew Slavin Ross, Nina Chen, Elisa Zhao Hang, Elena L. Glassman, Finale Doshi-Velez

    Abstract: For machine learning models to be most useful in numerous sociotechnical systems, many have argued that they must be human-interpretable. However, despite increasing interest in interpretability, there remains no firm consensus on how to measure it. This is especially true in representation learning, where interpretability research has focused on "disentanglement" measures only applicable to synth… ▽ More

    Submitted 1 February, 2021; originally announced February 2021.

    Comments: CHI 2021 accepted paper

  15. arXiv:2012.07581  [pdf, other

    cs.SE cs.PL

    Quality Estimation & Interpretability for Code Translation

    Authors: Mayank Agarwal, Kartik Talamadupula, Stephanie Houde, Fernando Martinez, Michael Muller, John Richards, Steven Ross, Justin D. Weisz

    Abstract: Recently, the automated translation of source code from one programming language to another by using automatic approaches inspired by Neural Machine Translation (NMT) methods for natural languages has come under study. However, such approaches suffer from the same problem as previous NMT approaches on natural languages, viz. the lack of an ability to estimate and evaluate the quality of the transl… ▽ More

    Submitted 26 April, 2021; v1 submitted 4 December, 2020; originally announced December 2020.

    Comments: NeurIPS 2020 Workshop on Computer-Assisted Programming

  16. arXiv:2010.13778  [pdf

    physics.ed-ph cs.ET cs.GL quant-ph

    Achieving a quantum smart workforce

    Authors: Clarice D. Aiello, D. D. Awschalom, Hannes Bernien, Tina Brower-Thomas, Kenneth R. Brown, Todd A. Brun, Justin R. Caram, Eric Chitambar, Rosa Di Felice, Michael F. J. Fox, Stephan Haas, Alexander W. Holleitner, Eric R. Hudson, Jeffrey H. Hunt, Robert Joynt, Scott Koziol, H. J. Lewandowski, Douglas T. McClure, Jens Palsberg, Gina Passante, Kristen L. Pudenz, Christopher J. K. Richardson, Jessica L. Rosenberg, R. S. Ross, Mark Saffman , et al. (7 additional authors not shown)

    Abstract: Interest in building dedicated Quantum Information Science and Engineering (QISE) education programs has greatly expanded in recent years. These programs are inherently convergent, complex, often resource intensive and likely require collaboration with a broad variety of stakeholders. In order to address this combination of challenges, we have captured ideas from many members in the community. Thi… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: 18 pages, 2 figures, 1 table

    Journal ref: Quantum Sci. Technol. 6 030501 (2021)

  17. arXiv:2009.09086  [pdf, other

    cs.CY cs.AI

    Focused Clinical Query Understanding and Retrieval of Medical Snippets powered through a Healthcare Knowledge Graph

    Authors: Maulik R. Kamdar, Michael Carroll, Will Dowling, Linda Wogulis, Cailey Fitzgerald, Matt Corkum, Danielle Walsh, David Conrad, Craig E. Stanley, Jr., Steve Ross, Dru Henke, Mevan Samarasinghe

    Abstract: Clinicians face several significant barriers to search and synthesize accurate, succinct, updated, and trustworthy medical information from several literature sources during the practice of medicine and patient care. In this talk, we will be presenting our research behind the development of a Focused Clinical Search Service, powered by a Healthcare Knowledge Graph, to interpret the query intent be… ▽ More

    Submitted 17 September, 2020; originally announced September 2020.

    Comments: Under Review as a Podium Talk at the AMIA Informatics Summit 2021

  18. arXiv:2009.09072  [pdf, other

    cs.CY cs.AI cs.LG

    Interpretable Machine Learning Approaches to Prediction of Chronic Homelessness

    Authors: Blake VanBerlo, Matthew A. S. Ross, Jonathan Rivard, Ryan Booker

    Abstract: We introduce a machine learning approach to predict chronic homelessness from de-identified client shelter records drawn from a commonly used Canadian homelessness management information system. Using a 30-day time step, a dataset for 6521 individuals was generated. Our model, HIFIS-RNN-MLP, incorporates both static and dynamic features of a client's history to forecast chronic homelessness 6 mont… ▽ More

    Submitted 12 September, 2020; originally announced September 2020.

    Comments: 14 pages, 7 figures, submitted to Engineering Applications of Artificial Intelligence

  19. arXiv:1911.01291  [pdf, other

    cs.LG stat.ML

    Ensembles of Locally Independent Prediction Models

    Authors: Andrew Slavin Ross, Weiwei Pan, Leo Anthony Celi, Finale Doshi-Velez

    Abstract: Ensembles depend on diversity for improved performance. Many ensemble training methods, therefore, attempt to optimize for diversity, which they almost always define in terms of differences in training set predictions. In this paper, however, we demonstrate the diversity of predictions on the training set does not necessarily imply diversity under mild covariate shift, which can harm generalizatio… ▽ More

    Submitted 7 February, 2020; v1 submitted 4 November, 2019; originally announced November 2019.

    Comments: This is an expansion of arXiv:1806.08716 with different applications and focus, accepted to AAAI 2020. Latest update clarifies a derivation

  20. arXiv:1909.08792  [pdf, other

    cs.RO cs.AI cs.LG

    Agent Prioritization for Autonomous Navigation

    Authors: Khaled S. Refaat, Kai Ding, Natalia Ponomareva, Stéphane Ross

    Abstract: In autonomous navigation, a planning system reasons about other agents to plan a safe and plausible trajectory. Before planning starts, agents are typically processed with computationally intensive models for recognition, tracking, motion estimation and prediction. With limited computational resources and a large number of agents to process in real time, it becomes important to efficiently rank ag… ▽ More

    Submitted 18 September, 2019; originally announced September 2019.

    Comments: 8 pages, accepted to IEEE/RSJ International Conference on Robots and Systems (IROS) 2019

  21. arXiv:1906.05433  [pdf, other

    cs.CY cs.AI cs.LG stat.ML

    Tackling Climate Change with Machine Learning

    Authors: David Rolnick, Priya L. Donti, Lynn H. Kaack, Kelly Kochanski, Alexandre Lacoste, Kris Sankaran, Andrew Slavin Ross, Nikola Milojevic-Dupont, Natasha Jaques, Anna Waldman-Brown, Alexandra Luccioni, Tegan Maharaj, Evan D. Sherwin, S. Karthik Mukkavilli, Konrad P. Kording, Carla Gomes, Andrew Y. Ng, Demis Hassabis, John C. Platt, Felix Creutzig, Jennifer Chayes, Yoshua Bengio

    Abstract: Climate change is one of the greatest challenges facing humanity, and we, as machine learning experts, may wonder how we can help. Here we describe how machine learning can be a powerful tool in reducing greenhouse gas emissions and helping society adapt to a changing climate. From smart grids to disaster management, we identify high impact problems where existing gaps can be filled by machine lea… ▽ More

    Submitted 5 November, 2019; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: For additional resources, please visit the website that accompanies this paper: https://www.climatechange.ai/

  22. arXiv:1810.00869  [pdf, other

    cs.LG cs.AI stat.ML

    Training Machine Learning Models by Regularizing their Explanations

    Authors: Andrew Slavin Ross

    Abstract: Neural networks are among the most accurate supervised learning methods in use today. However, their opacity makes them difficult to trust in critical applications, especially when conditions in training may differ from those in practice. Recent efforts to develop explanations for neural networks and machine learning models more generally have produced tools to shed light on the implicit rules beh… ▽ More

    Submitted 29 September, 2018; originally announced October 2018.

    Comments: Harvard CSE master's thesis; includes portions of arxiv:1703.03717 and arxiv:1711.09404

  23. arXiv:1806.08716  [pdf, other

    cs.LG stat.ML

    Learning Qualitatively Diverse and Interpretable Rules for Classification

    Authors: Andrew Slavin Ross, Weiwei Pan, Finale Doshi-Velez

    Abstract: There has been growing interest in developing accurate models that can also be explained to humans. Unfortunately, if there exist multiple distinct but accurate models for some dataset, current machine learning methods are unlikely to find them: standard techniques will likely recover a complex model that combines them. In this work, we introduce a way to identify a maximal set of distinct but acc… ▽ More

    Submitted 19 July, 2018; v1 submitted 22 June, 2018; originally announced June 2018.

    Comments: Presented at 2018 ICML Workshop on Human Interpretability in Machine Learning (WHI 2018), Stockholm, Sweden (revision fixes minor issues)

  24. arXiv:1805.11571  [pdf, other

    stat.ML cs.LG

    Human-in-the-Loop Interpretability Prior

    Authors: Isaac Lage, Andrew Slavin Ross, Been Kim, Samuel J. Gershman, Finale Doshi-Velez

    Abstract: We often desire our models to be interpretable as well as accurate. Prior work on optimizing models for interpretability has relied on easy-to-quantify proxies for interpretability, such as sparsity or the number of operations required. In this work, we optimize for interpretability by directly including humans in the optimization loop. We develop an algorithm that minimizes the number of user stu… ▽ More

    Submitted 30 October, 2018; v1 submitted 29 May, 2018; originally announced May 2018.

    Comments: To appear at NIPS 2018, selected for a spotlight. 13 pages (incl references and appendix)

  25. arXiv:1711.09404  [pdf, other

    cs.LG cs.CR cs.CV

    Improving the Adversarial Robustness and Interpretability of Deep Neural Networks by Regularizing their Input Gradients

    Authors: Andrew Slavin Ross, Finale Doshi-Velez

    Abstract: Deep neural networks have proven remarkably effective at solving many classification problems, but have been criticized recently for two major weaknesses: the reasons behind their predictions are uninterpretable, and the predictions themselves can often be fooled by small adversarial perturbations. These problems pose major obstacles for the adoption of neural networks in domains that require secu… ▽ More

    Submitted 26 November, 2017; originally announced November 2017.

    Comments: To appear in AAAI 2018

  26. arXiv:1703.03717  [pdf, other

    cs.LG cs.AI stat.ML

    Right for the Right Reasons: Training Differentiable Models by Constraining their Explanations

    Authors: Andrew Slavin Ross, Michael C. Hughes, Finale Doshi-Velez

    Abstract: Neural networks are among the most accurate supervised learning methods in use today, but their opacity makes them difficult to trust in critical applications, especially when conditions in training differ from those in test. Recent work on explanations for black-box models has produced tools (e.g. LIME) to show the implicit rules behind predictions, which can help us identify when models are righ… ▽ More

    Submitted 25 May, 2017; v1 submitted 10 March, 2017; originally announced March 2017.

  27. arXiv:1408.2065  [pdf

    cs.LG stat.ML

    Normalized Online Learning

    Authors: Stephane Ross, Paul Mineiro, John Langford

    Abstract: We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale. This has several useful effects: there is no need to pre-normalize data, the test-time and test-space complexity are reduced, and the algorithms are more robust.

    Submitted 9 August, 2014; originally announced August 2014.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

    Report number: UAI-P-2013-PG-537-545

  28. arXiv:1406.5979  [pdf, ps, other

    cs.LG stat.ML

    Reinforcement and Imitation Learning via Interactive No-Regret Learning

    Authors: Stephane Ross, J. Andrew Bagnell

    Abstract: Recent work has demonstrated that problems-- particularly imitation learning and structured prediction-- where a learner's predictions influence the input-distribution it is tested on can be naturally addressed by an interactive approach and analyzed using no-regret online learning. These approaches to imitation learning, however, neither require nor benefit from information about the cost of acti… ▽ More

    Submitted 23 June, 2014; originally announced June 2014.

    Comments: 14 pages. Under review for NIPS 2014 conference

  29. arXiv:1406.1837  [pdf, other

    cs.LG

    A Credit Assignment Compiler for Joint Prediction

    Authors: Kai-Wei Chang, He He, Hal Daumé III, John Langford, Stephane Ross

    Abstract: Many machine learning applications involve jointly predicting multiple mutually dependent output variables. Learning to search is a family of methods where the complex decision problem is cast into a sequence of decisions via a search space. Although these methods have shown promise both in theory and in practice, implementing them has been burdensomely awkward. In this paper, we show the search s… ▽ More

    Submitted 1 June, 2016; v1 submitted 6 June, 2014; originally announced June 2014.

  30. Online Planning Algorithms for POMDPs

    Authors: Stéphane Ross, Joelle Pineau, Sébastien Paquet, Brahim Chaib-draa

    Abstract: Partially Observable Markov Decision Processes (POMDPs) provide a rich framework for sequential decision-making under uncertainty in stochastic domains. However, solving a POMDP is often intractable except for small problems due to their complexity. Here, we focus on online approaches that alleviate the computational complexity by computing good local policies at each decision step during the exec… ▽ More

    Submitted 14 January, 2014; originally announced January 2014.

    Journal ref: Journal Of Artificial Intelligence Research, Volume 32, pages 663-704, 2008

  31. arXiv:1308.3541  [pdf, ps, other

    cs.LG

    Knapsack Constrained Contextual Submodular List Prediction with Application to Multi-document Summarization

    Authors: Jiaji Zhou, Stephane Ross, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell

    Abstract: We study the problem of predicting a set or list of options under knapsack constraint. The quality of such lists are evaluated by a submodular reward function that measures both quality and diversity. Similar to DAgger (Ross et al., 2010), by a reduction to online learning, we show how to adapt two sequence prediction models to imitate greedy maximization under knapsack constraint problems: CONSEQ… ▽ More

    Submitted 15 March, 2014; v1 submitted 15 August, 2013; originally announced August 2013.

    Comments: 8 pages, ICML 2013 Workshop on Inferning: Interactions between Inference and Learning

  32. arXiv:1305.6646  [pdf, other

    cs.LG stat.ML

    Normalized Online Learning

    Authors: Stephane Ross, Paul Mineiro, John Langford

    Abstract: We introduce online learning algorithms which are independent of feature scales, proving regret bounds dependent on the ratio of scales existent in the data rather than the absolute scale. This has several useful effects: there is no need to pre-normalize data, the test-time and test-space complexity are reduced, and the algorithms are more robust.

    Submitted 28 May, 2013; originally announced May 2013.

    Comments: Appears in Proceedings of the Twenty-Ninth Conference on Uncertainty in Artificial Intelligence (UAI2013)

  33. arXiv:1305.2532  [pdf, other

    cs.LG stat.ML

    Learning Policies for Contextual Submodular Prediction

    Authors: Stephane Ross, Jiaji Zhou, Yisong Yue, Debadeepta Dey, J. Andrew Bagnell

    Abstract: Many prediction domains, such as ad placement, recommendation, trajectory prediction, and document summarization, require predicting a set or list of options. Such lists are often evaluated using submodular reward functions that measure both quality and diversity. We propose a simple, efficient, and provably near-optimal approach to optimizing such prediction problems based on no-regret learning.… ▽ More

    Submitted 11 May, 2013; originally announced May 2013.

    Comments: 13 pages. To appear in proceedings of the International Conference on Machine Learning (ICML), 2013

  34. arXiv:1211.1690  [pdf, other

    cs.RO cs.CV cs.LG eess.SY

    Learning Monocular Reactive UAV Control in Cluttered Natural Environments

    Authors: Stephane Ross, Narek Melik-Barkhudarov, Kumar Shaurya Shankar, Andreas Wendel, Debadeepta Dey, J. Andrew Bagnell, Martial Hebert

    Abstract: Autonomous navigation for large Unmanned Aerial Vehicles (UAVs) is fairly straight-forward, as expensive sensors and monitoring devices can be employed. In contrast, obstacle avoidance remains a challenging task for Micro Aerial Vehicles (MAVs) which operate at low altitude in cluttered environments. Unlike large vehicles, MAVs can only carry very light sensors, such as cameras, making autonomous… ▽ More

    Submitted 7 November, 2012; originally announced November 2012.

    Comments: 8 pages, 10 figures

  35. arXiv:1206.3281  [pdf

    cs.AI

    Model-Based Bayesian Reinforcement Learning in Large Structured Domains

    Authors: Stephane Ross, Joelle Pineau

    Abstract: Model-based Bayesian reinforcement learning has generated significant interest in the AI community as it provides an elegant solution to the optimal exploration-exploitation tradeoff in classical reinforcement learning. Unfortunately, the applicability of this type of approach has been limited to small domains due to the high complexity of reasoning about the joint posterior over model parameters.… ▽ More

    Submitted 13 June, 2012; originally announced June 2012.

    Comments: Appears in Proceedings of the Twenty-Fourth Conference on Uncertainty in Artificial Intelligence (UAI2008)

    Report number: UAI-P-2008-PG-476-483

  36. arXiv:1203.1007  [pdf, other

    cs.LG cs.AI eess.SY stat.ML

    Agnostic System Identification for Model-Based Reinforcement Learning

    Authors: Stephane Ross, J. Andrew Bagnell

    Abstract: A fundamental problem in control is to learn a model of a system from observations that is useful for controller synthesis. To provide good performance guarantees, existing methods must assume that the real system is in the class of models considered during learning. We present an iterative method with strong guarantees even in the agnostic case where the system is not in the class. In particular,… ▽ More

    Submitted 3 July, 2012; v1 submitted 5 March, 2012; originally announced March 2012.

    Comments: 8 pages, published in ICML 2012

  37. arXiv:1108.3154  [pdf, ps, other

    cs.LG stat.ML

    Stability Conditions for Online Learnability

    Authors: Stephane Ross, J. Andrew Bagnell

    Abstract: Stability is a general notion that quantifies the sensitivity of a learning algorithm's output to small change in the training dataset (e.g. deletion or replacement of a single training sample). Such conditions have recently been shown to be more powerful to characterize learnability in the general learning setting under i.i.d. samples where uniform convergence is not necessary for learnability, b… ▽ More

    Submitted 17 August, 2011; v1 submitted 16 August, 2011; originally announced August 2011.

    Comments: 16 pages. Earlier version of this work submitted (but rejected) to COLT 2011

  38. arXiv:1011.0686  [pdf, other

    cs.LG cs.AI stat.ML

    A Reduction of Imitation Learning and Structured Prediction to No-Regret Online Learning

    Authors: Stephane Ross, Geoffrey J. Gordon, J. Andrew Bagnell

    Abstract: Sequential prediction problems such as imitation learning, where future observations depend on previous predictions (actions), violate the common i.i.d. assumptions made in statistical learning. This leads to poor performance in theory and often in practice. Some recent approaches provide stronger guarantees in this setting, but remain somewhat unsatisfactory as they train either non-stationary or… ▽ More

    Submitted 16 March, 2011; v1 submitted 2 November, 2010; originally announced November 2010.

    Comments: Appearing in the 14th International Conference on Artificial Intelligence and Statistics (AISTATS 2011)