Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Powell, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18322  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    The Need for Guardrails with Large Language Models in Medical Safety-Critical Settings: An Artificial Intelligence Application in the Pharmacovigilance Ecosystem

    Authors: Joe B Hakim, Jeffery L Painter, Darmendra Ramcharran, Vijay Kara, Greg Powell, Paulina Sobczak, Chiho Sato, Andrew Bate, Andrew Beam

    Abstract: Large language models (LLMs) are useful tools with the capacity for performing specific types of knowledge work at an effective scale. However, LLM deployments in high-risk and safety-critical domains pose unique challenges, notably the issue of ``hallucination,'' where LLMs can generate fabricated information. This is particularly concerning in settings such as drug safety, where inaccuracies cou… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    Comments: 27 pages, 6 figures, 4 tables and supplementary material provided

    ACM Class: I.2.1; I.2.7; I.7.1

  2. arXiv:2406.03431  [pdf, other

    cs.CV

    CattleFace-RGBT: RGB-T Cattle Facial Landmark Benchmark

    Authors: Ethan Coffman, Reagan Clark, Nhat-Tan Bui, Trong Thang Pham, Beth Kegley, Jeremy G. Powell, Jiangchao Zhao, Ngan Le

    Abstract: To address this challenge, we introduce CattleFace-RGBT, a RGB-T Cattle Facial Landmark dataset consisting of 2,300 RGB-T image pairs, a total of 4,600 images. Creating a landmark dataset is time-consuming, but AI-assisted annotation can help. However, applying AI to thermal images is challenging due to suboptimal results from direct thermal training and infeasible RGB-thermal alignment due to dif… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  3. arXiv:2401.07694  [pdf, other

    math.OC cs.LG stat.ML

    Stochastic optimization with arbitrary recurrent data sampling

    Authors: William G. Powell, Hanbaek Lyu

    Abstract: For obtaining optimal first-order convergence guarantee for stochastic optimization, it is necessary to use a recurrent data sampling algorithm that samples every data point with sufficient frequency. Most commonly used data sampling algorithms (e.g., i.i.d., MCMC, random reshuffling) are indeed recurrent under mild assumptions. In this work, we show that for a particular class of stochastic optim… ▽ More

    Submitted 20 July, 2024; v1 submitted 15 January, 2024; originally announced January 2024.

    Comments: 39 pages, 4 figures, 1 table

    Journal ref: International Conference on Machine Learning, 2024

  4. arXiv:2304.02491  [pdf, other

    cs.HC cs.AI

    "It's Weird That it Knows What I Want": Usability and Interactions with Copilot for Novice Programmers

    Authors: James Prather, Brent N. Reeves, Paul Denny, Brett A. Becker, Juho Leinonen, Andrew Luxton-Reilly, Garrett Powell, James Finnie-Ansley, Eddie Antonio Santos

    Abstract: Recent developments in deep learning have resulted in code-generation models that produce source code from natural language and code-based prompts with high accuracy. This is likely to have profound effects in the classroom, where novices learning to code can now use free tools to automatically suggest solutions to programming exercises and assignments. However, little is currently known about how… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: 26 pages, 2 figures, TOCHI

  5. arXiv:2211.11602  [pdf, other

    cs.LG cs.HC cs.MA

    Improving Multimodal Interactive Agents with Reinforcement Learning from Human Feedback

    Authors: Josh Abramson, Arun Ahuja, Federico Carnevale, Petko Georgiev, Alex Goldin, Alden Hung, Jessica Landon, Jirka Lhotka, Timothy Lillicrap, Alistair Muldal, George Powell, Adam Santoro, Guy Scully, Sanjana Srivastava, Tamara von Glehn, Greg Wayne, Nathaniel Wong, Chen Yan, Rui Zhu

    Abstract: An important goal in artificial intelligence is to create agents that can both interact naturally with humans and learn from their feedback. Here we demonstrate how to use reinforcement learning from human feedback (RLHF) to improve upon simulated, embodied agents trained to a base level of competency with imitation learning. First, we collected data of humans interacting with agents in a simulate… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  6. arXiv:2105.01238  [pdf, other

    cs.LG q-bio.QM

    Supervised multi-specialist topic model with applications on large-scale electronic health record data

    Authors: Ziyang Song, Xavier Sumba Toral, Yixin Xu, Aihua Liu, Liming Guo, Guido Powell, Aman Verma, David Buckeridge, Ariane Marelli, Yue Li

    Abstract: Motivation: Electronic health record (EHR) data provides a new venue to elucidate disease comorbidities and latent phenotypes for precision medicine. To fully exploit its potential, a realistic data generative process of the EHR data needs to be modelled. We present MixEHR-S to jointly infer specialist-disease topics from the EHR data. As the key contribution, we model the specialist assignments a… ▽ More

    Submitted 3 May, 2021; originally announced May 2021.

  7. arXiv:1910.07113  [pdf, other

    cs.LG cs.AI cs.CV cs.RO stat.ML

    Solving Rubik's Cube with a Robot Hand

    Authors: OpenAI, Ilge Akkaya, Marcin Andrychowicz, Maciek Chociej, Mateusz Litwin, Bob McGrew, Arthur Petron, Alex Paino, Matthias Plappert, Glenn Powell, Raphael Ribas, Jonas Schneider, Nikolas Tezak, Jerry Tworek, Peter Welinder, Lilian Weng, Qiming Yuan, Wojciech Zaremba, Lei Zhang

    Abstract: We demonstrate that models trained only in simulation can be used to solve a manipulation problem of unprecedented complexity on a real robot. This is made possible by two key components: a novel algorithm, which we call automatic domain randomization (ADR) and a robot platform built for machine learning. ADR automatically generates a distribution over randomized environments of ever-increasing di… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  8. arXiv:1909.07528  [pdf, other

    cs.LG cs.AI cs.MA stat.ML

    Emergent Tool Use From Multi-Agent Autocurricula

    Authors: Bowen Baker, Ingmar Kanitscheider, Todor Markov, Yi Wu, Glenn Powell, Bob McGrew, Igor Mordatch

    Abstract: Through multi-agent competition, the simple objective of hide-and-seek, and standard reinforcement learning algorithms at scale, we find that agents create a self-supervised autocurriculum inducing multiple distinct rounds of emergent strategy, many of which require sophisticated tool use and coordination. We find clear evidence of six emergent phases in agent strategy in our environment, each of… ▽ More

    Submitted 10 February, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  9. arXiv:1812.00528  [pdf, ps, other

    cs.LG q-bio.PE stat.ML

    Modeling disease progression in longitudinal EHR data using continuous-time hidden Markov models

    Authors: Aman Verma, Guido Powell, Yu Luo, David Stephens, David L. Buckeridge

    Abstract: Modeling disease progression in healthcare administrative databases is complicated by the fact that patients are observed only at irregular intervals when they seek healthcare services. In a longitudinal cohort of 76,888 patients with chronic obstructive pulmonary disease (COPD), we used a continuous-time hidden Markov model with a generalized linear model to model healthcare utilization events. W… ▽ More

    Submitted 2 December, 2018; originally announced December 2018.

    Comments: Machine Learning for Health (ML4H) Workshop at NeurIPS 2018 arXiv:1811.07216

    Report number: ML4H/2018/145

  10. arXiv:1808.00177  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Learning Dexterous In-Hand Manipulation

    Authors: OpenAI, Marcin Andrychowicz, Bowen Baker, Maciek Chociej, Rafal Jozefowicz, Bob McGrew, Jakub Pachocki, Arthur Petron, Matthias Plappert, Glenn Powell, Alex Ray, Jonas Schneider, Szymon Sidor, Josh Tobin, Peter Welinder, Lilian Weng, Wojciech Zaremba

    Abstract: We use reinforcement learning (RL) to learn dexterous in-hand manipulation policies which can perform vision-based object reorientation on a physical Shadow Dexterous Hand. The training is performed in a simulated environment in which we randomize many of the physical properties of the system like friction coefficients and an object's appearance. Our policies transfer to the physical robot despite… ▽ More

    Submitted 18 January, 2019; v1 submitted 1 August, 2018; originally announced August 2018.

    Comments: Making OpenAI the first author. We wish this paper to be cited as "Learning Dexterous In-Hand Manipulation" by OpenAI et al. We are replicating the approach from the physics community: arXiv:1812.06489

  11. arXiv:1802.09464  [pdf, other

    cs.LG cs.AI cs.RO

    Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

    Authors: Matthias Plappert, Marcin Andrychowicz, Alex Ray, Bob McGrew, Bowen Baker, Glenn Powell, Jonas Schneider, Josh Tobin, Maciek Chociej, Peter Welinder, Vikash Kumar, Wojciech Zaremba

    Abstract: The purpose of this technical report is two-fold. First of all, it introduces a suite of challenging continuous control tasks (integrated with OpenAI Gym) based on currently existing robotics hardware. The tasks include pushing, sliding and pick & place with a Fetch robotic arm as well as in-hand object manipulation with a Shadow Dexterous Hand. All tasks have sparse binary rewards and follow a Mu… ▽ More

    Submitted 10 March, 2018; v1 submitted 26 February, 2018; originally announced February 2018.

  12. arXiv:0803.0168  [pdf

    cs.HC

    Comparison of Characteristics and Practices amongst Spreadsheet Users with Different Levels of Experience

    Authors: Kenneth R. Baker, Stephen G. Powell, Barry Lawson, Lynn Foster-Johnson

    Abstract: We developed an internet-based questionnaire on spreadsheet use that we administered to a large number of users in several companies and organizations to document how spreadsheets are currently being developed and used in business. In this paper, we discuss the results drawn from of a comparison of responses from individuals with the most experience and expertise with those from individuals with… ▽ More

    Submitted 2 March, 2008; originally announced March 2008.

    Comments: 16 Pages, 11 Tables

    ACM Class: J.1; H.4.1; K.6.4; D.2.5; D.2.9; K.8.1

    Journal ref: Proc. European Spreadsheet Risks Int. Grp. (EuSpRIG) 2006 205-219 ISBN:1-905617-08-9

  13. arXiv:0801.0715  [pdf

    cs.CY

    Impact of Errors in Operational Spreadsheets

    Authors: Stephen G. Powell, Barry Lawson, Kenneth R. Baker

    Abstract: All users of spreadsheets struggle with the problem of errors. Errors are thought to be prevalent in spreadsheets, and in some instances they have cost organizations millions of dollars. In a previous study of 50 operational spreadsheets we found errors in 0.8% to 1.8% of all formula cells, depending on how errors are defined. In the current study we estimate the quantitative impacts of errors i… ▽ More

    Submitted 4 January, 2008; originally announced January 2008.

    Comments: 12 pages including references

    ACM Class: J.1; H.4.1; K.6.4; D.2.5; D.2.9; K.8.1

    Journal ref: Proc. European Spreadsheet Risks Int. Grp. 2007 57-68 ISBN 978-905617-58-6