Zum Hauptinhalt springen

Showing 1–10 of 10 results for author: Austin, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12865  [pdf, other

    cs.CL cs.AI

    GRAD-SUM: Leveraging Gradient Summarization for Optimal Prompt Engineering

    Authors: Derek Austin, Elliott Chartock

    Abstract: Prompt engineering for large language models (LLMs) is often a manual time-intensive process that involves generating, evaluating, and refining prompts iteratively to ensure high-quality outputs. While there has been work on automating prompt engineering, the solutions generally are either tuned to specific tasks with given answers or are quite costly. We introduce GRAD-SUM, a scalable and flexibl… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 15 pages, 2 figures

  2. Bayesian Optimization with LLM-Based Acquisition Functions for Natural Language Preference Elicitation

    Authors: David Eric Austin, Anton Korikov, Armin Toroghi, Scott Sanner

    Abstract: Designing preference elicitation (PE) methodologies that can quickly ascertain a user's top item preferences in a cold-start setting is a key challenge for building effective and personalized conversational recommendation (ConvRec) systems. While large language models (LLMs) enable fully natural language (NL) PE dialogues, we hypothesize that monolithic LLM NL-PE approaches lack the multi-turn, de… ▽ More

    Submitted 19 August, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

  3. arXiv:2403.12894  [pdf, other

    cs.CV

    MEDBind: Unifying Language and Multimodal Medical Data Embeddings

    Authors: Yuan Gao, Sangwook Kim, David E Austin, Chris McIntosh

    Abstract: Medical vision-language pretraining models (VLPM) have achieved remarkable progress in fusing chest X-rays (CXR) with clinical texts, introducing image-text data binding approaches that enable zero-shot learning and downstream clinical tasks. However, the current landscape lacks the holistic integration of additional medical modalities, such as electrocardiograms (ECG). We present MEDBind (Medical… ▽ More

    Submitted 20 March, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

  4. arXiv:2307.01492  [pdf, other

    cs.CV cs.RO

    FB-OCC: 3D Occupancy Prediction based on Forward-Backward View Transformation

    Authors: Zhiqi Li, Zhiding Yu, David Austin, Mingsheng Fang, Shiyi Lan, Jan Kautz, Jose M. Alvarez

    Abstract: This technical report summarizes the winning solution for the 3D Occupancy Prediction Challenge, which is held in conjunction with the CVPR 2023 Workshop on End-to-End Autonomous Driving and CVPR 23 Workshop on Vision-Centric Autonomous Driving Workshop. Our proposed solution FB-OCC builds upon FB-BEV, a cutting-edge camera-based bird's-eye view perception design using forward-backward projection.… ▽ More

    Submitted 4 July, 2023; originally announced July 2023.

    Comments: Outstanding Champion and Innovation Award in the 3D Occupancy Prediction Challenge (CVPR23)

  5. arXiv:2305.07152  [pdf, other

    cs.CV

    Surgical tool classification and localization: results and methods from the MICCAI 2022 SurgToolLoc challenge

    Authors: Aneeq Zia, Kiran Bhattacharyya, Xi Liu, Max Berniker, Ziheng Wang, Rogerio Nespolo, Satoshi Kondo, Satoshi Kasai, Kousuke Hirasawa, Bo Liu, David Austin, Yiheng Wang, Michal Futrega, Jean-Francois Puget, Zhenqiang Li, Yoichi Sato, Ryo Fujii, Ryo Hachiuma, Mana Masuda, Hideo Saito, An Wang, Mengya Xu, Mobarakol Islam, Long Bai, Winnie Pang , et al. (46 additional authors not shown)

    Abstract: The ability to automatically detect and track surgical instruments in endoscopic videos can enable transformational interventions. Assessing surgical performance and efficiency, identifying skilled tool use and choreography, and planning operational and logistical aspects of OR resources are just a few of the applications that could benefit. Unfortunately, obtaining the annotations needed to train… ▽ More

    Submitted 31 May, 2023; v1 submitted 11 May, 2023; originally announced May 2023.

  6. arXiv:2206.05442  [pdf, ps, other

    cs.LG

    From Human Days to Machine Seconds: Automatically Answering and Generating Machine Learning Final Exams

    Authors: Iddo Drori, Sarah J. Zhang, Reece Shuttleworth, Sarah Zhang, Keith Tyser, Zad Chin, Pedro Lantigua, Saisamrit Surbehera, Gregory Hunter, Derek Austin, Leonard Tang, Yann Hicke, Sage Simhon, Sathwik Karnik, Darnell Granberry, Madeleine Udell

    Abstract: A final exam in machine learning at a top institution such as MIT, Harvard, or Cornell typically takes faculty days to write, and students hours to solve. We demonstrate that large language models pass machine learning finals at a human level, on finals available online after the models were trained, and automatically generate new human-quality final exam questions in seconds. Previous work has de… ▽ More

    Submitted 28 June, 2023; v1 submitted 11 June, 2022; originally announced June 2022.

    Comments: 9 pages

  7. arXiv:2108.09751  [pdf

    cs.CY

    Connecting the Dots in Nutritional Rehabilitation: A Qualitative Study on ICT and Community Based Care

    Authors: Deepa Austin, Amit Prakash

    Abstract: 'Fragmentation in care' continuum is often considered as a shortcoming of Health system whereas, 'Integration of care' is widely acclaimed as a viable solution to fragmentation. In last two decades, Information and communication technologies (ICTs), by virtue of their ability to integrate information for action, has been extensively used in addressing many public health problems like malnutrition.… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: In proceedings of the 1st Virtual Conference on Implications of Information and Digital Technologies for Development, 2021

  8. arXiv:1910.11631  [pdf, other

    cs.CV

    Learning to Localize Temporal Events in Large-scale Video Data

    Authors: Mikel Bober-Irizar, Miha Skalic, David Austin

    Abstract: We address temporal localization of events in large-scale video data, in the context of the Youtube-8M Segments dataset. This emerging field within video recognition can enable applications to identify the precise time a specified event occurs in a video, which has broad implications for video search. To address this we present two separate approaches: (1) a gradient boosted decision tree model on… ▽ More

    Submitted 25 October, 2019; originally announced October 2019.

    Comments: ICCV 2019, 3rd Youtube-8M Workshop

  9. arXiv:1701.03978  [pdf, ps, other

    cs.CE physics.chem-ph

    Computer-aided molecular design: An introduction and review of tools, applications, and solution techniques

    Authors: Nick D. Austin, Nikolaos V. Sahinidis, Daniel W. Trahan

    Abstract: This article provides an introduction to and review of the field of computer-aided molecular design (CAMD). It is intended to be approachable for the absolute beginner as well as useful to the seasoned CAMD practitioner. We begin by discussing various quantitative structure-property relationships (QSPRs) which have been demonstrated to work well with CAMD problems. The methods discussed in this ar… ▽ More

    Submitted 14 January, 2017; originally announced January 2017.

    Comments: 38 pages, 13 figures, 3 tables, 173 references

    Journal ref: Chemical Engineering Research and Design, 116, 2-26, 2016

  10. arXiv:1310.4880  [pdf, other

    cs.OH

    Gait Velocity Estimation using time interleaved between Consecutive Passive IR Sensor Activations

    Authors: Rajib Rana, Daniel Austin, Peter G. Jacobs, Mohanraj Karunanithi, Jeffrey Kaye

    Abstract: Gait velocity has been consistently shown to be an important indicator and predictor of health status, especially in older adults. It is often assessed clinically, but the assessments occur infrequently and do not allow optimal detection of key health changes when they occur. In this paper, we show that the time gap between activations of a pair of Passive Infrared (PIR) motion sensors installed i… ▽ More

    Submitted 30 November, 2015; v1 submitted 17 October, 2013; originally announced October 2013.