Zum Hauptinhalt springen

Showing 1–28 of 28 results for author: Allen, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.18105  [pdf, other

    eess.IV cs.AI cs.CV

    Multi-Resolution Histopathology Patch Graphs for Ovarian Cancer Subtyping

    Authors: Jack Breen, Katie Allen, Kieran Zucker, Nicolas M. Orsi, Nishant Ravikumar

    Abstract: Computer vision models are increasingly capable of classifying ovarian epithelial cancer subtypes, but they differ from pathologists by processing small tissue patches at a single resolution. Multi-resolution graph models leverage the spatial relationships of patches at multiple magnifications, learning the context for each patch. In this study, we conduct the most thorough validation of a graph m… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Initially submitted version of a paper which has been accepted in the GRAIL workshop at MICCAI 2024

  2. arXiv:2406.09292  [pdf, other

    cs.CV cs.AI cs.LG

    Neural Assets: 3D-Aware Multi-Object Scene Synthesis with Image Diffusion Models

    Authors: Ziyi Wu, Yulia Rubanova, Rishabh Kabra, Drew A. Hudson, Igor Gilitschenski, Yusuf Aytar, Sjoerd van Steenkiste, Kelsey R. Allen, Thomas Kipf

    Abstract: We address the problem of multi-object 3D pose control in image diffusion models. Instead of conditioning on a sequence of text tokens, we propose to use a set of per-object representations, Neural Assets, to control the 3D pose of individual objects in a scene. Neural Assets are obtained by pooling visual representations of objects from a reference image, such as a frame in a video, and are train… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Additional details and video results are available at https://neural-assets-paper.github.io/

  3. arXiv:2405.14045  [pdf, other

    cs.LG cs.CV

    Learning rigid-body simulators over implicit shapes for large-scale scenes and vision

    Authors: Yulia Rubanova, Tatiana Lopez-Guevara, Kelsey R. Allen, William F. Whitney, Kimberly Stachenfeld, Tobias Pfaff

    Abstract: Simulating large scenes with many rigid objects is crucial for a variety of applications, such as robotics, engineering, film and video games. Rigid interactions are notoriously hard to model: small changes to the initial state or the simulation parameters can lead to large changes in the final state. Recently, learned simulators based on graph networks (GNNs) were developed as an alternative to h… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  4. arXiv:2405.09990  [pdf, other

    eess.IV cs.AI cs.CV

    Histopathology Foundation Models Enable Accurate Ovarian Cancer Subtype Classification

    Authors: Jack Breen, Katie Allen, Kieran Zucker, Lucy Godson, Nicolas M. Orsi, Nishant Ravikumar

    Abstract: Large pretrained transformers are increasingly being developed as generalised foundation models which can underpin powerful task-specific artificial intelligence models. Histopathology foundation models show promise across many tasks, but analyses have been limited by arbitrary hyperparameters that were not tuned to the specific task/dataset. We report the most rigorous single-task validation cond… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

  5. arXiv:2401.11985  [pdf, other

    cs.LG cs.CV cs.RO

    Scaling Face Interaction Graph Networks to Real World Scenes

    Authors: Tatiana Lopez-Guevara, Yulia Rubanova, William F. Whitney, Tobias Pfaff, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Accurately simulating real world object dynamics is essential for various applications such as robotics, engineering, graphics, and design. To better capture complex real dynamics such as contact and friction, learned simulators based on graph networks have recently shown great promise. However, applying these learned simulators to real scenes comes with two major challenges: first, scaling learne… ▽ More

    Submitted 22 January, 2024; originally announced January 2024.

    Comments: 16 pages, 12 figures

  6. arXiv:2312.05359  [pdf, other

    cs.LG

    Learning 3D Particle-based Simulators from RGB-D Videos

    Authors: William F. Whitney, Tatiana Lopez-Guevara, Tobias Pfaff, Yulia Rubanova, Thomas Kipf, Kimberly Stachenfeld, Kelsey R. Allen

    Abstract: Realistic simulation is critical for applications ranging from robotics to animation. Traditional analytic simulators sometimes struggle to capture sufficiently realistic simulation which can lead to problems including the well known "sim-to-real" gap in robotics. Learned simulators have emerged as an alternative for better capturing real-world physical dynamics, but require access to privileged g… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

  7. arXiv:2311.10284  [pdf, other

    cs.RO cs.HC

    From "Thumbs Up" to "10 out of 10": Reconsidering Scalar Feedback in Interactive Reinforcement Learning

    Authors: Hang Yu, Reuben M. Aronson, Katherine H. Allen, Elaine Schaertl Short

    Abstract: Learning from human feedback is an effective way to improve robotic learning in exploration-heavy tasks. Compared to the wide application of binary human feedback, scalar human feedback has been used less because it is believed to be noisy and unstable. In this paper, we compare scalar and binary feedback, and demonstrate that scalar feedback benefits learning when properly handled. We collected b… ▽ More

    Submitted 16 November, 2023; originally announced November 2023.

    Journal ref: IROS 2023

  8. arXiv:2310.12866  [pdf, other

    eess.IV cs.AI cs.CV

    Predicting Ovarian Cancer Treatment Response in Histopathology using Hierarchical Vision Transformers and Multiple Instance Learning

    Authors: Jack Breen, Katie Allen, Kieran Zucker, Geoff Hall, Nishant Ravikumar, Nicolas M. Orsi

    Abstract: For many patients, current ovarian cancer treatments offer limited clinical benefit. For some therapies, it is not possible to predict patients' responses, potentially exposing them to the adverse effects of treatment without any therapeutic benefit. As part of the automated prediction of treatment effectiveness in ovarian cancer using histopathological images (ATEC23) challenge, we evaluated the… ▽ More

    Submitted 19 October, 2023; originally announced October 2023.

    Comments: Submission to ATEC23 challenge at MICCAI 2023 conference

  9. arXiv:2309.17403  [pdf, other

    math.NA cs.LG

    Maximal Volume Matrix Cross Approximation for Image Compression and Least Squares Solution

    Authors: Kenneth Allen, Ming-Jun Lai, Zhaiming Shen

    Abstract: We study the classic matrix cross approximation based on the maximal volume submatrices. Our main results consist of an improvement of the classic estimate for matrix cross approximation and a greedy approach for finding the maximal volume submatrices. More precisely, we present a new proof of the classic estimate of the inequality with an improved constant. Also, we present a family of greedy max… ▽ More

    Submitted 6 August, 2024; v1 submitted 29 September, 2023; originally announced September 2023.

  10. arXiv:2309.02040  [pdf, other

    cs.LG cs.AI

    Diffusion Generative Inverse Design

    Authors: Marin Vlastelica, Tatiana López-Guevara, Kelsey Allen, Peter Battaglia, Arnaud Doucet, Kimberley Stachenfeld

    Abstract: Inverse design refers to the problem of optimizing the input of an objective function in order to enact a target outcome. For many real-world engineering problems, the objective function takes the form of a simulator that predicts how the system state will evolve over time, and the design challenge is to optimize the initial conditions that lead to a target outcome. Recent developments in learned… ▽ More

    Submitted 18 September, 2023; v1 submitted 5 September, 2023; originally announced September 2023.

    Comments: ICML workshop on Structured Probabilistic Inference & Generative Modeling

  11. arXiv:2308.02851  [pdf, other

    eess.IV cs.CV cs.LG

    Generative Adversarial Networks for Stain Normalisation in Histopathology

    Authors: Jack Breen, Kieran Zucker, Katie Allen, Nishant Ravikumar, Nicolas M. Orsi

    Abstract: The rapid growth of digital pathology in recent years has provided an ideal opportunity for the development of artificial intelligence-based tools to improve the accuracy and efficiency of clinical diagnoses. One of the significant roadblocks to current research is the high level of visual variability across digital pathology images, causing models to generalise poorly to unseen data. Stain normal… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 August, 2023; originally announced August 2023.

    Comments: Updated to add link to full publication at https://doi.org/10.1007/978-3-031-46238-2_11

  12. arXiv:2303.18005  [pdf, other

    eess.IV cs.AI cs.LG

    Artificial Intelligence in Ovarian Cancer Histopathology: A Systematic Review

    Authors: Jack Breen, Katie Allen, Kieran Zucker, Pratik Adusumilli, Andy Scarsbrook, Geoff Hall, Nicolas M. Orsi, Nishant Ravikumar

    Abstract: Purpose - To characterise and assess the quality of published research evaluating artificial intelligence (AI) methods for ovarian cancer diagnosis or prognosis using histopathology data. Methods - A search of PubMed, Scopus, Web of Science, CENTRAL, and WHO-ICTRP was conducted up to 19/05/2023. The inclusion criteria required that research evaluated AI on histopathology images for diagnostic or p… ▽ More

    Submitted 16 June, 2023; v1 submitted 31 March, 2023; originally announced March 2023.

  13. arXiv:2303.15311  [pdf, other

    cs.LG

    Improving Dual-Encoder Training through Dynamic Indexes for Negative Mining

    Authors: Nicholas Monath, Manzil Zaheer, Kelsey Allen, Andrew McCallum

    Abstract: Dual encoder models are ubiquitous in modern classification and retrieval. Crucial for training such dual encoders is an accurate estimation of gradients from the partition function of the softmax over the large output space; this requires finding negative targets that contribute most significantly ("hard negatives"). Since dual encoder model parameters change during training, the use of tradition… ▽ More

    Submitted 27 March, 2023; originally announced March 2023.

    Comments: To appear at AISTATS 2023

  14. arXiv:2302.08867  [pdf, other

    eess.IV cs.CV

    Efficient subtyping of ovarian cancer histopathology whole slide images using active sampling in multiple instance learning

    Authors: Jack Breen, Katie Allen, Kieran Zucker, Geoff Hall, Nicolas M. Orsi, Nishant Ravikumar

    Abstract: Weakly-supervised classification of histopathology slides is a computationally intensive task, with a typical whole slide image (WSI) containing billions of pixels to process. We propose Discriminative Region Active Sampling for Multiple Instance Learning (DRAS-MIL), a computationally efficient slide classification method using attention scores to focus sampling on highly discriminative regions. W… ▽ More

    Submitted 21 February, 2023; v1 submitted 17 February, 2023; originally announced February 2023.

  15. arXiv:2212.03574  [pdf, other

    cs.LG

    Learning rigid dynamics with face interaction graph networks

    Authors: Kelsey R. Allen, Yulia Rubanova, Tatiana Lopez-Guevara, William Whitney, Alvaro Sanchez-Gonzalez, Peter Battaglia, Tobias Pfaff

    Abstract: Simulating rigid collisions among arbitrary shapes is notoriously difficult due to complex geometry and the strong non-linearity of the interactions. While graph neural network (GNN)-based models are effective at learning to simulate complex physical dynamics, such as fluids, cloth and articulated bodies, they have been less effective and efficient on rigid-body physics, except with very simple sh… ▽ More

    Submitted 7 December, 2022; originally announced December 2022.

  16. arXiv:2202.00728  [pdf, other

    cs.LG

    Physical Design using Differentiable Learned Simulators

    Authors: Kelsey R. Allen, Tatiana Lopez-Guevara, Kimberly Stachenfeld, Alvaro Sanchez-Gonzalez, Peter Battaglia, Jessica Hamrick, Tobias Pfaff

    Abstract: Designing physical artifacts that serve a purpose - such as tools and other functional structures - is central to engineering as well as everyday human behavior. Though automating design has tremendous promise, general-purpose methods do not yet exist. Here we explore a simple, fast, and robust approach to inverse design which combines learned forward simulators based on graph neural networks with… ▽ More

    Submitted 1 February, 2022; originally announced February 2022.

    Comments: First three authors contributed equally

  17. arXiv:2110.00717  [pdf, other

    cs.RO

    Mobile Manipulation Leveraging Multiple Views

    Authors: David Watkins, Peter K Allen, Henrique Maia, Madhavan Seshadri, Jonathan Sanabria, Nicholas Waytowich, Jacob Varley

    Abstract: While both navigation and manipulation are challenging topics in isolation, many tasks require the ability to both navigate and manipulate in concert. To this end, we propose a mobile manipulation system that leverages novel navigation and shape completion methods to manipulate an object with a mobile robot. Our system utilizes uncertainty in the initial estimation of a manipulation target to calc… ▽ More

    Submitted 7 March, 2022; v1 submitted 1 October, 2021; originally announced October 2021.

    Comments: 6 pages, 2 pages of references, 5 figures, 5 tables

  18. arXiv:2103.10562  [pdf, other

    cs.RO

    Dynamic Grasping with Reachability and Motion Awareness

    Authors: Iretiayo Akinola, Jingxi Xu, Shuran Song, Peter K. Allen

    Abstract: Grasping in dynamic environments presents a unique set of challenges. A stable and reachable grasp can become unreachable and unstable as the target object moves, motion planning needs to be adaptive and in real time, the delay in computation makes prediction necessary. In this paper, we present a dynamic grasping framework that is reachability-aware and motion-aware. Specifically, we model the re… ▽ More

    Submitted 18 March, 2021; originally announced March 2021.

  19. arXiv:2004.14554  [pdf, other

    cs.CL cs.CY

    Indirect Identification of Psychosocial Risks from Natural Language

    Authors: Kristen C. Allen, Alex Davis, Tamar Krishnamurti

    Abstract: During the perinatal period, psychosocial health risks, including depression and intimate partner violence, are associated with serious adverse health outcomes for parents and children. To appropriately intervene, healthcare professionals must first identify those at risk, yet stigma often prevents people from directly disclosing the information needed to prompt an assessment. We examine indirect… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 12 pages, 4 figures

    ACM Class: J.3; J.4; H.5.2

  20. arXiv:2003.04956  [pdf, other

    cs.RO cs.LG

    SQUIRL: Robust and Efficient Learning from Video Demonstration of Long-Horizon Robotic Manipulation Tasks

    Authors: Bohan Wu, Feng Xu, Zhanpeng He, Abhi Gupta, Peter K. Allen

    Abstract: Recent advances in deep reinforcement learning (RL) have demonstrated its potential to learn complex robotic manipulation tasks. However, RL still requires the robot to collect a large amount of real-world experience. To address this problem, recent works have proposed learning from expert demonstrations (LfD), particularly via inverse reinforcement learning (IRL), given its ability to achieve rob… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

    Comments: 8 pages

  21. arXiv:1907.09620  [pdf, other

    cs.AI cs.LG cs.RO

    Rapid trial-and-error learning with simulation supports flexible tool use and physical reasoning

    Authors: Kelsey R. Allen, Kevin A. Smith, Joshua B. Tenenbaum

    Abstract: Many animals, and an increasing number of artificial agents, display sophisticated capabilities to perceive and manipulate objects. But human beings remain distinctive in their capacity for flexible, creative tool use -- using objects in new ways to act on the world, achieve a goal, or solve a problem. To study this type of general physical problem solving, we introduce the Virtual Tools game. In… ▽ More

    Submitted 29 June, 2020; v1 submitted 22 July, 2019; originally announced July 2019.

    Comments: This manuscript is in press at PNAS. It is an extended version of a paper "Rapid Trial-and-Error Learning in Physical Problem Solving" accepted for oral presentation at the 41st Annual Meeting of the Cognitive Science Society (2019). It represents ongoing work on the part of the authors

  22. arXiv:1904.06317  [pdf, other

    cs.AI

    Few-Shot Bayesian Imitation Learning with Logical Program Policies

    Authors: Tom Silver, Kelsey R. Allen, Alex K. Lew, Leslie Pack Kaelbling, Josh Tenenbaum

    Abstract: Humans can learn many novel tasks from a very small number (1--5) of demonstrations, in stark contrast to the data requirements of nearly tabula rasa deep learning methods. We propose an expressive class of policies, a strong but general prior, and a learning algorithm that, together, can learn interesting policies from very few examples. We represent policies as logical combinations of programs d… ▽ More

    Submitted 16 November, 2019; v1 submitted 12 April, 2019; originally announced April 2019.

    Comments: AAAI 2020

  23. arXiv:1903.03227  [pdf, other

    cs.RO cs.AI cs.LG

    Pixel-Attentive Policy Gradient for Multi-Fingered Grasping in Cluttered Scenes

    Authors: Bohan Wu, Iretiayo Akinola, Peter K. Allen

    Abstract: Recent advances in on-policy reinforcement learning (RL) methods enabled learning agents in virtual environments to master complex tasks with high-dimensional and continuous observation and action spaces. However, leveraging this family of algorithms in multi-fingered robotic grasping remains a challenge due to large sim-to-real fidelity gaps and the high sample complexity of on-policy RL algorith… ▽ More

    Submitted 21 September, 2019; v1 submitted 7 March, 2019; originally announced March 2019.

    Comments: Accepted at IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2019)

  24. arXiv:1902.04552  [pdf, other

    cs.LG stat.ML

    Infinite Mixture Prototypes for Few-Shot Learning

    Authors: Kelsey R. Allen, Evan Shelhamer, Hanul Shin, Joshua B. Tenenbaum

    Abstract: We propose infinite mixture prototypes to adaptively represent both simple and complex data distributions for few-shot learning. Our infinite mixture prototypes represent each class by a set of clusters, unlike existing prototypical methods that represent each class by a single cluster. By inferring the number of clusters, infinite mixture prototypes interpolate between nearest neighbor and protot… ▽ More

    Submitted 12 February, 2019; originally announced February 2019.

  25. arXiv:1812.06298  [pdf, other

    cs.RO cs.LG

    Residual Policy Learning

    Authors: Tom Silver, Kelsey Allen, Josh Tenenbaum, Leslie Kaelbling

    Abstract: We present Residual Policy Learning (RPL): a simple method for improving nondifferentiable policies using model-free deep reinforcement learning. RPL thrives in complex robotic manipulation tasks where good but imperfect controllers are available. In these tasks, reinforcement learning from scratch remains data-inefficient or intractable, but learning a residual on top of the initial controller ca… ▽ More

    Submitted 3 January, 2019; v1 submitted 15 December, 2018; originally announced December 2018.

  26. arXiv:1806.11402  [pdf, other

    cs.RO

    Workspace Aware Online Grasp Planning

    Authors: Iretiayo Akinola, Jacob Varley, Boyuan Chen, Peter K. Allen

    Abstract: This work provides a framework for a workspace aware online grasp planner. This framework greatly improves the performance of standard online grasp planning algorithms by incorporating a notion of reachability into the online grasp planning process. Offline, a database of hundreds of thousands of unique end-effector poses were queried for feasability. At runtime, our grasp planner uses this databa… ▽ More

    Submitted 29 June, 2018; originally announced June 2018.

    Comments: 8 pages, Submitted to IROS 2018

  27. arXiv:1806.01261  [pdf, other

    cs.LG cs.AI stat.ML

    Relational inductive biases, deep learning, and graph networks

    Authors: Peter W. Battaglia, Jessica B. Hamrick, Victor Bapst, Alvaro Sanchez-Gonzalez, Vinicius Zambaldi, Mateusz Malinowski, Andrea Tacchetti, David Raposo, Adam Santoro, Ryan Faulkner, Caglar Gulcehre, Francis Song, Andrew Ballard, Justin Gilmer, George Dahl, Ashish Vaswani, Kelsey Allen, Charles Nash, Victoria Langston, Chris Dyer, Nicolas Heess, Daan Wierstra, Pushmeet Kohli, Matt Botvinick, Oriol Vinyals , et al. (2 additional authors not shown)

    Abstract: Artificial intelligence (AI) has undergone a renaissance recently, making major progress in key domains such as vision, language, control, and decision-making. This has been due, in part, to cheap data and cheap compute resources, which have fit the natural strengths of deep learning. However, many defining characteristics of human intelligence, which developed under much different pressures, rema… ▽ More

    Submitted 17 October, 2018; v1 submitted 4 June, 2018; originally announced June 2018.

  28. arXiv:1806.01203  [pdf, other

    cs.LG cs.AI stat.ML

    Relational inductive bias for physical construction in humans and machines

    Authors: Jessica B. Hamrick, Kelsey R. Allen, Victor Bapst, Tina Zhu, Kevin R. McKee, Joshua B. Tenenbaum, Peter W. Battaglia

    Abstract: While current deep learning systems excel at tasks such as object classification, language processing, and gameplay, few can construct or modify a complex system such as a tower of blocks. We hypothesize that what these systems lack is a "relational inductive bias": a capacity for reasoning about inter-object relations and making choices over a structured description of a scene. To test this hypot… ▽ More

    Submitted 4 June, 2018; originally announced June 2018.

    Comments: In Proceedings of the Annual Meeting of the Cognitive Science Society (CogSci 2018)