Zum Hauptinhalt springen

Showing 1–46 of 46 results for author: Nori, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.08210  [pdf, other

    cs.LG

    Does Reasoning Emerge? Examining the Probabilities of Causation in Large Language Models

    Authors: Javier González, Aditya V. Nori

    Abstract: Recent advances in AI have been significantly driven by the capabilities of large language models (LLMs) to solve complex problems in ways that resemble human thinking. However, there is an ongoing debate about the extent to which LLMs are capable of actual reasoning. Central to this debate are two key probabilistic concepts that are essential for connecting causes to their effects: the probabilit… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  2. arXiv:2407.00004  [pdf, other

    q-bio.BM cs.AI cs.LG q-bio.QM

    Multi-objective generative AI for designing novel brain-targeting small molecules

    Authors: Ayush Noori, Iñaki Arango, William E. Byrd, Nada Amin

    Abstract: The strict selectivity of the blood-brain barrier (BBB) represents one of the most formidable challenges to successful central nervous system (CNS) drug delivery. Computational methods to generate BBB permeable drugs in silico may be valuable tools in the CNS drug design pipeline. However, in real-world applications, BBB penetration alone is insufficient; rather, after transiting the BBB, molecule… ▽ More

    Submitted 16 April, 2024; originally announced July 2024.

    Comments: 20 pages, 4 figures, Generative and Experimental Perspectives for Biomolecular Design Workshop at the 12th International Conference on Learning Representations

  3. arXiv:2406.18786  [pdf, other

    cs.AR

    Constable: Improving Performance and Power Efficiency by Safely Eliminating Load Instruction Execution

    Authors: Rahul Bera, Adithya Ranganathan, Joydeep Rakshit, Sujit Mahto, Anant V. Nori, Jayesh Gaur, Ataberk Olgun, Konstantinos Kanellopoulos, Mohammad Sadrosadati, Sreenivas Subramoney, Onur Mutlu

    Abstract: Load instructions often limit instruction-level parallelism (ILP) in modern processors due to data and resource dependences they cause. Prior techniques like Load Value Prediction (LVP) and Memory Renaming (MRN) mitigate load data dependence by predicting the data value of a load instruction. However, they fail to mitigate load resource dependence as the predicted load instruction gets executed no… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: To appear in the proceedings of 51st International Symposium on Computer Architecture (ISCA)

  4. arXiv:2405.05299  [pdf, other

    cs.HC cs.AI

    Challenges for Responsible AI Design and Workflow Integration in Healthcare: A Case Study of Automatic Feeding Tube Qualification in Radiology

    Authors: Anja Thieme, Abhijith Rajamohan, Benjamin Cooper, Heather Groombridge, Robert Simister, Barney Wong, Nicholas Woznitza, Mark Ames Pinnock, Maria Teodora Wetscherek, Cecily Morrison, Hannah Richardson, Fernando Pérez-García, Stephanie L. Hyland, Shruthi Bannur, Daniel C. Castro, Kenza Bouzid, Anton Schwaighofer, Mercy Ranjit, Harshita Sharma, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle, Aditya Nori, Stephen Harris, Joseph Jacob

    Abstract: Nasogastric tubes (NGTs) are feeding tubes that are inserted through the nose into the stomach to deliver nutrition or medication. If not placed correctly, they can cause serious harm, even death to patients. Recent AI developments demonstrate the feasibility of robustly detecting NGT placement from Chest X-ray images to reduce risks of sub-optimally or critically placed NGTs being missed or delay… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    ACM Class: H.5.m; I.2.m

  5. arXiv:2404.02831  [pdf, other

    cs.AI

    Empowering Biomedical Discovery with AI Agents

    Authors: Shanghua Gao, Ada Fang, Yepeng Huang, Valentina Giunchiglia, Ayush Noori, Jonathan Richard Schwarz, Yasha Ektefaie, Jovana Kondic, Marinka Zitnik

    Abstract: We envision "AI scientists" as systems capable of skeptical learning and reasoning that empower biomedical research through collaborative agents that integrate AI models and biomedical tools with experimental platforms. Rather than taking humans out of the discovery process, biomedical AI agents combine human creativity and expertise with AI's ability to analyze large datasets, navigate hypothesis… ▽ More

    Submitted 24 July, 2024; v1 submitted 3 April, 2024; originally announced April 2024.

  6. Multimodal Healthcare AI: Identifying and Designing Clinically Relevant Vision-Language Applications for Radiology

    Authors: Nur Yildirim, Hannah Richardson, Maria T. Wetscherek, Junaid Bajwa, Joseph Jacob, Mark A. Pinnock, Stephen Harris, Daniel Coelho de Castro, Shruthi Bannur, Stephanie L. Hyland, Pratik Ghosh, Mercy Ranjit, Kenza Bouzid, Anton Schwaighofer, Fernando Pérez-García, Harshita Sharma, Ozan Oktay, Matthew Lungren, Javier Alvarez-Valle, Aditya Nori, Anja Thieme

    Abstract: Recent advances in AI combine large language models (LLMs) with vision encoders that bring forward unprecedented technical capabilities to leverage for a wide range of healthcare applications. Focusing on the domain of radiology, vision-language models (VLMs) achieve good performance results for tasks such as generating radiology findings based on a patient's medical image, or answering visual que… ▽ More

    Submitted 21 February, 2024; originally announced February 2024.

    Comments: to appear at CHI 2024

  7. arXiv:2312.12865  [pdf, other

    cs.CV cs.AI

    RadEdit: stress-testing biomedical vision models via diffusion image editing

    Authors: Fernando Pérez-García, Sam Bond-Taylor, Pedro P. Sanchez, Boris van Breugel, Daniel C. Castro, Harshita Sharma, Valentina Salvatelli, Maria T. A. Wetscherek, Hannah Richardson, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay, Maximilian Ilse

    Abstract: Biomedical imaging datasets are often small and biased, meaning that real-world performance of predictive models can be substantially lower than expected from internal testing. This work proposes using generative image editing to simulate dataset shifts and diagnose failure modes of biomedical vision models; this can be used in advance of deployment to assess readiness, potentially reducing cost a… ▽ More

    Submitted 3 April, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

  8. arXiv:2311.03033  [pdf, ps, other

    cs.LG cs.AI

    Beyond Words: A Mathematical Framework for Interpreting Large Language Models

    Authors: Javier González, Aditya V. Nori

    Abstract: Large language models (LLMs) are powerful AI tools that can generate and comprehend natural language text and other complex information. However, the field lacks a mathematical framework to systematically describe, compare and improve LLMs. We propose Hex a framework that clarifies key terms and concepts in LLM research, such as hallucinations, alignment, self-verification and chain-of-thought rea… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

    Comments: 4 figures, 18 pages

  9. arXiv:2311.01301  [pdf, other

    cs.LG cs.AI stat.ME

    TRIALSCOPE: A Unifying Causal Framework for Scaling Real-World Evidence Generation with Biomedical Language Models

    Authors: Javier González, Cliff Wong, Zelalem Gero, Jass Bagga, Risa Ueno, Isabel Chien, Eduard Oravkin, Emre Kiciman, Aditya Nori, Roshanthi Weerasinghe, Rom S. Leidner, Brian Piening, Tristan Naumann, Carlo Bifulco, Hoifung Poon

    Abstract: The rapid digitization of real-world data offers an unprecedented opportunity for optimizing healthcare delivery and accelerating biomedical discovery. In practice, however, such data is most abundantly available in unstructured forms, such as clinical notes in electronic medical records (EMRs), and it is generally plagued by confounders. In this paper, we present TRIALSCOPE, a unifying framework… ▽ More

    Submitted 6 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: 6 Figures, 22 Pages, 3 Tables

  10. arXiv:2310.14573  [pdf, other

    cs.CL

    Exploring the Boundaries of GPT-4 in Radiology

    Authors: Qianchu Liu, Stephanie Hyland, Shruthi Bannur, Kenza Bouzid, Daniel C. Castro, Maria Teodora Wetscherek, Robert Tinn, Harshita Sharma, Fernando Pérez-García, Anton Schwaighofer, Pranav Rajpurkar, Sameer Tajdin Khanna, Hoifung Poon, Naoto Usuyama, Anja Thieme, Aditya V. Nori, Matthew P. Lungren, Ozan Oktay, Javier Alvarez-Valle

    Abstract: The recent success of general-domain large language models (LLMs) has significantly changed the natural language processing paradigm towards a unified foundation model across domains and applications. In this paper, we focus on assessing the performance of GPT-4, the most capable LLM so far, on the text-based applications for radiology reports, comparing against state-of-the-art (SOTA) radiology-s… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 main

  11. arXiv:2310.13767  [pdf, other

    cs.LG

    Graph AI in Medicine

    Authors: Ruth Johnson, Michelle M. Li, Ayush Noori, Owen Queen, Marinka Zitnik

    Abstract: In clinical artificial intelligence (AI), graph representation learning, mainly through graph neural networks (GNNs), stands out for its capability to capture intricate relationships within structured clinical datasets. With diverse data -- from patient records to imaging -- GNNs process data holistically by viewing modalities as nodes interconnected by their relationships. Graph AI facilitates mo… ▽ More

    Submitted 11 December, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

  12. arXiv:2310.12155  [pdf

    cs.NE

    Balancing exploration and exploitation phases in whale optimization algorithm: an insightful and empirical analysis

    Authors: Aram M. Ahmed, Tarik A. Rashid, Bryar A. Hassan, Jaffer Majidpour, Kaniaw A. Noori, Chnoor Maheadeen Rahman, Mohmad Hussein Abdalla, Shko M. Qader, Noor Tayfor, Naufel B Mohammed

    Abstract: Agents of any metaheuristic algorithms are moving in two modes, namely exploration and exploitation. Obtaining robust results in any algorithm is strongly dependent on how to balance between these two modes. Whale optimization algorithm as a robust and well recognized metaheuristic algorithm in the literature, has proposed a novel scheme to achieve this balance. It has also shown superior results… ▽ More

    Submitted 3 September, 2023; originally announced October 2023.

    Comments: 11 pages

  13. arXiv:2310.07723  [pdf

    cs.NE

    Equitable and Fair Performance Evaluation of Whale Optimization Algorithm

    Authors: Bryar A. Hassan, Tarik A. Rashid, Aram Ahmed, Shko M. Qader, Jaffer Majidpour, Mohmad Hussein Abdalla, Noor Tayfor, Hozan K. Hamarashid, Haval Sidqi, Kaniaw A. Noori

    Abstract: It is essential that all algorithms are exhaustively, somewhat, and intelligently evaluated. Nonetheless, evaluating the effectiveness of optimization algorithms equitably and fairly is not an easy process for various reasons. Choosing and initializing essential parameters, such as the size issues of the search area for each method and the number of iterations required to reduce the issues, might… ▽ More

    Submitted 4 September, 2023; originally announced October 2023.

    Comments: 21 pages

    Journal ref: 2023

  14. arXiv:2303.13386  [pdf, other

    cs.CL cs.LG

    Compositional Zero-Shot Domain Transfer with Text-to-Text Models

    Authors: Fangyu Liu, Qianchu Liu, Shruthi Bannur, Fernando Pérez-García, Naoto Usuyama, Sheng Zhang, Tristan Naumann, Aditya Nori, Hoifung Poon, Javier Alvarez-Valle, Ozan Oktay, Stephanie L. Hyland

    Abstract: Label scarcity is a bottleneck for improving task performance in specialised domains. We propose a novel compositional transfer learning framework (DoT5 - domain compositional zero-shot T5) for zero-shot domain transfer. Without access to in-domain labels, DoT5 jointly learns domain knowledge (from MLM of unlabelled in-domain free text) and task knowledge (from task training on more readily availa… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: Accepted at TACL, pre-MIT Press publication version. 16 pages, 4 figures

  15. arXiv:2301.04558  [pdf, other

    cs.CV cs.CL

    Learning to Exploit Temporal Structure for Biomedical Vision-Language Processing

    Authors: Shruthi Bannur, Stephanie Hyland, Qianchu Liu, Fernando Pérez-García, Maximilian Ilse, Daniel C. Castro, Benedikt Boecking, Harshita Sharma, Kenza Bouzid, Anja Thieme, Anton Schwaighofer, Maria Wetscherek, Matthew P. Lungren, Aditya Nori, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Self-supervised learning in vision-language processing exploits semantic alignment between imaging and text modalities. Prior work in biomedical VLP has mostly relied on the alignment of single image and report pairs even though clinical notes commonly refer to prior images. This does not only introduce poor alignment between the modalities but also a missed opportunity to exploit rich self-superv… ▽ More

    Submitted 16 March, 2023; v1 submitted 11 January, 2023; originally announced January 2023.

    Comments: To appear in CVPR 2023

  16. arXiv:2209.03299  [pdf, other

    cs.LG cs.AI

    Multimodal learning with graphs

    Authors: Yasha Ektefaie, George Dasoulas, Ayush Noori, Maha Farhat, Marinka Zitnik

    Abstract: Artificial intelligence for graphs has achieved remarkable success in modeling complex systems, ranging from dynamic networks in biology to interacting particle systems in physics. However, the increasingly heterogeneous graph datasets call for multimodal methods that can combine different inductive biases: the set of assumptions that algorithms use to make predictions for inputs they have not enc… ▽ More

    Submitted 23 January, 2023; v1 submitted 7 September, 2022; originally announced September 2022.

    Comments: 27 pages, 5 figures, 2 boxes

  17. arXiv:2207.04806  [pdf, other

    cs.LG

    Repairing Neural Networks by Leaving the Right Past Behind

    Authors: Ryutaro Tanno, Melanie F. Pradier, Aditya Nori, Yingzhen Li

    Abstract: Prediction failures of machine learning models often arise from deficiencies in training data, such as incorrect labels, outliers, and selection biases. However, such data points that are responsible for a given failure mode are generally not known a priori, let alone a mechanism for repairing the failure. This work draws on the Bayesian view of continual learning, and develops a generic framework… ▽ More

    Submitted 9 November, 2022; v1 submitted 11 July, 2022; originally announced July 2022.

    Comments: 24 pages, 12 figures

  18. arXiv:2206.01658  [pdf

    cs.CV

    Identification via Retinal Vessels Combining LBP and HOG

    Authors: Ali Noori

    Abstract: With development of information technology and necessity for high security, using different identification methods has become very important. Each biometric feature has its own advantages and disadvantages and choosing each of them depends on our usage. Retinal scanning is a bio scale method for identification. The retina is composed of vessels and optical disk. The vessels distribution pattern is… ▽ More

    Submitted 3 June, 2022; originally announced June 2022.

  19. arXiv:2205.14778  [pdf, other

    cs.AR cs.LG

    TransforMAP: Transformer for Memory Access Prediction

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Data Prefetching is a technique that can hide memory latency by fetching data before it is needed by a program. Prefetching relies on accurate memory access prediction, to which task machine learning based methods are increasingly applied. Unlike previous approaches that learn from deltas or offsets and perform one access prediction, we develop TransforMAP, based on the powerful Transformer model,… ▽ More

    Submitted 29 May, 2022; originally announced May 2022.

  20. Fine-Grained Address Segmentation for Attention-Based Variable-Degree Prefetching

    Authors: Pengmiao Zhang, Ajitesh Srivastava, Anant V. Nori, Rajgopal Kannan, Viktor K. Prasanna

    Abstract: Machine learning algorithms have shown potential to improve prefetching performance by accurately predicting future memory accesses. Existing approaches are based on the modeling of text prediction, considering prefetching as a classification problem for sequence prediction. However, the vast and sparse memory address space leads to large vocabulary, which makes this modeling impractical. The numb… ▽ More

    Submitted 1 May, 2022; originally announced May 2022.

  21. Making the Most of Text Semantics to Improve Biomedical Vision--Language Processing

    Authors: Benedikt Boecking, Naoto Usuyama, Shruthi Bannur, Daniel C. Castro, Anton Schwaighofer, Stephanie Hyland, Maria Wetscherek, Tristan Naumann, Aditya Nori, Javier Alvarez-Valle, Hoifung Poon, Ozan Oktay

    Abstract: Multi-modal data abounds in biomedicine, such as radiology images and reports. Interpreting this data at scale is essential for improving clinical care and accelerating clinical research. Biomedical text with its complex semantics poses additional challenges in vision--language modelling compared to the general domain, and previous work has used insufficiently adapted models that lack domain-speci… ▽ More

    Submitted 21 July, 2022; v1 submitted 20 April, 2022; originally announced April 2022.

    Comments: To appear in ECCV 2022. Code: https://aka.ms/biovil-code Dataset: https://aka.ms/ms-cxr Demo Notebook: https://aka.ms/biovil-demo-notebook

    Journal ref: Computer Vision - ECCV 2022, LNCS vol 13696, pp 1-21

  22. arXiv:2202.00478  [pdf

    cs.CL

    NeuraHealth: An Automated Screening Pipeline to Detect Undiagnosed Cognitive Impairment in Electronic Health Records with Deep Learning and Natural Language Processing

    Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

    Abstract: Dementia related cognitive impairment (CI) is a neurodegenerative disorder, affecting over 55 million people worldwide and growing rapidly at the rate of one new case every 3 seconds. 75% cases go undiagnosed globally with up to 90% in low-and-middle-income countries, leading to an estimated annual worldwide cost of USD 1.3 trillion, forecasted to reach 2.8 trillion by 2030. With no cure, a recurr… ▽ More

    Submitted 20 June, 2022; v1 submitted 12 January, 2022; originally announced February 2022.

  23. arXiv:2111.09115  [pdf, other

    cs.CL cs.LG

    Using Deep Learning to Identify Patients with Cognitive Impairment in Electronic Health Records

    Authors: Tanish Tyagi, Colin G. Magdamo, Ayush Noori, Zhaozhi Li, Xiao Liu, Mayuresh Deodhar, Zhuoqiao Hong, Wendong Ge, Elissa M. Ye, Yi-han Sheu, Haitham Alabsi, Laura Brenner, Gregory K. Robbins, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Alberto Serrano-Pozo, Dimitry Prokopenko, Rudolph E. Tanzi, Bradley T. Hyman, Deborah Blacker, Shibani S. Mukerji, M. Brandon Westover, Sudeshna Das

    Abstract: Dementia is a neurodegenerative disorder that causes cognitive decline and affects more than 50 million people worldwide. Dementia is under-diagnosed by healthcare professionals - only one in four people who suffer from dementia are diagnosed. Even when a diagnosis is made, it may not be entered as a structured International Classification of Diseases (ICD) diagnosis code in a patient's charts. In… ▽ More

    Submitted 12 November, 2021; originally announced November 2021.

    Comments: Machine Learning for Health (ML4H) - Extended Abstract

  24. Pythia: A Customizable Hardware Prefetching Framework Using Online Reinforcement Learning

    Authors: Rahul Bera, Konstantinos Kanellopoulos, Anant V. Nori, Taha Shahroodi, Sreenivas Subramoney, Onur Mutlu

    Abstract: Past research has proposed numerous hardware prefetching techniques, most of which rely on exploiting one specific type of program context information (e.g., program counter, cacheline address) to predict future memory accesses. These techniques either completely neglect a prefetcher's undesirable effects (e.g., memory bandwidth usage) on the overall system, or incorporate system-level feedback as… ▽ More

    Submitted 6 April, 2023; v1 submitted 24 September, 2021; originally announced September 2021.

    ACM Class: C.1.2

  25. Active label cleaning for improved dataset quality under resource constraints

    Authors: Melanie Bernhardt, Daniel C. Castro, Ryutaro Tanno, Anton Schwaighofer, Kerem C. Tezcan, Miguel Monteiro, Shruthi Bannur, Matthew Lungren, Aditya Nori, Ben Glocker, Javier Alvarez-Valle, Ozan Oktay

    Abstract: Imperfections in data annotation, known as label noise, are detrimental to the training of machine learning models and have an often-overlooked confounding effect on the assessment of model performance. Nevertheless, employing experts to remove label noise by fully re-annotating large datasets is infeasible in resource-constrained settings, such as healthcare. This work advocates for a data-driven… ▽ More

    Submitted 10 February, 2022; v1 submitted 1 September, 2021; originally announced September 2021.

    Comments: Accepted for publication in Nature Communications

    Journal ref: Nature Communications 13 (2022) 1161

  26. arXiv:2107.06618  [pdf, other

    eess.IV cs.CV cs.LG

    Hierarchical Analysis of Visual COVID-19 Features from Chest Radiographs

    Authors: Shruthi Bannur, Ozan Oktay, Melanie Bernhardt, Anton Schwaighofer, Rajesh Jena, Besmira Nushi, Sharan Wadhwani, Aditya Nori, Kal Natarajan, Shazad Ashraf, Javier Alvarez-Valle, Daniel C. Castro

    Abstract: Chest radiography has been a recommended procedure for patient triaging and resource management in intensive care units (ICUs) throughout the COVID-19 pandemic. The machine learning efforts to augment this workflow have been long challenged due to deficiencies in reporting, model evaluation, and failure mode analysis. To address some of those shortcomings, we model radiological features with a hum… ▽ More

    Submitted 14 July, 2021; originally announced July 2021.

    Comments: Presented at ICML 2021 Workshop on Interpretable Machine Learning in Healthcare

  27. pLUTo: Enabling Massively Parallel Computation in DRAM via Lookup Tables

    Authors: João Dinis Ferreira, Gabriel Falcao, Juan Gómez-Luna, Mohammed Alser, Lois Orosa, Mohammad Sadrosadati, Jeremie S. Kim, Geraldo F. Oliveira, Taha Shahroodi, Anant Nori, Onur Mutlu

    Abstract: Data movement between the main memory and the processor is a key contributor to execution time and energy consumption in memory-intensive applications. This data movement bottleneck can be alleviated using Processing-in-Memory (PiM). One category of PiM is Processing-using-Memory (PuM), in which computation takes place inside the memory array by exploiting intrinsic analog properties of the memory… ▽ More

    Submitted 3 October, 2022; v1 submitted 15 April, 2021; originally announced April 2021.

    ACM Class: B.3.1; C.1.3

    Journal ref: IEEE/ACM International Symposium on Microarchitecture (MICRO), 2022, 900-919

  28. arXiv:2012.05064  [pdf, other

    cs.CR

    Secure Medical Image Analysis with CrypTFlow

    Authors: Javier Alvarez-Valle, Pratik Bhatu, Nishanth Chandran, Divya Gupta, Aditya Nori, Aseem Rastogi, Mayank Rathee, Rahul Sharma, Shubham Ugare

    Abstract: We present CRYPTFLOW, a system that converts TensorFlow inference code into Secure Multi-party Computation (MPC) protocols at the push of a button. To do this, we build two components. Our first component is an end-to-end compiler from TensorFlow to a variety of MPC protocols. The second component is an improved semi-honest 3-party protocol that provides significant speedups for inference. We empi… ▽ More

    Submitted 9 December, 2020; originally announced December 2020.

    Comments: 6 pages. PPML NeurIPS 2020 Workshop, Vancouver, Canada. arXiv admin note: substantial text overlap with arXiv:1909.07814

  29. arXiv:2011.11695  [pdf, other

    cs.AR

    Proximu$: Efficiently Scaling DNN Inference in Multi-core CPUs through Near-Cache Compute

    Authors: Anant V. Nori, Rahul Bera, Shankar Balachandran, Joydeep Rakshit, Om J. Omer, Avishaii Abuhatzera, Belliappa Kuttanna, Sreenivas Subramoney

    Abstract: Deep Neural Network (DNN) inference is emerging as the fundamental bedrock for a multitude of utilities and services. CPUs continue to scale up their raw compute capabilities for DNN inference along with mature high performance libraries to extract optimal performance. While general purpose CPUs offer unique attractive advantages for DNN inference at both datacenter and edge, they have primarily e… ▽ More

    Submitted 2 December, 2020; v1 submitted 23 November, 2020; originally announced November 2020.

    Comments: 18 pages, 21 figures

  30. arXiv:2011.07555  [pdf, other

    cs.CR cs.CY

    Towards Compliant Data Management Systems for Healthcare ML

    Authors: Goutham Ramakrishnan, Aditya Nori, Hannah Murfet, Pashmina Cameron

    Abstract: The increasing popularity of machine learning approaches and the rising awareness of data protection and data privacy presents an opportunity to build truly secure and trustworthy healthcare systems. Regulations such as GDPR and HIPAA present broad guidelines and frameworks, but the implementation can present technical challenges. Compliant data management systems require enforcement of a number o… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

  31. arXiv:2011.06489  [pdf, other

    cs.CL

    Natural Language Processing to Detect Cognitive Concerns in Electronic Health Records Using Deep Learning

    Authors: Zhuoqiao Hong, Colin G. Magdamo, Yi-han Sheu, Prathamesh Mohite, Ayush Noori, Elissa M. Ye, Wendong Ge, Haoqi Sun, Laura Brenner, Gregory Robbins, Shibani Mukerji, Sahar Zafar, Nicole Benson, Lidia Moura, John Hsu, Bradley T. Hyman, Michael B. Westover, Deborah Blacker, Sudeshna Das

    Abstract: Dementia is under-recognized in the community, under-diagnosed by healthcare professionals, and under-coded in claims data. Information on cognitive dysfunction, however, is often found in unstructured clinician notes within medical records but manual review by experts is time consuming and often prone to errors. Automated mining of these notes presents a potential opportunity to label patients wi… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

    MSC Class: I.2.7

  32. arXiv:2009.07692  [pdf, other

    cs.AR q-bio.GN

    GenASM: A High-Performance, Low-Power Approximate String Matching Acceleration Framework for Genome Sequence Analysis

    Authors: Damla Senol Cali, Gurpreet S. Kalsi, Zülal Bingöl, Can Firtina, Lavanya Subramanian, Jeremie S. Kim, Rachata Ausavarungnirun, Mohammed Alser, Juan Gomez-Luna, Amirali Boroumand, Anant Nori, Allison Scibisz, Sreenivas Subramoney, Can Alkan, Saugata Ghose, Onur Mutlu

    Abstract: Genome sequence analysis has enabled significant advancements in medical and scientific areas such as personalized medicine, outbreak tracing, and the understanding of evolution. Unfortunately, it is currently bottlenecked by the computational power and memory bandwidth limitations of existing systems, as many of the steps in genome sequence analysis must process a large amount of data. A major co… ▽ More

    Submitted 16 September, 2020; originally announced September 2020.

    Comments: To appear in MICRO 2020

  33. DSPatch: Dual Spatial Pattern Prefetcher

    Authors: Rahul Bera, Anant V. Nori, Onur Mutlu, Sreenivas Subramoney

    Abstract: High main memory latency continues to limit performance of modern high-performance out-of-order cores. While DRAM latency has remained nearly the same over many generations, DRAM bandwidth has grown significantly due to higher frequencies, newer architectures (DDR4, LPDDR4, GDDR5) and 3D-stacked memory packaging (HBM). Current state-of-the-art prefetchers do not do well in extracting higher perfor… ▽ More

    Submitted 7 October, 2019; originally announced October 2019.

    Comments: This work is to appear in MICRO 2019

  34. arXiv:1909.12732  [pdf, other

    cs.LG stat.ML

    Alleviating Privacy Attacks via Causal Learning

    Authors: Shruti Tople, Amit Sharma, Aditya Nori

    Abstract: Machine learning models, especially deep neural networks have been shown to be susceptible to privacy attacks such as membership inference where an adversary can detect whether a data point was used for training a black-box model. Such privacy risks are exacerbated when a model's predictions are used on an unseen data distribution. To alleviate privacy attacks, we demonstrate the benefit of predic… ▽ More

    Submitted 17 July, 2020; v1 submitted 27 September, 2019; originally announced September 2019.

    Comments: Accepted at International Conference on Machine Learning, 2020

  35. arXiv:1905.07457  [pdf, other

    cs.PL cs.LG

    Overfitting in Synthesis: Theory and Practice (Extended Version)

    Authors: Saswat Padhi, Todd Millstein, Aditya Nori, Rahul Sharma

    Abstract: In syntax-guided synthesis (SyGuS), a synthesizer's goal is to automatically generate a program belonging to a grammar of possible implementations that meets a logical specification. We investigate a common limitation across state-of-the-art SyGuS tools that perform counterexample-guided inductive synthesis (CEGIS). We empirically observe that as the expressiveness of the provided grammar increase… ▽ More

    Submitted 7 June, 2019; v1 submitted 17 May, 2019; originally announced May 2019.

    Comments: 24 pages (5 pages of appendices), 7 figures, includes proofs of theorems

  36. arXiv:1902.05983  [pdf, other

    cs.LG cs.PL cs.SE stat.ML

    Robustness of Neural Networks: A Probabilistic and Practical Approach

    Authors: Ravi Mangal, Aditya V. Nori, Alessandro Orso

    Abstract: Neural networks are becoming increasingly prevalent in software, and it is therefore important to be able to verify their behavior. Because verifying the correctness of neural networks is extremely challenging, it is common to focus on the verification of other properties of these systems. One important property, in particular, is robustness. Most existing definitions of robustness, however, focus… ▽ More

    Submitted 15 February, 2019; originally announced February 2019.

    Comments: Accepted for publication at ICSE-NIER 2019

  37. arXiv:1807.06699  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Adaptive Neural Trees

    Authors: Ryutaro Tanno, Kai Arulkumaran, Daniel C. Alexander, Antonio Criminisi, Aditya Nori

    Abstract: Deep neural networks and decision trees operate on largely separate paradigms; typically, the former performs representation learning with pre-specified architectures, while the latter is characterised by learning hierarchies over pre-specified features with data-driven architectures. We unite the two via adaptive neural trees (ANTs) that incorporates representation learning into edges, routing fu… ▽ More

    Submitted 9 June, 2019; v1 submitted 17 July, 2018; originally announced July 2018.

    Comments: International Conference on Machine Learning 2019

  38. arXiv:1806.02679  [pdf, other

    cs.LG cs.CV cs.NE stat.ML

    Semi-Supervised Learning via Compact Latent Space Clustering

    Authors: Konstantinos Kamnitsas, Daniel C. Castro, Loic Le Folgoc, Ian Walker, Ryutaro Tanno, Daniel Rueckert, Ben Glocker, Antonio Criminisi, Aditya Nori

    Abstract: We present a novel cost function for semi-supervised learning of neural networks that encourages compact clustering of the latent space to facilitate separation. The key idea is to dynamically create a graph over embeddings of labeled and unlabeled samples of a training batch to capture underlying structure in feature space, and use label propagation to estimate its high and low density regions. W… ▽ More

    Submitted 29 July, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: Presented as a long oral in ICML 2018. Post-conference camera ready

  39. arXiv:1805.08403  [pdf, other

    cs.CV

    Autofocus Layer for Semantic Segmentation

    Authors: Yao Qin, Konstantinos Kamnitsas, Siddharth Ancha, Jay Nanavati, Garrison Cottrell, Antonio Criminisi, Aditya Nori

    Abstract: We propose the autofocus convolutional layer for semantic segmentation with the objective of enhancing the capabilities of neural networks for multi-scale processing. Autofocus layers adaptively change the size of the effective receptive field based on the processed context to generate more powerful features. This is achieved by parallelising multiple convolutional layers with different dilation r… ▽ More

    Submitted 11 June, 2018; v1 submitted 22 May, 2018; originally announced May 2018.

    Comments: Published on MICCAI 2018

  40. arXiv:1702.05437  [pdf, other

    cs.PL cs.AI

    Quantifying Program Bias

    Authors: Aws Albarghouthi, Loris D'Antoni, Samuel Drews, Aditya Nori

    Abstract: With the range and sensitivity of algorithmic decisions expanding at a break-neck speed, it is imperative that we aggressively investigate whether programs are biased. We propose a novel probabilistic program analysis technique and apply it to quantifying bias in decision-making programs. Specifically, we (i) present a sound and complete automated verification technique for proving quantitative pr… ▽ More

    Submitted 6 March, 2017; v1 submitted 17 February, 2017; originally announced February 2017.

  41. arXiv:1612.08894  [pdf, other

    cs.CV

    Unsupervised domain adaptation in brain lesion segmentation with adversarial networks

    Authors: Konstantinos Kamnitsas, Christian Baumgartner, Christian Ledig, Virginia F. J. Newcombe, Joanna P. Simpson, Andrew D. Kane, David K. Menon, Aditya Nori, Antonio Criminisi, Daniel Rueckert, Ben Glocker

    Abstract: Significant advances have been made towards building accurate automatic segmentation systems for a variety of biomedical applications using machine learning. However, the performance of these systems often degrades when they are applied on new data that differ from the training data, for example, due to variations in imaging protocols. Manually annotating new data for each test domain is not a fea… ▽ More

    Submitted 28 December, 2016; originally announced December 2016.

  42. arXiv:1610.06067  [pdf, other

    cs.PL cs.AI

    Fairness as a Program Property

    Authors: Aws Albarghouthi, Loris D'Antoni, Samuel Drews, Aditya Nori

    Abstract: We explore the following question: Is a decision-making program fair, for some useful definition of fairness? First, we describe how several algorithmic fairness questions can be phrased as program verification problems. Second, we discuss an automated verification technique for proving or disproving fairness of decision-making programs with respect to a probabilistic model of the population.

    Submitted 19 October, 2016; originally announced October 2016.

  43. arXiv:1605.07262  [pdf, other

    cs.LG cs.CV cs.NE

    Measuring Neural Net Robustness with Constraints

    Authors: Osbert Bastani, Yani Ioannou, Leonidas Lampropoulos, Dimitrios Vytiniotis, Aditya Nori, Antonio Criminisi

    Abstract: Despite having high accuracy, neural nets have been shown to be susceptible to adversarial examples, where a small perturbation to an input can cause it to become mislabeled. We propose metrics for measuring the robustness of a neural net and devise a novel algorithm for approximating these metrics based on an encoding of robustness as a linear program. We show how our metrics can be used to evalu… ▽ More

    Submitted 16 June, 2017; v1 submitted 23 May, 2016; originally announced May 2016.

  44. arXiv:1603.07292  [pdf, other

    cs.LG cs.AI cs.PL stat.ML

    Debugging Machine Learning Tasks

    Authors: Aleksandar Chakarov, Aditya Nori, Sriram Rajamani, Shayak Sen, Deepak Vijaykeerthy

    Abstract: Unlike traditional programs (such as operating systems or word processors) which have large amounts of code, machine learning tasks use programs with relatively small amounts of code (written in machine learning libraries), but voluminous amounts of data. Just like developers of traditional programs debug errors in their code, developers of machine learning tasks debug and fix errors in their data… ▽ More

    Submitted 23 March, 2016; originally announced March 2016.

    ACM Class: D.2.5; I.2.3

  45. arXiv:1208.1743   

    cs.AI

    Hybrid systems modeling for gas transmission network

    Authors: Amir Noori, Mohammad Bagher Menhaj, Masoud Shafiee

    Abstract: Gas Transmission Networks are large-scale complex systems, and corresponding design and control problems are challenging. In this paper, we consider the problem of control and management of these systems in crisis situations. We present these networks by a hybrid systems framework that provides required analysis models. Further, we discuss decision-making using computational discrete and hybrid op… ▽ More

    Submitted 28 September, 2012; v1 submitted 8 August, 2012; originally announced August 2012.

    Comments: This paper has been withdrawn by the author due to a crucial citation error in introduction section

    Journal ref: The 4th IFAC Conference on Management and Control of Production and Logistics, 2007

  46. arXiv:1208.1740  [pdf

    eess.SY cs.SI math.OC

    On the Relation between Centrality Measures and Consensus Algorithms

    Authors: Amir Noori

    Abstract: This paper introduces some tools from graph theory and distributed consensus algorithms to construct an optimal, yet robust, hierarchical information sharing structure for large-scale decision making and control problems. The proposed method is motivated by the robustness and optimality of leaf-venation patterns. We introduce a new class of centrality measures which are built based on the degree d… ▽ More

    Submitted 8 August, 2012; originally announced August 2012.

    Comments: 2011 International Conference on High Performance Computing and Simulation (HPCS)