Zum Hauptinhalt springen

Showing 1–24 of 24 results for author: Nori, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07707  [pdf, other

    cs.CV cs.LG

    A Deep Learning Approach to Detect Complete Safety Equipment For Construction Workers Based On YOLOv7

    Authors: Md. Shariful Islam, SM Shaqib, Shahriar Sultan Ramit, Shahrun Akter Khushbu, Abdus Sattar, Sheak Rashed Haider Noori

    Abstract: In the construction sector, ensuring worker safety is of the utmost significance. In this study, a deep learning-based technique is presented for identifying safety gear worn by construction workers, such as helmets, goggles, jackets, gloves, and footwears. The recommended approach uses the YOLO v7 (You Only Look Once) object detection algorithm to precisely locate these safety items. The dataset… ▽ More

    Submitted 13 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  2. arXiv:2404.15168  [pdf, other

    eess.AS cs.HC cs.LG cs.SD

    Artificial Neural Networks to Recognize Speakers Division from Continuous Bengali Speech

    Authors: Hasmot Ali, Md. Fahad Hossain, Md. Mehedi Hasan, Sheikh Abujar, Sheak Rashed Haider Noori

    Abstract: Voice based applications are ruling over the era of automation because speech has a lot of factors that determine a speakers information as well as speech. Modern Automatic Speech Recognition (ASR) is a blessing in the field of Human-Computer Interaction (HCI) for efficient communication among humans and devices using Artificial Intelligence technology. Speech is one of the easiest mediums of comm… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  3. arXiv:2404.06209  [pdf, other

    cs.LG cs.AI cs.CL

    Elephants Never Forget: Memorization and Learning of Tabular Data in Large Language Models

    Authors: Sebastian Bordt, Harsha Nori, Vanessa Rodrigues, Besmira Nushi, Rich Caruana

    Abstract: While many have shown how Large Language Models (LLMs) can be applied to a diverse set of tasks, the critical issues of data contamination and memorization are often glossed over. In this work, we address this concern for tabular data. Specifically, we introduce a variety of different techniques to assess whether a language model has seen a tabular dataset during training. This investigation revea… ▽ More

    Submitted 20 August, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: COLM camera ready

  4. arXiv:2403.06644  [pdf, other

    cs.LG cs.CL

    Elephants Never Forget: Testing Language Models for Memorization of Tabular Data

    Authors: Sebastian Bordt, Harsha Nori, Rich Caruana

    Abstract: While many have shown how Large Language Models (LLMs) can be applied to a diverse set of tasks, the critical issues of data contamination and memorization are often glossed over. In this work, we address this concern for tabular data. Starting with simple qualitative tests for whether an LLM knows the names and values of features, we introduce a variety of different techniques to assess the degre… ▽ More

    Submitted 11 March, 2024; originally announced March 2024.

    Comments: Table Representation Learning Workshop at NeurIPS 2023

  5. arXiv:2403.01749  [pdf, other

    cs.CL

    Differentially Private Synthetic Data via Foundation Model APIs 2: Text

    Authors: Chulin Xie, Zinan Lin, Arturs Backurs, Sivakanth Gopi, Da Yu, Huseyin A Inan, Harsha Nori, Haotian Jiang, Huishuai Zhang, Yin Tat Lee, Bo Li, Sergey Yekhanin

    Abstract: Text data has become extremely valuable due to the emergence of machine learning algorithms that learn from it. A lot of high-quality text data generated in the real world is private and therefore cannot be shared or used freely due to privacy concerns. Generating synthetic replicas of private text data with a formal privacy guarantee, i.e., differential privacy (DP), offers a promising and scalab… ▽ More

    Submitted 23 July, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: ICML'24 Spotlight

  6. arXiv:2402.14474  [pdf, other

    cs.LG cs.CL

    Data Science with LLMs and Interpretable Models

    Authors: Sebastian Bordt, Ben Lengerich, Harsha Nori, Rich Caruana

    Abstract: Recent years have seen important advances in the building of interpretable models, machine learning models that are designed to be easily understood by humans. In this work, we show that large language models (LLMs) are remarkably good at working with interpretable models, too. In particular, we show that LLMs can describe, interpret, and debug Generalized Additive Models (GAMs). Combining the fle… ▽ More

    Submitted 22 February, 2024; originally announced February 2024.

    Comments: XAI4Sci Workshop at AAAI-24

  7. arXiv:2311.16452  [pdf, other

    cs.CL

    Can Generalist Foundation Models Outcompete Special-Purpose Tuning? Case Study in Medicine

    Authors: Harsha Nori, Yin Tat Lee, Sheng Zhang, Dean Carignan, Richard Edgar, Nicolo Fusi, Nicholas King, Jonathan Larson, Yuanzhi Li, Weishung Liu, Renqian Luo, Scott Mayer McKinney, Robert Osazuwa Ness, Hoifung Poon, Tao Qin, Naoto Usuyama, Chris White, Eric Horvitz

    Abstract: Generalist foundation models such as GPT-4 have displayed surprising capabilities in a wide variety of domains and tasks. Yet, there is a prevalent assumption that they cannot match specialist capabilities of fine-tuned models. For example, most explorations to date on medical competency benchmarks have leveraged domain-specific training, as exemplified by efforts on BioGPT and Med-PaLM. We build… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 21 pages, 7 figures

    ACM Class: I.2.7

  8. Interpretable Predictive Models to Understand Risk Factors for Maternal and Fetal Outcomes

    Authors: Tomas M. Bosschieter, Zifei Xu, Hui Lan, Benjamin J. Lengerich, Harsha Nori, Ian Painter, Vivienne Souter, Rich Caruana

    Abstract: Although most pregnancies result in a good outcome, complications are not uncommon and can be associated with serious implications for mothers and babies. Predictive modeling has the potential to improve outcomes through better understanding of risk factors, heightened surveillance for high risk patients, and more timely and appropriate interventions, thereby helping obstetricians deliver better c… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 25 pages (including appendix and references), 12 figures, 2 tables. J Healthc Inform Res (2023)

  9. arXiv:2309.13069  [pdf

    cs.CL cs.LG

    Machine Learning Technique Based Fake News Detection

    Authors: Biplob Kumar Sutradhar, Md. Zonaid, Nushrat Jahan Ria, Sheak Rashed Haider Noori

    Abstract: False news has received attention from both the general public and the scholarly world. Such false information has the ability to affect public perception, giving nefarious groups the chance to influence the results of public events like elections. Anyone can share fake news or facts about anyone or anything for their personal gain or to cause someone trouble. Also, information varies depending on… ▽ More

    Submitted 18 September, 2023; originally announced September 2023.

  10. arXiv:2308.10783  [pdf, other

    cs.CL cs.LG

    Zero- and Few-Shot Prompting with LLMs: A Comparative Study with Fine-tuned Models for Bangla Sentiment Analysis

    Authors: Md. Arid Hasan, Shudipta Das, Afiyat Anjum, Firoj Alam, Anika Anjum, Avijit Sarker, Sheak Rashed Haider Noori

    Abstract: The rapid expansion of the digital world has propelled sentiment analysis into a critical tool across diverse sectors such as marketing, politics, customer service, and healthcare. While there have been significant advancements in sentiment analysis for widely spoken languages, low-resource languages, such as Bangla, remain largely under-researched due to resource constraints. Furthermore, the rec… ▽ More

    Submitted 4 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

    Comments: Accepted at LREC-COLING 2024. Zero-Shot Prompting, Few-Shot Prompting, LLMs, Comparative Study, Fine-tuned Models, Bangla, Sentiment Analysis

    MSC Class: 68T50 ACM Class: I.2.7

  11. arXiv:2308.01157  [pdf, other

    stat.ML cs.AI cs.LG

    LLMs Understand Glass-Box Models, Discover Surprises, and Suggest Repairs

    Authors: Benjamin J. Lengerich, Sebastian Bordt, Harsha Nori, Mark E. Nunnally, Yin Aphinyanaphongs, Manolis Kellis, Rich Caruana

    Abstract: We show that large language models (LLMs) are remarkably good at working with interpretable models that decompose complex outcomes into univariate graph-represented components. By adopting a hierarchical approach to reasoning, LLMs can provide comprehensive model-level summaries without ever requiring the entire model to fit in context. This approach enables LLMs to apply their extensive backgroun… ▽ More

    Submitted 7 August, 2023; v1 submitted 2 August, 2023; originally announced August 2023.

  12. arXiv:2305.15560  [pdf, other

    cs.CV cs.CR cs.LG

    Differentially Private Synthetic Data via Foundation Model APIs 1: Images

    Authors: Zinan Lin, Sivakanth Gopi, Janardhan Kulkarni, Harsha Nori, Sergey Yekhanin

    Abstract: Generating differentially private (DP) synthetic data that closely resembles the original private data is a scalable way to mitigate privacy concerns in the current data-driven world. In contrast to current practices that train customized models for this task, we aim to generate DP Synthetic Data via APIs (DPSDA), where we treat foundation models as blackboxes and only utilize their inference APIs… ▽ More

    Submitted 29 February, 2024; v1 submitted 24 May, 2023; originally announced May 2023.

    Comments: 49 pages, 42 figures

  13. arXiv:2304.09991  [pdf, other

    cs.HC cs.AI cs.CL

    Supporting Human-AI Collaboration in Auditing LLMs with LLMs

    Authors: Charvi Rastogi, Marco Tulio Ribeiro, Nicholas King, Harsha Nori, Saleema Amershi

    Abstract: Large language models are becoming increasingly pervasive and ubiquitous in society via deployment in sociotechnical systems. Yet these language models, be it for classification or generation, have been shown to be biased and behave irresponsibly, causing harm to people at scale. It is crucial to audit these language models rigorously. Existing auditing tools leverage either or both humans and AI… ▽ More

    Submitted 30 November, 2023; v1 submitted 19 April, 2023; originally announced April 2023.

    Comments: 21 pages, 3 figures

    Journal ref: In Proceedings of the 2023 AAAI and ACM Conference on AI, Ethics, and Society. Association for Computing Machinery, New York, NY, USA, 913-926

  14. arXiv:2303.13375  [pdf, other

    cs.CL cs.AI

    Capabilities of GPT-4 on Medical Challenge Problems

    Authors: Harsha Nori, Nicholas King, Scott Mayer McKinney, Dean Carignan, Eric Horvitz

    Abstract: Large language models (LLMs) have demonstrated remarkable capabilities in natural language understanding and generation across various domains, including medicine. We present a comprehensive evaluation of GPT-4, a state-of-the-art LLM, on medical competency examinations and benchmark datasets. GPT-4 is a general-purpose model that is not specialized for medical problems through training or enginee… ▽ More

    Submitted 12 April, 2023; v1 submitted 20 March, 2023; originally announced March 2023.

    Comments: 35 pages, 15 figures; added GPT-4-base model results and discussion

  15. arXiv:2303.12712  [pdf, other

    cs.CL cs.AI

    Sparks of Artificial General Intelligence: Early experiments with GPT-4

    Authors: Sébastien Bubeck, Varun Chandrasekaran, Ronen Eldan, Johannes Gehrke, Eric Horvitz, Ece Kamar, Peter Lee, Yin Tat Lee, Yuanzhi Li, Scott Lundberg, Harsha Nori, Hamid Palangi, Marco Tulio Ribeiro, Yi Zhang

    Abstract: Artificial intelligence (AI) researchers have been developing and refining large language models (LLMs) that exhibit remarkable capabilities across a variety of domains and tasks, challenging our understanding of learning and cognition. The latest model developed by OpenAI, GPT-4, was trained using an unprecedented scale of compute and data. In this paper, we report on our investigation of an earl… ▽ More

    Submitted 13 April, 2023; v1 submitted 22 March, 2023; originally announced March 2023.

  16. arXiv:2207.07308  [pdf, other

    cs.CL cs.LG

    Z-Index at CheckThat! Lab 2022: Check-Worthiness Identification on Tweet Text

    Authors: Prerona Tarannum, Firoj Alam, Md. Arid Hasan, Sheak Rashed Haider Noori

    Abstract: The wide use of social media and digital technologies facilitates sharing various news and information about events and activities. Despite sharing positive information misleading and false information is also spreading on social media. There have been efforts in identifying such misleading information both manually by human experts and automatic tools. Manual effort does not scale well due to the… ▽ More

    Submitted 15 July, 2022; originally announced July 2022.

    Comments: Accepted in CLEF 2022

    ACM Class: I.2.7

  17. arXiv:2207.05322  [pdf, other

    cs.LG stat.AP

    Using Interpretable Machine Learning to Predict Maternal and Fetal Outcomes

    Authors: Tomas M. Bosschieter, Zifei Xu, Hui Lan, Benjamin J. Lengerich, Harsha Nori, Kristin Sitcov, Vivienne Souter, Rich Caruana

    Abstract: Most pregnancies and births result in a good outcome, but complications are not uncommon and when they do occur, they can be associated with serious implications for mothers and babies. Predictive modeling has the potential to improve outcomes through better understanding of risk factors, heightened surveillance, and more timely and appropriate interventions, thereby helping obstetricians deliver… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

    Comments: DSHealth at SIGKDD 2022, 5 pages, 3 figures

  18. arXiv:2206.15465  [pdf, other

    cs.LG cs.AI cs.HC

    Interpretability, Then What? Editing Machine Learning Models to Reflect Human Knowledge and Values

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark E. Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Machine learning (ML) interpretability techniques can reveal undesirable patterns in data that models exploit to make predictions--potentially causing harms once deployed. However, how to take action to address these patterns is not always clear. In a collaboration between ML and human-computer interaction researchers, physicians, and data scientists, we develop GAM Changer, the first interactive… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted at KDD 2022. 11 pages, 19 figures. For a demo video, see https://youtu.be/D6whtfInqTc. For a live demo, visit https://interpret.ml/gam-changer

  19. arXiv:2202.11043  [pdf, other

    stat.ML cs.CR cs.LG econ.EM

    Differentially Private Estimation of Heterogeneous Causal Effects

    Authors: Fengshi Niu, Harsha Nori, Brian Quistorff, Rich Caruana, Donald Ngwe, Aadharsh Kannan

    Abstract: Estimating heterogeneous treatment effects in domains such as healthcare or social science often involves sensitive data where protecting privacy is important. We introduce a general meta-algorithm for estimating conditional average treatment effects (CATE) with differential privacy (DP) guarantees. Our meta-algorithm can work with simple, single-stage CATE estimators such as S-learner and more co… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

  20. arXiv:2112.03245  [pdf, other

    cs.LG cs.AI cs.HC

    GAM Changer: Editing Generalized Additive Models with Interactive Visualization

    Authors: Zijie J. Wang, Alex Kale, Harsha Nori, Peter Stella, Mark Nunnally, Duen Horng Chau, Mihaela Vorvoreanu, Jennifer Wortman Vaughan, Rich Caruana

    Abstract: Recent strides in interpretable machine learning (ML) research reveal that models exploit undesirable patterns in the data to make predictions, which potentially causes harms in deployment. However, it is unclear how we can fix these models. We present our ongoing work, GAM Changer, an open-source interactive system to help data scientists and domain experts easily and responsibly edit their Gener… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 7 pages, 15 figures, accepted to the Research2Clinics workshop at NeurIPS 2021. For a demo video, see https://youtu.be/2gVSoPoSeJ8. For a live demo, visit https://interpret.ml/gam-changer/

  21. arXiv:2106.09680  [pdf, other

    cs.LG cs.CR

    Accuracy, Interpretability, and Differential Privacy via Explainable Boosting

    Authors: Harsha Nori, Rich Caruana, Zhiqi Bu, Judy Hanwen Shen, Janardhan Kulkarni

    Abstract: We show that adding differential privacy to Explainable Boosting Machines (EBMs), a recent method for training interpretable ML models, yields state-of-the-art accuracy while protecting privacy. Our experiments on multiple classification and regression datasets show that DP-EBM models suffer surprisingly little accuracy loss even with strong differential privacy guarantees. In addition to high acc… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: To be published in ICML 2021. 12 pages, 6 figures

  22. arXiv:1909.09223  [pdf, other

    cs.LG stat.ML

    InterpretML: A Unified Framework for Machine Learning Interpretability

    Authors: Harsha Nori, Samuel Jenkins, Paul Koch, Rich Caruana

    Abstract: InterpretML is an open-source Python package which exposes machine learning interpretability algorithms to practitioners and researchers. InterpretML exposes two types of interpretability - glassbox models, which are machine learning models designed for interpretability (ex: linear models, rule lists, generalized additive models), and blackbox explainability techniques for explaining existing syst… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

  23. arXiv:1807.00736  [pdf, other

    cs.CR cs.DS

    An Algorithmic Framework For Differentially Private Data Analysis on Trusted Processors

    Authors: Joshua Allen, Bolin Ding, Janardhan Kulkarni, Harsha Nori, Olga Ohrimenko, Sergey Yekhanin

    Abstract: Differential privacy has emerged as the main definition for private data analysis and machine learning. The {\em global} model of differential privacy, which assumes that users trust the data collector, provides strong privacy guarantees and introduces small errors in the output. In contrast, applications of differential privacy in commercial systems by Apple, Google, and Microsoft, use the {\em l… ▽ More

    Submitted 26 October, 2019; v1 submitted 2 July, 2018; originally announced July 2018.

    Comments: Accepted at NeurIPS 2019

  24. arXiv:1803.09027  [pdf, other

    cs.CR math.ST

    Comparing Population Means under Local Differential Privacy: with Significance and Power

    Authors: Bolin Ding, Harsha Nori, Paul Li, Joshua Allen

    Abstract: A statistical hypothesis test determines whether a hypothesis should be rejected based on samples from populations. In particular, randomized controlled experiments (or A/B testing) that compare population means using, e.g., t-tests, have been widely deployed in technology companies to aid in making data-driven decisions. Samples used in these tests are collected from users and may contain sensiti… ▽ More

    Submitted 23 March, 2018; originally announced March 2018.

    Comments: Full version of an AAAI 2018 conference paper