Skip to main content

Showing 1–23 of 23 results for author: Datta, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.13313  [pdf, other

    cs.AI cs.CL

    Polaris: A Safety-focused LLM Constellation Architecture for Healthcare

    Authors: Subhabrata Mukherjee, Paul Gamble, Markel Sanz Ausin, Neel Kant, Kriti Aggarwal, Neha Manjunath, Debajyoti Datta, Zhengliang Liu, Jiayuan Ding, Sophia Busacca, Cezanne Bianco, Swapnil Sharma, Rae Lasko, Michelle Voisard, Sanchay Harneja, Darya Filippova, Gerry Meixiong, Kevin Cha, Amir Youssefi, Meyhaa Buvanesh, Howard Weingram, Sebastian Bierman-Lytle, Harpreet Singh Mangat, Kim Parikh, Saad Godil , et al. (1 additional authors not shown)

    Abstract: We develop Polaris, the first safety-focused LLM constellation for real-time patient-AI healthcare conversations. Unlike prior LLM works in healthcare focusing on tasks like question answering, our work specifically focuses on long multi-turn voice conversations. Our one-trillion parameter constellation system is composed of several multibillion parameter LLMs as co-operative agents: a stateful pr… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

  2. arXiv:2310.18600  [pdf, other

    cs.CL cs.AI

    MILDSum: A Novel Benchmark Dataset for Multilingual Summarization of Indian Legal Case Judgments

    Authors: Debtanu Datta, Shubham Soni, Rajdeep Mukherjee, Saptarshi Ghosh

    Abstract: Automatic summarization of legal case judgments is a practically important problem that has attracted substantial research efforts in many countries. In the context of the Indian judiciary, there is an additional complexity -- Indian legal case judgments are mostly written in complex English, but a significant portion of India's population lacks command of the English language. Hence, it is crucia… ▽ More

    Submitted 28 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023 (Main Conference)

  3. arXiv:2310.09765  [pdf, other

    cs.CL cs.AI

    Improving Access to Justice for the Indian Population: A Benchmark for Evaluating Translation of Legal Text to Indian Languages

    Authors: Sayan Mahapatra, Debtanu Datta, Shubham Soni, Adrijit Goswami, Saptarshi Ghosh

    Abstract: Most legal text in the Indian judiciary is written in complex English due to historical reasons. However, only about 10% of the Indian population is comfortable in reading English. Hence legal text needs to be made available in various Indian languages, possibly by translating the available legal text from English. Though there has been a lot of research on translation to and between Indian langua… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  4. arXiv:2309.01954  [pdf

    cs.CE

    Electro-Chemo-Mechanical Modeling of Multiscale Active Materials for Next-Generation Energy Storage: Opportunities and Challenges

    Authors: Dibakar Datta

    Abstract: The recent geopolitical crisis resulted in a gas price surge. Although lithium-ion batteries represent the best available rechargeable battery technology, a significant energy and power density gap exists between LIBs and petrol/gasoline. The battery electrodes comprise a mixture of active materials particles, conductive carbon, and binder additives deposited onto a current collector. Although thi… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

    Comments: 33 pages, 17 figures

  5. arXiv:2304.01293  [pdf, other

    cs.CY eess.SP

    Wearable Sensor-based Multimodal Physiological Responses of Socially Anxious Individuals across Social Contexts

    Authors: Emma R. Toner, Mark Rucker, Zhiyuan Wang, Maria A. Larrazabal, Lihua Cai, Debajyoti Datta, Elizabeth Thompson, Haroon Lone, Mehdi Boukhechba, Bethany A. Teachman, Laura E. Barnes

    Abstract: Correctly identifying an individual's social context from passively worn sensors holds promise for delivering just-in-time adaptive interventions (JITAIs) to treat social anxiety disorder. In this study, we present results using passively collected data from a within-subject experiment that assessed physiological response across different social contexts (i.e, alone vs. with others), social phases… ▽ More

    Submitted 3 April, 2023; originally announced April 2023.

  6. arXiv:2211.05100  [pdf, other

    cs.CL

    BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

    Authors: BigScience Workshop, :, Teven Le Scao, Angela Fan, Christopher Akiki, Ellie Pavlick, Suzana Ilić, Daniel Hesslow, Roman Castagné, Alexandra Sasha Luccioni, François Yvon, Matthias Gallé, Jonathan Tow, Alexander M. Rush, Stella Biderman, Albert Webson, Pawan Sasanka Ammanamanchi, Thomas Wang, Benoît Sagot, Niklas Muennighoff, Albert Villanova del Moral, Olatunji Ruwase, Rachel Bawden, Stas Bekman, Angelina McMillan-Major , et al. (369 additional authors not shown)

    Abstract: Large language models (LLMs) have been shown to be able to perform new tasks based on a few demonstrations or natural language instructions. While these capabilities have led to widespread adoption, most LLMs are developed by resource-rich organizations and are frequently kept from the public. As a step towards democratizing this powerful technology, we present BLOOM, a 176B-parameter open-access… ▽ More

    Submitted 27 June, 2023; v1 submitted 9 November, 2022; originally announced November 2022.

  7. arXiv:2209.04710  [pdf, other

    cs.LG

    Shape Analysis for Pediatric Upper Body Motor Function Assessment

    Authors: Shashwat Kumar, Robert Gutierez, Debajyoti Datta, Sarah Tolman, Allison McCrady, Silvia Blemker, Rebecca J. Scharf, Laura Barnes

    Abstract: Neuromuscular disorders, such as Spinal Muscular Atrophy (SMA) and Duchenne Muscular Dystrophy (DMD), cause progressive muscular degeneration and loss of motor function for 1 in 6,000 children. Traditional upper limb motor function assessments do not quantitatively measure patient-performed motions, which makes it difficult to track progress for incremental changes. Assessing motor function in chi… ▽ More

    Submitted 10 September, 2022; originally announced September 2022.

    Comments: ISWC 22

  8. arXiv:2208.00493  [pdf, other

    cs.LG cs.AI

    Scrutinizing Shipment Records To Thwart Illegal Timber Trade

    Authors: Debanjan Datta, Sathappan Muthiah, John Simeone, Amelia Meadows, Naren Ramakrishnan

    Abstract: Timber and forest products made from wood, like furniture, are valuable commodities, and like the global trade of many highly-valued natural resources, face challenges of corruption, fraud, and illegal harvesting. These grey and black market activities in the wood and forest products sector are not limited to the countries where the wood was harvested, but extend throughout the global supply chain… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: Accepted in Proceedings of 6th Outlier Detection and Description Workshop, ACM SigKDD 2021 https://oddworkshop.github.io/assets/papers/7.pdf. arXiv admin note: substantial text overlap with arXiv:2104.01156

  9. arXiv:2206.15076  [pdf, other

    cs.CL

    BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

    Authors: Jason Alan Fries, Leon Weber, Natasha Seelam, Gabriel Altay, Debajyoti Datta, Samuele Garda, Myungsun Kang, Ruisi Su, Wojciech Kusa, Samuel Cahyawijaya, Fabio Barth, Simon Ott, Matthias Samwald, Stephen Bach, Stella Biderman, Mario Sänger, Bo Wang, Alison Callahan, Daniel León Periñán, Théo Gigant, Patrick Haller, Jenny Chim, Jose David Posada, John Michael Giorgi, Karthik Rangasai Sivaraman , et al. (18 additional authors not shown)

    Abstract: Training and evaluating language models increasingly requires the construction of meta-datasets --diverse collections of curated data with clear provenance. Natural language prompting has recently lead to improved zero-shot generalization by transforming existing, supervised datasets into a diversity of novel pretraining tasks, highlighting the benefits of meta-dataset curation. While successful i… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Submitted to NeurIPS 2022 Datasets and Benchmarks Track

  10. arXiv:2206.14384  [pdf, other

    cs.LG cs.AI stat.ME

    Framing Algorithmic Recourse for Anomaly Detection

    Authors: Debanjan Datta, Feng Chen, Naren Ramakrishnan

    Abstract: The problem of algorithmic recourse has been explored for supervised machine learning models, to provide more interpretable, transparent and robust outcomes from decision support systems. An unexplored area is that of algorithmic recourse for anomaly detection, specifically for tabular data with only discrete feature values. Here the problem is to present a set of counterfactuals that are deemed n… ▽ More

    Submitted 28 June, 2022; originally announced June 2022.

    Comments: ACM SigKDD 2022, Research Track

  11. arXiv:2206.04615  [pdf, other

    cs.CL cs.AI cs.CY cs.LG stat.ML

    Beyond the Imitation Game: Quantifying and extrapolating the capabilities of language models

    Authors: Aarohi Srivastava, Abhinav Rastogi, Abhishek Rao, Abu Awal Md Shoeb, Abubakar Abid, Adam Fisch, Adam R. Brown, Adam Santoro, Aditya Gupta, Adrià Garriga-Alonso, Agnieszka Kluska, Aitor Lewkowycz, Akshat Agarwal, Alethea Power, Alex Ray, Alex Warstadt, Alexander W. Kocurek, Ali Safaya, Ali Tazarv, Alice Xiang, Alicia Parrish, Allen Nie, Aman Hussain, Amanda Askell, Amanda Dsouza , et al. (426 additional authors not shown)

    Abstract: Language models demonstrate both quantitative improvement and new qualitative capabilities with increasing scale. Despite their potentially transformative impact, these new capabilities are as yet poorly characterized. In order to inform future research, prepare for disruptive new model capabilities, and ameliorate socially harmful effects, it is vital that we understand the present and near-futur… ▽ More

    Submitted 12 June, 2023; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: 27 pages, 17 figures + references and appendices, repo: https://github.com/google/BIG-bench

    Journal ref: Transactions on Machine Learning Research, May/2022, https://openreview.net/forum?id=uyTL5Bvosj

  12. arXiv:2112.01537  [pdf, other

    cs.HC cs.AI cs.LG

    Improving mathematical questioning in teacher training

    Authors: Debajyoti Datta, Maria Phillips, James P Bywater, Jennifer Chiu, Ginger S. Watson, Laura E. Barnes, Donald E Brown

    Abstract: High-fidelity, AI-based simulated classroom systems enable teachers to rehearse effective teaching strategies. However, dialogue-oriented open-ended conversations such as teaching a student about scale factors can be difficult to model. This paper builds a text-based interactive conversational agent to help teachers practice mathematical questioning skills based on the well-known Instructional Qua… ▽ More

    Submitted 6 December, 2021; v1 submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to appear at the NeurIPS 2021 Human Centered AI Workshop (HCAI). Data collection process for this data is described here arXiv:2112.00985

  13. arXiv:2112.00985  [pdf, other

    cs.AI cs.HC cs.LG

    Evaluation of mathematical questioning strategies using data collected through weak supervision

    Authors: Debajyoti Datta, Maria Phillips, James P Bywater, Jennifer Chiu, Ginger S. Watson, Laura E. Barnes, Donald E Brown

    Abstract: A large body of research demonstrates how teachers' questioning strategies can improve student learning outcomes. However, developing new scenarios is challenging because of the lack of training data for a specific scenario and the costs associated with labeling. This paper presents a high-fidelity, AI-based classroom simulator to help teachers rehearse research-based mathematical questioning skil… ▽ More

    Submitted 2 December, 2021; originally announced December 2021.

    Comments: Accepted to appear at the NeurIPS 2021 Workshop on Math AI for Education (MATHAI4ED)

  14. arXiv:2110.08207  [pdf, other

    cs.LG cs.CL

    Multitask Prompted Training Enables Zero-Shot Task Generalization

    Authors: Victor Sanh, Albert Webson, Colin Raffel, Stephen H. Bach, Lintang Sutawika, Zaid Alyafeai, Antoine Chaffin, Arnaud Stiegler, Teven Le Scao, Arun Raja, Manan Dey, M Saiful Bari, Canwen Xu, Urmish Thakker, Shanya Sharma Sharma, Eliza Szczechla, Taewoon Kim, Gunjan Chhablani, Nihal Nayak, Debajyoti Datta, Jonathan Chang, Mike Tian-Jian Jiang, Han Wang, Matteo Manica, Sheng Shen , et al. (16 additional authors not shown)

    Abstract: Large language models have recently been shown to attain reasonable zero-shot generalization on a diverse set of tasks (Brown et al., 2020). It has been hypothesized that this is a consequence of implicit multitask learning in language models' pretraining (Radford et al., 2019). Can zero-shot generalization instead be directly induced by explicit multitask learning? To test this question at scale,… ▽ More

    Submitted 17 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ICLR 2022 Spotlight (with extended discussion)

  15. arXiv:2104.01156  [pdf, other

    cs.LG

    Detecting Anomalies Through Contrast in Heterogeneous Data

    Authors: Debanjan Datta, Sathappan Muthiah, Naren Ramakrishnan

    Abstract: Detecting anomalies has been a fundamental approach in detecting potentially fraudulent activities. Tasked with detection of illegal timber trade that threatens ecosystems and economies and association with other illegal activities, we formulate our problem as one of anomaly detection. Among other challenges annotations are unavailable for our large-scale trade data with heterogeneous features (ca… ▽ More

    Submitted 2 April, 2021; originally announced April 2021.

  16. arXiv:2011.05801  [pdf, ps, other

    cs.SI cs.LG

    A Small Survey On Event Detection Using Twitter

    Authors: Debanjan Datta

    Abstract: A small survey on event detection using Twitter. This work first defines the problem statement, and then summarizes and collates the different research works towards solving the problem.

    Submitted 30 July, 2022; v1 submitted 8 November, 2020; originally announced November 2020.

  17. arXiv:2010.12710  [pdf, other

    cs.CL cs.CY cs.LG

    Improving Classification through Weak Supervision in Context-specific Conversational Agent Development for Teacher Education

    Authors: Debajyoti Datta, Maria Phillips, Jennifer Chiu, Ginger S. Watson, James P. Bywater, Laura Barnes, Donald Brown

    Abstract: Machine learning techniques applied to the Natural Language Processing (NLP) component of conversational agent development show promising results for improved accuracy and quality of feedback that a conversational agent can provide. The effort required to develop an educational scenario specific conversational agent is time consuming as it requires domain experts to label and annotate noisy data s… ▽ More

    Submitted 23 October, 2020; originally announced October 2020.

    Comments: Preprint: Under Review

    ACM Class: I.2.7

  18. arXiv:2010.07212  [pdf, other

    cs.CL stat.ML

    Geometry matters: Exploring language examples at the decision boundary

    Authors: Debajyoti Datta, Shashwat Kumar, Laura Barnes, Tom Fletcher

    Abstract: A growing body of recent evidence has highlighted the limitations of natural language processing (NLP) datasets and classifiers. These include the presence of annotation artifacts in datasets, classifiers relying on shallow features like a single word (e.g., if a movie review has the word "romantic", the review tends to be positive), or unnecessary words (e.g., learning a proper noun to classify a… ▽ More

    Submitted 28 October, 2021; v1 submitted 14 October, 2020; originally announced October 2020.

    Comments: Ongoing Work, Presented at TADA 2021

    ACM Class: F.2.0; I.2.7

  19. arXiv:2008.12284  [pdf, ps, other

    cs.LG cs.CV cs.RO stat.ML

    learn2learn: A Library for Meta-Learning Research

    Authors: Sébastien M. R. Arnold, Praateek Mahajan, Debajyoti Datta, Ian Bunner, Konstantinos Saitas Zarkias

    Abstract: Meta-learning researchers face two fundamental issues in their empirical work: prototyping and reproducibility. Researchers are prone to make mistakes when prototyping new algorithms and tasks because modern meta-learning methods rely on unconventional functionalities of machine learning frameworks. In turn, reproducing existing results becomes a tedious endeavour -- a situation exacerbated by the… ▽ More

    Submitted 27 August, 2020; v1 submitted 27 August, 2020; originally announced August 2020.

    Comments: Software available at: https://github.com/learnables/learn2learn

  20. arXiv:2005.06943  [pdf, ps, other

    cs.CL

    NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor

    Authors: Steve Durairaj Swamy, Shubham Laddha, Basil Abdussalam, Debayan Datta, Anupam Jamatia

    Abstract: The paper describes the systems submitted to SemEval-2020 Task 8: Memotion by the `NIT-Agartala-NLP-Team'. A dataset of 8879 memes was made available by the task organizers to train and test our models. Our systems include a Logistic Regression baseline, a BiLSTM + Attention-based learner and a transfer learning approach with BERT. For the three sub-tasks A, B and C, we attained ranks 24/33, 11/29… ▽ More

    Submitted 16 May, 2020; v1 submitted 14 May, 2020; originally announced May 2020.

    Comments: Submitted to International Workshop on Semantic Evaluation (SemEval)-2020 Task 8: Memotion Analysis, http://alt.qcri.org/semeval2020/index.php?id=tasks

  21. arXiv:1910.07784  [pdf, other

    cs.IR cs.HC

    Indoor Information Retrieval using Lifelog Data

    Authors: Deepanwita Datta

    Abstract: Studying human behaviour through lifelogging has seen an increase in attention from researchers over the past decade. The opportunities that lifelogging offers are based on the fact that a lifelog, as a "black box" of our lives, offers rich contextual information, which has been an Achilles heel of information discovery. While lifelog data has been put to use in various contexts, its application t… ▽ More

    Submitted 17 October, 2019; originally announced October 2019.

  22. arXiv:1603.03938  [pdf, ps, other

    cs.NI

    Multimedia Channel Allocation in Cognitive Radio Networks using FDM-FDMA and OFDM-FDMA

    Authors: Ansuman Bhattacharya, Rabindranath Ghosh, Koushik Sinha, Debasish Datta, Bhabani P. Sinha

    Abstract: In conventional wireless systems, unless a contiguous frequency band with width at least equal to the required bandwidth is obtained, multimedia communication can not be effected with the desired Quality of Service. We propose here a novel channel allocation technique to overcome this limitation in a Cognitive Radio Network which is based on utilizing several non-contiguous channels, each of width… ▽ More

    Submitted 12 March, 2016; originally announced March 2016.

  23. arXiv:1209.3869  [pdf

    cs.AI

    Hybrid technique for effective knowledge representation & a comparative study

    Authors: Poonam Tanwar, T. V. Prasad, Dr. Kamlesh Datta

    Abstract: Knowledge representation (KR) and inference mechanism are most desirable thing to make the system intelligent. System is known to an intelligent if its intelligence is equivalent to the intelligence of human being for a particular domain or general. Because of incomplete ambiguous and uncertain information the task of making intelligent system is very difficult. The objective of this paper is to p… ▽ More

    Submitted 18 September, 2012; originally announced September 2012.

    Comments: 15 pages,9 figures, 1 table, Pablished in IJCSES,International Journal of Computer Science & Engineering Survey Vol.3, No.4, August 2012

    Journal ref: Pablished in IJCSES,International Journal of Computer Science & Engineering Survey Vol.3, No.4, August 2012