Zum Hauptinhalt springen

Showing 1–45 of 45 results for author: Banerjee, I

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.19675  [pdf, other

    cs.CV

    Knowledge-grounded Adaptation Strategy for Vision-language Models: Building Unique Case-set for Screening Mammograms for Residents Training

    Authors: Aisha Urooj Khan, John Garrett, Tyler Bradshaw, Lonie Salkowski, Jiwoong Jason Jeong, Amara Tariq, Imon Banerjee

    Abstract: A visual-language model (VLM) pre-trained on natural images and text pairs poses a significant barrier when applied to medical contexts due to domain shift. Yet, adapting or fine-tuning these VLMs for medical use presents considerable hurdles, including domain misalignment, limited access to extensive datasets, and high-class imbalances. Hence, there is a pressing need for strategies to effectivel… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  2. arXiv:2405.16402  [pdf, other

    cs.CL cs.AI

    Assessing Empathy in Large Language Models with Real-World Physician-Patient Interactions

    Authors: Man Luo, Christopher J. Warren, Lu Cheng, Haidar M. Abdul-Muhsin, Imon Banerjee

    Abstract: The integration of Large Language Models (LLMs) into the healthcare domain has the potential to significantly enhance patient care and support through the development of empathetic, patient-facing chatbots. This study investigates an intriguing question Can ChatGPT respond with a greater degree of empathy than those typically offered by physicians? To answer this question, we collect a de-identifi… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  3. arXiv:2312.12442  [pdf

    cs.CV cs.AI

    Hierarchical Classification System for Breast Cancer Specimen Report (HCSBC) -- an end-to-end model for characterizing severity and diagnosis

    Authors: Thiago Santos, Harish Kamath, Christopher R. McAdams, Mary S. Newell, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Constance Lehman, Judy Gichoya, Imon Banerjee, Hari Trivedi

    Abstract: Automated classification of cancer pathology reports can extract information from unstructured reports and categorize each report into structured diagnosis and severity categories. Thus, such system can reduce the burden for populating tumor registries, help registration for clinical trial as well as developing large dataset for deep learning model development using true pathologic ground truth. H… ▽ More

    Submitted 2 November, 2023; originally announced December 2023.

  4. arXiv:2305.04422  [pdf

    eess.IV cs.CV cs.CY cs.LG

    Multivariate Analysis on Performance Gaps of Artificial Intelligence Models in Screening Mammography

    Authors: Linglin Zhang, Beatrice Brown-Mulry, Vineela Nalla, InChan Hwang, Judy Wawira Gichoya, Aimilia Gastounioti, Imon Banerjee, Laleh Seyyed-Kalantari, MinJae Woo, Hari Trivedi

    Abstract: Although deep learning models for abnormality classification can perform well in screening mammography, the demographic, imaging, and clinical characteristics associated with increased risk of model failure remain unclear. This retrospective study uses the Emory BrEast Imaging Dataset(EMBED) containing mammograms from 115931 patients imaged at Emory Healthcare between 2013-2020, with BI-RADS asses… ▽ More

    Submitted 19 October, 2023; v1 submitted 7 May, 2023; originally announced May 2023.

    Comments: 29 pages, 6 tables, 7 figures, 2 supplemental tables

  5. arXiv:2302.01061  [pdf

    cs.AI

    MLOps with enhanced performance control and observability

    Authors: Indradumna Banerjee, Dinesh Ghanta, Girish Nautiyal, Pradeep Sanchana, Prateek Katageri, Atin Modi

    Abstract: The explosion of data and its ever increasing complexity in the last few years, has made MLOps systems more prone to failure, and new tools need to be embedded in such systems to avoid such failure. In this demo, we will introduce crucial tools in the observability module of a MLOps system that target difficult issues like data drfit and model version control for optimum model selection. We believ… ▽ More

    Submitted 2 February, 2023; originally announced February 2023.

    Comments: SECOND INTERNATIONAL CONFERENCE ON AI-ML SYSTEMS

  6. arXiv:2302.00651  [pdf

    cs.IR

    Ngram-LSTM Open Rate Prediction Model (NLORP) and Error_accuracy@C metric: Simple effective, and easy to implement approach to predict open rates for marketing email

    Authors: Shubham Joshi, Indradumna Banerjee

    Abstract: Our generation has seen an exponential increase in digital tools adoption. One of the unique areas where digital tools have made an exponential foray is in the sphere of digital marketing, where goods and services have been extensively promoted through the use of digital advertisements. Following this growth, multiple companies have leveraged multiple apps and channels to display their brand ident… ▽ More

    Submitted 14 February, 2023; v1 submitted 25 January, 2023; originally announced February 2023.

  7. arXiv:2212.12454  [pdf

    cs.CL

    Generalizable Natural Language Processing Framework for Migraine Reporting from Social Media

    Authors: Yuting Guo, Swati Rajwal, Sahithi Lakamana, Chia-Chun Chiang, Paul C. Menell, Adnan H. Shahid, Yi-Chieh Chen, Nikita Chhabra, Wan-Ju Chao, Chieh-Ju Chao, Todd J. Schwedt, Imon Banerjee, Abeed Sarker

    Abstract: Migraine is a high-prevalence and disabling neurological disorder. However, information migraine management in real-world settings could be limited to traditional health information sources. In this paper, we (i) verify that there is substantial migraine-related chatter available on social media (Twitter and Reddit), self-reported by migraine sufferers; (ii) develop a platform-independent text cla… ▽ More

    Submitted 23 December, 2022; originally announced December 2022.

    Comments: Accepted by AMIA 2023 Informatics Summit

  8. arXiv:2211.07092  [pdf, ps, other

    stat.ML cs.LG math.ST

    Offline Estimation of Controlled Markov Chains: Minimaxity and Sample Complexity

    Authors: Imon Banerjee, Harsha Honnappa, Vinayak Rao

    Abstract: In this work, we study a natural nonparametric estimator of the transition probability matrices of a finite controlled Markov chain. We consider an offline setting with a fixed dataset, collected using a so-called logging policy. We develop sample complexity bounds for the estimator and establish conditions for minimaxity. Our statistical bounds depend on the logging policy through its mixing prop… ▽ More

    Submitted 26 January, 2024; v1 submitted 13 November, 2022; originally announced November 2022.

    Comments: 71 pages, 23 main

  9. arXiv:2208.08938  [pdf, other

    stat.ML cs.LG

    Meta Sparse Principal Component Analysis

    Authors: Imon Banerjee, Jean Honorio

    Abstract: We study the meta-learning for support (i.e. the set of non-zero entries) recovery in high-dimensional Principal Component Analysis. We reduce the sufficient sample complexity in a novel task with the information that is learned from auxiliary tasks. We assume each task to be a different random Principal Component (PC) matrix with a possibly different support and that the support union of the PC m… ▽ More

    Submitted 19 August, 2022; v1 submitted 18 August, 2022; originally announced August 2022.

    Comments: 29 pages, 7 figures

  10. arXiv:2208.00475  [pdf, other

    cs.CV

    Augmenting Vision Language Pretraining by Learning Codebook with Visual Semantics

    Authors: Xiaoyuan Guo, Jiali Duan, C. -C. Jay Kuo, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Language modality within the vision language pretraining framework is innately discretized, endowing each word in the language vocabulary a semantic meaning. In contrast, visual modality is inherently continuous and high-dimensional, which potentially prohibits the alignment as well as fusion between vision and language modalities. We therefore propose to "discretize" the visual representation by… ▽ More

    Submitted 31 July, 2022; originally announced August 2022.

    Comments: 7 pages, 4 figures, ICPR2022. arXiv admin note: text overlap with arXiv:2203.00048

  11. arXiv:2207.04846  [pdf

    cs.NE

    Fitness Dependent Optimizer for IoT Healthcare using Adapted Parameters: A Case Study Implementation

    Authors: Aso M. Aladdin, Jaza M. Abdullah, Kazhan Othman Mohammed Salih, Tarik A. Rashid, Rafid Sagban, Abeer Alsaddon, Nebojsa Bacanin, Amit Chhabra, S. Vimal, Indradip Banerjee

    Abstract: This discusses a case study on Fitness Dependent Optimizer or so-called FDO and adapting its parameters to the Internet of Things (IoT) healthcare. The reproductive way is sparked by the bee swarm and the collaborative decision-making of FDO. As opposed to the honey bee or artificial bee colony algorithms, this algorithm has no connection to them. In FDO, the search agent's position is updated usi… ▽ More

    Submitted 18 May, 2022; originally announced July 2022.

    Comments: 17 pages

    Journal ref: -

  12. arXiv:2207.00066  [pdf

    cs.LG cs.AI math.NA

    Advances in Prediction of Readmission Rates Using Long Term Short Term Memory Networks on Healthcare Insurance Data

    Authors: Shuja Khalid, Francisco Matos, Ayman Abunimer, Joel Bartlett, Richard Duszak, Michal Horny, Judy Gichoya, Imon Banerjee, Hari Trivedi

    Abstract: 30-day hospital readmission is a long standing medical problem that affects patients' morbidity and mortality and costs billions of dollars annually. Recently, machine learning models have been created to predict risk of inpatient readmission for patients with specific diseases, however no model exists to predict this risk across all patients. We developed a bi-directional Long Short Term Memory (… ▽ More

    Submitted 30 June, 2022; originally announced July 2022.

    Comments: 7 pages, 3 figures, 3 tables

  13. arXiv:2205.06885  [pdf

    cs.CL

    PathologyBERT -- Pre-trained Vs. A New Transformer Language Model for Pathology Domain

    Authors: Thiago Santos, Amara Tariq, Susmita Das, Kavyasree Vayalpati, Geoffrey H. Smith, Hari Trivedi, Imon Banerjee

    Abstract: Pathology text mining is a challenging task given the reporting variability and constant new findings in cancer sub-type definitions. However, successful text mining of a large pathology database can play a critical role to advance 'big data' cancer research like similarity-based treatment selection, case identification, prognostication, surveillance, clinical trial screening, risk stratification,… ▽ More

    Submitted 13 May, 2022; originally announced May 2022.

    Comments: submitted to "American Medical Informatics Association (AMIA)" 2022 Annual Symposium

  14. Multimodal spatiotemporal graph neural networks for improved prediction of 30-day all-cause hospital readmission

    Authors: Siyi Tang, Amara Tariq, Jared Dunnmon, Umesh Sharma, Praneetha Elugunti, Daniel Rubin, Bhavik N. Patel, Imon Banerjee

    Abstract: Measures to predict 30-day readmission are considered an important quality factor for hospitals as accurate predictions can reduce the overall cost of care by identifying high risk patients before they are discharged. While recent deep learning-based studies have shown promising empirical results on readmission prediction, several limitations exist that may hinder widespread clinical utility, such… ▽ More

    Submitted 14 April, 2022; originally announced April 2022.

    Journal ref: IEEE Journal of Biomedical and Health Informatics, vol. 27, no. 4, pp. 2071-2082, April 2023

  15. arXiv:2204.03074  [pdf, other

    cs.CV

    OSCARS: An Outlier-Sensitive Content-Based Radiography Retrieval System

    Authors: Xiaoyuan Guo, Jiali Duan, Saptarshi Purkayastha, Hari Trivedi, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Improving the retrieval relevance on noisy datasets is an emerging need for the curation of a large-scale clean dataset in the medical domain. While existing methods can be applied for class-wise retrieval (aka. inter-class), they cannot distinguish the granularity of likeness within the same class (aka. intra-class). The problem is exacerbated on medical external datasets, where noisy samples of… ▽ More

    Submitted 6 April, 2022; originally announced April 2022.

    Comments: 12 pages, 6 figures, 2 tables

  16. arXiv:2202.04073  [pdf

    eess.IV cs.CV cs.LG

    The EMory BrEast imaging Dataset (EMBED): A Racially Diverse, Granular Dataset of 3.5M Screening and Diagnostic Mammograms

    Authors: Jiwoong J. Jeong, Brianna L. Vey, Ananth Reddy, Thomas Kim, Thiago Santos, Ramon Correa, Raman Dutt, Marina Mosunjac, Gabriela Oprea-Ilies, Geoffrey Smith, Minjae Woo, Christopher R. McAdams, Mary S. Newell, Imon Banerjee, Judy Gichoya, Hari Trivedi

    Abstract: Developing and validating artificial intelligence models in medical imaging requires datasets that are large, granular, and diverse. To date, the majority of publicly available breast imaging datasets lack in one or more of these areas. Models trained on these data may therefore underperform on patient populations or pathologies that have not previously been encountered. The EMory BrEast imaging D… ▽ More

    Submitted 8 February, 2022; originally announced February 2022.

  17. arXiv:2112.13885  [pdf, other

    eess.IV cs.CV

    MedShift: identifying shift data for medical dataset curation

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Hari Trivedi, Saptarshi Purkayastha, Imon Banerjee

    Abstract: To curate a high-quality dataset, identifying data variance between the internal and external sources is a fundamental and crucial step. However, methods to detect shift or variance in data have not been significantly researched. Challenges to this are the lack of effective approaches to learn dense representation of a dataset and difficulties of sharing private data across medical institutions. T… ▽ More

    Submitted 27 December, 2021; originally announced December 2021.

    Comments: 35 pages, 28 figures, 2 tables

  18. arXiv:2111.11665  [pdf, other

    eess.IV cs.CV

    RadFusion: Benchmarking Performance and Fairness for Multimodal Pulmonary Embolism Detection from CT and EHR

    Authors: Yuyin Zhou, Shih-Cheng Huang, Jason Alan Fries, Alaa Youssef, Timothy J. Amrhein, Marcello Chang, Imon Banerjee, Daniel Rubin, Lei Xing, Nigam Shah, Matthew P. Lungren

    Abstract: Despite the routine use of electronic health record (EHR) data by radiologists to contextualize clinical history and inform image interpretation, the majority of deep learning architectures for medical imaging are unimodal, i.e., they only learn features from pixel-level information. Recent research revealing how race can be recovered from pixel data alone highlights the potential for serious bias… ▽ More

    Submitted 26 November, 2021; v1 submitted 23 November, 2021; originally announced November 2021.

    Comments: RadFusion dataset: https://stanfordaimi.azurewebsites.net/datasets/3a7548a4-8f65-4ab7-85fa-3d68c9efc1bd

  19. arXiv:2111.08711  [pdf, other

    eess.IV cs.CV cs.LG

    Two-step adversarial debiasing with partial learning -- medical image case-studies

    Authors: Ramon Correa, Jiwoong Jason Jeong, Bhavik Patel, Hari Trivedi, Judy W. Gichoya, Imon Banerjee

    Abstract: The use of artificial intelligence (AI) in healthcare has become a very active research area in the last few years. While significant progress has been made in image classification tasks, only a few AI methods are actually being deployed in hospitals. A major hurdle in actively using clinical AI models currently is the trustworthiness of these models. More often than not, these complex models are… ▽ More

    Submitted 16 November, 2021; originally announced November 2021.

  20. arXiv:2110.15811  [pdf, other

    eess.IV cs.CV

    CVAD: A generic medical anomaly detector based on Cascade VAE

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Saptarshi Purkayastha, Imon Banerjee

    Abstract: Detecting out-of-distribution (OOD) samples in medical imaging plays an important role for downstream medical diagnosis. However, existing OOD detectors are demonstrated on natural images composed of inter-classes and have difficulty generalizing to medical images. The key issue is the granularity of OOD data in the medical domain, where intra-class OOD samples are predominant. We focus on the gen… ▽ More

    Submitted 26 January, 2022; v1 submitted 29 October, 2021; originally announced October 2021.

    Comments: 6 pages, 4 figures, 4 tables

  21. arXiv:2108.00117  [pdf, other

    cs.CV

    Margin-Aware Intra-Class Novelty Identification for Medical Images

    Authors: Xiaoyuan Guo, Judy Wawira Gichoya, Saptarshi Purkayastha, Imon Banerjee

    Abstract: Traditional anomaly detection methods focus on detecting inter-class variations while medical image novelty identification is inherently an intra-class detection problem. For example, a machine learning model trained with normal chest X-ray and common lung abnormalities, is expected to discover and flag idiopathic pulmonary fibrosis which a rare lung disease and unseen by the model during training… ▽ More

    Submitted 22 January, 2022; v1 submitted 30 July, 2021; originally announced August 2021.

    Comments: 35 pages, 8 figures

    Journal ref: Journal of Medical Imaging 2022

  22. arXiv:2107.10356  [pdf

    cs.CV cs.CY eess.IV

    Reading Race: AI Recognises Patient's Racial Identity In Medical Images

    Authors: Imon Banerjee, Ananth Reddy Bhimireddy, John L. Burns, Leo Anthony Celi, Li-Ching Chen, Ramon Correa, Natalie Dullerud, Marzyeh Ghassemi, Shih-Cheng Huang, Po-Chih Kuo, Matthew P Lungren, Lyle Palmer, Brandon J Price, Saptarshi Purkayastha, Ayis Pyrros, Luke Oakden-Rayner, Chima Okechukwu, Laleh Seyyed-Kalantari, Hari Trivedi, Ryan Wang, Zachary Zaiman, Haoran Zhang, Judy W Gichoya

    Abstract: Background: In medical imaging, prior studies have demonstrated disparate AI performance by race, yet there is no known correlation for race on medical imaging that would be obvious to the human expert interpreting the images. Methods: Using private and public datasets we evaluate: A) performance quantification of deep learning models to detect race from medical images, including the ability of… ▽ More

    Submitted 21 July, 2021; originally announced July 2021.

    MSC Class: 68-XX ACM Class: I.2

  23. An Improved Simulation Model for Pedestrian Crowd Evacuation

    Authors: Danial A. Muhammed, Tarik A. Rashid, Abeer Alsadoon, Nebojsa Bacanin, Polla Fattah, Mokhtar Mohammadi, Indradip Banerjee

    Abstract: This paper works on one of the most recent pedestrian crowd evacuation models, i.e., "a simulation model for pedestrian crowd evacuation based on various AI techniques", developed in late 2019. This study adds a new feature to the developed model by proposing a new method and integrating it with the model. This method enables the developed model to find a more appropriate evacuation area design, a… ▽ More

    Submitted 4 December, 2020; originally announced December 2020.

    Comments: 15 pages, accepted in Mathematics, MDPI, 2020

  24. arXiv:2007.05786  [pdf, other

    cs.CV cs.LG

    Generalization of Deep Convolutional Neural Networks -- A Case-study on Open-source Chest Radiographs

    Authors: Nazanin Mashhaditafreshi, Amara Tariq, Judy Wawira Gichoya, Imon Banerjee

    Abstract: Deep Convolutional Neural Networks (DCNNs) have attracted extensive attention and been applied in many areas, including medical image analysis and clinical diagnosis. One major challenge is to conceive a DCNN model with remarkable performance on both internal and external data. We demonstrate that DCNNs may not generalize to new data, but increasing the quality and heterogeneity of the training da… ▽ More

    Submitted 11 July, 2020; originally announced July 2020.

  25. arXiv:2006.13262  [pdf

    eess.IV cs.CV cs.LG

    Was there COVID-19 back in 2012? Challenge for AI in Diagnosis with Similar Indications

    Authors: Imon Banerjee, Priyanshu Sinha, Saptarshi Purkayastha, Nazanin Mashhaditafreshi, Amara Tariq, Jiwoong Jeong, Hari Trivedi, Judy W. Gichoya

    Abstract: Purpose: Since the recent COVID-19 outbreak, there has been an avalanche of research papers applying deep learning based image processing to chest radiographs for detection of the disease. To test the performance of the two top models for CXR COVID-19 diagnosis on external datasets to assess model generalizability. Methods: In this paper, we present our argument regarding the efficiency and applic… ▽ More

    Submitted 23 June, 2020; originally announced June 2020.

  26. arXiv:2006.02825  [pdf, other

    cs.CY cond-mat.dis-nn nlin.AO

    SOS -- Self-Organization for Survival: Introducing fairness in emergency communication to save lives

    Authors: Indushree Banerjee, Martijn Warnier, Frances M. T. Brazier, Dirk Helbing

    Abstract: Communication is crucial when disasters isolate communities of people and rescue is delayed. Such delays force citizens to be first responders and form small rescue teams. Rescue teams require reliable communication, particularly in the first 72 hours, which is challenging due to damaged infrastructure and electrical blackouts. We design a peer-to-peer communication network that meets these challe… ▽ More

    Submitted 4 June, 2020; originally announced June 2020.

  27. arXiv:2004.07965  [pdf, other

    eess.IV cs.CV cs.LG

    A DICOM Framework for Machine Learning Pipelines against Real-Time Radiology Images

    Authors: Pradeeban Kathiravelu, Puneet Sharma, Ashish Sharma, Imon Banerjee, Hari Trivedi, Saptarshi Purkayastha, Priyanshu Sinha, Alexandre Cadrin-Chenevert, Nabile Safdar, Judy Wawira Gichoya

    Abstract: Executing machine learning (ML) pipelines in real-time on radiology images is hard due to the limited computing resources in clinical environments and the lack of efficient data transfer capabilities to run them on research clusters. We propose Niffler, an integrated framework that enables the execution of ML pipelines at research clusters by efficiently querying and retrieving radiology images fr… ▽ More

    Submitted 5 August, 2020; v1 submitted 16 April, 2020; originally announced April 2020.

    Comments: Preprint

    Journal ref: Journal of Digital Imaging (JDI), 2021

  28. arXiv:1902.10700  [pdf

    q-bio.QM cs.LG

    A Deep-learning Approach for Prognosis of Age-Related Macular Degeneration Disease using SD-OCT Imaging Biomarkers

    Authors: Imon Banerjee, Luis de Sisternes, Joelle Hallak, Theodore Leng, Aaron Osborne, Mary Durbin, Daniel Rubin

    Abstract: We propose a hybrid sequential deep learning model to predict the risk of AMD progression in non-exudative AMD eyes at multiple timepoints, starting from short-term progression (3-months) up to long-term progression (21-months). Proposed model combines radiomics and deep learning to handle challenges related to imperfect ratio of OCT scan dimension and training cohort size. We considered a retrosp… ▽ More

    Submitted 27 February, 2019; originally announced February 2019.

  29. arXiv:1806.07346  [pdf

    cs.CL cs.AI

    A Scalable Machine Learning Approach for Inferring Probabilistic US-LI-RADS Categorization

    Authors: Imon Banerjee, Hailey H. Choi, Terry Desser, Daniel L. Rubin

    Abstract: We propose a scalable computerized approach for large-scale inference of Liver Imaging Reporting and Data System (LI-RADS) final assessment categories in narrative ultrasound (US) reports. Although our model was trained on reports created using a LI-RADS template, it was also able to infer LI-RADS scoring for unstructured reports that were created before the LI-RADS guidelines were established. No… ▽ More

    Submitted 15 June, 2018; originally announced June 2018.

    Comments: AMIA Annual Symposium 2018 (accepted)

  30. arXiv:1801.03058  [pdf

    cs.AI

    Abstract: Probabilistic Prognostic Estimates of Survival in Metastatic Cancer Patients

    Authors: Imon Banerjee, Michael Francis Gensheimer, Douglas J. Wood, Solomon Henry, Daniel Chang, Daniel L. Rubin

    Abstract: We propose a deep learning model - Probabilistic Prognostic Estimates of Survival in Metastatic Cancer Patients (PPES-Met) for estimating short-term life expectancy (3 months) of the patients by analyzing free-text clinical notes in the electronic medical record, while maintaining the temporal visit sequence. In a single framework, we integrated semantic data mapping and neural embedding technique… ▽ More

    Submitted 13 July, 2018; v1 submitted 9 January, 2018; originally announced January 2018.

    Journal ref: AMIA Informatics Conference 2018

  31. arXiv:1711.06968  [pdf, other

    cs.IR cs.CL

    Intelligent Word Embeddings of Free-Text Radiology Reports

    Authors: Imon Banerjee, Sriraman Madhavan, Roger Eric Goldman, Daniel L. Rubin

    Abstract: Radiology reports are a rich resource for advancing deep learning applications in medicine by leveraging the large volume of data continuously being updated, integrated, and shared. However, there are significant challenges as well, largely due to the ambiguity and subtlety of natural language. We propose a hybrid strategy that combines semantic-dictionary mapping and word2vec modeling for creatin… ▽ More

    Submitted 19 November, 2017; originally announced November 2017.

    Comments: AMIA Annual Symposium 2017

  32. arXiv:1709.02477  [pdf, other

    cs.LG cs.AI stat.ML

    Inferring Generative Model Structure with Static Analysis

    Authors: Paroma Varma, Bryan He, Payal Bajaj, Imon Banerjee, Nishith Khandwala, Daniel L. Rubin, Christopher Ré

    Abstract: Obtaining enough labeled data to robustly train complex discriminative models is a major bottleneck in the machine learning pipeline. A popular solution is combining multiple sources of weak supervision using generative models. The structure of these models affects training label quality, but is difficult to learn without any ground truth labels. We instead rely on these weak supervision sources h… ▽ More

    Submitted 7 September, 2017; originally announced September 2017.

    Comments: NIPS 2017

  33. arXiv:1706.09355  [pdf, other

    cs.DS

    New Results On Routing Via Matchings On Graphs

    Authors: Indranil Banerjee, Dana Richards

    Abstract: In this paper we present some new complexity results on the routing time of a graph under the \textit{routing via matching} model. This is a parallel routing model which was introduced by Alon et al\cite{alon1994routing}. The model can be viewed as a communication scheme on a distributed network. The nodes in the network can communicate via matchings (a step), where a node exchanges data (pebbles)… ▽ More

    Submitted 18 March, 2022; v1 submitted 28 June, 2017; originally announced June 2017.

    Comments: 15 Pages, 5 Figures , 21st International Symposium on Fundamentals of Computation Theory. arXiv admin note: text overlap with arXiv:1604.04978

  34. arXiv:1612.08178  [pdf

    cs.IR cs.CL

    JU_KS_Group@FIRE 2016: Consumer Health Information Search

    Authors: Kamal Sarkar, Debanjan Das, Indra Banerjee, Mamta Kumari, Prasenjit Biswas

    Abstract: In this paper, we describe the methodology used and the results obtained by us for completing the tasks given under the shared task on Consumer Health Information Search (CHIS) collocated with the Forum for Information Retrieval Evaluation (FIRE) 2016, ISI Kolkata. The shared task consists of two sub-tasks - (1) task1: given a query and a document/set of documents associated with that query, the t… ▽ More

    Submitted 24 December, 2016; originally announced December 2016.

    Comments: 8th meeting of Forum for Information Retrieval Evaluation 2016, 2016

  35. arXiv:1612.06473  [pdf, other

    cs.DS

    Sorting Networks On Restricted Topologies

    Authors: Indranil Banerjee, Dana Richards, Igor Shinkar

    Abstract: The sorting number of a graph with $n$ vertices is the minimum depth of a sorting network with $n$ inputs and outputs that uses only the edges of the graph to perform comparisons. Many known results on sorting networks can be stated in terms of sorting numbers of different classes of graphs. In this paper we show the following general results about the sorting number of graphs. Any $n$-vertex gr… ▽ More

    Submitted 18 March, 2022; v1 submitted 19 December, 2016; originally announced December 2016.

    Comments: 16 pages, 3 figures

  36. arXiv:1612.03361  [pdf, ps, other

    cs.ET

    An Energy-Efficient VCO-Based Matrix Multiplier Block to Support On-Chip Image Analysis

    Authors: Imon Banerjee, Arindam Sanyal

    Abstract: Images typically are represented as uniformly sampled data in the form of matrix of pixels/voxels. Therefore, matrix multiply-and-accumulate (MAC) forms the core of most state-of-the-art image analysis algorithms. While digital implementation of MAC has generally been the preferred approach, high power consumption is an impediment to adopting it for medical image analysis. In this work, we present… ▽ More

    Submitted 10 December, 2016; originally announced December 2016.

  37. arXiv:1612.00408  [pdf, other

    cs.CV

    Computerized Multiparametric MR image Analysis for Prostate Cancer Aggressiveness-Assessment

    Authors: Imon Banerjee, Lewis Hahn, Geoffrey Sonn, Richard Fan, Daniel L. Rubin

    Abstract: We propose an automated method for detecting aggressive prostate cancer(CaP) (Gleason score >=7) based on a comprehensive analysis of the lesion and the surrounding normal prostate tissue which has been simultaneously captured in T2-weighted MR images, diffusion-weighted images (DWI) and apparent diffusion coefficient maps (ADC). The proposed methodology was tested on a dataset of 79 patients (40… ▽ More

    Submitted 1 December, 2016; originally announced December 2016.

    Comments: NIPS 2016 Workshop on Machine Learning for Health (NIPS ML4HC)

  38. arXiv:1611.07933  [pdf, other

    cs.DM

    Routing Number Of A Pyramid

    Authors: Indranil Banerjee, Dana Richards

    Abstract: In this short note we give the routing number of pyramid graph under the \textit{routing via matching} model introduced by Alon et al\cite{5}. This model can be viewed as a communication scheme on a distributed network. The nodes in the network can communicate via matchings (a step), where a node exchanges data with its partner. Formally, given a connected graph $G$ with vertices labeled from… ▽ More

    Submitted 23 November, 2016; originally announced November 2016.

    Comments: 3 pages, 2 figures

  39. arXiv:1604.04978  [pdf, other

    cs.DM

    Routing and Sorting Via Matchings On Graphs

    Authors: Indranil Banerjee, Dana Richards

    Abstract: The paper is divided in to two parts. In the first part we present some new results for the \textit{routing via matching} model introduced by Alon et al\cite{5}. This model can be viewed as a communication scheme on a distributed network. The nodes in the network can communicate via matchings (a step), where a node exchanges data with its partner. Formally, given a connected graph $G$ with vertice… ▽ More

    Submitted 27 April, 2016; v1 submitted 17 April, 2016; originally announced April 2016.

    Comments: 14 pages, submitted to ESA 2016

  40. arXiv:1508.03698  [pdf, ps, other

    cs.DS

    Sorting Under 1-$\infty$ Cost Model

    Authors: Indranil Banerjee, Dana Richards

    Abstract: In this paper we study the problem of sorting under non-uniform comparison costs, where costs are either 1 or $\infty$. If comparing a pair has an associated cost of $\infty$ then we say that such a pair cannot be compared (forbidden pairs). Along with the set of elements $V$ the input to our problem is a graph $G(V, E)$, whose edges represents the pairs that we can compare incurring an unit of co… ▽ More

    Submitted 10 November, 2015; v1 submitted 15 August, 2015; originally announced August 2015.

    Comments: 12 pages, 1 figure, submitted to STOC 2016

  41. arXiv:1508.02477  [pdf, ps, other

    cs.CG cs.DS

    Computing Maximal Layers Of Points in $E^{f(n)}$

    Authors: Indranil Banerjee, Dana Richards

    Abstract: In this paper we present a randomized algorithm for computing the collection of maximal layers for a point set in $E^{k}$ ($k = f(n)$). The input to our algorithm is a point set $P = \{p_1,...,p_n\}$ with $p_i \in E^{k}$. The proposed algorithm achieves a runtime of $O\left(kn^{2 - {1 \over \log{k}} + \log_k{\left(1 + {2 \over {k+1}}\right)}}\log{n}\right)$ when $P$ is a random order and a runtime… ▽ More

    Submitted 10 November, 2015; v1 submitted 10 August, 2015; originally announced August 2015.

    Comments: 13 pages, submitted to LATIN 2016

  42. arXiv:1305.7103  [pdf

    cs.NI

    Fault-tolerant multipath routing scheme for energy efficient wireless sensor networks

    Authors: Prasenjit Chanak, Tuhina Samanta, Indrajit Banerjee

    Abstract: The main challenge in wireless sensor network is to improve the fault tolerance of each node and also provide an energy efficient fast data routing service. In this paper we propose an energy efficient node fault diagnosis and recovery for wireless sensor networks referred as fault tolerant multipath routing scheme for energy efficient wireless sensor network (FTMRS).The FTMRS is based on multipat… ▽ More

    Submitted 30 May, 2013; originally announced May 2013.

    Journal ref: International Journal of Wireless & Mobile Networks (IJWMN) Vol. 5, No. 2, April 2013

  43. arXiv:1209.0286  [pdf

    cs.CR

    CAWS - Security Algorithms for Wireless Sensor Networks: A Cellular Automata Based Approach

    Authors: Nilanjan Sen, Indrajit Banerjee

    Abstract: Security in the Wireless Sensor Networks (WSN) is a very challenging task because of their dissimilarities with the conventional wireless networks. The related works so far have been done have tried to solve the problem keeping in the mind the constraints of WSNs. In this paper we have proposed a set of cellular automata based security algorithms (CAWS) which consists of CAKD, a Cellular Automata… ▽ More

    Submitted 3 September, 2012; originally announced September 2012.

    Comments: Proceedings of "All India Seminar on Role of ICT in Improving Quality of Life" on March 26-27, 2010 organized by The Institution of Engineers (India) and Bengal Engineering and Science University, Shibpur

    Journal ref: Proceedings of "All India Seminar on Role of ICT in Improving Quality of Life", Dated on March 26-27, 2010; pp: 81-88

  44. arXiv:1205.4928  [pdf, ps, other

    cs.SE

    Grey-box GUI Testing: Efficient Generation of Event Sequences

    Authors: Stephan Arlt, Ishan Banerjee, Cristiano Bertolini, Atif M. Memon, Martin Schäf

    Abstract: Graphical user interfaces (GUIs), due to their event driven nature, present a potentially unbounded space of all possible ways to interact with software. During testing it becomes necessary to effectively sample this space. In this paper we develop algorithms that sample the GUI's input space by only generating sequences that (1) are allowed by the GUI's structure, and (2) chain together only thos… ▽ More

    Submitted 22 May, 2012; originally announced May 2012.

    Comments: 11 pages

    MSC Class: 68N30

  45. arXiv:1109.2430  [pdf

    cs.NI

    CCABC: Cyclic Cellular Automata Based Clustering For Energy Conservation in Sensor Networks

    Authors: Indrajit Banerjee, Prasenjit Chanak, Hafizur Rahaman

    Abstract: Sensor network has been recognized as the most significant technology for next century. Despites of its potential application, wireless sensor network encounters resource restriction such as low power, reduced bandwidth and specially limited power sources. This work proposes an efficient technique for the conservation of energy in a wireless sensor network (WSN) by forming an effective cluster of… ▽ More

    Submitted 12 September, 2011; originally announced September 2011.