Zum Hauptinhalt springen

Showing 1–50 of 76 results for author: Mehta, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.10340  [pdf, other

    stat.ML cs.LG q-fin.ST stat.AP

    Can an unsupervised clustering algorithm reproduce a categorization system?

    Authors: Nathalia Castellanos, Dhruv Desai, Sebastian Frank, Stefano Pasquali, Dhagash Mehta

    Abstract: Peer analysis is a critical component of investment management, often relying on expert-provided categorization systems. These systems' consistency is questioned when they do not align with cohorts from unsupervised clustering algorithms optimized for various metrics. We investigate whether unsupervised clustering can reproduce ground truth classes in a labeled dataset, showing that success depend… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 9 pages, 4 tables 28 figures

  2. arXiv:2408.06679  [pdf, other

    cs.LG q-fin.ST stat.ML

    Case-based Explainability for Random Forest: Prototypes, Critics, Counter-factuals and Semi-factuals

    Authors: Gregory Yampolsky, Dhruv Desai, Mingshu Li, Stefano Pasquali, Dhagash Mehta

    Abstract: The explainability of black-box machine learning algorithms, commonly known as Explainable Artificial Intelligence (XAI), has become crucial for financial and other regulated industrial applications due to regulatory requirements and the need for transparency in business practices. Among the various paradigms of XAI, Explainable Case-Based Reasoning (XCBR) stands out as a pragmatic approach that e… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    Comments: 8 pages, 2 figures, 5 tables

  3. arXiv:2408.04948  [pdf, other

    cs.CL cs.LG q-fin.ST stat.AP stat.ML

    HybridRAG: Integrating Knowledge Graphs and Vector Retrieval Augmented Generation for Efficient Information Extraction

    Authors: Bhaskarjit Sarmah, Benika Hall, Rohan Rao, Sunil Patel, Stefano Pasquali, Dhagash Mehta

    Abstract: Extraction and interpretation of intricate information from unstructured text data arising in financial applications, such as earnings call transcripts, present substantial challenges to large language models (LLMs) even using the current best practices to use Retrieval Augmented Generation (RAG) (referred to as VectorRAG techniques which utilize vector databases for information retrieval) due to… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 9 pages, 2 figures, 5 tables

  4. arXiv:2408.02684  [pdf

    cs.LG stat.ML

    Open Set Recognition for Random Forest

    Authors: Guanchao Feng, Dhruv Desai, Stefano Pasquali, Dhagash Mehta

    Abstract: In many real-world classification or recognition tasks, it is often difficult to collect training examples that exhaust all possible classes due to, for example, incomplete knowledge during training or ever changing regimes. Therefore, samples from unknown/novel classes may be encountered in testing/deployment. In such scenarios, the classifiers should be able to i) perform classification on known… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

  5. arXiv:2408.02355  [pdf, other

    stat.ML cs.LG q-fin.ST q-fin.TR

    Quantile Regression using Random Forest Proximities

    Authors: Mingshu Li, Bhaskarjit Sarmah, Dhruv Desai, Joshua Rosaler, Snigdha Bhagat, Philip Sommer, Dhagash Mehta

    Abstract: Due to the dynamic nature of financial markets, maintaining models that produce precise predictions over time is difficult. Often the goal isn't just point prediction but determining uncertainty. Quantifying uncertainty, especially the aleatoric uncertainty due to the unpredictable nature of market drivers, helps investors understand varying risk levels. Recently, quantile regression forests (QRF)… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: 9 pages, 5 figures, 3 tables

  6. arXiv:2408.02078  [pdf, other

    cs.CV

    LDFaceNet: Latent Diffusion-based Network for High-Fidelity Deepfake Generation

    Authors: Dwij Mehta, Aditya Mehta, Pratik Narang

    Abstract: Over the past decade, there has been tremendous progress in the domain of synthetic media generation. This is mainly due to the powerful methods based on generative adversarial networks (GANs). Very recently, diffusion probabilistic models, which are inspired by non-equilibrium thermodynamics, have taken the spotlight. In the realm of image generation, diffusion models (DMs) have exhibited remarka… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

  7. arXiv:2404.01585  [pdf, other

    cs.DB cs.PF

    FLEXIS: FLEXible Frequent Subgraph Mining using Maximal Independent Sets

    Authors: Akshit Sharma, Sam Reinher, Dinesh Mehta, Bo Wu

    Abstract: Frequent Subgraph Mining (FSM) is the process of identifying common subgraph patterns that surpass a predefined frequency threshold. While FSM is widely applicable in fields like bioinformatics, chemical analysis, and social network anomaly detection, its execution remains time-consuming and complex. This complexity stems from the need to recognize high-frequency subgraphs and ascertain if they ex… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

  8. arXiv:2402.04297  [pdf, other

    cs.CV

    Road Surface Defect Detection -- From Image-based to Non-image-based: A Survey

    Authors: Jongmin Yu, Jiaqi Jiang, Sebastiano Fichera, Paolo Paoletti, Lisa Layzell, Devansh Mehta, Shan Luo

    Abstract: Ensuring traffic safety is crucial, which necessitates the detection and prevention of road surface defects. As a result, there has been a growing interest in the literature on the subject, leading to the development of various road surface defect detection methods. The methods for detecting road defects can be categorised in various ways depending on the input data types or training methodologies… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Survey papers

  9. arXiv:2402.04064  [pdf, other

    cs.CV cs.AI

    Multi-class Road Defect Detection and Segmentation using Spatial and Channel-wise Attention for Autonomous Road Repairing

    Authors: Jongmin Yu, Chen Bene Chi, Sebastiano Fichera, Paolo Paoletti, Devansh Mehta, Shan Luo

    Abstract: Road pavement detection and segmentation are critical for developing autonomous road repair systems. However, developing an instance segmentation method that simultaneously performs multi-class defect detection and segmentation is challenging due to the textural simplicity of road pavement image, the diversity of defect geometries, and the morphological ambiguity between classes. We propose a nove… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted to the ICRA 2024

  10. arXiv:2401.17671  [pdf, other

    cs.CL cs.AI q-bio.NC

    Contextual Feature Extraction Hierarchies Converge in Large Language Models and the Brain

    Authors: Gavin Mischler, Yinghao Aaron Li, Stephan Bickel, Ashesh D. Mehta, Nima Mesgarani

    Abstract: Recent advancements in artificial intelligence have sparked interest in the parallels between large language models (LLMs) and human neural processing, particularly in language comprehension. While prior research has established similarities in the representation of LLMs and the brain, the underlying computational principles that cause this convergence, especially in the context of evolving LLMs,… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

    Comments: 19 pages, 5 figures and 4 supplementary figures

  11. arXiv:2312.13225  [pdf, other

    cs.SE

    Automated DevOps Pipeline Generation for Code Repositories using Large Language Models

    Authors: Deep Mehta, Kartik Rawool, Subodh Gujar, Bowen Xu

    Abstract: Automating software development processes through the orchestration of GitHub Action workflows has revolutionized the efficiency and agility of software delivery pipelines. This paper presents a detailed investigation into the use of Large Language Models (LLMs) specifically, GPT 3.5 and GPT 4 to generate and evaluate GitHub Action workflows for DevOps tasks. Our methodology involves data collecti… ▽ More

    Submitted 20 December, 2023; originally announced December 2023.

  12. arXiv:2311.01009  [pdf, other

    cs.CV cs.AI

    Revamping AI Models in Dermatology: Overcoming Critical Challenges for Enhanced Skin Lesion Diagnosis

    Authors: Deval Mehta, Brigid Betz-Stablein, Toan D Nguyen, Yaniv Gal, Adrian Bowling, Martin Haskett, Maithili Sashindranath, Paul Bonnington, Victoria Mar, H Peter Soyer, Zongyuan Ge

    Abstract: The surge in developing deep learning models for diagnosing skin lesions through image analysis is notable, yet their clinical black faces challenges. Current dermatology AI models have limitations: limited number of possible diagnostic outputs, lack of real-world testing on uncommon skin lesions, inability to detect out-of-distribution images, and over-reliance on dermoscopic images. To address t… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  13. arXiv:2310.12428  [pdf, other

    stat.ML cs.AI cs.LG q-fin.ST stat.ME

    Enhanced Local Explainability and Trust Scores with Random Forest Proximities

    Authors: Joshua Rosaler, Dhruv Desai, Bhaskarjit Sarmah, Dimitrios Vamvourellis, Deran Onay, Dhagash Mehta, Stefano Pasquali

    Abstract: We initiate a novel approach to explain the predictions and out of sample performance of random forest (RF) regression and classification models by exploiting the fact that any RF can be mathematically formulated as an adaptive weighted K nearest-neighbors model. Specifically, we employ a recent result that, for both regression and classification tasks, any RF prediction can be rewritten exactly a… ▽ More

    Submitted 5 August, 2024; v1 submitted 18 October, 2023; originally announced October 2023.

    Comments: 5 pages, 6 figures

  14. arXiv:2310.10760  [pdf, other

    cs.CL q-fin.PM q-fin.ST stat.AP

    Towards reducing hallucination in extracting information from financial reports using Large Language Models

    Authors: Bhaskarjit Sarmah, Tianjie Zhu, Dhagash Mehta, Stefano Pasquali

    Abstract: For a financial analyst, the question and answer (Q\&A) segment of the company financial report is a crucial piece of information for various analysis and investment decisions. However, extracting valuable insights from the Q\&A section has posed considerable challenges as the conventional methods such as detailed reading and note-taking lack scalability and are susceptible to human errors, and Op… ▽ More

    Submitted 16 October, 2023; originally announced October 2023.

    Comments: 4 pages + references. Accepted for publication in Workshop on Generative AI at the 3rd International Conference on AI-ML Systems 2023, Bengaluru, India

  15. arXiv:2309.08794  [pdf, other

    cs.AI cs.CV

    Privacy-preserving Early Detection of Epileptic Seizures in Videos

    Authors: Deval Mehta, Shobi Sivathamboo, Hugh Simpson, Patrick Kwan, Terence O`Brien, Zongyuan Ge

    Abstract: In this work, we contribute towards the development of video-based epileptic seizure classification by introducing a novel framework (SETR-PKD), which could achieve privacy-preserved early detection of seizures in videos. Specifically, our framework has two significant components - (1) It is built upon optical flow features extracted from the video of a seizure, which encodes the seizure motion se… ▽ More

    Submitted 15 September, 2023; originally announced September 2023.

    Comments: Accepted to MICCAI 2023

  16. arXiv:2309.06884  [pdf, other

    cs.CV

    Autoencoder-Based Visual Anomaly Localization for Manufacturing Quality Control

    Authors: Devang Mehta, Noah Klarmann

    Abstract: Manufacturing industries require efficient and voluminous production of high-quality finished goods. In the context of Industry 4.0, visual anomaly detection poses an optimistic solution for automatically controlled product quality with high precision. In general, automation based on computer vision is a promising solution to prevent bottlenecks at the product quality checkpoint. We considered rec… ▽ More

    Submitted 3 November, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

  17. Toward Generalizable Machine Learning Models in Speech, Language, and Hearing Sciences: Estimating Sample Size and Reducing Overfitting

    Authors: Hamzeh Ghasemzadeh, Robert E. Hillman, Daryush D. Mehta

    Abstract: This study's first purpose is to provide quantitative evidence that would incentivize researchers to instead use the more robust method of nested cross-validation. The second purpose is to present methods and MATLAB codes for doing power analysis for ML-based analysis during the design of a study. Monte Carlo simulations were used to quantify the interactions between the employed cross-validation… ▽ More

    Submitted 22 December, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

    Comments: Accepted at JSLHR

    Journal ref: Journal of Speech, Language, and Hearing Research (JSLHR),Volume 67 Issue 3, March 2024, Pages 753-781

  18. arXiv:2308.06882  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Quantifying Outlierness of Funds from their Categories using Supervised Similarity

    Authors: Dhruv Desai, Ashmita Dhiman, Tushar Sharma, Deepika Sharma, Dhagash Mehta, Stefano Pasquali

    Abstract: Mutual fund categorization has become a standard tool for the investment management industry and is extensively used by allocators for portfolio construction and manager selection, as well as by fund managers for peer analysis and competitive positioning. As a result, a (unintended) miscategorization or lack of precision can significantly impact allocation decisions and investment fund managers. H… ▽ More

    Submitted 13 August, 2023; originally announced August 2023.

    Comments: 8 pages, 5 tables, 8 figures

  19. arXiv:2305.18703  [pdf, other

    cs.CL cs.AI

    Domain Specialization as the Key to Make Large Language Models Disruptive: A Comprehensive Survey

    Authors: Chen Ling, Xujiang Zhao, Jiaying Lu, Chengyuan Deng, Can Zheng, Junxiang Wang, Tanmoy Chowdhury, Yun Li, Hejie Cui, Xuchao Zhang, Tianjiao Zhao, Amit Panalkar, Dhagash Mehta, Stefano Pasquali, Wei Cheng, Haoyu Wang, Yanchi Liu, Zhengzhang Chen, Haifeng Chen, Chris White, Quanquan Gu, Jian Pei, Carl Yang, Liang Zhao

    Abstract: Large language models (LLMs) have significantly advanced the field of natural language processing (NLP), providing a highly useful, task-agnostic foundation for a wide range of applications. However, directly applying LLMs to solve sophisticated problems in specific domains meets many hurdles, caused by the heterogeneity of domain data, the sophistication of domain knowledge, the uniqueness of dom… ▽ More

    Submitted 29 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  20. arXiv:2305.00696  [pdf, other

    cs.CV

    TPMIL: Trainable Prototype Enhanced Multiple Instance Learning for Whole Slide Image Classification

    Authors: Litao Yang, Deval Mehta, Sidong Liu, Dwarikanath Mahapatra, Antonio Di Ieva, Zongyuan Ge

    Abstract: Digital pathology based on whole slide images (WSIs) plays a key role in cancer diagnosis and clinical practice. Due to the high resolution of the WSI and the unavailability of patch-level annotations, WSI classification is usually formulated as a weakly supervised problem, which relies on multiple instance learning (MIL) based on patches of a WSI. In this paper, we aim to learn an optimal patch-l… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

    Comments: Accepted for MIDL 2023

  21. arXiv:2304.14497  [pdf

    cs.CV cs.RO eess.IV

    Vehicle Safety Management System

    Authors: Chanthini Bhaskar, Bharath Manoj Nair, Dev Mehta

    Abstract: Overtaking is a critical maneuver in driving that requires accurate information about the location and distance of other vehicles on the road. This study suggests a real-time overtaking assistance system that uses a combination of the You Only Look Once (YOLO) object detection algorithm and stereo vision techniques to accurately identify and locate vehicles in front of the driver, and estimate the… ▽ More

    Submitted 16 April, 2023; originally announced April 2023.

  22. arXiv:2211.16172  [pdf, other

    cs.CL cs.CY

    Learnings from Technological Interventions in a Low Resource Language: Enhancing Information Access in Gondi

    Authors: Devansh Mehta, Harshita Diddee, Ananya Saxena, Anurag Shukla, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Vishnu Prasad, Venkanna U, Kalika Bali

    Abstract: The primary obstacle to developing technologies for low-resource languages is the lack of representative, usable data. In this paper, we report the deployment of technology-driven data collection methods for creating a corpus of more than 60,000 translations from Hindi to Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. During this pr… ▽ More

    Submitted 29 November, 2022; originally announced November 2022.

    Comments: In Submission (Revised) to Language Resources and Evaluation Journal. arXiv admin note: text overlap with arXiv:2004.10270

  23. arXiv:2209.04406  [pdf, other

    q-bio.NC cs.SD eess.AS

    Longitudinal Acoustic Speech Tracking Following Pediatric Traumatic Brain Injury

    Authors: Camille Noufi, Adam C. Lammert, Daryush D. Mehta, James R. Williamson, Gregory Ciccarelli, Douglas Sturim, Jordan R. Green, Thomas F. Quatieri, Thomas F. Campbell

    Abstract: Recommendations for common outcome measures following pediatric traumatic brain injury (TBI) support the integration of instrumental measurements alongside perceptual assessment in recovery and treatment plans. A comprehensive set of sensitive, robust and non-invasive measurements is therefore essential in assessing variations in speech characteristics over time following pediatric TBI. In this ar… ▽ More

    Submitted 9 September, 2022; originally announced September 2022.

  24. arXiv:2208.13318  [pdf, other

    cs.CY cs.AI cs.SI

    Multi-dimensional Racism Classification during COVID-19: Stigmatization, Offensiveness, Blame, and Exclusion

    Authors: Xin Pei, Deval Mehta

    Abstract: Transcending the binary categorization of racist texts, our study takes cues from social science theories to develop a multi-dimensional model for racism detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of BERT and topic modeling, this categorical detection enables insights into the underlying subtlety of racist discussion on digital platforms during COVID-19. Ou… ▽ More

    Submitted 28 August, 2022; originally announced August 2022.

    Comments: Social Network Analysis and Mining (accepted, 2022). arXiv admin note: substantial text overlap with arXiv:2107.08347

  25. arXiv:2208.10639  [pdf, other

    cs.HC

    Evaluating Cardiovascular Surgical Planning in Mobile Augmented Reality

    Authors: Haoyang Yang, Pratham Darrpan Mehta, Jonathan Leo, Zhiyan Zhou, Megan Dass, Anish Upadhayay, Timothy C. Slesnick, Fawwaz Shaw, Amanda Randles, Duen Horng Chau

    Abstract: Advanced surgical procedures for congenital heart diseases (CHDs) require precise planning before the surgeries. The conventional approach utilizes 3D-printing and cutting physical heart models, which is a time and resource intensive process. While rapid advances in augmented reality (AR) technologies have the potential to streamline surgical planning, there is limited research that evaluates such… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: IEEE VIS 2022. 2 pages, 1 figure

  26. arXiv:2208.08331  [pdf, other

    eess.IV cs.CV cs.LG

    Leukocyte Classification using Multimodal Architecture Enhanced by Knowledge Distillation

    Authors: Litao Yang, Deval Mehta, Dwarikanath Mahapatra, Zongyuan Ge

    Abstract: Recently, a lot of automated white blood cells (WBC) or leukocyte classification techniques have been developed. However, all of these methods only utilize a single modality microscopic image i.e. either blood smear or fluorescence based, thus missing the potential of a better learning from multimodal images. In this work, we develop an efficient multimodal architecture based on a first of its kin… ▽ More

    Submitted 17 August, 2022; originally announced August 2022.

    Comments: Accepted to MICCAI 2022 workshop - MOVI2022

  27. arXiv:2206.15186  [pdf, other

    cs.CV cs.AI cs.LG

    Out-of-Distribution Detection for Long-tailed and Fine-grained Skin Lesion Images

    Authors: Deval Mehta, Yaniv Gal, Adrian Bowling, Paul Bonnington, Zongyuan Ge

    Abstract: Recent years have witnessed a rapid development of automated methods for skin lesion diagnosis and classification. Due to an increasing deployment of such systems in clinics, it has become important to develop a more robust system towards various Out-of-Distribution(OOD) samples (unknown skin lesions and conditions). However, the current deep learning models trained for skin lesion classification… ▽ More

    Submitted 30 June, 2022; originally announced June 2022.

    Comments: Accepted to MICCAI 2022 (top 13% paper; early accept)

  28. arXiv:2206.08236  [pdf, other

    cs.CV cs.LG eess.IV

    Simple and Efficient Architectures for Semantic Segmentation

    Authors: Dushyant Mehta, Andrii Skliar, Haitam Ben Yahia, Shubhankar Borse, Fatih Porikli, Amirhossein Habibian, Tijmen Blankevoort

    Abstract: Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware. This paper demonstrates that a simple encoder-decoder architecture with a ResNet-like backbone and a sm… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: To be presented at Efficient Deep Learning for Computer Vision Workshop at CVPR 2022

  29. arXiv:2206.08009  [pdf, other

    cs.CV cs.LG

    Balancing Discriminability and Transferability for Source-Free Domain Adaptation

    Authors: Jogendra Nath Kundu, Akshay Kulkarni, Suvaansh Bhambri, Deepesh Mehta, Shreyas Kulkarni, Varun Jampani, R. Venkatesh Babu

    Abstract: Conventional domain adaptation (DA) techniques aim to improve domain transferability by learning domain-invariant representations; while concurrently preserving the task-discriminability knowledge gathered from the labeled source data. However, the requirement of simultaneous access to labeled source and unlabeled target renders them unsuitable for the challenging source-free DA setting. The trivi… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

    Comments: ICML 2022. Project page: https://sites.google.com/view/mixup-sfda

  30. arXiv:2111.15629  [pdf, other

    cs.SI cs.CL cs.IR cs.LG

    DiPD: Disruptive event Prediction Dataset from Twitter

    Authors: Sanskar Soni, Dev Mehta, Vinush Vishwanath, Aditi Seetha, Satyendra Singh Chouhan

    Abstract: Riots and protests, if gone out of control, can cause havoc in a country. We have seen examples of this, such as the BLM movement, climate strikes, CAA Movement, and many more, which caused disruption to a large extent. Our motive behind creating this dataset was to use it to develop machine learning systems that can give its users insight into the trending events going on and alert them about the… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  31. arXiv:2107.08347  [pdf, other

    cs.SI cs.CL

    Beyond a binary of (non)racist tweets: A four-dimensional categorical detection and analysis of racist and xenophobic opinions on Twitter in early Covid-19

    Authors: Xin Pei, Deval Mehta

    Abstract: Transcending the binary categorization of racist and xenophobic texts, this research takes cues from social science theories to develop a four dimensional category for racism and xenophobia detection, namely stigmatization, offensiveness, blame, and exclusion. With the aid of deep learning techniques, this categorical detection enables insights into the nuances of emergent topics reflected in raci… ▽ More

    Submitted 17 July, 2021; originally announced July 2021.

  32. arXiv:2107.07452  [pdf, other

    cs.RO cs.AI

    GI-NNet \& RGI-NNet: Development of Robotic Grasp Pose Models, Trainable with Large as well as Limited Labelled Training Datasets, under supervised and semi supervised paradigms

    Authors: Priya Shukla, Nilotpal Pramanik, Deepesh Mehta, G. C. Nandi

    Abstract: Our way of grasping objects is challenging for efficient, intelligent and optimal grasp by COBOTs. To streamline the process, here we use deep learning techniques to help robots learn to generate and execute appropriate grasps quickly. We developed a Generative Inception Neural Network (GI-NNet) model, capable of generating antipodal robotic grasps on seen as well as unseen objects. It is trained… ▽ More

    Submitted 15 July, 2021; originally announced July 2021.

  33. arXiv:2106.12987  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.AP

    Fund2Vec: Mutual Funds Similarity using Graph Learning

    Authors: Vipul Satone, Dhruv Desai, Dhagash Mehta

    Abstract: Identifying similar mutual funds with respect to the underlying portfolios has found many applications in financial services ranging from fund recommender systems, competitors analysis, portfolio analytics, marketing and sales, etc. The traditional methods are either qualitative, and hence prone to biases and often not reproducible, or, are known not to capture all the nuances (non-linearities) am… ▽ More

    Submitted 24 June, 2021; originally announced June 2021.

    Comments: 2 column format, 8 pages, 8 figures, 5 tables

  34. An Adaptive Synaptic Array using Fowler-Nordheim Dynamic Analog Memory

    Authors: Darshit Mehta, Kenji Aono, Shantanu Chakrabartty

    Abstract: In this paper we present a synaptic array that uses dynamical states to implement an analog memory for energy-efficient training of machine learning (ML) systems. Each of the analog memory elements is a micro-dynamical system that is driven by the physics of Fowler-Nordheim (FN) quantum tunneling, whereas the system level learning modulates the state trajectory of the memory ensembles towards the… ▽ More

    Submitted 13 April, 2021; originally announced April 2021.

    Comments: 22 pages (incl. 7 supplementary pages), 11 figures (incl. 6 supplementary figures)

  35. arXiv:2104.04650  [pdf, other

    cs.CV cs.AI

    Towards Automated and Marker-less Parkinson Disease Assessment: Predicting UPDRS Scores using Sit-stand videos

    Authors: Deval Mehta, Umar Asif, Tian Hao, Erhan Bilal, Stefan Von Cavallar, Stefan Harrer, Jeffrey Rogers

    Abstract: This paper presents a novel deep learning enabled, video based analysis framework for assessing the Unified Parkinsons Disease Rating Scale (UPDRS) that can be used in the clinic or at home. We report results from comparing the performance of the framework to that of trained clinicians on a population of 32 Parkinsons disease (PD) patients. In-person clinical assessments by trained neurologists ar… ▽ More

    Submitted 9 April, 2021; originally announced April 2021.

    Comments: Accepted by CVPR Workshops 2021

  36. arXiv:2102.06837  [pdf, other

    cs.CV

    Learning Speech-driven 3D Conversational Gestures from Video

    Authors: Ikhsanul Habibie, Weipeng Xu, Dushyant Mehta, Lingjie Liu, Hans-Peter Seidel, Gerard Pons-Moll, Mohamed Elgharib, Christian Theobalt

    Abstract: We propose the first approach to automatically and jointly synthesize both the synchronous 3D conversational body and hand gestures, as well as 3D face and head animations, of a virtual character from speech input. Our algorithm uses a CNN architecture that leverages the inherent correlation between facial expression and hand gestures. Synthesis of conversational body gestures is a multi-modal pro… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  37. arXiv:2101.04104  [pdf, other

    cs.CV

    Neural Re-Rendering of Humans from a Single Image

    Authors: Kripasindhu Sarkar, Dushyant Mehta, Weipeng Xu, Vladislav Golyanik, Christian Theobalt

    Abstract: Human re-rendering from a single image is a starkly under-constrained problem, and state-of-the-art algorithms often exhibit undesired artefacts, such as over-smoothing, unrealistic distortions of the body parts and garments, or implausible changes of the texture. To address these challenges, we propose a new method for neural re-rendering of a human under a novel user-defined pose and viewpoint,… ▽ More

    Submitted 11 January, 2021; originally announced January 2021.

    Comments: Published in ECCV 2020

  38. arXiv:2012.08859  [pdf, other

    cs.LG cs.AI cs.CV cs.NE stat.ML

    Distilling Optimal Neural Networks: Rapid Search in Diverse Spaces

    Authors: Bert Moons, Parham Noorzad, Andrii Skliar, Giovanni Mariani, Dushyant Mehta, Chris Lott, Tijmen Blankevoort

    Abstract: Current state-of-the-art Neural Architecture Search (NAS) methods neither efficiently scale to multiple hardware platforms, nor handle diverse architectural search-spaces. To remedy this, we present DONNA (Distilling Optimal Neural Network Architectures), a novel pipeline for rapid, scalable and diverse NAS, that scales to many user scenarios. DONNA consists of three phases. First, an accuracy pre… ▽ More

    Submitted 27 August, 2021; v1 submitted 16 December, 2020; originally announced December 2020.

    Comments: Accepted at ICCV2021. Main text 9 pages, Full text 21 pages, 18 figures

  39. arXiv:2011.06557  [pdf, other

    stat.ML cs.LG stat.ME

    A partition-based similarity for classification distributions

    Authors: Hayden S. Helm, Ronak D. Mehta, Brandon Duderstadt, Weiwei Yang, Christoper M. White, Ali Geisa, Joshua T. Vogelstein, Carey E. Priebe

    Abstract: Herein we define a measure of similarity between classification distributions that is both principled from the perspective of statistical pattern recognition and useful from the perspective of machine learning practitioners. In particular, we propose a novel similarity on classification distributions, dubbed task similarity, that quantifies how an optimally-transformed optimal representation for a… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  40. arXiv:2009.09818  [pdf, other

    cs.CV

    DeepActsNet: Spatial and Motion features from Face, Hands, and Body Combined with Convolutional and Graph Networks for Improved Action Recognition

    Authors: Umar Asif, Deval Mehta, Stefan von Cavallar, Jianbin Tang, Stefan Harrer

    Abstract: Existing action recognition methods mainly focus on joint and bone information in human body skeleton data due to its robustness to complex backgrounds and dynamic characteristics of the environments. In this paper, we combine body skeleton data with spatial and motion features from face and two hands, and present "Deep Action Stamps (DeepActs)", a novel data representation to encode actions from… ▽ More

    Submitted 4 June, 2021; v1 submitted 21 September, 2020; originally announced September 2020.

  41. Study on State-of-the-art Cloud Services Integration Capabilities with Autonomous Ground Vehicles

    Authors: Praveen Damacharla, Dhwani Mehta, Ahmad Y Javaid, Vijay K. Devabhaktuni

    Abstract: Computing and intelligence are substantial requirements for the accurate performance of autonomous ground vehicles (AGVs). In this context, the use of cloud services in addition to onboard computers enhances computing and intelligence capabilities of AGVs. In addition, the vast amount of data processed in a cloud system contributes to overall performance and capabilities of the onboard system. Thi… ▽ More

    Submitted 11 August, 2020; originally announced August 2020.

    Journal ref: 2018 IEEE 88th Vehicular Technology Conference (VTC-Fall), Chicago, IL, USA, 2018, pp. 1-5

  42. arXiv:2006.14078  [pdf, other

    stat.ML cs.LG cs.SC math.AG stat.AP

    Machine learning the real discriminant locus

    Authors: Edgar A. Bernal, Jonathan D. Hauenstein, Dhagash Mehta, Margaret H. Regan, Tingting Tang

    Abstract: Parameterized systems of polynomial equations arise in many applications in science and engineering with the real solutions describing, for example, equilibria of a dynamical system, linkages satisfying design constraints, and scene reconstruction in computer vision. Since different parameter values can have a different number of real solutions, the parameter space is decomposed into regions whose… ▽ More

    Submitted 8 August, 2022; v1 submitted 24 June, 2020; originally announced June 2020.

    Comments: 22 pages, 14 figures

  43. arXiv:2006.00123  [pdf, other

    q-fin.ST cs.LG q-fin.CP stat.ML

    Machine Learning Fund Categorizations

    Authors: Dhagash Mehta, Dhruv Desai, Jithin Pradeep

    Abstract: Given the surge in popularity of mutual funds (including exchange-traded funds (ETFs)) as a diversified financial investment, a vast variety of mutual funds from various investment management firms and diversification strategies have become available in the market. Identifying similar mutual funds among such a wide landscape of mutual funds has become more important than ever because of many appli… ▽ More

    Submitted 29 May, 2020; originally announced June 2020.

    Comments: 8 pages, 2-column format, 5 figures

  44. arXiv:2005.08224  [pdf

    cs.SI cs.CL

    #Coronavirus or #Chinesevirus?!: Understanding the negative sentiment reflected in Tweets with racist hashtags across the development of COVID-19

    Authors: Xin Pei, Deval Mehta

    Abstract: Situated in the global outbreak of COVID-19, our study enriches the discussion concerning the emergent racism and xenophobia on social media. With big data extracted from Twitter, we focus on the analysis of negative sentiment reflected in tweets marked with racist hashtags, as racism and xenophobia are more likely to be delivered via the negative sentiment. Especially, we propose a stage-based ap… ▽ More

    Submitted 17 May, 2020; originally announced May 2020.

  45. arXiv:2005.00116  [pdf, other

    cs.CV cs.LG

    Sequence Information Channel Concatenation for Improving Camera Trap Image Burst Classification

    Authors: Bhuvan Malladihalli Shashidhara, Darshan Mehta, Yash Kale, Dan Morris, Megan Hazen

    Abstract: Camera Traps are extensively used to observe wildlife in their natural habitat without disturbing the ecosystem. This could help in the early detection of natural or human threats to animals, and help towards ecological conservation. Currently, a massive number of such camera traps have been deployed at various ecological conservation areas around the world, collecting data for decades, thereby re… ▽ More

    Submitted 5 June, 2020; v1 submitted 30 April, 2020; originally announced May 2020.

    Comments: 8 pages, 4 figures, 2 tables. Git repository can be found at: https://github.com/bhuvi3/camera_trap_animal_classification

    ACM Class: I.4.9; I.4.10; I.2.10

  46. arXiv:2004.12908  [pdf, other

    cs.AI cs.LG stat.ML

    A Simple Lifelong Learning Approach

    Authors: Joshua T. Vogelstein, Jayanta Dey, Hayden S. Helm, Will LeVine, Ronak D. Mehta, Tyler M. Tomita, Haoyin Xu, Ali Geisa, Qingyang Wang, Gido M. van de Ven, Chenyu Gao, Weiwei Yang, Bryan Tower, Jonathan Larson, Christopher M. White, Carey E. Priebe

    Abstract: In lifelong learning, data are used to improve performance not only on the present task, but also on past and future (unencountered) tasks. While typical transfer learning algorithms can improve performance on future tasks, their performance on prior tasks degrades upon learning new tasks (called forgetting). Many recent approaches for continual or lifelong learning have attempted to maintain perf… ▽ More

    Submitted 11 June, 2024; v1 submitted 27 April, 2020; originally announced April 2020.

  47. arXiv:2004.10270  [pdf, other

    cs.CL cs.CY

    Learnings from Technological Interventions in a Low Resource Language: A Case-Study on Gondi

    Authors: Devansh Mehta, Sebastin Santy, Ramaravind Kommiya Mothilal, Brij Mohan Lal Srivastava, Alok Sharma, Anurag Shukla, Vishnu Prasad, Venkanna U, Amit Sharma, Kalika Bali

    Abstract: The primary obstacle to developing technologies for low-resource languages is the lack of usable data. In this paper, we report the adoption and deployment of 4 technology-driven methods of data collection for Gondi, a low-resource vulnerable language spoken by around 2.3 million tribal people in south and central India. In the process of data collection, we also help in its revival by expanding a… ▽ More

    Submitted 26 January, 2021; v1 submitted 21 April, 2020; originally announced April 2020.

    Comments: Accepted at LREC 2020 (7 pages). D.M. and S.S. contributed equally

  48. XNect: Real-time Multi-Person 3D Motion Capture with a Single RGB Camera

    Authors: Dushyant Mehta, Oleksandr Sotnychenko, Franziska Mueller, Weipeng Xu, Mohamed Elgharib, Pascal Fua, Hans-Peter Seidel, Helge Rhodin, Gerard Pons-Moll, Christian Theobalt

    Abstract: We present a real-time approach for multi-person 3D motion capture at over 30 fps using a single RGB camera. It operates successfully in generic scenes which may contain occlusions by objects and by other people. Our method operates in subsequent stages. The first stage is a convolutional neural network (CNN) that estimates 2D and 3D pose features along with identity assignments for all visible jo… ▽ More

    Submitted 30 April, 2020; v1 submitted 1 July, 2019; originally announced July 2019.

    Comments: To appear in ACM Transactions on Graphics (SIGGRAPH) 2020

  49. arXiv:1907.00199  [pdf, other

    cs.CR

    Incidents Are Meant for Learning, Not Repeating: Sharing Knowledge About Security Incidents in Cyber-Physical Systems

    Authors: Faeq Alrimawi, Liliana Pasquale, Deepak Mehta, Nobukazu Yoshioka, Bashar Nuseibeh

    Abstract: Cyber-physical systems (CPSs) are part of most critical infrastructures such as industrial automation and transportation systems. Thus, security incidents targeting CPSs can have disruptive consequences to assets and people. As prior incidents tend to re-occur, sharing knowledge about these incidents can help organizations be more prepared to prevent, mitigate or investigate future incidents. This… ▽ More

    Submitted 29 June, 2019; originally announced July 2019.

  50. arXiv:1905.07628  [pdf, other

    cs.LG cs.AI cs.NE stat.ML

    Evolving Rewards to Automate Reinforcement Learning

    Authors: Aleksandra Faust, Anthony Francis, Dar Mehta

    Abstract: Many continuous control tasks have easily formulated objectives, yet using them directly as a reward in reinforcement learning (RL) leads to suboptimal policies. Therefore, many classical control tasks guide RL training using complex rewards, which require tedious hand-tuning. We automate the reward search with AutoRL, an evolutionary layer over standard RL that treats reward tuning as hyperparame… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

    Comments: Accepted to 6th AutoML@ICML