Zum Hauptinhalt springen

Showing 1–50 of 52 results for author: Babu, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11039  [pdf, other

    cs.AI cs.CV

    Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

    Authors: Chunting Zhou, Lili Yu, Arun Babu, Kushal Tirumala, Michihiro Yasunaga, Leonid Shamis, Jacob Kahn, Xuezhe Ma, Luke Zettlemoyer, Omer Levy

    Abstract: We introduce Transfusion, a recipe for training a multi-modal model over discrete and continuous data. Transfusion combines the language modeling loss function (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. We pretrain multiple Transfusion models up to 7B parameters from scratch on a mixture of text and image data, establishing scaling laws with… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 23 pages

  2. arXiv:2408.07841  [pdf

    cs.LG cs.AI eess.SY

    SustainDC -- Benchmarking for Sustainable Data Center Control

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna, Vineet Gundecha, Desik Rengarajan, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Dejan Markovikj, Lekhapriya D Kashyap, Soumyendu Sarkar

    Abstract: Machine learning has driven an exponential increase in computational demand, leading to massive data centers that consume significant amounts of energy and contribute to climate change. This makes sustainable data center control a priority. In this paper, we introduce SustainDC, a set of Python environments for benchmarking multi-agent reinforcement learning (MARL) algorithms for data centers (DC)… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Under review at Advances in Neural Information Processing Systems 2024 (NeurIPS 2024)

  3. arXiv:2407.21674  [pdf, other

    cs.CV cs.AI

    Synthetic Simplicity: Unveiling Bias in Medical Data Augmentation

    Authors: Krishan Agyakari Raja Babu, Rachana Sathish, Mrunal Pattanaik, Rahul Venkataramani

    Abstract: Synthetic data is becoming increasingly integral in data-scarce fields such as medical imaging, serving as a substitute for real data. However, its inherent statistical characteristics can significantly impact downstream tasks, potentially compromising deployment performance. In this study, we empirically investigate this issue and uncover a critical phenomenon: downstream neural networks often ex… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  4. arXiv:2407.08270  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.LG physics.app-ph

    SciQu: Accelerating Materials Properties Prediction with Automated Literature Mining for Self-Driving Laboratories

    Authors: Anand Babu

    Abstract: Assessing different material properties to predict specific attributes, such as band gap, resistivity, young modulus, work function, and refractive index, is a fundamental requirement for materials science-based applications. However, the process is time-consuming and often requires extensive literature reviews and numerous experiments. Our study addresses these challenges by leveraging machine le… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  5. arXiv:2404.12498  [pdf

    cs.LG cs.AI eess.SY

    A Configurable Pythonic Data Center Model for Sustainable Cooling and ML Integration

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutierrez, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: There have been growing discussions on estimating and subsequently reducing the operational carbon footprint of enterprise data centers. The design and intelligent control for data centers have an important impact on data center carbon footprint. In this paper, we showcase PyDCM, a Python library that enables extremely fast prototyping of data center design and applies reinforcement learning-enabl… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: NeurIPS 2023 Workshop on Tackling Climate Change with Machine Learning https://www.climatechange.ai/papers/neurips2023/15. arXiv admin note: substantial text overlap with arXiv:2310.03906

  6. arXiv:2404.10991  [pdf

    cs.AI cs.LG eess.SY

    Function Approximation for Reinforcement Learning Controller for Energy from Spread Waves

    Authors: Soumyendu Sarkar, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ashwin Ramesh Babu, Avisek Naug, Alexandre Pichard, Mathieu Cocho

    Abstract: The industrial multi-generator Wave Energy Converters (WEC) must handle multiple simultaneous waves coming from different directions called spread waves. These complex devices in challenging circumstances need controllers with multiple objectives of energy capture efficiency, reduction of structural stress to limit maintenance, and proactive protection against high waves. The Multi-Agent Reinforce… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023

    Journal ref: IJCAI 2023, Proceedings of the Thirty-Second International Joint Conference on Artificial IntelligenceAugust 2023, Article No 688, Pages 6201 to 6209

  7. arXiv:2404.10786  [pdf

    cs.DC cs.AI cs.LG cs.MA eess.SY

    Sustainability of Data Center Digital Twins with Reinforcement Learning

    Authors: Soumyendu Sarkar, Avisek Naug, Antonio Guillen, Ricardo Luna, Vineet Gundecha, Ashwin Ramesh Babu, Sajad Mousavi

    Abstract: The rapid growth of machine learning (ML) has led to an increased demand for computational power, resulting in larger data centers (DCs) and higher energy consumption. To address this issue and reduce carbon emissions, intelligent design and control of DC components such as IT servers, cabinets, HVAC cooling, flexible load shifting, and battery energy storage are essential. However, the complexity… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

    Journal ref: Proceedings of the AAAI Conference on Artificial Intelligence, vol. 38, no. 20, pp. 22322-22330, Mar. 2024

  8. arXiv:2403.18985  [pdf

    cs.LG cs.AI cs.CR cs.CV cs.MA

    Robustness and Visual Explanation for Black Box Image, Video, and ECG Signal Classification with Reinforcement Learning

    Authors: Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Vineet Gundecha, Avisek Naug, Sahand Ghorbanpour

    Abstract: We present a generic Reinforcement Learning (RL) framework optimized for crafting adversarial attacks on different model types spanning from ECG signal analysis (1D), image classification (2D), and video classification (3D). The framework focuses on identifying sensitive regions and inducing misclassifications with minimal distortions and various distortion types. The novel RL method outperforms s… ▽ More

    Submitted 22 April, 2024; v1 submitted 27 March, 2024; originally announced March 2024.

    Comments: AAAI Proceedings reference: https://ojs.aaai.org/index.php/AAAI/article/view/30579

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  9. arXiv:2403.14092  [pdf

    cs.LG cs.AI cs.MA eess.SY

    Carbon Footprint Reduction for Sustainable Data Centers in Real-Time

    Authors: Soumyendu Sarkar, Avisek Naug, Ricardo Luna, Antonio Guillen, Vineet Gundecha, Sahand Ghorbanpour, Sajad Mousavi, Dejan Markovikj, Ashwin Ramesh Babu

    Abstract: As machine learning workloads significantly increase energy consumption, sustainable data centers with low carbon emissions are becoming a top priority for governments and corporations worldwide. This requires a paradigm shift in optimizing power consumption in cooling and IT loads, shifting flexible loads based on the availability of renewable energy in the power grid, and leveraging battery stor… ▽ More

    Submitted 25 March, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Journal ref: 2024 Proceedings of the AAAI Conference on Artificial Intelligence

  10. arXiv:2310.18679  [pdf

    cs.CL cs.AI cs.LG

    N-Critics: Self-Refinement of Large Language Models with Ensemble of Critics

    Authors: Sajad Mousavi, Ricardo Luna Gutiérrez, Desik Rengarajan, Vineet Gundecha, Ashwin Ramesh Babu, Avisek Naug, Antonio Guillen, Soumyendu Sarkar

    Abstract: We propose a self-correction mechanism for Large Language Models (LLMs) to mitigate issues such as toxicity and fact hallucination. This method involves refining model outputs through an ensemble of critics and the model's own feedback. Drawing inspiration from human behavior, we explore whether LLMs can emulate the self-correction process observed in humans who often engage in self-reflection and… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Journal ref: NeurIPS 2023 Workshop on Robustness of Few-shot and Zero-shot Learning in Foundation Models 2023(NeurIPS 2023)

  11. arXiv:2310.18626  [pdf

    cs.CV cs.AI cs.LG

    Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness

    Authors: Soumyendu Sarkar, Ashwin Ramesh Babu, Sajad Mousavi, Zachariah Carmichael, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna, Gutierrez Antonio Guillen, Avisek Naug

    Abstract: We present a novel framework for generating adversarial benchmarks to evaluate the robustness of image classification models. Our framework allows users to customize the types of distortions to be optimally applied to images, which helps address the specific distortions relevant to their deployment. The benchmark can generate datasets at various distortion levels to assess the robustness of differ… ▽ More

    Submitted 8 November, 2023; v1 submitted 28 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)

  12. arXiv:2310.08715  [pdf, other

    cs.CL cs.AI cs.SD eess.AS

    Toward Joint Language Modeling for Speech Units and Text

    Authors: Ju-Chieh Chou, Chung-Ming Chien, Wei-Ning Hsu, Karen Livescu, Arun Babu, Alexis Conneau, Alexei Baevski, Michael Auli

    Abstract: Speech and text are two major forms of human language. The research community has been focusing on mapping speech to text or vice versa for many years. However, in the field of language modeling, very little effort has been made to model them jointly. In light of this, we explore joint language modeling for speech units and text. Specifically, we compare different speech tokenizers to transform co… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: EMNLP findings 2023

  13. RTDK-BO: High Dimensional Bayesian Optimization with Reinforced Transformer Deep kernels

    Authors: Alexander Shmakov, Avisek Naug, Vineet Gundecha, Sahand Ghorbanpour, Ricardo Luna Gutierrez, Ashwin Ramesh Babu, Antonio Guillen, Soumyendu Sarkar

    Abstract: Bayesian Optimization (BO), guided by Gaussian process (GP) surrogates, has proven to be an invaluable technique for efficient, high-dimensional, black-box optimization, a critical problem inherent to many applications such as industrial design and scientific computing. Recent contributions have introduced reinforcement learning (RL) to improve the optimization performance on both single function… ▽ More

    Submitted 8 November, 2023; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE 19th International Conference on Automation Science and Engineering (CASE)

  14. PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability

    Authors: Avisek Naug, Antonio Guillen, Ricardo Luna Gutiérrez, Vineet Gundecha, Dejan Markovikj, Lekhapriya Dheeraj Kashyap, Lorenz Krause, Sahand Ghorbanpour, Sajad Mousavi, Ashwin Ramesh Babu, Soumyendu Sarkar

    Abstract: The increasing global emphasis on sustainability and reducing carbon emissions is pushing governments and corporations to rethink their approach to data center design and operation. Given their high energy consumption and exponentially large computational workloads, data centers are prime candidates for optimizing power consumption, especially in areas such as cooling and IT energy usage. A signif… ▽ More

    Submitted 26 March, 2024; v1 submitted 5 October, 2023; originally announced October 2023.

    Comments: The 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation (BuildSys '23), November 15-16, 2023, Istanbul, Turkey

    Journal ref: 2023 BuildSys '23: Proceedings of the 10th ACM International Conference on Systems for Energy-Efficient Buildings, Cities, and Transportation

  15. arXiv:2309.02591  [pdf, other

    cs.LG cs.CL cs.CV

    Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

    Authors: Lili Yu, Bowen Shi, Ramakanth Pasunuru, Benjamin Muller, Olga Golovneva, Tianlu Wang, Arun Babu, Binh Tang, Brian Karrer, Shelly Sheynin, Candace Ross, Adam Polyak, Russell Howes, Vasu Sharma, Puxin Xu, Hovhannes Tamoyan, Oron Ashual, Uriel Singer, Shang-Wen Li, Susan Zhang, Richard James, Gargi Ghosh, Yaniv Taigman, Maryam Fazel-Zarandi, Asli Celikyilmaz , et al. (2 additional authors not shown)

    Abstract: We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows the extreme benefits of scaling up and tuning on more diverse instruction-style data. It is the first multi-modal model trained with a recipe adapted fr… ▽ More

    Submitted 5 September, 2023; originally announced September 2023.

  16. arXiv:2308.11814  [pdf, other

    cs.LG cs.CE physics.ao-ph physics.geo-ph

    Evaluation of Deep Neural Operator Models toward Ocean Forecasting

    Authors: Ellery Rajagopal, Anantha N. S. Babu, Tony Ryu, Patrick J. Haley Jr., Chris Mirabito, Pierre F. J. Lermusiaux

    Abstract: Data-driven, deep-learning modeling frameworks have been recently developed for forecasting time series data. Such machine learning models may be useful in multiple domains including the atmospheric and oceanic ones, and in general, the larger fluids community. The present work investigates the possible effectiveness of such deep neural operator models for reproducing and predicting classic fluid… ▽ More

    Submitted 22 August, 2023; originally announced August 2023.

    Comments: Rajagopal, E., A.N.S. Babu, T. Ryu, P.J. Haley, Jr., C. Mirabito, and P.F.J. Lermusiaux, 2023. Evaluation of Deep Neural Operator Models toward Ocean Forecasting. In OCEANS' 23 IEEE/MTS Gulf Coast, 25-28 September 2023, in press

    MSC Class: 76U60; 86A05; 86-08; 86A10; 86A08; 68T01; 68T07; 68T37 ACM Class: J.2; I.2; I.6

  17. arXiv:2305.13516  [pdf, other

    cs.CL cs.SD eess.AS

    Scaling Speech Technology to 1,000+ Languages

    Authors: Vineel Pratap, Andros Tjandra, Bowen Shi, Paden Tomasello, Arun Babu, Sayani Kundu, Ali Elkahky, Zhaoheng Ni, Apoorv Vyas, Maryam Fazel-Zarandi, Alexei Baevski, Yossi Adi, Xiaohui Zhang, Wei-Ning Hsu, Alexis Conneau, Michael Auli

    Abstract: Expanding the language coverage of speech technology has the potential to improve access to information for many more people. However, current speech technology is restricted to about one hundred languages which is a small fraction of the over 7,000 languages spoken around the world. The Massively Multilingual Speech (MMS) project increases the number of supported languages by 10-40x, depending on… ▽ More

    Submitted 22 May, 2023; originally announced May 2023.

  18. arXiv:2304.04297  [pdf, other

    cs.CV cs.DC eess.IV

    AI-assisted Automated Workflow for Real-time X-ray Ptychography Data Analysis via Federated Resources

    Authors: Anakha V Babu, Tekin Bicer, Saugat Kandel, Tao Zhou, Daniel J. Ching, Steven Henke, Siniša Veseli, Ryan Chard, Antonino Miceli, Mathew Joseph Cherukara

    Abstract: We present an end-to-end automated workflow that uses large-scale remote compute resources and an embedded GPU platform at the edge to enable AI/ML-accelerated real-time analysis of data collected for x-ray ptychography. Ptychography is a lensless method that is being used to image samples through a simultaneous numerical inversion of a large number of diffraction patterns from adjacent overlappin… ▽ More

    Submitted 9 April, 2023; originally announced April 2023.

    Comments: 7 pages, 1 figure, to be published in High Performance Computing for Imaging Conference, Electronic Imaging (HPCI 2023)

  19. arXiv:2212.07525  [pdf, other

    cs.LG cs.CL cs.SD eess.AS

    Efficient Self-supervised Learning with Contextualized Target Representations for Vision, Speech and Language

    Authors: Alexei Baevski, Arun Babu, Wei-Ning Hsu, Michael Auli

    Abstract: Current self-supervised learning algorithms are often modality-specific and require large amounts of computational resources. To address these issues, we increase the training efficiency of data2vec, a learning objective that generalizes across several modalities. We do not encode masked tokens, use a fast convolutional decoder and amortize the effort to build teacher representations. data2vec 2.0… ▽ More

    Submitted 15 June, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

  20. arXiv:2210.11546  [pdf, other

    cs.CR cs.NI

    Proof of Backhaul: Trustfree Measurement of Broadband Bandwidth

    Authors: Peiyao Sheng, Nikita Yadav, Vishal Sevani, Arun Babu, SVR Anand, Himanshu Tyagi, Pramod Viswanath

    Abstract: Recent years have seen the emergence of decentralized wireless networks consisting of nodes hosted by many individuals and small enterprises, reawakening the decades-old dream of open networking. These networks have been deployed in an organic, distributed manner and are driven by new economic models resting on tokenized incentives. A critical requirement for the incentives to scale is the ability… ▽ More

    Submitted 20 October, 2022; originally announced October 2022.

  21. arXiv:2209.09408  [pdf, other

    cs.LG eess.IV

    Deep learning at the edge enables real-time streaming ptychographic imaging

    Authors: Anakha V Babu, Tao Zhou, Saugat Kandel, Tekin Bicer, Zhengchun Liu, William Judge, Daniel J. Ching, Yi Jiang, Sinisa Veseli, Steven Henke, Ryan Chard, Yudong Yao, Ekaterina Sirazitdinova, Geetika Gupta, Martin V. Holt, Ian T. Foster, Antonino Miceli, Mathew J. Cherukara

    Abstract: Coherent microscopy techniques provide an unparalleled multi-scale view of materials across scientific and technological fields, from structural materials to quantum devices, from integrated circuits to biological cells. Driven by the construction of brighter sources and high-rate detectors, coherent X-ray microscopy methods like ptychography are poised to revolutionize nanoscale materials charact… ▽ More

    Submitted 19 September, 2022; originally announced September 2022.

  22. Skip Training for Multi-Agent Reinforcement Learning Controller for Industrial Wave Energy Converters

    Authors: Soumyendu Sarkar, Vineet Gundecha, Sahand Ghorbanpour, Alexander Shmakov, Ashwin Ramesh Babu, Alexandre Pichard, Mathieu Cocho

    Abstract: Recent Wave Energy Converters (WEC) are equipped with multiple legs and generators to maximize energy generation. Traditional controllers have shown limitations to capture complex wave patterns and the controllers must efficiently maximize the energy capture. This paper introduces a Multi-Agent Reinforcement Learning controller (MARL), which outperforms the traditionally used spring damper control… ▽ More

    Submitted 12 September, 2022; originally announced September 2022.

    Comments: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE) August 20-24, 2022

    Report number: 02

    Journal ref: 2022 IEEE 18th International Conference on Automation Science and Engineering (CASE)

  23. arXiv:2202.03555  [pdf, other

    cs.LG

    data2vec: A General Framework for Self-supervised Learning in Speech, Vision and Language

    Authors: Alexei Baevski, Wei-Ning Hsu, Qiantong Xu, Arun Babu, Jiatao Gu, Michael Auli

    Abstract: While the general idea of self-supervised learning is identical across modalities, the actual algorithms and objectives differ widely because they were developed with a single modality in mind. To get us closer to general self-supervised learning, we present data2vec, a framework that uses the same learning method for either speech, NLP or computer vision. The core idea is to predict latent repres… ▽ More

    Submitted 25 October, 2022; v1 submitted 7 February, 2022; originally announced February 2022.

  24. arXiv:2111.09296  [pdf, other

    cs.CL cs.SD eess.AS

    XLS-R: Self-supervised Cross-lingual Speech Representation Learning at Scale

    Authors: Arun Babu, Changhan Wang, Andros Tjandra, Kushal Lakhotia, Qiantong Xu, Naman Goyal, Kritika Singh, Patrick von Platen, Yatharth Saraf, Juan Pino, Alexei Baevski, Alexis Conneau, Michael Auli

    Abstract: This paper presents XLS-R, a large-scale model for cross-lingual speech representation learning based on wav2vec 2.0. We train models with up to 2B parameters on nearly half a million hours of publicly available speech audio in 128 languages, an order of magnitude more public data than the largest known prior work. Our evaluation covers a wide range of tasks, domains, data regimes and languages, b… ▽ More

    Submitted 16 December, 2021; v1 submitted 17 November, 2021; originally announced November 2021.

  25. arXiv:2109.06262  [pdf, other

    cs.CL

    Evaluating Multiway Multilingual NMT in the Turkic Languages

    Authors: Jamshidbek Mirzakhalov, Anoop Babu, Aigiz Kunafin, Ahsan Wahab, Behzod Moydinboyev, Sardana Ivanova, Mokhiyakhon Uzokova, Shaxnoza Pulatova, Duygu Ataman, Julia Kreutzer, Francis Tyers, Orhan Firat, John Licato, Sriram Chellappan

    Abstract: Despite the increasing number of large and comprehensive machine translation (MT) systems, evaluation of these methods in various languages has been restrained by the lack of high-quality parallel corpora as well as engagement with the people that speak these languages. In this study, we present an evaluation of state-of-the-art approaches to training and evaluating MT systems in 22 languages from… ▽ More

    Submitted 13 September, 2021; originally announced September 2021.

    Comments: 9 pages, 3 figures, 7 tables. To be presented at WMT 2021

  26. arXiv:2109.04593  [pdf, other

    cs.CL cs.LG

    A Large-Scale Study of Machine Translation in the Turkic Languages

    Authors: Jamshidbek Mirzakhalov, Anoop Babu, Duygu Ataman, Sherzod Kariev, Francis Tyers, Otabek Abduraufov, Mammad Hajili, Sardana Ivanova, Abror Khaytbaev, Antonio Laverghetta Jr., Behzodbek Moydinboyev, Esra Onal, Shaxnoza Pulatova, Ahsan Wahab, Orhan Firat, Sriram Chellappan

    Abstract: Recent advances in neural machine translation (NMT) have pushed the quality of machine translation systems to the point where they are becoming widely adopted to build competitive systems. However, there is still a large number of languages that are yet to reap the benefits of NMT. In this paper, we provide the first large-scale case study of the practical application of MT in the Turkic language… ▽ More

    Submitted 9 September, 2021; originally announced September 2021.

    Comments: 9 pages, 1 figure, 8 tables. Main proceedings of EMNLP 2021

  27. arXiv:2106.15009  [pdf, other

    cs.CV

    Understanding Cognitive Fatigue from fMRI Scans with Self-supervised Learning

    Authors: Ashish Jaiswal, Ashwin Ramesh Babu, Mohammad Zaki Zadeh, Fillia Makedon, Glenn Wylie

    Abstract: Functional magnetic resonance imaging (fMRI) is a neuroimaging technique that records neural activations in the brain by capturing the blood oxygen level in different regions based on the task performed by a subject. Given fMRI data, the problem of predicting the state of cognitive fatigue in a person has not been investigated to its full extent. This paper proposes tackling this issue as a multi-… ▽ More

    Submitted 19 September, 2021; v1 submitted 28 June, 2021; originally announced June 2021.

    Comments: 8 pages, 5 figures, 2 tables

  28. arXiv:2106.11890  [pdf, other

    cs.LG

    Latency-Aware Neural Architecture Search with Multi-Objective Bayesian Optimization

    Authors: David Eriksson, Pierce I-Jen Chuang, Samuel Daulton, Peng Xia, Akshat Shrivastava, Arun Babu, Shicong Zhao, Ahmed Aly, Ganesh Venkatesh, Maximilian Balandat

    Abstract: When tuning the architecture and hyperparameters of large machine learning models for on-device deployment, it is desirable to understand the optimal trade-offs between on-device latency and model accuracy. In this work, we leverage recent methodological advances in Bayesian optimization over high-dimensional search spaces and multi-objective Bayesian optimization to efficiently explore these trad… ▽ More

    Submitted 25 June, 2021; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: To Appear at the 8th ICML Workshop on Automated Machine Learning, ICML 2021

  29. arXiv:2104.07275  [pdf, other

    cs.CL

    Span Pointer Networks for Non-Autoregressive Task-Oriented Semantic Parsing

    Authors: Akshat Shrivastava, Pierce Chuang, Arun Babu, Shrey Desai, Abhinav Arora, Alexander Zotov, Ahmed Aly

    Abstract: An effective recipe for building seq2seq, non-autoregressive, task-oriented parsers to map utterances to semantic frames proceeds in three steps: encoding an utterance $x$, predicting a frame's length |y|, and decoding a |y|-sized frame with utterance and ontology tokens. Though empirically strong, these models are typically bottlenecked by length prediction, as even small inaccuracies change the… ▽ More

    Submitted 14 September, 2021; v1 submitted 15 April, 2021; originally announced April 2021.

  30. arXiv:2104.04923  [pdf, other

    cs.CL cs.LG

    Non-Autoregressive Semantic Parsing for Compositional Task-Oriented Dialog

    Authors: Arun Babu, Akshat Shrivastava, Armen Aghajanyan, Ahmed Aly, Angela Fan, Marjan Ghazvininejad

    Abstract: Semantic parsing using sequence-to-sequence models allows parsing of deeper representations compared to traditional word tagging based models. In spite of these advantages, widespread adoption of these models for real-time conversational use cases has been stymied by higher compute requirements and thus higher latency. In this work, we propose a non-autoregressive approach to predict semantic pars… ▽ More

    Submitted 11 April, 2021; originally announced April 2021.

  31. Automated system to measure Tandem Gait to assess executive functions in children

    Authors: Mohammad Zaki Zadeh, Ashwin Ramesh Babu, Ashish Jaiswal, Maria Kyrarini, Morris Bell, Fillia Makedon

    Abstract: As mobile technologies have become ubiquitous in recent years, computer-based cognitive tests have become more popular and efficient. In this work, we focus on assessing motor function in children by analyzing their gait movements. Although there has been a lot of research on designing automated assessment systems for gait analysis, most of these efforts use obtrusive wearable sensors for measurin… ▽ More

    Submitted 28 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  32. arXiv:2011.00362  [pdf, other

    cs.CV

    A Survey on Contrastive Self-supervised Learning

    Authors: Ashish Jaiswal, Ashwin Ramesh Babu, Mohammad Zaki Zadeh, Debapriya Banerjee, Fillia Makedon

    Abstract: Self-supervised learning has gained popularity because of its ability to avoid the cost of annotating large-scale datasets. It is capable of adopting self-defined pseudo labels as supervision and use the learned representations for several downstream tasks. Specifically, contrastive learning has recently become a dominant component in self-supervised learning methods for computer vision, natural l… ▽ More

    Submitted 7 February, 2021; v1 submitted 31 October, 2020; originally announced November 2020.

    Comments: 20 pages, 18 figures, 6 tables

  33. Self-Supervised Human Activity Recognition by Augmenting Generative Adversarial Networks

    Authors: Mohammad Zaki Zadeh, Ashwin Ramesh Babu, Ashish Jaiswal, Fillia Makedon

    Abstract: This article proposes a novel approach for augmenting generative adversarial network (GAN) with a self-supervised task in order to improve its ability for encoding video representations that are useful in downstream tasks such as human activity recognition. In the proposed method, input video frames are randomly transformed by different spatial transformations, such as rotation, translation and sh… ▽ More

    Submitted 28 December, 2020; v1 submitted 26 August, 2020; originally announced August 2020.

  34. arXiv:2008.02189  [pdf, other

    cs.NE cs.AR cs.ET

    SpinAPS: A High-Performance Spintronic Accelerator for Probabilistic Spiking Neural Networks

    Authors: Anakha V Babu, Osvaldo Simeone, Bipin Rajendran

    Abstract: We discuss a high-performance and high-throughput hardware accelerator for probabilistic Spiking Neural Networks (SNNs) based on Generalized Linear Model (GLM) neurons, that uses binary STT-RAM devices as synapses and digital CMOS logic for neurons. The inference accelerator, termed "SpinAPS" for Spintronic Accelerator for Probabilistic SNNs, implements a principled direct learning rule for first-… ▽ More

    Submitted 5 August, 2020; originally announced August 2020.

    Comments: 25 pages, 10 figures, Submitted to Elsevier Neural Networks for review

  35. arXiv:2004.10746  [pdf, other

    cs.LG cs.AI

    Chip Placement with Deep Reinforcement Learning

    Authors: Azalia Mirhoseini, Anna Goldie, Mustafa Yazgan, Joe Jiang, Ebrahim Songhori, Shen Wang, Young-Joon Lee, Eric Johnson, Omkar Pathak, Sungmin Bae, Azade Nazi, Jiwoo Pak, Andy Tong, Kavya Srinivasa, William Hang, Emre Tuncer, Anand Babu, Quoc V. Le, James Laudon, Richard Ho, Roger Carpenter, Jeff Dean

    Abstract: In this work, we present a learning-based approach to chip placement, one of the most complex and time-consuming stages of the chip design process. Unlike prior methods, our approach has the ability to learn from past experience and improve over time. In particular, as we train over a greater number of chip blocks, our method becomes better at rapidly generating optimized placements for previously… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  36. arXiv:2004.02810  [pdf

    cs.CV

    Computer Vision and Abnormal Patient Gait Assessment a Comparison of Machine Learning Models

    Authors: Jasmin Hundall, Benson A. Babu

    Abstract: Abnormal gait, its associated falls and complications have high patient morbidity, mortality. Computer vision detects, predicts patient gait abnormalities, assesses fall risk and serves as clinical decision support tool for physicians. This paper performs a systematic review of how computer vision, machine learning models perform an abnormal patient's gait assessment. Computer vision is beneficial… ▽ More

    Submitted 21 March, 2020; originally announced April 2020.

    Comments: 2 tables

  37. arXiv:2002.01535  [pdf, ps, other

    cs.CL cs.LG

    Lightweight Convolutional Representations for On-Device Natural Language Processing

    Authors: Shrey Desai, Geoffrey Goh, Arun Babu, Ahmed Aly

    Abstract: The increasing computational and memory complexities of deep neural networks have made it difficult to deploy them on low-resource electronic devices (e.g., mobile phones, tablets, wearables). Practitioners have developed numerous model compression methods to address these concerns, but few have condensed input representations themselves. In this work, we propose a fast, accurate, and lightweight… ▽ More

    Submitted 4 February, 2020; originally announced February 2020.

    Comments: Accepted to MLSys 2020

  38. arXiv:1905.10979  [pdf, other

    cs.LG stat.ML

    Scalable K-Medoids via True Error Bound and Familywise Bandits

    Authors: Aravindakshan Babu, Saurabh Agarwal, Sudarshan Babu, Hariharan Chandrasekaran

    Abstract: K-Medoids(KM) is a standard clustering method, used extensively on semi-metric data.Error analyses of KM have traditionally used an in-sample notion of error,which can be far from the true error and suffer from generalization gap. We formalize the true K-Medoid error based on the underlying data distribution.We decompose the true error into fundamental statistical problems of: minimum estimation (… ▽ More

    Submitted 29 October, 2019; v1 submitted 27 May, 2019; originally announced May 2019.

  39. arXiv:1804.01174  [pdf, other

    cs.CV

    Towards Deep Learning based Hand Keypoints Detection for Rapid Sequential Movements from RGB Images

    Authors: Srujana Gattupalli, Ashwin Ramesh Babu, James Robert Brady, Fillia Makedon, Vassilis Athitsos

    Abstract: Hand keypoints detection and pose estimation has numerous applications in computer vision, but it is still an unsolved problem in many aspects. An application of hand keypoints detection is in performing cognitive assessments of a subject by observing the performance of that subject in physical tasks involving rapid finger motion. As a part of this work, we introduce a novel hand key-points benchm… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

  40. arXiv:1802.03201  [pdf, other

    cs.CR

    Freestyle, a randomized version of ChaCha for resisting offline brute-force and dictionary attacks

    Authors: P. Arun Babu, Jithin Jose Thomas

    Abstract: This paper introduces Freestyle, a randomized and variable round version of the ChaCha cipher. Freestyle uses the concept of hash based halting condition where a decryption attempt with an incorrect key is likely to take longer time to halt. This makes Freestyle resistant to key-guessing attacks i.e. brute-force and dictionary based attacks. Freestyle demonstrates a novel approach for ciphertext r… ▽ More

    Submitted 19 February, 2018; v1 submitted 9 February, 2018; originally announced February 2018.

  41. arXiv:1711.03640  [pdf, other

    stat.ML cs.AI cs.LG cs.NE

    Stochastic Deep Learning in Memristive Networks

    Authors: Anakha V Babu, Bipin Rajendran

    Abstract: We study the performance of stochastically trained deep neural networks (DNNs) whose synaptic weights are implemented using emerging memristive devices that exhibit limited dynamic range, resolution, and variability in their programming characteristics. We show that a key device parameter to optimize the learning efficiency of DNNs is the variability in its programming characteristics. DNNs with s… ▽ More

    Submitted 9 November, 2017; originally announced November 2017.

    Comments: 4 pages, 5 figures, accepted at ICECS 2017

  42. Question Analysis for Arabic Question Answering Systems

    Authors: Waheeb Ahmed, Dr. Anto P Babu

    Abstract: The first step of processing a question in Question Answering(QA) Systems is to carry out a detailed analysis of the question for the purpose of determining what it is asking for and how to perfectly approach answering it. Our Question analysis uses several techniques to analyze any question given in natural language: a Stanford POS Tagger & parser for Arabic language, a named entity recognizer, t… ▽ More

    Submitted 11 January, 2017; originally announced January 2017.

    Comments: 10 pages, 3 figures, published article in IJNLC

  43. arXiv:1509.00693  [pdf

    cs.DB cs.IR

    A Fuzzy Clustering Based Approach for Mining Usage Profiles from Web Log Data

    Authors: Zahid Ansari, Mohammad Fazle Azeem, A. Vinaya Babu, Waseem Ahmed

    Abstract: The World Wide Web continues to grow at an amazing rate in both the size and complexity of Web sites and is well on its way to being the main reservoir of information and data. Due to this increase in growth and complexity of WWW, web site publishers are facing increasing difficulty in attracting and retaining users. To design popular and attractive websites publishers must understand their users… ▽ More

    Submitted 1 September, 2015; originally announced September 2015.

    Journal ref: International Journal of Computer Science and Information Security, pp. 70-79 Vol. 9, No. 6, June 2011. (ISSN 1947-5500, IJCSIS Publications, United State)

  44. arXiv:1509.00692  [pdf

    cs.DB cs.IR cs.LG

    Discovery of Web Usage Profiles Using Various Clustering Techniques

    Authors: Zahid Ansari, Waseem Ahmed, M. F. Azeem, A. Vinaya Babu

    Abstract: The explosive growth of World Wide Web (WWW) has necessitated the development of Web personalization systems in order to understand the user preferences to dynamically serve customized content to individual users. To reveal information about user preferences from Web usage data, Web Usage Mining (WUM) techniques are extensively being applied to the Web log data. Clustering techniques are widely us… ▽ More

    Submitted 1 September, 2015; originally announced September 2015.

    Comments: arXiv admin note: substantial text overlap with arXiv:1507.03340

    Journal ref: International Journal of Computer Information Systems, pp. 18-27 Vol. 1, No. 3, July 2011. (ISSN 2229-5208, Silicon Valley Publishers, United Kingdom)

  45. arXiv:1509.00690  [pdf

    cs.DB cs.AI cs.IR

    A Fuzzy Approach for Feature Evaluation and Dimensionality Reduction to Improve the Quality of Web Usage Mining Results

    Authors: Zahid Ansari, M. F. Azeem, A. Vinaya Babu, Waseem Ahmed

    Abstract: Web Usage Mining is the application of data mining techniques to web usage log repositories in order to discover the usage patterns that can be used to analyze the users navigational behavior. During the preprocessing stage, raw web log data is transformed into a set of user profiles. Each user profile captures a set of URLs representing a user session. Clustering can be applied to this sessionize… ▽ More

    Submitted 1 September, 2015; originally announced September 2015.

    Journal ref: International Journal on Advanced Science Engineering and Information Technology, pp. 67-73 Vol. 2 No. 6, 2012. (ISSN: 2088-5334, INSIGHT Publishers, Indonesia)

  46. arXiv:1507.03340  [pdf

    cs.LG cs.SI

    Quantitative Evaluation of Performance and Validity Indices for Clustering the Web Navigational Sessions

    Authors: Zahid Ansari, M. F. Azeem, Waseem Ahmed, A. Vinaya Babu

    Abstract: Clustering techniques are widely used in Web Usage Mining to capture similar interests and trends among users accessing a Web site. For this purpose, web access logs generated at a particular web site are preprocessed to discover the user navigational sessions. Clustering techniques are then applied to group the user session data into user session clusters, where intercluster similarities are mini… ▽ More

    Submitted 13 July, 2015; originally announced July 2015.

    Journal ref: World of Computer Science and Information Technology Journal pp. 217-226, Vol. 1, No. 5, June 2011. (ISSN: 2221- 0741, WCSIT Publisher, Unites States)

  47. arXiv:1104.0848  [pdf, ps, other

    cs.DS cs.FL

    Streaming algorithms for language recognition problems

    Authors: Ajesh Babu, Nutan Limaye, Jaikumar Radhakrishnan, Girish Varma

    Abstract: We study the complexity of the following problems in the streaming model. Membership testing for \DLIN We show that every language in \DLIN\ can be recognised by a randomized one-pass $O(\log n)$ space algorithm with inverse polynomial one-sided error, and by a deterministic p-pass $O(n/p)$ space algorithm. We show that these algorithms are optimal. Membership testing for \LL$(k)$ For language… ▽ More

    Submitted 5 April, 2011; originally announced April 2011.

  48. arXiv:1011.1058  [pdf, ps, other

    cs.DM

    An entropy based proof of the Moore bound for irregular graphs

    Authors: S. Ajesh Babu, Jaikumar Radhakrishnan

    Abstract: We provide proofs of the following theorems by considering the entropy of random walks: Theorem 1.(Alon, Hoory and Linial) Let G be an undirected simple graph with n vertices, girth g, minimum degree at least 2 and average degree d: Odd girth: If g=2r+1,then n \geq 1 + d*(\Sum_{i=0}^{r-1}(d-1)^i) Even girth: If g=2r,then n \geq 2*(\Sum_{i=0}^{r-1} (d-1)^i) Theorem 2.(Hoory) Let G = (V_L,V_R,E) be… ▽ More

    Submitted 3 November, 2010; originally announced November 2010.

    Comments: 6 pages

  49. arXiv:1010.3862  [pdf

    cs.CR

    A New Non Linear, Time Stamped & Feed Back Model Based Encryption Mechanism with Acknowledgement Support

    Authors: A. V. N. Krishna, A. Vinaya Babu

    Abstract: In this work a model is going to be used which develops data distributed over a identified value which is used as nonce (IV). The model considers an equilibrium equation which is a function of non linear relationships, time variant and nonce variant values and takes the feed back of earlier round as input to the present round. The process is repeated for different timings which are used as time st… ▽ More

    Submitted 19 October, 2010; originally announced October 2010.

    Comments: 6 pages

    Journal ref: IJoAT Vol 1, No 2 (October 2010)

  50. arXiv:1007.0411  [pdf

    cs.NI

    Role of Statistical tests in Estimation of the Security of a New Encryption Algorithm

    Authors: Addepalli V. N Krishna, A Vinay Babu

    Abstract: Encryption study basically deals with three levels of algorithms. The first algorithm deals with encryption mechanism, second deals with decryption Mechanism and the third discusses about the generation of keys and sub keys used in the encryption study. In the given study, a new algorithm is discussed. The algorithm executes a series of steps and generates a sequence. This sequence is being used a… ▽ More

    Submitted 1 July, 2010; originally announced July 2010.

    Comments: http://ijict.org/index.php/ijoat/article/view/statistical-tests-for-encryption-algorithm

    Journal ref: International Journal of Advancements in Technology, Vol 1, No 1 (2010)