Zum Hauptinhalt springen

Showing 1–46 of 46 results for author: Singhal, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.03185  [pdf, other

    cs.LG cs.AI

    Preventing Reward Hacking with Occupancy Measure Regularization

    Authors: Cassidy Laidlaw, Shivam Singhal, Anca Dragan

    Abstract: Reward hacking occurs when an agent performs very well with respect to a "proxy" reward function (which may be hand-specified or learned), but poorly with respect to the unknown true reward. Since ensuring good alignment between the proxy and true reward is extremely difficult, one approach to prevent reward hacking is optimizing the proxy conservatively. Prior work has particularly focused on enf… ▽ More

    Submitted 5 March, 2024; originally announced March 2024.

  2. arXiv:2311.14948  [pdf, other

    cs.LG cs.AI cs.CV

    Effective Backdoor Mitigation Depends on the Pre-training Objective

    Authors: Sahil Verma, Gantavya Bhatt, Avi Schwarzschild, Soumye Singhal, Arnav Mohanty Das, Chirag Shah, John P Dickerson, Jeff Bilmes

    Abstract: Despite the advanced capabilities of contemporary machine learning (ML) models, they remain vulnerable to adversarial and backdoor attacks. This vulnerability is particularly concerning in real-world deployments, where compromised models may exhibit unpredictable behavior in critical scenarios. Such risks are heightened by the prevalent practice of collecting massive, internet-sourced datasets for… ▽ More

    Submitted 5 December, 2023; v1 submitted 25 November, 2023; originally announced November 2023.

    Comments: Accepted for oral presentation at BUGS workshop @ NeurIPS 2023 (https://neurips2023-bugs.github.io/)

  3. arXiv:2311.04603  [pdf, other

    cs.GT

    Navigating Resource Conflicts: Co-opetition and Fairness

    Authors: Shiksha Singhal

    Abstract: In today's dynamic and interconnected world, resource constraints pose significant challenges across various domains, ranging from networks, logistics and manufacturing to project management and optimization, etc. Resource-constrained problems (RCPs) represent a class of complex computational problems that require efficient allocation and utilization of limited resources to achieve optimal outcome… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: PhD thesis

  4. arXiv:2310.01780  [pdf, other

    cs.IT

    Social Optimal Freshness in Multi-Source, Multi-Channel Systems via MDP

    Authors: Shiksha Singhal, Veeraruna Kavitha, Vidya Shankar

    Abstract: Many systems necessitate frequent and consistent updates of a specific information. Often this information is updated regularly, where an old packet becomes completely obsolete in the presence of a new packet. In this context, we consider a system with multiple sources, each equipped with a storage buffer of size one, communicating to a common destination via d orthogonal channels. In each slot, t… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 8 pages, 9 figures

  5. arXiv:2304.12902  [pdf, other

    cs.GT

    On the ubiquity of duopolies in constant sum congestion games

    Authors: Shiksha Singhal, Veeraruna Kavitha, Jayakrishnan Nair

    Abstract: We analyse a coalition formation game between strategic service providers of a congestible service. The key novelty of our formulation is that it is a constant sum game, i.e., the total payoff across all service providers (or coalitions of providers) is fixed, and dictated by the size of the market. The game thus captures the tension between resource pooling (to benefit from the resulting statisti… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: arXiv admin note: text overlap with arXiv:2109.12840

  6. arXiv:2304.03518  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    SSS at SemEval-2023 Task 10: Explainable Detection of Online Sexism using Majority Voted Fine-Tuned Transformers

    Authors: Sriya Rallabandi, Sanchit Singhal, Pratinav Seth

    Abstract: This paper describes our submission to Task 10 at SemEval 2023-Explainable Detection of Online Sexism (EDOS), divided into three subtasks. The recent rise in social media platforms has seen an increase in disproportionate levels of sexism experienced by women on social media platforms. This has made detecting and explaining online sexist content more important than ever to make social media safer… ▽ More

    Submitted 23 April, 2023; v1 submitted 7 April, 2023; originally announced April 2023.

    Comments: Accepted at The 17th International Workshop on Semantic Evaluation, ACL 2023

  7. CoReFusion: Contrastive Regularized Fusion for Guided Thermal Super-Resolution

    Authors: Aditya Kasliwal, Pratinav Seth, Sriya Rallabandi, Sanchit Singhal

    Abstract: Thermal imaging has numerous advantages over regular visible-range imaging since it performs well in low-light circumstances. Super-Resolution approaches can broaden their usefulness by replicating accurate high-resolution thermal pictures using measurements from low-cost, low-resolution thermal sensors. Because of the spectral range mismatch between the images, Guided Super-Resolution of thermal… ▽ More

    Submitted 24 April, 2023; v1 submitted 3 April, 2023; originally announced April 2023.

    Comments: Accepted at 19th IEEE Workshop on Perception Beyond the Visible Spectrum,CVPR 2023

  8. arXiv:2302.14045  [pdf, other

    cs.CL cs.CV

    Language Is Not All You Need: Aligning Perception with Language Models

    Authors: Shaohan Huang, Li Dong, Wenhui Wang, Yaru Hao, Saksham Singhal, Shuming Ma, Tengchao Lv, Lei Cui, Owais Khan Mohammed, Barun Patra, Qiang Liu, Kriti Aggarwal, Zewen Chi, Johan Bjorck, Vishrav Chaudhary, Subhojit Som, Xia Song, Furu Wei

    Abstract: A big convergence of language, multimodal perception, action, and world modeling is a key step toward artificial general intelligence. In this work, we introduce Kosmos-1, a Multimodal Large Language Model (MLLM) that can perceive general modalities, learn in context (i.e., few-shot), and follow instructions (i.e., zero-shot). Specifically, we train Kosmos-1 from scratch on web-scale multimodal co… ▽ More

    Submitted 1 March, 2023; v1 submitted 27 February, 2023; originally announced February 2023.

  9. arXiv:2211.14851  [pdf, other

    cs.CV cs.LG

    Performance evaluation of deep segmentation models for Contrails detection

    Authors: Akshat Bhandari, Sriya Rallabandi, Sanchit Singhal, Aditya Kasliwal, Pratinav Seth

    Abstract: Contrails, short for condensation trails, are line-shaped ice clouds produced by aircraft engine exhaust when they fly through cold and humid air. They generate a greenhouse effect by absorbing or directing back to Earth approximately 33% of emitted outgoing longwave radiation. They account for over half of the climate change resulting from aviation activities. Avoiding contrails and adjusting fli… ▽ More

    Submitted 4 November, 2023; v1 submitted 27 November, 2022; originally announced November 2022.

    Comments: Accepted to Tackling Climate Change with Machine Learning: workshop at NeurIPS 2022

  10. arXiv:2211.09061  [pdf, other

    cs.LG

    Squeeze flow of micro-droplets: convolutional neural network with trainable and tunable refinement

    Authors: Aryan Mehboudi, Shrawan Singhal, S. V. Sreenivasan

    Abstract: We propose a platform based on neural networks to solve the image-to-image translation problem in the context of squeeze flow of micro-droplets. In the first part of this paper, we present the governing partial differential equations to lay out the underlying physics of the problem. We also discuss our developed Python package, sqflow, which can potentially serve as free, flexible, and scalable st… ▽ More

    Submitted 16 November, 2022; originally announced November 2022.

    Comments: 27 pages, 18 figures

    MSC Class: 68T07; 68T10; 68T20; 68P25; 94A08; ACM Class: I.2.6; I.2.10; I.4.2; I.4.6; I.4.8; I.4.9; I.4.10; I.5.1; I.5.2; I.5.3; I.5.4; I.6.5; J.2

  11. arXiv:2210.14867  [pdf, other

    cs.CL cs.LG

    Beyond English-Centric Bitexts for Better Multilingual Language Representation Learning

    Authors: Barun Patra, Saksham Singhal, Shaohan Huang, Zewen Chi, Li Dong, Furu Wei, Vishrav Chaudhary, Xia Song

    Abstract: In this paper, we elaborate upon recipes for building multilingual representation models that are not only competitive with existing state-of-the-art models but are also more parameter efficient, thereby promoting better adoption in resource-constrained scenarios and practical applications. We show that going beyond English-centric bitexts, coupled with a novel sampling strategy aimed at reducing… ▽ More

    Submitted 26 October, 2022; originally announced October 2022.

    Comments: Work in progress

  12. arXiv:2210.06423  [pdf, other

    cs.LG cs.CL cs.CV

    Foundation Transformers

    Authors: Hongyu Wang, Shuming Ma, Shaohan Huang, Li Dong, Wenhui Wang, Zhiliang Peng, Yu Wu, Payal Bajaj, Saksham Singhal, Alon Benhaim, Barun Patra, Zhun Liu, Vishrav Chaudhary, Xia Song, Furu Wei

    Abstract: A big convergence of model architectures across language, vision, speech, and multimodal is emerging. However, under the same name "Transformers", the above areas use different implementations for better performance, e.g., Post-LayerNorm for BERT, and Pre-LayerNorm for GPT and vision Transformers. We call for the development of Foundation Transformer for true general-purpose modeling, which serves… ▽ More

    Submitted 19 October, 2022; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Work in progress

  13. arXiv:2209.08743  [pdf, other

    cs.DC cs.DB

    DINOMO: An Elastic, Scalable, High-Performance Key-Value Store for Disaggregated Persistent Memory (Extended Version)

    Authors: Sekwon Lee, Soujanya Ponnapalli, Sharad Singhal, Marcos K. Aguilera, Kimberly Keeton, Vijay Chidambaram

    Abstract: We present Dinomo, a novel key-value store for disaggregated persistent memory (DPM). Dinomo is the first key-value store for DPM that simultaneously achieves high common-case performance, scalability, and lightweight online reconfiguration. We observe that previously proposed key-value stores for DPM had architectural limitations that prevent them from achieving all three goals simultaneously. Di… ▽ More

    Submitted 18 September, 2022; originally announced September 2022.

    Comments: This is an extended version of the full paper to appear in PVLDB 15.13 (VLDB 2023)

  14. arXiv:2208.10442  [pdf, other

    cs.CV cs.CL

    Image as a Foreign Language: BEiT Pretraining for All Vision and Vision-Language Tasks

    Authors: Wenhui Wang, Hangbo Bao, Li Dong, Johan Bjorck, Zhiliang Peng, Qiang Liu, Kriti Aggarwal, Owais Khan Mohammed, Saksham Singhal, Subhojit Som, Furu Wei

    Abstract: A big convergence of language, vision, and multimodal pretraining is emerging. In this work, we introduce a general-purpose multimodal foundation model BEiT-3, which achieves state-of-the-art transfer performance on both vision and vision-language tasks. Specifically, we advance the big convergence from three aspects: backbone architecture, pretraining task, and model scaling up. We introduce Mult… ▽ More

    Submitted 30 August, 2022; v1 submitted 22 August, 2022; originally announced August 2022.

    Comments: 18 pages

  15. arXiv:2205.10152  [pdf

    cs.ET

    Investigating the impact of BTI, HCI and time-zero variability on neuromorphic spike event generation circuits

    Authors: Shaik Jani Babu, Rohit Singh, Siona Menezes Picardo, Nilesh Goel, Sonal Singhal

    Abstract: Neuromorphic computing refers to brain-inspired computers, that differentiate it from von Neumann architecture. Analog VLSI based neuromorphic circuits is a current research interest. Two simpler spiking integrate and fire neuron model namely axon-Hillock (AH) and voltage integrate, and fire (VIF) circuits are commonly used for generating spike events. This paper discusses the impact of reliabilit… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 4 pages, 4 figures, IWPSD 2019

  16. arXiv:2205.09519  [pdf

    eess.SY cs.NE

    Design and Mathematical Modelling of Inter Spike Interval of Temporal Neuromorphic Encoder for Image Recognition

    Authors: Aadhitiya VS, Jani Babu Shaik, Sonal Singhal, Siona Menezes Picardo, Nilesh Goel

    Abstract: Neuromorphic computing systems emulate the electrophysiological behavior of the biological nervous system using mixed-mode analog or digital VLSI circuits. These systems show superior accuracy and power efficiency in carrying out cognitive tasks. The neural network architecture used in neuromorphic computing systems is spiking neural networks (SNNs) analogous to the biological nervous system. SNN… ▽ More

    Submitted 19 May, 2022; originally announced May 2022.

    Comments: 4 pages, 6 figures, one table, IEEE ICEE 2020 conference proceeding

  17. arXiv:2204.09179  [pdf, other

    cs.CL cs.LG

    On the Representation Collapse of Sparse Mixture of Experts

    Authors: Zewen Chi, Li Dong, Shaohan Huang, Damai Dai, Shuming Ma, Barun Patra, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei

    Abstract: Sparse mixture of experts provides larger model capacity while requiring a constant computational overhead. It employs the routing mechanism to distribute input tokens to the best-matched experts according to their hidden representations. However, learning such a routing mechanism encourages token clustering around expert centroids, implying a trend toward representation collapse. In this work, we… ▽ More

    Submitted 12 October, 2022; v1 submitted 19 April, 2022; originally announced April 2022.

    Comments: NeurIPS 2022

  18. arXiv:2202.07848  [pdf, other

    cs.DC cs.AI

    Singularity: Planet-Scale, Preemptive and Elastic Scheduling of AI Workloads

    Authors: Dharma Shukla, Muthian Sivathanu, Srinidhi Viswanatha, Bhargav Gulavani, Rimma Nehme, Amey Agrawal, Chen Chen, Nipun Kwatra, Ramachandran Ramjee, Pankaj Sharma, Atul Katiyar, Vipul Modi, Vaibhav Sharma, Abhishek Singh, Shreshth Singhal, Kaustubh Welankar, Lu Xun, Ravi Anupindi, Karthik Elangovan, Hasibur Rahman, Zhou Lin, Rahul Seetharaman, Cheng Xu, Eddie Ailijiang, Suresh Krishnappa , et al. (1 additional authors not shown)

    Abstract: Lowering costs by driving high utilization across deep learning workloads is a crucial lever for cloud providers. We present Singularity, Microsoft's globally distributed scheduling service for highly-efficient and reliable execution of deep learning training and inference workloads. At the heart of Singularity is a novel, workload-aware scheduler that can transparently preempt and elastically sca… ▽ More

    Submitted 21 February, 2022; v1 submitted 15 February, 2022; originally announced February 2022.

    Comments: Revision: Fixed some typos

  19. Discrete Simulation Optimization for Tuning Machine Learning Method Hyperparameters

    Authors: Varun Ramamohan, Shobhit Singhal, Aditya Raj Gupta, Nomesh Bhojkumar Bolia

    Abstract: Machine learning (ML) methods are used in most technical areas such as image recognition, product recommendation, financial analysis, medical diagnosis, and predictive maintenance. An important aspect of implementing ML methods involves controlling the learning process for the ML method so as to maximize the performance of the method under consideration. Hyperparameter tuning is the process of sel… ▽ More

    Submitted 20 June, 2023; v1 submitted 16 January, 2022; originally announced January 2022.

    Journal ref: Journal of Simulation (2023)

  20. arXiv:2111.12172  [pdf, other

    cs.CV cs.AI cs.LG

    Multi-label Iterated Learning for Image Classification with Label Ambiguity

    Authors: Sai Rajeswar, Pau Rodriguez, Soumye Singhal, David Vazquez, Aaron Courville

    Abstract: Transfer learning from large-scale pre-trained models has become essential for many computer vision tasks. Recent studies have shown that datasets like ImageNet are weakly labeled since images with multiple object classes present are assigned a single label. This ambiguity biases models towards a single prediction, which could result in the suppression of classes that tend to co-occur in the data.… ▽ More

    Submitted 23 November, 2021; originally announced November 2021.

  21. arXiv:2111.02086  [pdf, other

    cs.CL

    Multilingual Machine Translation Systems from Microsoft for WMT21 Shared Task

    Authors: Jian Yang, Shuming Ma, Haoyang Huang, Dongdong Zhang, Li Dong, Shaohan Huang, Alexandre Muzio, Saksham Singhal, Hany Hassan Awadalla, Xia Song, Furu Wei

    Abstract: This report describes Microsoft's machine translation systems for the WMT21 shared task on large-scale multilingual machine translation. We participated in all three evaluation tracks including Large Track and two Small Tracks where the former one is unconstrained and the latter two are fully constrained. Our model submissions to the shared task were initialized with DeltaLM\footnote{\url{https://… ▽ More

    Submitted 3 November, 2021; originally announced November 2021.

    Comments: WMT21

  22. arXiv:2109.12840  [pdf, other

    cs.GT math.OC

    Coalition Formation in Constant Sum Queueing Games

    Authors: Shiksha Singhal, Veeraruna Kavitha, Jayakrishnan Nair

    Abstract: We analyse a coalition formation game between strategic service providers of a congestible service. The key novelty of our formulation is that it is a constant sum game, i.e., the total payoff across all service providers (or coalitions of providers) is fixed, and dictated by the total size of the market. The game thus captures the tension between resource pooling (to benefit from the resulting st… ▽ More

    Submitted 27 September, 2021; originally announced September 2021.

    Comments: 15 pages, 3 figures

  23. arXiv:2109.07306  [pdf, other

    cs.CL

    Allocating Large Vocabulary Capacity for Cross-lingual Language Model Pre-training

    Authors: Bo Zheng, Li Dong, Shaohan Huang, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei

    Abstract: Compared to monolingual models, cross-lingual models usually require a more expressive vocabulary to represent all languages adequately. We find that many languages are under-represented in recent cross-lingual language models due to the limited vocabulary capacity. To this end, we propose an algorithm VoCap to determine the desired vocabulary capacity of each language. However, increasing the voc… ▽ More

    Submitted 15 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  24. arXiv:2109.05329  [pdf, other

    cs.DC

    MODC: Resilience for disaggregated memory architectures using task-based programming

    Authors: Kimberly Keeton, Sharad Singhal, Haris Volos, Yupu Zhang, Ramesh Chandra Chaurasiya, Clarete Riana Crasta, Sherin T George, Nagaraju K N, Mashood Abdulla K, Kavitha Natarajan, Porno Shome, Sanish Suresh

    Abstract: Disaggregated memory architectures provide benefits to applications beyond traditional scale out environments, such as independent scaling of compute and memory resources. They also provide an independent failure model, where computations or the compute nodes they run on may fail independently of the disaggregated memory; thus, data that's resident in the disaggregated memory is unaffected by the… ▽ More

    Submitted 11 September, 2021; originally announced September 2021.

    Comments: 9 pages, 4 figures

    ACM Class: D.4.1; D.4.5; D.4.7; C.1.4; E.1

    Journal ref: Proceedings of 2nd Workshop on Resource Disaggregation and Serverless (WORDS'21), Co-located with ASPLOS'21, April 2021

  25. Desk Organization: Effect of Multimodal Inputs on Spatial Relational Learning

    Authors: Ryan Rowe, Shivam Singhal, Daqing Yi, Tapomayukh Bhattacharjee, Siddhartha S. Srinivasa

    Abstract: For robots to operate in a three dimensional world and interact with humans, learning spatial relationships among objects in the surrounding is necessary. Reasoning about the state of the world requires inputs from many different sensory modalities including vision ($V$) and haptics ($H$). We examine the problem of desk organization: learning how humans spatially position different objects on a pl… ▽ More

    Submitted 2 August, 2021; originally announced August 2021.

    Comments: 8 pages, 7 figures

    ACM Class: I.2.9

    Journal ref: 2019 28th IEEE International Conference on Robot and Human Interactive Communication (RO-MAN) (pp. 1-8). IEEE

  26. arXiv:2106.16138  [pdf, other

    cs.CL

    XLM-E: Cross-lingual Language Model Pre-training via ELECTRA

    Authors: Zewen Chi, Shaohan Huang, Li Dong, Shuming Ma, Bo Zheng, Saksham Singhal, Payal Bajaj, Xia Song, Xian-Ling Mao, Heyan Huang, Furu Wei

    Abstract: In this paper, we introduce ELECTRA-style tasks to cross-lingual language model pre-training. Specifically, we present two pre-training tasks, namely multilingual replaced token detection, and translation replaced token detection. Besides, we pretrain the model, named as XLM-E, on both multilingual and parallel corpora. Our model outperforms the baseline models on various cross-lingual understandi… ▽ More

    Submitted 19 April, 2022; v1 submitted 30 June, 2021; originally announced June 2021.

    Comments: ACL-2022

  27. arXiv:2106.13736  [pdf, other

    cs.CL

    DeltaLM: Encoder-Decoder Pre-training for Language Generation and Translation by Augmenting Pretrained Multilingual Encoders

    Authors: Shuming Ma, Li Dong, Shaohan Huang, Dongdong Zhang, Alexandre Muzio, Saksham Singhal, Hany Hassan Awadalla, Xia Song, Furu Wei

    Abstract: While pretrained encoders have achieved success in various natural language understanding (NLU) tasks, there is a gap between these pretrained encoders and natural language generation (NLG). NLG tasks are often based on the encoder-decoder framework, where the pretrained encoders can only benefit part of it. To reduce this gap, we introduce DeltaLM, a pretrained multilingual encoder-decoder model… ▽ More

    Submitted 17 August, 2021; v1 submitted 25 June, 2021; originally announced June 2021.

    Comments: Work in progress

  28. arXiv:2106.08226  [pdf, other

    cs.CL

    Consistency Regularization for Cross-Lingual Fine-Tuning

    Authors: Bo Zheng, Li Dong, Shaohan Huang, Wenhui Wang, Zewen Chi, Saksham Singhal, Wanxiang Che, Ting Liu, Xia Song, Furu Wei

    Abstract: Fine-tuning pre-trained cross-lingual language models can transfer task-specific supervision from one language to the others. In this work, we propose to improve cross-lingual fine-tuning with consistency regularization. Specifically, we use example consistency regularization to penalize the prediction sensitivity to four types of data augmentations, i.e., subword sampling, Gaussian noise, code-sw… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: ACL-2021

  29. arXiv:2103.03096  [pdf, other

    cs.CV cs.AI

    Towards Designing Computer Vision-based Explainable-AI Solution: A Use Case of Livestock Mart Industry

    Authors: Devam Dave, Het Naik, Smiti Singhal, Rudresh Dwivedi, Pankesh Patel

    Abstract: The objective of an online Mart is to match buyers and sellers, to weigh animals and to oversee their sale. A reliable pricing method can be developed by ML models that can read through historical sales data. However, when AI models suggest or recommend a price, that in itself does not reveal too much (i.e., it acts like a black box) about the qualities and the abilities of an animal. An intereste… ▽ More

    Submitted 8 February, 2021; originally announced March 2021.

    Comments: 8 pages, 5 figures

  30. arXiv:2102.11276  [pdf, other

    cs.CL cs.CY

    Factorization of Fact-Checks for Low Resource Indian Languages

    Authors: Shivangi Singhal, Rajiv Ratn Shah, Ponnurangam Kumaraguru

    Abstract: The advancement in technology and accessibility of internet to each individual is revolutionizing the real time information. The liberty to express your thoughts without passing through any credibility check is leading to dissemination of fake content in the ecosystem. It can have disastrous effects on both individuals and society as a whole. The amplification of fake news is becoming rampant in I… ▽ More

    Submitted 23 February, 2021; originally announced February 2021.

    Comments: 15 pages, 6 figures

  31. arXiv:2012.15547  [pdf, other

    cs.CL

    XLM-T: Scaling up Multilingual Machine Translation with Pretrained Cross-lingual Transformer Encoders

    Authors: Shuming Ma, Jian Yang, Haoyang Huang, Zewen Chi, Li Dong, Dongdong Zhang, Hany Hassan Awadalla, Alexandre Muzio, Akiko Eriguchi, Saksham Singhal, Xia Song, Arul Menezes, Furu Wei

    Abstract: Multilingual machine translation enables a single model to translate between different languages. Most existing multilingual machine translation systems adopt a randomly initialized Transformer backbone. In this work, inspired by the recent success of language model pre-training, we present XLM-T, which initializes the model with an off-the-shelf pretrained cross-lingual Transformer encoder and fi… ▽ More

    Submitted 31 December, 2020; originally announced December 2020.

  32. arXiv:2012.13548  [pdf, other

    cs.DC cs.PF

    Graph500 from OCaml-Multicore Perspective

    Authors: Shubhendra Pal Singhal

    Abstract: OCaml is an industrial-strength, multi-paradigm programming language, widely used in industry and academia. OCaml was developed for solving numerical and scientific problems involving large scale data-intensive operations and one such classic application set is Graph Algorithms, which are a core part of most analytics workloads. In this paper, we aim to implement the graph benchmarks along with th… ▽ More

    Submitted 25 December, 2020; originally announced December 2020.

    Comments: 6 pages

  33. arXiv:2012.02960  [pdf, ps, other

    cs.GT math.OC

    Cooperative Ressource Sharing With Adamant Player

    Authors: Shiksha Singhal, Veeraruna Kavitha

    Abstract: Cooperative game theory deals with systems where players want to cooperate to improve their payoffs. But players may choose coalitions in a non-cooperative manner, leading to a coalition-formation game. We consider such a game with several players (willing to cooperate) and an adamant player (unwilling to cooperate) involved in resource-sharing. Here, the strategy of a player is the set of players… ▽ More

    Submitted 5 December, 2020; originally announced December 2020.

  34. arXiv:2011.03195  [pdf, other

    cs.LG cs.AI

    Explainable AI meets Healthcare: A Study on Heart Disease Dataset

    Authors: Devam Dave, Het Naik, Smiti Singhal, Pankesh Patel

    Abstract: With the increasing availability of structured and unstructured data and the swift progress of analytical techniques, Artificial Intelligence (AI) is bringing a revolution to the healthcare industry. With the increasingly indispensable role of AI in healthcare, there are growing concerns over the lack of transparency and explainability in addition to potential bias encountered by predictions of th… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

    Comments: 23

  35. arXiv:2010.02975  [pdf, other

    cs.CL

    Supervised Seeded Iterated Learning for Interactive Language Learning

    Authors: Yuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron Courville

    Abstract: Language drift has been one of the major obstacles to train language models through interaction. When word-based conversational agents are trained towards completing a task, they tend to invent their language rather than leveraging natural language. In recent literature, two general methods partially counter this phenomenon: Supervised Selfplay (S2P) and Seeded Iterated Learning (SIL). While S2P j… ▽ More

    Submitted 6 October, 2020; originally announced October 2020.

  36. arXiv:2007.07834  [pdf, other

    cs.CL

    InfoXLM: An Information-Theoretic Framework for Cross-Lingual Language Model Pre-Training

    Authors: Zewen Chi, Li Dong, Furu Wei, Nan Yang, Saksham Singhal, Wenhui Wang, Xia Song, Xian-Ling Mao, Heyan Huang, Ming Zhou

    Abstract: In this work, we present an information-theoretic framework that formulates cross-lingual language model pre-training as maximizing mutual information between multilingual-multi-granularity texts. The unified view helps us to better understand the existing methods for learning cross-lingual representations. More importantly, inspired by the framework, we propose a new pre-training task based on co… ▽ More

    Submitted 7 April, 2021; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: NAACL 2021

  37. arXiv:2003.12694  [pdf, other

    cs.AI cs.CL

    Countering Language Drift with Seeded Iterated Learning

    Authors: Yuchen Lu, Soumye Singhal, Florian Strub, Olivier Pietquin, Aaron Courville

    Abstract: Pretraining on human corpus and then finetuning in a simulator has become a standard pipeline for training a goal-oriented dialogue agent. Nevertheless, as soon as the agents are finetuned to maximize task completion, they suffer from the so-called language drift phenomenon: they slowly lose syntactic and semantic properties of language as they only focus on solving the task. In this paper, we pro… ▽ More

    Submitted 24 August, 2020; v1 submitted 27 March, 2020; originally announced March 2020.

  38. arXiv:2002.00372  [pdf, other

    cs.AI cs.LG stat.ML

    Interpretability of Blackbox Machine Learning Models through Dataview Extraction and Shadow Model creation

    Authors: Rupam Patir, Shubham Singhal, C. Anantaram, Vikram Goyal

    Abstract: Deep learning models trained using massive amounts of data tend to capture one view of the data and its associated mapping. Different deep learning models built on the same training data may capture different views of the data based on the underlying techniques used. For explaining the decisions arrived by blackbox deep learning models, we argue that it is essential to reproduce that model's view… ▽ More

    Submitted 2 February, 2020; originally announced February 2020.

    Comments: 13 pages, 3 figures, 7 tables

    ACM Class: I.2; I.2.6

  39. arXiv:1912.07991  [pdf, other

    cs.LG cs.CV stat.ML

    Jointly Trained Image and Video Generation using Residual Vectors

    Authors: Yatin Dandi, Aniket Das, Soumye Singhal, Vinay P. Namboodiri, Piyush Rai

    Abstract: In this work, we propose a modeling technique for jointly training image and video generation models by simultaneously learning to map latent variables with a fixed prior onto real images and interpolate over images to generate videos. The proposed approach models the variations in representations using residual vectors encoding the change at each time step over a summary vector for the entire vid… ▽ More

    Submitted 17 December, 2019; originally announced December 2019.

    Comments: Accepted in 2020 Winter Conference on Applications of Computer Vision (WACV '20)

  40. arXiv:1909.13058  [pdf, other

    cs.PL cs.SE

    Profiling minisat based on user defined execution time -- GPROF

    Authors: Shubhendra Pal Singhal, Sandeep Gupta, Pierluigi Nuzzo

    Abstract: This paper focuses on the explanation of the architecture of profilers particularly gprof and how to profile a program according to the user defined input of execution time . Gprof is a profiler available open source in the package of binutils. Gprof records the flow of the program including the callee and caller information and their respective execution time. This information is represented in t… ▽ More

    Submitted 28 September, 2019; originally announced September 2019.

    Comments: 10 figures, 13 pages

  41. arXiv:1909.10012  [pdf, other

    cs.CL

    Is change the only constant? Profile change perspective on #LokSabhaElections2019

    Authors: Kumari Neha, Shashank Srikanth, Sonali Singhal, Shwetanshu Singh, Arun Balaji Buduru, Ponnurangam Kumaraguru

    Abstract: Users on Twitter are identified with the help of their profile attributes that consists of username, display name, profile image, to name a few. The profile attributes that users adopt can reflect their interests, belief, or thematic inclinations. Literature has proposed the implications and significance of profile attribute change for a random population of users. However, the use of profile attr… ▽ More

    Submitted 22 September, 2019; originally announced September 2019.

    Comments: 8 pages, 11 figures, 4 tables

  42. Comparative study of performance of parallel Alpha Beta Pruning for different architectures

    Authors: Shubhendra Pal Singhal, M. Sridevi

    Abstract: Optimization of searching the best possible action depending on various states like state of environment, system goal etc. has been a major area of study in computer systems. In any search algorithm, searching best possible solution from the pool of every possibility known can lead to the construction of the whole state search space popularly called as minimax algorithm. This may lead to a impract… ▽ More

    Submitted 29 October, 2019; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: 5 pages, 6 figures, Accepted in 2019 IEEE 9th International Advance Computing Conference(IEEE Xplore)

    Journal ref: 2019 IEEE 9th International Conference on Advanced Computing (IACC), Tiruchirappalli, India, 2019, pp. 115-119

  43. arXiv:1908.11648  [pdf, other

    cs.OS

    Porting of eChronos RTOS on RISC-V Architecture

    Authors: Shubhendra Pal Singhal, M. Sridevi, N Sathya Narayanan, M J Shankar Raman

    Abstract: eChronos is a formally verified Real Time Operating System(RTOS) designed for embedded micro-controllers. eChronos was targeted for tightly constrained devices without memory management units. Currently, eChronos is available on proprietary designs like ARM, PowerPC and Intel architectures. eChronos is adopted in safety critical systems like aircraft control system and medical implant devices. eCh… ▽ More

    Submitted 26 December, 2019; v1 submitted 30 August, 2019; originally announced August 2019.

    Comments: 11 pages, 3 figures, Accepted for Publication for Springer LNCS Germany

    Report number: Submission Id - 205

  44. arXiv:1906.07339  [pdf, other

    cs.SE cs.SI

    Reputation Systems -- Fair allocation of points to the editors in the collaborative community

    Authors: Shubhendra Pal Singhal

    Abstract: In this paper we are trying to determine a scheme for the fair allocation of points to the contributors of the collaborative community. The major problem of fair allocation of points among the contributors is that we have to analyze the improvement in the versions of an article. Lets say there is a contribution of major change in content which is relevant vs the contribution of adding a single com… ▽ More

    Submitted 28 June, 2019; v1 submitted 17 June, 2019; originally announced June 2019.

    Comments: 6 pages, 2 figures

    Report number: Volume: 06 Issue: 06, June 2019, Page number(2259-2263) MSC Class: 65-05

    Journal ref: International Research Journal of Engineering and Technology (IRJET) (2019), e-ISSN: 2395-0056, p-ISSN: 2395-0072, June 2019, "https://www.irjet.net/archives/V6/i6/IRJET-V6I6482.pdf"

  45. arXiv:1804.00379  [pdf, other

    cs.LG stat.ML

    Recall Traces: Backtracking Models for Efficient Reinforcement Learning

    Authors: Anirudh Goyal, Philemon Brakel, William Fedus, Soumye Singhal, Timothy Lillicrap, Sergey Levine, Hugo Larochelle, Yoshua Bengio

    Abstract: In many environments only a tiny subset of all states yield high reward. In these cases, few of the interactions with the environment provide a relevant learning signal. Hence, we may want to preferentially train on those high-reward states and the probable trajectories leading to them. To this end, we advocate for the use of a backtracking model that predicts the preceding states that terminate a… ▽ More

    Submitted 28 January, 2019; v1 submitted 1 April, 2018; originally announced April 2018.

    Comments: Accepted at ICLR 2019

  46. arXiv:1310.8540  [pdf, other

    cs.IT

    Quantitative Assessment of TV White Space in India

    Authors: Gaurang Naik, Sudesh Singhal, Animesh Kumar, Abhay Karandikar

    Abstract: Licensed but unutilized television (TV) band spectrum is called as TV white space in the literature. Ultra high frequency (UHF) TV band spectrum has very good wireless radio propagation characteristics. The amount of TV white space in the UHF TV band in India is of interest. Comprehensive quantitative assessment and estimates for the TV white space in the 470-590MHz band for four zones of India (a… ▽ More

    Submitted 31 October, 2013; originally announced October 2013.