Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Desarkar, M S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2403.06292  [pdf, other

    cs.CV cs.CL

    Transformer based Multitask Learning for Image Captioning and Object Detection

    Authors: Debolena Basak, P. K. Srijith, Maunendra Sankar Desarkar

    Abstract: In several real-world scenarios like autonomous navigation and mobility, to obtain a better visual understanding of the surroundings, image captioning and object detection play a crucial role. This work introduces a novel multitask learning framework that combines image captioning and object detection into a joint model. We propose TICOD, Transformer-based Image Captioning and Object detection mod… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

    Comments: Accepted at PAKDD 2024

  2. arXiv:2307.15455  [pdf, other

    cs.CL

    Trie-NLG: Trie Context Augmentation to Improve Personalized Query Auto-Completion for Short and Unseen Prefixes

    Authors: Kaushal Kumar Maurya, Maunendra Sankar Desarkar, Manish Gupta, Puneet Agrawal

    Abstract: Query auto-completion (QAC) aims to suggest plausible completions for a given query prefix. Traditionally, QAC systems have leveraged tries curated from historical query logs to suggest most popular completions. In this context, there are two specific scenarios that are difficult to handle for any QAC system: short prefixes (which are inherently ambiguous) and unseen prefixes. Recently, personaliz… ▽ More

    Submitted 23 October, 2023; v1 submitted 28 July, 2023; originally announced July 2023.

    Comments: ECML-PKDD 2023 (Journal Track)

    Journal ref: Data Mining and Knowledge Discovery (DAMI) 2023

  3. arXiv:2305.05214  [pdf, other

    cs.CL

    CharSpan: Utilizing Lexical Similarity to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages

    Authors: Kaushal Kumar Maurya, Rahul Kejriwal, Maunendra Sankar Desarkar, Anoop Kunchukuttan

    Abstract: We address the task of machine translation (MT) from extremely low-resource language (ELRL) to English by leveraging cross-lingual transfer from 'closely-related' high-resource language (HRL). The development of an MT system for ELRL is challenging because these languages typically lack parallel corpora and monolingual corpora, and their representations are absent from large multilingual language… ▽ More

    Submitted 4 February, 2024; v1 submitted 9 May, 2023; originally announced May 2023.

    Journal ref: EACL 2024

  4. arXiv:2212.14599  [pdf

    cs.LG cs.AI

    ComplAI: Theory of A Unified Framework for Multi-factor Assessment of Black-Box Supervised Machine Learning Models

    Authors: Arkadipta De, Satya Swaroop Gudipudi, Sourab Panchanan, Maunendra Sankar Desarkar

    Abstract: The advances in Artificial Intelligence are creating new opportunities to improve lives of people around the world, from business to healthcare, from lifestyle to education. For example, some systems profile the users using their demographic and behavioral characteristics to make certain domain-specific predictions. Often, such predictions impact the life of the user directly or indirectly (e.g.,… ▽ More

    Submitted 30 December, 2022; originally announced December 2022.

    Comments: Full Length Theory Paper for Poster Paper accepted at ACM SAC 2023 (SIGAPP)

  5. arXiv:2210.06394  [pdf, other

    cs.CL

    On Text Style Transfer via Style Masked Language Models

    Authors: Sharan Narasimhan, Pooja Shekar, Suvodip Dey, Maunendra Sankar Desarkar

    Abstract: Text Style Transfer (TST) is performable through approaches such as latent space disentanglement, cycle-consistency losses, prototype editing etc. The prototype editing approach, which is known to be quite successful in TST, involves two key phases a) Masking of source style-associated tokens and b) Reconstruction of this source-style masked sentence conditioned with the target style. We follow a… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

  6. arXiv:2210.06282  [pdf, other

    cs.CL

    DialoGen: Generalized Long-Range Context Representation for Dialogue Systems

    Authors: Suvodip Dey, Maunendra Sankar Desarkar, Asif Ekbal, P. K. Srijith

    Abstract: Long-range context modeling is crucial to both dialogue understanding and generation. The most popular method for dialogue context representation is to concatenate the last-$k$ utterances in chronological order. However, this method may not be ideal for conversations containing long-range dependencies, i.e., when there is a need to look beyond last-$k$ utterances to generate a meaningful response.… ▽ More

    Submitted 3 October, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted at PACLIC 2023

  7. arXiv:2210.00213  [pdf, other

    cs.LG

    HyperHawkes: Hypernetwork based Neural Temporal Point Process

    Authors: Manisha Dubey, P. K. Srijith, Maunendra Sankar Desarkar

    Abstract: Temporal point process serves as an essential tool for modeling time-to-event data in continuous time space. Despite having massive amounts of event sequence data from various domains like social media, healthcare etc., real world application of temporal point process faces two major challenges: 1) it is not generalizable to predict events from unseen sequences in dynamic environment 2) they are n… ▽ More

    Submitted 1 October, 2022; originally announced October 2022.

    Comments: 9 pages, 2 figures

  8. arXiv:2205.02309  [pdf, other

    cs.CL

    Towards Robust and Semantically Organised Latent Representations for Unsupervised Text Style Transfer

    Authors: Sharan Narasimhan, Suvodip Dey, Maunendra Sankar Desarkar

    Abstract: Recent studies show that auto-encoder based approaches successfully perform language generation, smooth sentence interpolation, and style transfer over unseen attributes using unlabelled datasets in a zero-shot manner. The latent space geometry of such models is organised well enough to perform on datasets where the style is "coarse-grained" i.e. a small fraction of words alone in a sentence are e… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: NAACL 2022 Main Conference paper

  9. arXiv:2204.03375  [pdf, other

    cs.CL

    Towards Fair Evaluation of Dialogue State Tracking by Flexible Incorporation of Turn-level Performances

    Authors: Suvodip Dey, Ramamohan Kummara, Maunendra Sankar Desarkar

    Abstract: Dialogue State Tracking (DST) is primarily evaluated using Joint Goal Accuracy (JGA) defined as the fraction of turns where the ground-truth dialogue state exactly matches the prediction. Generally in DST, the dialogue state or belief state for a given turn contains all the intents shown by the user till that turn. Due to this cumulative nature of the belief state, it is difficult to get a correct… ▽ More

    Submitted 7 April, 2022; originally announced April 2022.

    Comments: ACL 2022 Main Conference (short paper)

  10. arXiv:2203.10250  [pdf, other

    cs.CL

    Meta-X$_{NLG}$: A Meta-Learning Approach Based on Language Clustering for Zero-Shot Cross-Lingual Transfer and Generation

    Authors: Kaushal Kumar Maurya, Maunendra Sankar Desarkar

    Abstract: Recently, the NLP community has witnessed a rapid advancement in multilingual and cross-lingual transfer research where the supervision is transferred from high-resource languages (HRLs) to low-resource languages (LRLs). However, the cross-lingual transfer is not uniform across languages, particularly in the zero-shot setting. Towards this goal, one promising research direction is to learn shareab… ▽ More

    Submitted 19 March, 2022; originally announced March 2022.

    Journal ref: Findings of ACL 2022

  11. arXiv:2203.02912  [pdf, other

    cs.CL

    Graph Neural Network Enhanced Language Models for Efficient Multilingual Text Classification

    Authors: Samujjwal Ghosh, Subhadeep Maji, Maunendra Sankar Desarkar

    Abstract: Online social media works as a source of various valuable and actionable information during disasters. These information might be available in multiple languages due to the nature of user generated content. An effective system to automatically identify and categorize these actionable information should be capable to handle multiple languages and under limited supervision. However, existing works m… ▽ More

    Submitted 6 March, 2022; originally announced March 2022.

    Comments: Under Review

  12. Supervised Graph Contrastive Pretraining for Text Classification

    Authors: Samujjwal Ghosh, Subhadeep Maji, Maunendra Sankar Desarkar

    Abstract: Contrastive pretraining techniques for text classification has been largely studied in an unsupervised setting. However, oftentimes labeled data from related tasks which share label semantics with current task is available. We hypothesize that using this labeled data effectively can lead to better generalization on current task. In this paper, we propose a novel way to effectively utilize labeled… ▽ More

    Submitted 21 December, 2021; originally announced December 2021.

    Comments: A condensed version of this paper has been accepted to ACM SAC'22. DOI: https://doi.org/10.1145/3477314.3507194

  13. arXiv:2106.01597  [pdf, other

    cs.CL cs.AI cs.LG

    ZmBART: An Unsupervised Cross-lingual Transfer Framework for Language Generation

    Authors: Kaushal Kumar Maurya, Maunendra Sankar Desarkar, Yoshinobu Kano, Kumari Deepshikha

    Abstract: Despite the recent advancement in NLP research, cross-lingual transfer for natural language generation is relatively understudied. In this work, we transfer supervision from high resource language (HRL) to multiple low-resource languages (LRLs) for natural language generation (NLG). We consider four NLG tasks (text summarization, question generation, news headline generation, and distractor genera… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Accepted in Findings of ACL-IJCNLP 2021

  14. arXiv:2104.01436  [pdf, other

    cs.CL cs.LG

    Unsupervised Domain Adaptation with Global and Local Graph Neural Networks in Limited Labeled Data Scenario: Application to Disaster Management

    Authors: Samujjwal Ghosh, Subhadeep Maji, Maunendra Sankar Desarkar

    Abstract: Identification and categorization of social media posts generated during disasters are crucial to reduce the sufferings of the affected people. However, lack of labeled data is a significant bottleneck in learning an effective categorization system for a disaster. This motivates us to study the problem as unsupervised domain adaptation (UDA) between a previous disaster with labeled data (source) a… ▽ More

    Submitted 3 April, 2021; originally announced April 2021.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  15. arXiv:2101.06004  [pdf, other

    cs.CL

    Walk in Wild: An Ensemble Approach for Hostility Detection in Hindi Posts

    Authors: Chander Shekhar, Bhavya Bagla, Kaushal Kumar Maurya, Maunendra Sankar Desarkar

    Abstract: As the reach of the internet increases, pejorative terms started flooding over social media platforms. This leads to the necessity of identifying hostile content on social media platforms. Identification of hostile contents on low-resource languages like Hindi poses different challenges due to its diverse syntactic structure compared to English. In this paper, we develop a simple ensemble based mo… ▽ More

    Submitted 15 January, 2021; originally announced January 2021.

  16. arXiv:2101.04998  [pdf

    cs.CL

    Coarse and Fine-Grained Hostility Detection in Hindi Posts using Fine Tuned Multilingual Embeddings

    Authors: Arkadipta De, Venkatesh E, Kaushal Kumar Maurya, Maunendra Sankar Desarkar

    Abstract: Due to the wide adoption of social media platforms like Facebook, Twitter, etc., there is an emerging need of detecting online posts that can go against the community acceptance standards. The hostility detection task has been well explored for resource-rich languages like English, but is unexplored for resource-constrained languages like Hindidue to the unavailability of large suitable data. We v… ▽ More

    Submitted 13 January, 2021; originally announced January 2021.

    Comments: Accepted at Constrain 2021 Workshop in AAAI 2021 Conference

  17. HAP-SAP: Semantic Annotation in LBSNs using Latent Spatio-Temporal Hawkes Process

    Authors: Manisha Dubey, P. K. Srijith, Maunendra Sankar Desarkar

    Abstract: The prevalence of location-based social networks (LBSNs) has eased the understanding of human mobility patterns. Knowledge of human dynamics can aid in various ways like urban planning, managing traffic congestion, personalized recommendation etc. These dynamics are influenced by factors like social impact, periodicity in mobility, spatial proximity, influence among users and semantic categories e… ▽ More

    Submitted 8 September, 2020; v1 submitted 5 September, 2020; originally announced September 2020.

    Comments: 11 pages

  18. arXiv:1909.06228  [pdf, other

    cs.PL cs.LG cs.NE cs.SE

    IR2Vec: LLVM IR based Scalable Program Embeddings

    Authors: S. VenkataKeerthy, Rohit Aggarwal, Shalini Jain, Maunendra Sankar Desarkar, Ramakrishna Upadrasta, Y. N. Srikant

    Abstract: We propose IR2Vec, a Concise and Scalable encoding infrastructure to represent programs as a distributed embedding in continuous space. This distributed embedding is obtained by combining representation learning methods with flow information to capture the syntax as well as the semantics of the input programs. As our infrastructure is based on the Intermediate Representation (IR) of the source cod… ▽ More

    Submitted 1 September, 2020; v1 submitted 13 September, 2019; originally announced September 2019.

    Comments: Accepted in ACM TACO