Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Wadhawan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2306.01201  [pdf, other

    cs.CL cs.LG cs.SD eess.AS

    Learning When to Speak: Latency and Quality Trade-offs for Simultaneous Speech-to-Speech Translation with Offline Models

    Authors: Liam Dugan, Anshul Wadhawan, Kyle Spence, Chris Callison-Burch, Morgan McGuire, Victor Zordan

    Abstract: Recent work in speech-to-speech translation (S2ST) has focused primarily on offline settings, where the full input utterance is available before any output is given. This, however, is not reasonable in many real-world scenarios. In latency-sensitive applications, rather than waiting for the full utterance, translations should be spoken as soon as the information in the input is present. In this wo… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: To appear at INTERSPEECH 2023

  2. arXiv:2103.01679  [pdf, other

    cs.CL

    AraBERT and Farasa Segmentation Based Approach For Sarcasm and Sentiment Detection in Arabic Tweets

    Authors: Anshul Wadhawan

    Abstract: This paper presents our strategy to tackle the EACL WANLP-2021 Shared Task 2: Sarcasm and Sentiment Detection. One of the subtasks aims at developing a system that identifies whether a given Arabic tweet is sarcastic in nature or not, while the other aims to identify the sentiment of the Arabic tweet. We approach the task in two steps. The first step involves pre processing the provided ArSarcasm-… ▽ More

    Submitted 2 March, 2021; originally announced March 2021.

  3. arXiv:2102.12082  [pdf, other

    cs.CL

    Hopeful_Men@LT-EDI-EACL2021: Hope Speech Detection Using Indic Transliteration and Transformers

    Authors: Ishan Sanjeev Upadhyay, Nikhil E, Anshul Wadhawan, Radhika Mamidi

    Abstract: This paper aims to describe the approach we used to detect hope speech in the HopeEDI dataset. We experimented with two approaches. In the first approach, we used contextual embeddings to train classifiers using logistic regression, random forest, SVM, and LSTM based models.The second approach involved using a majority voting ensemble of 11 models which were obtained by fine-tuning pre-trained tra… ▽ More

    Submitted 24 February, 2021; v1 submitted 24 February, 2021; originally announced February 2021.

  4. arXiv:2102.09943  [pdf, other

    cs.CL

    Towards Emotion Recognition in Hindi-English Code-Mixed Data: A Transformer Based Approach

    Authors: Anshul Wadhawan, Akshita Aggarwal

    Abstract: In the last few years, emotion detection in social-media text has become a popular problem due to its wide ranging application in better understanding the consumers, in psychology, in aiding human interaction with computers, designing smart systems etc. Because of the availability of huge amounts of data from social-media, which is regularly used for expressing sentiments and opinions, this proble… ▽ More

    Submitted 28 February, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  5. arXiv:2102.09749  [pdf, other

    cs.CL

    Dialect Identification in Nuanced Arabic Tweets Using Farasa Segmentation and AraBERT

    Authors: Anshul Wadhawan

    Abstract: This paper presents our approach to address the EACL WANLP-2021 Shared Task 1: Nuanced Arabic Dialect Identification (NADI). The task is aimed at developing a system that identifies the geographical location(country/province) from where an Arabic tweet in the form of modern standard Arabic or dialect comes from. We solve the task in two parts. The first part involves pre-processing the provided da… ▽ More

    Submitted 22 February, 2021; v1 submitted 19 February, 2021; originally announced February 2021.

  6. PublishInCovid19 at WNUT 2020 Shared Task-1: Entity Recognition in Wet Lab Protocols using Structured Learning Ensemble and Contextualised Embeddings

    Authors: Janvijay Singh, Anshul Wadhawan

    Abstract: In this paper, we describe the approach that we employed to address the task of Entity Recognition over Wet Lab Protocols -- a shared task in EMNLP WNUT-2020 Workshop. Our approach is composed of two phases. In the first phase, we experiment with various contextualised word embeddings (like Flair, BERT-based) and a BiLSTM-CRF model to arrive at the best-performing architecture. In the second phase… ▽ More

    Submitted 15 October, 2020; v1 submitted 5 October, 2020; originally announced October 2020.

  7. "Did you really mean what you said?" : Sarcasm Detection in Hindi-English Code-Mixed Data using Bilingual Word Embeddings

    Authors: Akshita Aggarwal, Anshul Wadhawan, Anshima Chaudhary, Kavita Maurya

    Abstract: With the increased use of social media platforms by people across the world, many new interesting NLP problems have come into existence. One such being the detection of sarcasm in the social media texts. We present a corpus of tweets for training custom word embeddings and a Hinglish dataset labelled for sarcasm detection. We propose a deep learning based approach to address the issue of sarcasm d… ▽ More

    Submitted 15 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.

  8. Phonemer at WNUT-2020 Task 2: Sequence Classification Using COVID Twitter BERT and Bagging Ensemble Technique based on Plurality Voting

    Authors: Anshul Wadhawan

    Abstract: This paper presents the approach that we employed to tackle the EMNLP WNUT-2020 Shared Task 2 : Identification of informative COVID-19 English Tweets. The task is to develop a system that automatically identifies whether an English Tweet related to the novel coronavirus (COVID-19) is informative or not. We solve the task in three stages. The first stage involves pre-processing the dataset by filte… ▽ More

    Submitted 15 October, 2020; v1 submitted 1 October, 2020; originally announced October 2020.