Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Bahuleyan, H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2204.01171  [pdf, other

    cs.CL cs.AI cs.LG

    Why Exposure Bias Matters: An Imitation Learning Perspective of Error Accumulation in Language Generation

    Authors: Kushal Arora, Layla El Asri, Hareesh Bahuleyan, Jackie Chi Kit Cheung

    Abstract: Current language generation models suffer from issues such as repetition, incoherence, and hallucinations. An often-repeated hypothesis is that this brittleness of generation models is caused by the training and the generation procedure mismatch, also referred to as exposure bias. In this paper, we verify this hypothesis by analyzing exposure bias from an imitation learning perspective. We show th… ▽ More

    Submitted 9 January, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

    Comments: Accepted in Findings of ACL 2022. v2: Equation 7 updated, typo fixes

  2. arXiv:2010.07665  [pdf, other

    cs.CL

    Diverse Keyphrase Generation with Neural Unlikelihood Training

    Authors: Hareesh Bahuleyan, Layla El Asri

    Abstract: In this paper, we study sequence-to-sequence (S2S) keyphrase generation models from the perspective of diversity. Recent advances in neural natural language generation have made possible remarkable progress on the task of keyphrase generation, demonstrated through improvements on quality metrics such as F1-score. However, the importance of diversity in keyphrase generation has been largely ignored… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: Accepted to COLING 2020

  3. arXiv:2004.10809  [pdf, other

    cs.CL

    Polarized-VAE: Proximity Based Disentangled Representation Learning for Text Generation

    Authors: Vikash Balasubramanian, Ivan Kobyzev, Hareesh Bahuleyan, Ilya Shapiro, Olga Vechtomova

    Abstract: Learning disentangled representations of real-world data is a challenging open problem. Most previous methods have focused on either supervised approaches which use attribute labels or unsupervised approaches that manipulate the factorization in the latent space of models such as the variational autoencoder (VAE) by training with task-specific losses. In this work, we propose polarized-VAE, an app… ▽ More

    Submitted 24 January, 2021; v1 submitted 22 April, 2020; originally announced April 2020.

    Comments: Camera Ready for EACL 2021

  4. arXiv:1812.08318  [pdf

    cs.CL cs.SD eess.AS

    Generating lyrics with variational autoencoder and multi-modal artist embeddings

    Authors: Olga Vechtomova, Hareesh Bahuleyan, Amirpasha Ghabussi, Vineet John

    Abstract: We present a system for generating song lyrics lines conditioned on the style of a specified artist. The system uses a variational autoencoder with artist embeddings. We propose the pre-training of artist embeddings with the representations learned by a CNN classifier, which is trained to predict artists based on MEL spectrograms of their song clips. This work is the first step towards combining a… ▽ More

    Submitted 19 December, 2018; originally announced December 2018.

    Comments: 5 pages, 5 tables, 1 figure

  5. arXiv:1808.09012  [pdf, other

    cs.CL

    Natural Language Generation with Neural Variational Models

    Authors: Hareesh Bahuleyan

    Abstract: In this thesis, we explore the use of deep neural networks for generation of natural language. Specifically, we implement two sequence-to-sequence neural variational models - variational autoencoders (VAE) and variational encoder-decoders (VED). VAEs for text generation are difficult to train due to issues associated with the Kullback-Leibler (KL) divergence term of the loss function vanishing to… ▽ More

    Submitted 27 August, 2018; originally announced August 2018.

    Comments: Masters Thesis, University of Waterloo

  6. arXiv:1808.04339  [pdf, other

    cs.CL

    Disentangled Representation Learning for Non-Parallel Text Style Transfer

    Authors: Vineet John, Lili Mou, Hareesh Bahuleyan, Olga Vechtomova

    Abstract: This paper tackles the problem of disentangling the latent variables of style and content in language models. We propose a simple yet effective approach, which incorporates auxiliary multi-task and adversarial objectives, for label prediction and bag-of-words prediction, respectively. We show, both qualitatively and quantitatively, that the style and content are indeed disentangled in the latent s… ▽ More

    Submitted 10 September, 2018; v1 submitted 13 August, 2018; originally announced August 2018.

    Comments: 11 pages, 7 figures, 6 tables; Preliminary work rejected by EMNLP-18

    MSC Class: 68T50 ACM Class: I.2.7

  7. arXiv:1806.08462  [pdf, other

    cs.CL cs.LG stat.ML

    Stochastic Wasserstein Autoencoder for Probabilistic Sentence Generation

    Authors: Hareesh Bahuleyan, Lili Mou, Hao Zhou, Olga Vechtomova

    Abstract: The variational autoencoder (VAE) imposes a probabilistic distribution (typically Gaussian) on the latent space and penalizes the Kullback--Leibler (KL) divergence between the posterior and prior. In NLP, VAEs are extremely difficult to train due to the problem of KL collapsing to zero. One has to implement various heuristics such as KL weight annealing and word dropout in a carefully engineered m… ▽ More

    Submitted 12 April, 2019; v1 submitted 21 June, 2018; originally announced June 2018.

    Comments: Accepted by NAACL-HLT 2019

  8. arXiv:1804.01149  [pdf, other

    cs.SD eess.AS

    Music Genre Classification using Machine Learning Techniques

    Authors: Hareesh Bahuleyan

    Abstract: Categorizing music files according to their genre is a challenging task in the area of music information retrieval (MIR). In this study, we compare the performance of two classes of models. The first is a deep learning approach wherein a CNN model is trained end-to-end, to predict the genre label of an audio signal, solely using its spectrogram. The second approach utilizes hand-crafted features,… ▽ More

    Submitted 3 April, 2018; originally announced April 2018.

    Comments: 12 Pages, 6 figures, 4 tables

  9. arXiv:1712.08207  [pdf, other

    cs.CL

    Variational Attention for Sequence-to-Sequence Models

    Authors: Hareesh Bahuleyan, Lili Mou, Olga Vechtomova, Pascal Poupart

    Abstract: The variational encoder-decoder (VED) encodes source information as a set of random variables using a neural network, which in turn is decoded into target data using another neural network. In natural language processing, sequence-to-sequence (Seq2Seq) models typically serve as encoder-decoder networks. When combined with a traditional (deterministic) attention mechanism, the variational latent sp… ▽ More

    Submitted 21 June, 2018; v1 submitted 21 December, 2017; originally announced December 2017.

    Comments: In Proceedings of COLING 2018. Also accepted by TADGM Workshop@ICML 2018 for presentation