Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Saluja, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2402.15021  [pdf, other

    cs.CV cs.CL

    CLoVe: Encoding Compositional Language in Contrastive Vision-Language Models

    Authors: Santiago Castro, Amir Ziai, Avneesh Saluja, Zhuoning Yuan, Rada Mihalcea

    Abstract: Recent years have witnessed a significant increase in the performance of Vision and Language tasks. Foundational Vision-Language Models (VLMs), such as CLIP, have been leveraged in multiple settings and demonstrated remarkable performance across several tasks. Such models excel at object-centric recognition yet learn text representations that seem invariant to word order, failing to compose known… ▽ More

    Submitted 29 February, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

  2. arXiv:2005.11197  [pdf, other

    cs.CL

    Simplify-then-Translate: Automatic Preprocessing for Black-Box Machine Translation

    Authors: Sneha Mehta, Bahareh Azarnoush, Boris Chen, Avneesh Saluja, Vinith Misra, Ballav Bihani, Ritwik Kumar

    Abstract: Black-box machine translation systems have proven incredibly useful for a variety of applications yet by design are hard to adapt, tune to a specific domain, or build on top of. In this work, we introduce a method to improve such systems via automatic pre-processing (APP) using sentence simplification. We first propose a method to automatically generate a large in-domain paraphrase corpus through… ▽ More

    Submitted 27 May, 2020; v1 submitted 22 May, 2020; originally announced May 2020.

  3. arXiv:2004.14532  [pdf, other

    cs.CL

    Hierarchical Encoders for Modeling and Interpreting Screenplays

    Authors: Gayatri Bhat, Avneesh Saluja, Melody Dye, Jan Florjanczyk

    Abstract: While natural language understanding of long-form documents is still an open challenge, such documents often contain structural information that can inform the design of models for encoding them. Movie scripts are an example of such richly structured text - scripts are segmented into scenes, which are further decomposed into dialogue and descriptive components. In this work, we propose a neural ar… ▽ More

    Submitted 29 April, 2020; originally announced April 2020.

    Comments: 12 pages, including references and appendix

  4. arXiv:1804.08666  [pdf, other

    cs.CL

    Using Aspect Extraction Approaches to Generate Review Summaries and User Profiles

    Authors: Christopher Mitcheltree, Skyler Wharton, Avneesh Saluja

    Abstract: Reviews of products or services on Internet marketplace websites contain a rich amount of information. Users often wish to survey reviews or review snippets from the perspective of a certain aspect, which has resulted in a large body of work on aspect identification and extraction from such corpora. In this work, we evaluate a newly-proposed neural model for aspect extraction on two practical task… ▽ More

    Submitted 3 June, 2020; v1 submitted 23 April, 2018; originally announced April 2018.

    Comments: Equal contribution from first two authors. Accepted for publication in the NAACL 2018 Industry Track

  5. arXiv:1801.10293  [pdf, other

    cs.CL

    Paraphrase-Supervised Models of Compositionality

    Authors: Avneesh Saluja, Chris Dyer, Jean-David Ruvini

    Abstract: Compositional vector space models of meaning promise new solutions to stubborn language understanding problems. This paper makes two contributions toward this end: (i) it uses automatically-extracted paraphrase examples as a source of supervision for training compositional models, replacing previous work which relied on manual annotations used for the same purpose, and (ii) develops a context-awar… ▽ More

    Submitted 30 January, 2018; originally announced January 2018.

    Comments: This paper was originally submitted for review at NAACL 2015 and ACL 2015. This version maintains the original author affiliation "as-is" (as of when the work was done)

  6. arXiv:1401.3413  [pdf, other

    cs.LG cs.IR

    Infinite Mixed Membership Matrix Factorization

    Authors: Avneesh Saluja, Mahdi Pakdaman, Dongzhen Piao, Ankur P. Parikh

    Abstract: Rating and recommendation systems have become a popular application area for applying a suite of machine learning techniques. Current approaches rely primarily on probabilistic interpretations and extensions of matrix factorization, which factorizes a user-item ratings matrix into latent user and item vectors. Most of these methods fail to model significant variations in item ratings from otherwis… ▽ More

    Submitted 14 January, 2014; originally announced January 2014.

    Comments: For ICDM 2013 Workshop Proceedings

  7. arXiv:1312.7077  [pdf, other

    cs.CL cs.LG stat.ML

    Language Modeling with Power Low Rank Ensembles

    Authors: Ankur P. Parikh, Avneesh Saluja, Chris Dyer, Eric P. Xing

    Abstract: We present power low rank ensembles (PLRE), a flexible framework for n-gram language modeling where ensembles of low rank matrices and tensors are used to obtain smoothed probability estimates of words in context. Our method can be understood as a generalization of n-gram modeling to non-integer n, and includes standard techniques such as absolute discounting and Kneser-Ney smoothing as special ca… ▽ More

    Submitted 3 October, 2014; v1 submitted 26 December, 2013; originally announced December 2013.