Zum Hauptinhalt springen

Showing 101–110 of 110 results for author: Mihalcea, R

.
  1. arXiv:1810.02508  [pdf, other

    cs.CL

    MELD: A Multimodal Multi-Party Dataset for Emotion Recognition in Conversations

    Authors: Soujanya Poria, Devamanyu Hazarika, Navonil Majumder, Gautam Naik, Erik Cambria, Rada Mihalcea

    Abstract: Emotion recognition in conversations is a challenging task that has recently gained popularity due to its potential applications. Until now, however, a large-scale multimodal multi-party emotional conversational database containing more than two speakers per dialogue was missing. Thus, we propose the Multimodal EmotionLines Dataset (MELD), an extension and enhancement of EmotionLines. MELD contain… ▽ More

    Submitted 4 June, 2019; v1 submitted 4 October, 2018; originally announced October 2018.

    Comments: https://affective-meld.github.io

  2. arXiv:1809.08761  [pdf, other

    cs.CL cs.CV

    Speaker Naming in Movies

    Authors: Mahmoud Azab, Mingzhe Wang, Max Smith, Noriyuki Kojima, Jia Deng, Rada Mihalcea

    Abstract: We propose a new model for speaker naming in movies that leverages visual, textual, and acoustic modalities in an unified optimization framework. To evaluate the performance of our model, we introduce a new dataset consisting of six episodes of the Big Bang Theory TV show and eighteen full movies covering different genres. Our experiments show that our multimodal model significantly outperforms se… ▽ More

    Submitted 24 September, 2018; originally announced September 2018.

  3. Multi-Label Transfer Learning for Multi-Relational Semantic Similarity

    Authors: Li Zhang, Steven R. Wilson, Rada Mihalcea

    Abstract: Multi-relational semantic similarity datasets define the semantic relations between two short texts in multiple ways, e.g., similarity, relatedness, and so on. Yet, all the systems to date designed to capture such relations target one relation at a time. We propose a multi-label transfer learning approach based on LSTM to make predictions for several relations simultaneously and aggregate the loss… ▽ More

    Submitted 10 April, 2019; v1 submitted 31 May, 2018; originally announced May 2018.

    Comments: Accepted to *SEM 2019

    Journal ref: Proceedings of the Eighth Joint Conference on Lexical and Computational Semantics (SEM 2019) (2019) 44-50

  4. arXiv:1805.06413  [pdf, other

    cs.CL

    CASCADE: Contextual Sarcasm Detection in Online Discussion Forums

    Authors: Devamanyu Hazarika, Soujanya Poria, Sruthi Gorantla, Erik Cambria, Roger Zimmermann, Rada Mihalcea

    Abstract: The literature in automated sarcasm detection has mainly focused on lexical, syntactic and semantic-level analysis of text. However, a sarcastic sentence can be expressed with contextual presumptions, background and commonsense knowledge. In this paper, we propose CASCADE (a ContextuAl SarCasm DEtector) that adopts a hybrid approach of both content and context-driven modeling for sarcasm detection… ▽ More

    Submitted 16 May, 2018; originally announced May 2018.

    Comments: Accepted in COLING 2018

  5. Factors Influencing the Surprising Instability of Word Embeddings

    Authors: Laura Wendlandt, Jonathan K. Kummerfeld, Rada Mihalcea

    Abstract: Despite the recent popularity of word embedding methods, there is only a small body of work exploring the limitations of these representations. In this paper, we consider one aspect of embedding spaces, namely their stability. We show that even relatively high frequency words (100-200 occurrences) are often unstable. We provide empirical evidence for how various factors contribute to the stability… ▽ More

    Submitted 25 April, 2018; originally announced April 2018.

    Comments: NAACL HLT 2018

    Journal ref: NAACL-HLT (2018) 2092-2102

  6. arXiv:1804.07835  [pdf, other

    cs.CL

    Direct Network Transfer: Transfer Learning of Sentence Embeddings for Semantic Similarity

    Authors: Li Zhang, Steven R. Wilson, Rada Mihalcea

    Abstract: Sentence encoders, which produce sentence embeddings using neural networks, are typically evaluated by how well they transfer to downstream tasks. This includes semantic similarity, an important task in natural language understanding. Although there has been much work dedicated to building sentence encoders, the accompanying transfer learning techniques have received relatively little attention. I… ▽ More

    Submitted 31 October, 2018; v1 submitted 20 April, 2018; originally announced April 2018.

  7. arXiv:1708.07104  [pdf, ps, other

    cs.CL

    Automatic Detection of Fake News

    Authors: Verónica Pérez-Rosas, Bennett Kleinberg, Alexandra Lefevre, Rada Mihalcea

    Abstract: The proliferation of misleading information in everyday access media outlets such as social media feeds, news blogs, and online newspapers have made it challenging to identify trustworthy news sources, thus increasing the need for computational tools able to provide insights into the reliability of online content. In this paper, we focus on the automatic identification of fake content in online ne… ▽ More

    Submitted 23 August, 2017; originally announced August 2017.

  8. arXiv:1612.08205  [pdf, ps, other

    cs.CL cs.SI

    Predicting the Industry of Users on Social Media

    Authors: Konstantinos Pappas, Rada Mihalcea

    Abstract: Automatic profiling of social media users is an important task for supporting a multitude of downstream applications. While a number of studies have used social media content to extract and study collective social attributes, there is a lack of substantial research that addresses the detection of a user's industry. We frame this task as classification using both feature engineering and ensemble le… ▽ More

    Submitted 24 December, 2016; originally announced December 2016.

    Comments: 8 pages, 3 figures, 12 tables

  9. arXiv:1612.06685  [pdf, other

    cs.CL

    Stateology: State-Level Interactive Charting of Language, Feelings, and Values

    Authors: Konstantinos Pappas, Steven Wilson, Rada Mihalcea

    Abstract: People's personality and motivations are manifest in their everyday language usage. With the emergence of social media, ample examples of such usage are procurable. In this paper, we aim to analyze the vocabulary used by close to 200,000 Blogger users in the U.S. with the purpose of geographically portraying various demographic, linguistic, and psychological dimensions at the state level. We give… ▽ More

    Submitted 20 December, 2016; originally announced December 2016.

    Comments: 5 pages, 5 figures

  10. arXiv:1311.2978  [pdf, other

    cs.CL

    Authorship Attribution Using Word Network Features

    Authors: Shibamouli Lahiri, Rada Mihalcea

    Abstract: In this paper, we explore a set of novel features for authorship attribution of documents. These features are derived from a word network representation of natural language text. As has been noted in previous studies, natural language tends to show complex network structure at word level, with low degrees of separation and scale-free (power law) degree distribution. There has also been work on aut… ▽ More

    Submitted 12 November, 2013; originally announced November 2013.