Skip to main content

Showing 1–11 of 11 results for author: Mathur, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03418  [pdf, other

    cs.LG cs.AI cs.CL cs.CV

    HEMM: Holistic Evaluation of Multimodal Foundation Models

    Authors: Paul Pu Liang, Akshay Goindani, Talha Chafekar, Leena Mathur, Haofei Yu, Ruslan Salakhutdinov, Louis-Philippe Morency

    Abstract: Multimodal foundation models that can holistically process text alongside images, video, audio, and other sensory modalities are increasingly used in a variety of real-world applications. However, it is challenging to characterize and study progress in multimodal foundation models, given the range of possible modeling decisions, tasks, and domains. In this paper, we introduce Holistic Evaluation o… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: Code available at https://github.com/pliang279/HEMM

  2. arXiv:2404.11023  [pdf, other

    cs.HC cs.CL cs.LG

    Advancing Social Intelligence in AI Agents: Technical Challenges and Open Questions

    Authors: Leena Mathur, Paul Pu Liang, Louis-Philippe Morency

    Abstract: Building socially-intelligent AI agents (Social-AI) is a multidisciplinary, multimodal research goal that involves creating agents that can sense, perceive, reason about, learn from, and respond to affect, behavior, and cognition of other agents (human or artificial). Progress towards Social-AI has accelerated in the past decade across several computing communities, including natural language proc… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Position Paper, Under Review, 19 pages, 2 figures

  3. arXiv:2310.11667  [pdf, other

    cs.AI cs.CL cs.LG

    SOTOPIA: Interactive Evaluation for Social Intelligence in Language Agents

    Authors: Xuhui Zhou, Hao Zhu, Leena Mathur, Ruohong Zhang, Haofei Yu, Zhengyang Qi, Louis-Philippe Morency, Yonatan Bisk, Daniel Fried, Graham Neubig, Maarten Sap

    Abstract: Humans are social beings; we pursue social goals in our daily interactions, which is a crucial aspect of social intelligence. Yet, AI systems' abilities in this realm remain elusive. We present SOTOPIA, an open-ended environment to simulate complex social interactions between artificial agents and evaluate their social intelligence. In our environment, agents role-play and interact under a wide va… ▽ More

    Submitted 22 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Preprint, 43 pages. The first two authors contribute equally

  4. arXiv:2305.14577  [pdf, other

    cs.LG cs.CL

    Difference-Masking: Choosing What to Mask in Continued Pretraining

    Authors: Alex Wilf, Syeda Nahida Akter, Leena Mathur, Paul Pu Liang, Sheryl Mathew, Mengrou Shou, Eric Nyberg, Louis-Philippe Morency

    Abstract: The self-supervised objective of masking-and-predicting has led to promising performance gains on a variety of downstream tasks. However, while most approaches randomly mask tokens, there is strong intuition that deciding what to mask can substantially improve learning outcomes. We investigate this in continued pretraining setting in which pretrained models continue to pretrain on domain-specific… ▽ More

    Submitted 17 October, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  5. arXiv:2305.10827  [pdf, other

    cs.HC cs.AI

    Expanding the Role of Affective Phenomena in Multimodal Interaction Research

    Authors: Leena Mathur, Maja J Matarić, Louis-Philippe Morency

    Abstract: In recent decades, the field of affective computing has made substantial progress in advancing the ability of AI systems to recognize and express affective phenomena, such as affect and emotions, during human-human and human-machine interactions. This paper describes our examination of research at the intersection of multimodal interaction and affective computing, with the objective of observing t… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

    Comments: 4 pages, 4 figures

  6. arXiv:2208.00344  [pdf, other

    cs.CV cs.HC cs.LG

    Towards Intercultural Affect Recognition: Audio-Visual Affect Recognition in the Wild Across Six Cultures

    Authors: Leena Mathur, Ralph Adolphs, Maja J Matarić

    Abstract: In our multicultural world, affect-aware AI systems that support humans need the ability to perceive affect across variations in emotion expression patterns across cultures. These systems must perform well in cultural contexts without annotated affect datasets available for training models. A standard assumption in affective computing is that affect recognition models trained and used within the s… ▽ More

    Submitted 31 October, 2022; v1 submitted 30 July, 2022; originally announced August 2022.

    Comments: Accepted at IEEE International Conference on Automatic Face and Gesture Recognition (FG 2023), publication and presentation at refereed IEEE workshop

  7. arXiv:2108.12531  [pdf, other

    eess.AS cs.CL cs.LG

    Speech Representations and Phoneme Classification for Preserving the Endangered Language of Ladin

    Authors: Zane Durante, Leena Mathur, Eric Ye, Sichong Zhao, Tejas Ramdas, Khalil Iskarous

    Abstract: A vast majority of the world's 7,000 spoken languages are predicted to become extinct within this century, including the endangered language of Ladin from the Italian Alps. Linguists who work to preserve a language's phonetic and phonological structure can spend hours transcribing each minute of speech from native speakers. To address this problem in the context of Ladin, our paper presents the fi… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Accepted to ICSA MLSLP 2021 (held with Interspeech 2021)

  8. arXiv:2108.07897  [pdf, other

    cs.CV cs.LG

    Affect-Aware Deep Belief Network Representations for Multimodal Unsupervised Deception Detection

    Authors: Leena Mathur, Maja J Matarić

    Abstract: Automated systems that detect the social behavior of deception can enhance human well-being across medical, social work, and legal domains. Labeled datasets to train supervised deception detection models can rarely be collected for real-world, high-stakes contexts. To address this challenge, we propose the first unsupervised approach for detecting real-world, high-stakes deception in videos withou… ▽ More

    Submitted 8 November, 2021; v1 submitted 17 August, 2021; originally announced August 2021.

    Comments: Accepted at IEEE International Conference on Automatic Face and Gesture Recognition (FG 2021), copyright 2021 IEEE

  9. arXiv:2107.14345  [pdf, other

    cs.RO cs.HC cs.LG

    Modeling User Empathy Elicited by a Robot Storyteller

    Authors: Leena Mathur, Micol Spitale, Hao Xi, Jieyun Li, Maja J Matarić

    Abstract: Virtual and robotic agents capable of perceiving human empathy have the potential to participate in engaging and meaningful human-machine interactions that support human well-being. Prior research in computational empathy has focused on designing empathic agents that use verbal and nonverbal behaviors to simulate empathy and attempt to elicit empathic responses from humans. The challenge of develo… ▽ More

    Submitted 29 July, 2021; originally announced July 2021.

    Comments: Accepted for publication and oral presentation at the International Conference on Affective Computing and Intelligent Interaction (ACII 2021)

  10. Unsupervised Audio-Visual Subspace Alignment for High-Stakes Deception Detection

    Authors: Leena Mathur, Maja J Matarić

    Abstract: Automated systems that detect deception in high-stakes situations can enhance societal well-being across medical, social work, and legal domains. Existing models for detecting high-stakes deception in videos have been supervised, but labeled datasets to train models can rarely be collected for most real-world applications. To address this problem, we propose the first multimodal unsupervised trans… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

    Comments: Accepted at ICASSP 2021 \c{opyright} 2021 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of copyrighted components of this work

  11. arXiv:2008.13369  [pdf

    cs.CV cs.HC cs.LG

    Introducing Representations of Facial Affect in Automated Multimodal Deception Detection

    Authors: Leena Mathur, Maja J Matarić

    Abstract: Automated deception detection systems can enhance health, justice, and security in society by helping humans detect deceivers in high-stakes situations across medical and legal domains, among others. This paper presents a novel analysis of the discriminative power of dimensional representations of facial affect for automated deception detection, along with interpretable features from visual, vocal… ▽ More

    Submitted 31 August, 2020; originally announced August 2020.

    Comments: 10 pages, Accepted at ACM International Conference on Multimodal Interaction (ICMI), October 2020

    Journal ref: Proceedings of the 2020 International Conference on Multimodal Interaction (ICMI)