Zum Hauptinhalt springen

Showing 1–50 of 106 results for author: Fernandez, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.09226  [pdf, other

    cs.IR

    FabricQA-Extractor: A Question Answering System to Extract Information from Documents using Natural Language Questions

    Authors: Qiming Wang, Raul Castro Fernandez

    Abstract: Reading comprehension models answer questions posed in natural language when provided with a short passage of text. They present an opportunity to address a long-standing challenge in data management: the extraction of structured data from unstructured text. Consequently, several approaches are using these models to perform information extraction. However, these modern approaches leave an opportun… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

  2. arXiv:2408.04092  [pdf, other

    cs.DB

    Programmable Dataflows: Abstraction and Programming Model for Data Sharing

    Authors: Siyuan Xia, Chris Zhu, Tapan Srivastava, Bridget Fahey, Raul Castro Fernandez

    Abstract: Data sharing is central to a wide variety of applications such as fraud detection, ad matching, and research. The lack of data sharing abstractions makes the solution to each data sharing problem bespoke and cost-intensive, hampering value generation. In this paper, we first introduce a data sharing model to represent every data sharing problem with a sequence of dataflows. From the model, we dist… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  3. arXiv:2408.01580  [pdf, other

    cs.DB

    Controlling Dataflows with a Bolt-on Data Escrow

    Authors: Zhiru Zhu, Raul Castro Fernandez

    Abstract: The data-driven economy has created tremendous value in our society. Individuals share their data with platforms in exchange for services such as search, social networks, and health recommendations. Platforms use the data to provide those services and create other revenue-generating opportunities, e.g., selling the data to data brokers. With the ever-expanding data economy comes the growing concer… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  4. arXiv:2408.00253  [pdf, other

    cs.DB

    Saving Money for Analytical Workloads in the Cloud

    Authors: Tapan Srivastava, Raul Castro Fernandez

    Abstract: As users migrate their analytical workloads to cloud databases, it is becoming just as important to reduce monetary costs as it is to optimize query runtime. In the cloud, a query is billed based on either its compute time or the amount of data it processes. We observe that analytical queries are either compute- or IO-bound and each query type executes cheaper in a different pricing model. We expl… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: 12 pages; VLDB 2024

  5. arXiv:2407.17914  [pdf, other

    cs.CL

    Modelling Multimodal Integration in Human Concept Processing with Vision-and-Language Models

    Authors: Anna Bavaresco, Marianne de Heer Kloots, Sandro Pezzelle, Raquel Fernández

    Abstract: Representations from deep neural networks (DNNs) have proven remarkably predictive of neural activity involved in both visual and linguistic processing. Despite these successes, most studies to date concern unimodal DNNs, encoding either visual or textual input but not both. Yet, there is growing evidence that human meaning representations integrate linguistic and sensory-motor information. Here w… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  6. arXiv:2407.04559  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Not (yet) the whole story: Evaluating Visual Storytelling Requires More than Measuring Coherence, Grounding, and Repetition

    Authors: Aditya K Surikuchi, Raquel Fernández, Sandro Pezzelle

    Abstract: Visual storytelling consists in generating a natural language story given a temporally ordered sequence of images. This task is not only challenging for models, but also very difficult to evaluate with automatic metrics since there is no consensus about what makes a story 'good'. In this paper, we introduce a novel method that measures story quality in terms of human likeness regarding three key a… ▽ More

    Submitted 29 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  7. arXiv:2406.18403  [pdf, other

    cs.CL

    LLMs instead of Human Judges? A Large Scale Empirical Study across 20 NLP Evaluation Tasks

    Authors: Anna Bavaresco, Raffaella Bernardi, Leonardo Bertolazzi, Desmond Elliott, Raquel Fernández, Albert Gatt, Esam Ghaleb, Mario Giulianelli, Michael Hanna, Alexander Koller, André F. T. Martins, Philipp Mondorf, Vera Neplenbroek, Sandro Pezzelle, Barbara Plank, David Schlangen, Alessandro Suglia, Aditya K Surikuchi, Ece Takmaz, Alberto Testoni

    Abstract: There is an increasing trend towards evaluating NLP models with LLM-generated judgments instead of human judgments. In the absence of a comparison against human data, this raises concerns about the validity of these evaluations; in case they are conducted with proprietary models, this also raises concerns over reproducibility. We provide JUDGE-BENCH, a collection of 20 NLP datasets with human anno… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  8. arXiv:2406.13663  [pdf, other

    cs.CL cs.AI cs.LG

    Model Internals-based Answer Attribution for Trustworthy Retrieval-Augmented Generation

    Authors: Jirui Qi, Gabriele Sarti, Raquel Fernández, Arianna Bisazza

    Abstract: Ensuring the verifiability of model answers is a fundamental challenge for retrieval-augmented generation (RAG) in the question answering (QA) domain. Recently, self-citation prompting was proposed to make large language models (LLMs) generate citations to supporting documents along with their answers. However, self-citing LLMs often struggle to match the required format, refer to non-existent sou… ▽ More

    Submitted 1 July, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

    Comments: Under review. Code and data released at https://github.com/Betswish/MIRAGE

  9. arXiv:2406.07243  [pdf, other

    cs.CL

    MBBQ: A Dataset for Cross-Lingual Comparison of Stereotypes in Generative LLMs

    Authors: Vera Neplenbroek, Arianna Bisazza, Raquel Fernández

    Abstract: Generative large language models (LLMs) have been shown to exhibit harmful biases and stereotypes. While safety fine-tuning typically takes place in English, if at all, these models are being used by speakers of many different languages. There is existing evidence that the performance of these models is inconsistent across languages and that they discriminate based on demographic factors of the us… ▽ More

    Submitted 17 July, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted to COLM 2024

  10. arXiv:2406.05547  [pdf, other

    cs.SD cs.CL eess.AS

    Exploring the Benefits of Tokenization of Discrete Acoustic Units

    Authors: Avihu Dekel, Raul Fernandez

    Abstract: Tokenization algorithms that merge the units of a base vocabulary into larger, variable-rate units have become standard in natural language processing tasks. This idea, however, has been mostly overlooked when the vocabulary consists of phonemes or Discrete Acoustic Units (DAUs), an audio-based representation that is playing an increasingly important role due to the success of discrete language-mo… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

    Comments: Interspeech 2024

  11. arXiv:2405.20846  [pdf, other

    cs.CL cs.AI

    Don't Buy it! Reassessing the Ad Understanding Abilities of Contrastive Multimodal Models

    Authors: A. Bavaresco, A. Testoni, R. Fernández

    Abstract: Image-based advertisements are complex multimodal stimuli that often contain unusual visual elements and figurative language. Previous research on automatic ad understanding has reported impressive zero-shot accuracy of contrastive vision-and-language models (VLMs) on an ad-explanation retrieval task. Here, we examine the original task setup and show that contrastive VLMs can solve it by exploitin… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Accepted to the main conference ACL 2024

  12. arXiv:2405.08546  [pdf, other

    cs.CL

    Analysing Cross-Speaker Convergence in Face-to-Face Dialogue through the Lens of Automatically Detected Shared Linguistic Constructions

    Authors: Esam Ghaleb, Marlou Rasenberg, Wim Pouw, Ivan Toni, Judith Holler, Aslı Özyürek, Raquel Fernández

    Abstract: Conversation requires a substantial amount of coordination between dialogue participants, from managing turn taking to negotiating mutual understanding. Part of this coordination effort surfaces as the reuse of linguistic behaviour across speakers, a process often referred to as alignment. While the presence of linguistic alignment is well documented in the literature, several questions remain ope… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Accepted for publication at the 46th Proceedings of the Annual Meeting of the Cognitive Science Society

  13. arXiv:2404.18798  [pdf, other

    cs.MA

    Multi-Agent Synchronization Tasks

    Authors: Rolando Fernandez, Garrett Warnell, Derrik E. Asher, Peter Stone

    Abstract: In multi-agent reinforcement learning (MARL), coordination plays a crucial role in enhancing agents' performance beyond what they could achieve through cooperation alone. The interdependence of agents' actions, coupled with the need for communication, leads to a domain where effective coordination is crucial. In this paper, we introduce and define $\textit{Multi-Agent Synchronization Tasks}$ (MSTs… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Adaptive Learning Agents Workshop at AAMAS 2024

  14. arXiv:2404.14952  [pdf, other

    cs.CV cs.AI

    Leveraging Speech for Gesture Detection in Multimodal Communication

    Authors: Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim Pouw, Ivan Toni, Peter Uhrig, Anna Wilson, Judith Holler, Aslı Özyürek, Raquel Fernández

    Abstract: Gestures are inherent to human interaction and often complement speech in face-to-face communication, forming a multimodal communication system. An important task in gesture analysis is detecting a gesture's beginning and end. Research on automatic gesture detection has primarily focused on visual and kinematic information to detect a limited set of isolated or silent gestures with low variability… ▽ More

    Submitted 23 April, 2024; originally announced April 2024.

  15. arXiv:2404.10250  [pdf, other

    cs.PL cs.HC cs.MM

    AniFrame: A Programming Language for 2D Drawing and Frame-Based Animation

    Authors: Mark Edward M. Gonzales, Hans Oswald A. Ibrahim, Elyssia Barrie H. Ong, Ryan Austin Fernandez

    Abstract: Creative coding is an experimentation-heavy activity that requires translating high-level visual ideas into code. However, most languages and libraries for creative coding may not be adequately intuitive for beginners. In this paper, we present AniFrame, a domain-specific language for drawing and animation. Designed for novice programmers, it (i) features animation-specific data types, operations,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted for paper presentation at the 24th Philippine Computing Science Congress (PCSC 2024), held in Laguna, Philippines

    ACM Class: D.3.2; J.5

  16. arXiv:2403.11209  [pdf, other

    cs.CL cs.HC

    Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations

    Authors: Claudio Pinhanez, Raul Fernandez, Marcelo Grave, Julio Nogima, Ron Hoory

    Abstract: Representations of AI agents in user interfaces and robotics are predominantly White, not only in terms of facial and skin features, but also in the synthetic voices they use. In this paper we explore some unexpected challenges in the representation of race we found in the process of developing an U.S. English Text-to-Speech (TTS) system aimed to sound like an educated, professional, regional acce… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Full version including appendixes

  17. arXiv:2403.08810  [pdf

    cs.NI cs.AI cs.AR cs.DC cs.IT cs.LG

    Comparison of edge computing methods in Internet of Things architectures for efficient estimation of indoor environmental parameters with Machine Learning

    Authors: Jose-Carlos Gamazo-Real, Raul Torres Fernandez, Adrian Murillo Armas

    Abstract: The large increase in the number of Internet of Things (IoT) devices have revolutionised the way data is processed, which added to the current trend from cloud to edge computing has resulted in the need for efficient and reliable data processing near the data sources using energy-efficient devices. Two methods based on low-cost edge-IoT architectures are proposed to implement lightweight Machine L… ▽ More

    Submitted 7 February, 2024; originally announced March 2024.

    Journal ref: Engineering Applications of Artificial Intelligence, 2023, vol. 126, Part D, no. 107149, pp. 1-27, ISSN 0952-1976

  18. arXiv:2402.16102  [pdf, other

    cs.CL

    Interpreting Predictive Probabilities: Model Confidence or Human Label Variation?

    Authors: Joris Baan, Raquel Fernández, Barbara Plank, Wilker Aziz

    Abstract: With the rise of increasingly powerful and user-facing NLP systems, there is growing interest in assessing whether they have a good representation of uncertainty by evaluating the quality of their predictive distribution over outcomes. We identify two main perspectives that drive starkly different evaluation protocols. The first treats predictive probability as an indication of model confidence; t… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: EACL 2024 main

  19. arXiv:2402.06509  [pdf, other

    cs.CL cs.AI

    Asking the Right Question at the Right Time: Human and Model Uncertainty Guidance to Ask Clarification Questions

    Authors: Alberto Testoni, Raquel Fernández

    Abstract: Clarification questions are an essential dialogue tool to signal misunderstanding, ambiguities, and under-specification in language use. While humans are able to resolve uncertainty by asking questions since childhood, modern dialogue systems struggle to generate effective questions. To make progress in this direction, in this work we take a collaborative dialogue task as a testbed and study how m… ▽ More

    Submitted 9 February, 2024; originally announced February 2024.

    Comments: Accepted at EACL 2024

  20. arXiv:2402.01352  [pdf, other

    cs.CL cs.AI cs.CV

    Describing Images $\textit{Fast and Slow}$: Quantifying and Predicting the Variation in Human Signals during Visuo-Linguistic Processes

    Authors: Ece Takmaz, Sandro Pezzelle, Raquel Fernández

    Abstract: There is an intricate relation between the properties of an image and how humans behave while describing the image. This behavior shows ample variation, as manifested in human signals such as eye movements and when humans start to describe the image. Despite the value of such signals of visuo-linguistic variation, they are virtually disregarded in the training of current pretrained models, which m… ▽ More

    Submitted 2 February, 2024; originally announced February 2024.

    Comments: To appear in EACL 2024

  21. arXiv:2311.01460  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Implicit Chain of Thought Reasoning via Knowledge Distillation

    Authors: Yuntian Deng, Kiran Prasad, Roland Fernandez, Paul Smolensky, Vishrav Chaudhary, Stuart Shieber

    Abstract: To augment language models with the ability to reason, researchers usually prompt or finetune them to produce chain of thought reasoning steps before producing the final answer. However, although people use natural language to reason effectively, it may be that LMs could reason more effectively with some intermediate computation that is not in natural language. In this work, we explore an alternat… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  22. arXiv:2310.17843  [pdf, other

    cs.LG cs.GT

    A Data-Centric Online Market for Machine Learning: From Discovery to Pricing

    Authors: Minbiao Han, Jonathan Light, Steven Xia, Sainyam Galhotra, Raul Castro Fernandez, Haifeng Xu

    Abstract: Data fuels machine learning (ML) - rich and high-quality training data is essential to the success of ML. However, to transform ML from the race among a few large corporations to an accessible technology that serves numerous normal users' data analysis requests, there still exist important challenges. One gap we observed is that many ML users can benefit from new data that other data owners posses… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

  23. arXiv:2310.17770  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    GROOViST: A Metric for Grounding Objects in Visual Storytelling

    Authors: Aditya K Surikuchi, Sandro Pezzelle, Raquel Fernández

    Abstract: A proper evaluation of stories generated for a sequence of images -- the task commonly referred to as visual storytelling -- must consider multiple aspects, such as coherence, grammatical correctness, and visual grounding. In this work, we focus on evaluating the degree of grounding, that is, the extent to which a story is about the entities shown in the images. We analyze current metrics, both de… ▽ More

    Submitted 26 October, 2023; originally announced October 2023.

    Comments: In EMNLP 2023 main conference proceedings (to appear)

  24. arXiv:2310.15061  [pdf, other

    cs.CL cs.AI cs.CV

    The BLA Benchmark: Investigating Basic Language Abilities of Pre-Trained Multimodal Models

    Authors: Xinyi Chen, Raquel Fernández, Sandro Pezzelle

    Abstract: Despite the impressive performance achieved by pre-trained language-and-vision models in downstream tasks, it remains an open question whether this reflects a proper understanding of image-text interaction. In this work, we explore to what extent they handle basic linguistic constructions -- active-passive voice, coordination, and relative clauses -- that even preschool children can typically mast… ▽ More

    Submitted 23 October, 2023; originally announced October 2023.

    Comments: This is the camera-ready version of the paper that will be published in the Proceedings of EMNLP 2023 (Singapore, 6-10 December 2023)

  25. arXiv:2310.13676  [pdf, other

    cs.CL

    Information Value: Measuring Utterance Predictability as Distance from Plausible Alternatives

    Authors: Mario Giulianelli, Sarenne Wallbridge, Raquel Fernández

    Abstract: We present information value, a measure which quantifies the predictability of an utterance relative to a set of plausible alternatives. We introduce a method to obtain interpretable estimates of information value using neural text generators, and exploit their psychometric predictive power to investigate the dimensions of predictability that drive human comprehension behaviour. Information value… ▽ More

    Submitted 20 October, 2023; originally announced October 2023.

    Comments: EMNLP 2023 (Main, Long paper)

  26. arXiv:2310.13104  [pdf, other

    cs.DB cs.CR

    Making Differential Privacy Easier to Use for Data Controllers and Data Analysts using a Privacy Risk Indicator and an Escrow-Based Platform

    Authors: Zhiru Zhu, Raul Castro Fernandez

    Abstract: Differential privacy (DP) enables private data analysis but is hard to use in practice. For data controllers who decide what output to release, choosing the amount of noise to add to the output is a non-trivial task because of the difficulty of interpreting the privacy parameter $ε$. For data analysts who submit queries, it is hard to understand the impact of the noise introduced by DP on their ta… ▽ More

    Submitted 2 March, 2024; v1 submitted 19 October, 2023; originally announced October 2023.

  27. arXiv:2310.10378  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Cross-Lingual Consistency of Factual Knowledge in Multilingual Language Models

    Authors: Jirui Qi, Raquel Fernández, Arianna Bisazza

    Abstract: Multilingual large-scale Pretrained Language Models (PLMs) have been shown to store considerable amounts of factual knowledge, but large variations are observed across languages. With the ultimate goal of ensuring that users with different language backgrounds obtain consistent feedback from the same model, we study the cross-lingual consistency (CLC) of factual knowledge in various multilingual P… ▽ More

    Submitted 9 November, 2023; v1 submitted 16 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP2023 main conference. All code and data are released at https://github.com/Betswish/Cross-Lingual-Consistency

  28. arXiv:2309.11210  [pdf, other

    eess.AS cs.CL cs.SD

    Speak While You Think: Streaming Speech Synthesis During Text Generation

    Authors: Avihu Dekel, Slava Shechtman, Raul Fernandez, David Haws, Zvi Kons, Ron Hoory

    Abstract: Large Language Models (LLMs) demonstrate impressive capabilities, yet interaction with these models is mostly facilitated through text. Using Text-To-Speech to synthesize LLM outputs typically results in notable latency, which is impractical for fluent voice conversations. We propose LLM2Speech, an architecture to synthesize speech while text is being generated by an LLM which yields significant l… ▽ More

    Submitted 20 September, 2023; originally announced September 2023.

    Comments: Under review for ICASSP 2024

  29. Co-Speech Gesture Detection through Multi-Phase Sequence Labeling

    Authors: Esam Ghaleb, Ilya Burenko, Marlou Rasenberg, Wim Pouw, Peter Uhrig, Judith Holler, Ivan Toni, Aslı Özyürek, Raquel Fernández

    Abstract: Gestures are integral components of face-to-face communication. They unfold over time, often following predictable movement phases of preparation, stroke, and retraction. Yet, the prevalent approach to automatic gesture detection treats the problem as binary classification, classifying a segment as either containing a gesture or not, thus failing to capture its inherently sequential and contextual… ▽ More

    Submitted 23 April, 2024; v1 submitted 21 August, 2023; originally announced August 2023.

  30. arXiv:2308.05055  [pdf, other

    cs.IT cs.NI

    Enhancement of Direct LEO Satellite-to-Smartphone Communications by Distributed Beamforming

    Authors: Zhuoao Xu, Gaojie Chen, Ryan Fernandez, Yue Gao, Rahim Tafazolli

    Abstract: The low earth orbit (LEO) satellite network is undergoing rapid development with the maturing of satellite communications and rocket launch technologies, and the demand for a global coverage network. However, current satellite communication networks are constrained by limited transmitting signal power, resulting in the use of large-size and energy-consuming ground terminals to provide additional g… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 11 pages, 14 figures, 2 tables

  31. arXiv:2308.04818  [pdf, other

    cs.NI

    Enhancement of Satellite-to-Phone Link Budget by Using Distributed Beamforming

    Authors: Zhuoao Xu, Yue Gao, Gaojie Chen, Ryan Fernandez, Vedaprabhu Basavarajappa, Rahim Tafazolli

    Abstract: Small satellites in Low Earth Orbit (LEO) attract much attention from both industry and academia. The latest production and launch technologies constantly drive the development of LEO constellations. However, the wideband signal, except text messages, cannot be transmitted directly from an LEO satellite to a standard mobile cellular phone due to the insufficient link budget. The current LEO conste… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    Comments: 8 pages, 6 figures, 1 table

  32. arXiv:2307.15703  [pdf, other

    cs.CL cs.AI cs.LG

    Uncertainty in Natural Language Generation: From Theory to Applications

    Authors: Joris Baan, Nico Daheim, Evgenia Ilia, Dennis Ulmer, Haau-Sing Li, Raquel Fernández, Barbara Plank, Rico Sennrich, Chrysoula Zerva, Wilker Aziz

    Abstract: Recent advances of powerful Language Models have allowed Natural Language Generation (NLG) to emerge as an important technology that can not only perform traditional tasks like summarisation or translation, but also serve as a natural language interface to a variety of applications. As such, it is crucial that NLG systems are trustworthy and reliable, for example by indicating when they are likely… ▽ More

    Submitted 28 July, 2023; originally announced July 2023.

  33. arXiv:2307.13163  [pdf, other

    cs.RO

    Advancing Robot Autonomy for Long-Horizon Tasks

    Authors: Isabel M. Rayas Fernández

    Abstract: Autonomous robots have real-world applications in diverse fields, such as mobile manipulation and environmental exploration, and many such tasks benefit from a hands-off approach in terms of human user involvement over a long task horizon. However, the level of autonomy achievable by a deployment is limited in part by the problem definition or task specification required by the system. Task specif… ▽ More

    Submitted 24 July, 2023; originally announced July 2023.

    Comments: PhD dissertation. 160 pages

  34. arXiv:2307.00432  [pdf, other

    cs.DB cs.CR

    Saibot: A Differentially Private Data Search Platform

    Authors: Zezhou Huang, Jiaxiang Liu, Daniel Alabi, Raul Castro Fernandez, Eugene Wu

    Abstract: Recent data search platforms use ML task-based utility measures rather than metadata-based keywords, to search large dataset corpora. Requesters submit a training dataset and these platforms search for augmentations (join or union compatible datasets) that, when used to augment the requester's dataset, most improve model (e.g., linear regression) performance. Although effective, providers that man… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Journal ref: VLDB 2023

  35. arXiv:2306.17747  [pdf, other

    cs.MA cs.AI math.DS math.OC nlin.AO

    Discriminatory or Samaritan -- which AI is needed for humanity? An Evolutionary Game Theory Analysis of Hybrid Human-AI populations

    Authors: Tim Booker, Manuel Miranda, Jesús A. Moreno López, José María Ramos Fernández, Max Reddel, Valeria Widler, Filippo Zimmaro, Alberto Antonioni, The Anh Han

    Abstract: As artificial intelligence (AI) systems are increasingly embedded in our lives, their presence leads to interactions that shape our behaviour, decision-making, and social interactions. Existing theoretical research has primarily focused on human-to-human interactions, overlooking the unique dynamics triggered by the presence of AI. In this paper, resorting to methods from evolutionary game theory,… ▽ More

    Submitted 3 July, 2023; v1 submitted 30 June, 2023; originally announced June 2023.

    Comments: This work is the result of the Complexity72h 2023 workshop

  36. arXiv:2306.02543  [pdf, other

    cs.LG

    Addressing Budget Allocation and Revenue Allocation in Data Market Environments Using an Adaptive Sampling Algorithm

    Authors: Boxin Zhao, Boxiang Lyu, Raul Castro Fernandez, Mladen Kolar

    Abstract: High-quality machine learning models are dependent on access to high-quality training data. When the data are not already available, it is tedious and costly to obtain them. Data markets help with identifying valuable training data: model consumers pay to train a model, the market uses that budget to identify data and train the model (the budget allocation problem), and finally the market compensa… ▽ More

    Submitted 4 June, 2023; originally announced June 2023.

    Comments: Published on International Conference on Machine Learning (ICML) 2023

  37. arXiv:2306.00751  [pdf, other

    cs.CL cs.LG

    Differentiable Tree Operations Promote Compositional Generalization

    Authors: Paul Soulos, Edward Hu, Kate McCurdy, Yunmo Chen, Roland Fernandez, Paul Smolensky, Jianfeng Gao

    Abstract: In the context of structure-to-structure transformation tasks, learning sequences of discrete symbolic operations poses significant challenges due to their non-differentiability. To facilitate the learning of these symbolic sequences, we introduce a differentiable tree interpreter that compiles high-level symbolic tree operations into subsymbolic matrix operations on tensors. We present a novel Di… ▽ More

    Submitted 1 June, 2023; originally announced June 2023.

    Comments: ICML 2023. Code available at https://github.com/psoulos/dtm

  38. arXiv:2305.19933  [pdf, other

    cs.CL cs.AI cs.CV

    Speaking the Language of Your Listener: Audience-Aware Adaptation via Plug-and-Play Theory of Mind

    Authors: Ece Takmaz, Nicolo' Brandizzi, Mario Giulianelli, Sandro Pezzelle, Raquel Fernández

    Abstract: Dialogue participants may have varying levels of knowledge about the topic under discussion. In such cases, it is essential for speakers to adapt their utterances by taking their audience into account. Yet, it is an open question how such adaptation can be modelled in computational agents. In this paper, we model a visually grounded referential game between a knowledgeable speaker and a listener w… ▽ More

    Submitted 31 May, 2023; originally announced May 2023.

    Comments: To appear in Findings of ACL 2023

  39. arXiv:2305.12050  [pdf, other

    cs.SE cs.AI

    AI-assisted Code Authoring at Scale: Fine-tuning, deploying, and mixed methods evaluation

    Authors: Vijayaraghavan Murali, Chandra Maddila, Imad Ahmad, Michael Bolin, Daniel Cheng, Negar Ghorbani, Renuka Fernandez, Nachiappan Nagappan, Peter C. Rigby

    Abstract: Generative LLMs have been shown to effectively power AI-based code authoring tools that can suggest entire statements or blocks of code during code authoring. In this paper we present CodeCompose, an AI-assisted code authoring tool developed and deployed at Meta internally. CodeCompose is based on the InCoder LLM that merges generative capabilities with bi-directionality. We have scaled up CodeCom… ▽ More

    Submitted 16 February, 2024; v1 submitted 19 May, 2023; originally announced May 2023.

  40. arXiv:2305.11993  [pdf, other

    cs.CL

    Interpretable Word Sense Representations via Definition Generation: The Case of Semantic Change Analysis

    Authors: Mario Giulianelli, Iris Luden, Raquel Fernandez, Andrey Kutuzov

    Abstract: We propose using automatically generated natural language definitions of contextualised word usages as interpretable word and word sense representations. Given a collection of usage examples for a target word, and the corresponding data-driven usage clusters (i.e., word senses), a definition is generated for each usage with a specialised Flan-T5 language model, and the most prototypical definition… ▽ More

    Submitted 25 July, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: ACL 2023

  41. arXiv:2305.11707  [pdf, other

    cs.CL cs.AI cs.LG

    What Comes Next? Evaluating Uncertainty in Neural Text Generators Against Human Production Variability

    Authors: Mario Giulianelli, Joris Baan, Wilker Aziz, Raquel Fernández, Barbara Plank

    Abstract: In Natural Language Generation (NLG) tasks, for any input, multiple communicative goals are plausible, and any goal can be put into words, or produced, in multiple ways. We characterise the extent to which human production varies lexically, syntactically, and semantically across four NLG tasks, connecting human production variability to aleatoric or data uncertainty. We then inspect the space of o… ▽ More

    Submitted 20 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: Camera ready version for EMNLP 2023

  42. arXiv:2305.10419  [pdf, other

    cs.DB

    Kitana: Efficient Data Augmentation Search for AutoML

    Authors: Zezhou Huang, Pranav Subramaniam, Raul Castro Fernandez, Eugene Wu

    Abstract: AutoML services provide a way for non-expert users to benefit from high-quality ML models without worrying about model design and deployment, in exchange for a charge per hour ($21.252 for VertexAI). However, existing AutoML services are model-centric, in that they are limited to extracting features and searching for models from initial training data-they are only as effective as the initial train… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  43. arXiv:2305.03842  [pdf, other

    cs.DB

    Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow

    Authors: Siyuan Xia, Zhiru Zhu, Chris Zhu, Jinjin Zhao, Kyle Chard, Aaron J. Elmore, Ian Foster, Michael Franklin, Sanjay Krishnan, Raul Castro Fernandez

    Abstract: Pooling and sharing data increases and distributes its value. But since data cannot be revoked once shared, scenarios that require controlled release of data for regulatory, privacy, and legal reasons default to not sharing. Because selectively controlling what data to release is difficult, the few data-sharing consortia that exist are often built around data-sharing agreements resulting from long… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  44. arXiv:2304.09068  [pdf, other

    cs.DB cs.LG

    METAM: Goal-Oriented Data Discovery

    Authors: Sainyam Galhotra, Yue Gong, Raul Castro Fernandez

    Abstract: Data is a central component of machine learning and causal inference tasks. The availability of large amounts of data from sources such as open data repositories, data lakes and data marketplaces creates an opportunity to augment data and boost those tasks' performance. However, augmentation techniques rely on a user manually discovering and shortlisting useful candidate augmentations. Existing so… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

    Comments: ICDE 2023 paper

  45. arXiv:2304.06873  [pdf, other

    cs.RO cs.MA

    Reducing Network Load via Message Utility Estimation for Decentralized Multirobot Teams

    Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme

    Abstract: We are motivated by quantile estimation of algae concentration in lakes and how decentralized multirobot teams can effectively tackle this problem. We find that multirobot teams improve performance in this task over single robots, and communication-enabled teams further over communication-deprived teams; however, real robots are resource-constrained, and communication networks cannot support arbit… ▽ More

    Submitted 6 July, 2023; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: 7 pages, 1 table, 7 figures

  46. arXiv:2303.03539  [pdf, other

    cs.RO cs.MA

    A Study on Multirobot Quantile Estimation in Natural Environments

    Authors: Isabel M. Rayas Fernández, Christopher E. Denniston, Gaurav S. Sukhatme

    Abstract: Quantiles of a natural phenomena can provide scientists with an important understanding of different spreads of concentrations. When there are several available robots, it may be advantageous to pool resources in a collaborative way to improve performance. A multirobot team can be difficult to practically bring together and coordinate. To this end, we present a study across several axes of the imp… ▽ More

    Submitted 6 July, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: 7 pages, 2 tables, 7 figures

  47. Short-Term Aggregated Residential Load Forecasting using BiLSTM and CNN-BiLSTM

    Authors: Bharat Bohara, Raymond I. Fernandez, Vysali Gollapudi, Xingpeng Li

    Abstract: Higher penetration of renewable and smart home technologies at the residential level challenges grid stability as utility-customer interactions add complexity to power system operations. In response, short-term residential load forecasting has become an increasing area of focus. However, forecasting at the residential level is challenging due to the higher uncertainties involved. Recently deep neu… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    Comments: This article has been accepted for publication in 2022 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT). This preprint is for personal use - that is solely for the purpose of research, but republication/redistribution requires IEEE permission. Please check IEEE website for more information

    Journal ref: 2022 International Conference on Innovation and Intelligence for Informatics, Computing, and Technologies (3ICT)

  48. arXiv:2301.03560  [pdf, other

    cs.IR

    Solo: Data Discovery Using Natural Language Questions Via A Self-Supervised Approach

    Authors: Qiming Wang, Raul Castro Fernandez

    Abstract: Most deployed data discovery systems, such as Google Datasets, and open data portals only support keyword search. Keyword search is geared towards general audiences but limits the types of queries the systems can answer. We propose a new system that lets users write natural language questions directly. A major barrier to using this learned data discovery system is it needs expensive-to-collect tra… ▽ More

    Submitted 17 October, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: To appear at Sigmod 2024

  49. arXiv:2210.16133  [pdf, other

    cs.CL cs.AI cs.LG

    Stop Measuring Calibration When Humans Disagree

    Authors: Joris Baan, Wilker Aziz, Barbara Plank, Raquel Fernández

    Abstract: Calibration is a popular framework to evaluate whether a classifier knows when it does not know - i.e., its predictive probabilities are a good indication of how likely a prediction is to be correct. Correctness is commonly estimated against the human majority class. Recently, calibration to human majority has been measured on tasks where humans inherently disagree about which class applies. We sh… ▽ More

    Submitted 30 November, 2022; v1 submitted 28 October, 2022; originally announced October 2022.

    Comments: Accepted at EMNLP 2022

  50. arXiv:2210.08321  [pdf, other

    cs.CL

    Construction Repetition Reduces Information Rate in Dialogue

    Authors: Mario Giulianelli, Arabella Sinclair, Raquel Fernández

    Abstract: Speakers repeat constructions frequently in dialogue. Due to their peculiar information-theoretic properties, repetitions can be thought of as a strategy for cost-effective communication. In this study, we focus on the repetition of lexicalised constructions -- i.e., recurring multi-word units -- in English open-domain spoken dialogues. We hypothesise that speakers use construction repetition to m… ▽ More

    Submitted 15 October, 2022; originally announced October 2022.

    Comments: In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (AACL-IJCNLP 2022)