Zum Hauptinhalt springen

Showing 1–15 of 15 results for author: Pinhanez, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.12832  [pdf, other

    cs.CL

    Sentence-level Aggregation of Lexical Metrics Correlate Stronger with Human Judgements than Corpus-level Aggregation

    Authors: Paulo Cavalin, Pedro Henrique Domingues, Claudio Pinhanez

    Abstract: In this paper we show that corpus-level aggregation hinders considerably the capability of lexical metrics to accurately evaluate machine translation (MT) systems. With empirical experiments we demonstrate that averaging individual segment-level scores can make metrics such as BLEU and chrF correlate much stronger with human judgements and make them behave considerably more similar to neural metri… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  2. arXiv:2407.12620  [pdf, other

    cs.CL cs.AI

    Harnessing the Power of Artificial Intelligence to Vitalize Endangered Indigenous Languages: Technologies and Experiences

    Authors: Claudio Pinhanez, Paulo Cavalin, Luciana Storto, Thomas Finbow, Alexander Cobbinah, Julio Nogima, Marisa Vasconcelos, Pedro Domingues, Priscila de Souza Mizukami, Nicole Grell, Majoí Gongora, Isabel Gonçalves

    Abstract: Since 2022 we have been exploring application areas and technologies in which Artificial Intelligence (AI) and modern Natural Language Processing (NLP), such as Large Language Models (LLMs), can be employed to foster the usage and facilitate the documentation of Indigenous languages which are in danger of disappearing. We start by discussing the decreasing diversity of languages in the world and h… ▽ More

    Submitted 29 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  3. arXiv:2403.11209  [pdf, other

    cs.CL cs.HC

    Creating an African American-Sounding TTS: Guidelines, Technical Challenges,and Surprising Evaluations

    Authors: Claudio Pinhanez, Raul Fernandez, Marcelo Grave, Julio Nogima, Ron Hoory

    Abstract: Representations of AI agents in user interfaces and robotics are predominantly White, not only in terms of facial and skin features, but also in the synthetic voices they use. In this paper we explore some unexpected challenges in the representation of race we found in the process of developing an U.S. English Text-to-Speech (TTS) system aimed to sound like an educated, professional, regional acce… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: Full version including appendixes

  4. arXiv:2205.09021  [pdf, other

    cs.LG cs.AI

    Exploring the Advantages of Dense-Vector to One-Hot Encoding of Intent Classes in Out-of-Scope Detection Tasks

    Authors: Claudio Pinhanez, Paulo Cavalin

    Abstract: This work explores the intrinsic limitations of the popular one-hot encoding method in classification of intents when detection of out-of-scope (OOS) inputs is required. Although recent work has shown that there can be significant improvements in OOS detection when the intent classes are represented as dense-vectors based on domain specific knowledge, we argue in this paper that such gains are mor… ▽ More

    Submitted 18 May, 2022; originally announced May 2022.

  5. arXiv:2204.01489  [pdf, other

    cs.CY cs.AI cs.SI

    Towards a New Science of Disinformation

    Authors: Claudio S. Pinhanez, German H. Flores, Marisa A. Vasconcelos, Mu Qiao, Nick Linck, Rogério de Paula, Yuya J. Ong

    Abstract: How can we best address the dangerous impact that deep learning-generated fake audios, photographs, and videos (a.k.a. deepfakes) may have in personal and societal life? We foresee that the availability of cheap deepfake technology will create a second wave of disinformation where people will receive specific, personalized disinformation through different channels, making the current approaches to… ▽ More

    Submitted 17 March, 2022; originally announced April 2022.

  6. arXiv:2112.01281  [pdf, ps, other

    cs.CY

    Expose Uncertainty, Instill Distrust, Avoid Explanations: Towards Ethical Guidelines for AI

    Authors: Claudio S. Pinhanez

    Abstract: In this position paper, I argue that the best way to help and protect humans using AI technology is to make them aware of the intrinsic limitations and problems of AI algorithms. To accomplish this, I suggest three ethical guidelines to be used in the presentation of results, mandating AI systems to expose uncertainty, to instill distrust, and, contrary to traditional views, to avoid explanations.… ▽ More

    Submitted 29 November, 2021; originally announced December 2021.

    Comments: Presented in the NeurIPS 2021 workshop on Human-Centered AI. December 13th 2021

  7. arXiv:2012.09005  [pdf, other

    cs.CL cs.LG

    Using Meta-Knowledge Mined from Identifiers to Improve Intent Recognition in Neuro-Symbolic Algorithms

    Authors: Claudio Pinhanez, Paulo Cavalin, Victor Ribeiro, Heloisa Candello, Julio Nogima, Ana Appel, Mauro Pichiliani, Maira Gatti de Bayser, Melina Guerra, Henrique Ferreira, Gabriel Malfatti

    Abstract: In this paper we explore the use of meta-knowledge embedded in intent identifiers to improve intent recognition in conversational systems. As evidenced by the analysis of thousands of real-world chatbots and in interviews with professional chatbot curators, developers and domain experts tend to organize the set of chatbot intents by identifying them using proto-taxonomies, i.e., meta-knowledge con… ▽ More

    Submitted 16 December, 2020; originally announced December 2020.

  8. arXiv:2001.06350  [pdf, other

    cs.CL cs.FL

    A Hybrid Solution to Learn Turn-Taking in Multi-Party Service-based Chat Groups

    Authors: Maira Gatti de Bayser, Melina Alberio Guerra, Paulo Cavalin, Claudio Pinhanez

    Abstract: To predict the next most likely participant to interact in a multi-party conversation is a difficult problem. In a text-based chat group, the only information available is the sender, the content of the text and the dialogue history. In this paper we present our study on how these information can be used on the prediction task through a corpus and architecture that integrates turn-taking classifie… ▽ More

    Submitted 14 January, 2020; originally announced January 2020.

    Comments: arXiv admin note: text overlap with arXiv:1907.02090

  9. arXiv:1908.08931  [pdf, other

    cs.CY cs.HC

    Machine Teaching by Domain Experts: Towards More Humane,Inclusive, and Intelligent Machine Learning Systems

    Authors: Claudio Pinhanez

    Abstract: This paper argues that a possible way to escape from the limitations of current machine learning (ML) systems is to allow their development directly by domain experts without the mediation of ML experts. This could be accomplished by making ML systems interactively teachable using concepts, definitions, and similar high level knowledge constructs. Pointing to the recent advances in machine teachin… ▽ More

    Submitted 19 August, 2019; originally announced August 2019.

  10. arXiv:1907.02090  [pdf, other

    cs.CL

    Learning Multi-Party Turn-Taking Models from Dialogue Logs

    Authors: Maira Gatti de Bayser, Paulo Cavalin, Claudio Pinhanez, Bianca Zadrozny

    Abstract: This paper investigates the application of machine learning (ML) techniques to enable intelligent systems to learn multi-party turn-taking models from dialogue logs. The specific ML task consists of determining who speaks next, after each utterance of a dialogue, given who has spoken and what was said in the previous utterances. With this goal, this paper presents comparisons of the accuracy of di… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

  11. arXiv:1808.08157  [pdf

    cs.HC cs.AI cs.MA

    Different but Equal: Comparing User Collaboration with Digital Personal Assistants vs. Teams of Expert Agents

    Authors: Claudio S. Pinhanez, Heloisa Candello, Mauro C. Pichiliani, Marisa Vasconcelos, Melina Guerra, Maíra G. de Bayser, Paulo Cavalin

    Abstract: This work compares user collaboration with conversational personal assistants vs. teams of expert chatbots. Two studies were performed to investigate whether each approach affects accomplishment of tasks and collaboration costs. Participants interacted with two equivalent financial advice chatbot systems, one composed of a single conversational adviser and the other based on a team of four experts… ▽ More

    Submitted 24 August, 2018; originally announced August 2018.

  12. arXiv:1802.07117  [pdf, other

    cs.CL

    Combining Textual Content and Structure to Improve Dialog Similarity

    Authors: Ana Paula Appel, Paulo Rodrigo Cavalin, Marisa Affonso Vasconcelos, Claudio Santos Pinhanez

    Abstract: Chatbots, taking advantage of the success of the messaging apps and recent advances in Artificial Intelligence, have become very popular, from helping business to improve customer services to chatting to users for the sake of conversation and engagement (celebrity or personal bots). However, developing and improving a chatbot requires understanding their data generated by its users. Dialog data ha… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: 5 pages

  13. arXiv:1802.07116  [pdf, other

    cs.SI

    A Social Network Analysis Framework for Modeling Health Insurance Claims Data

    Authors: Ana Paula Appel, Vagner F. de Santana, Luis G. Moyano, Marcia Ito, Claudio Santos Pinhanez

    Abstract: Health insurance companies in Brazil have their data about claims organized having the view only for providers. In this way, they loose the physician view and how they share patients. Partnership between physicians can view as a fruitful work in most of the cases but sometimes this could be a problem for health insurance companies and patients, for example a recommendation to visit another physici… ▽ More

    Submitted 20 February, 2018; originally announced February 2018.

    Comments: 8 pages, 5 figures

  14. arXiv:1712.03012  [pdf

    cs.HC cs.CY

    Computer Interfaces to Organizations: Perspectives on Borg-Human Interaction Design

    Authors: Claudio Pinhanez

    Abstract: We use the term borg to refer to the complex organizations composed of people, machines, and processes with which users frequently interact using computer interfaces and websites. Unlike interfaces to pure machines, we contend that borg-human interaction (BHI) happens in a context combining the anthropomorphization of the interface, conflict with users, and dramatization of the interaction process… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.

    Comments: 10 pages

  15. arXiv:1705.01214  [pdf, other

    cs.CL

    A Hybrid Architecture for Multi-Party Conversational Systems

    Authors: Maira Gatti de Bayser, Paulo Cavalin, Renan Souza, Alan Braz, Heloisa Candello, Claudio Pinhanez, Jean-Pierre Briot

    Abstract: Multi-party Conversational Systems are systems with natural language interaction between one or more people or systems. From the moment that an utterance is sent to a group, to the moment that it is replied in the group by a member, several activities must be done by the system: utterance understanding, information search, reasoning, among others. In this paper we present the challenges of designi… ▽ More

    Submitted 4 May, 2017; v1 submitted 2 May, 2017; originally announced May 2017.