Skip to main content

Showing 1–19 of 19 results for author: Yoshino, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2406.09839  [pdf, other

    cs.CL cs.HC

    Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting

    Authors: Muhammad Yeza Baihaqi, Angel García Contreras, Seiya Kawano, Koichiro Yoshino

    Abstract: Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: will be presented at INTERSPEECH 2024

  3. arXiv:2403.19259  [pdf, other

    cs.CL

    J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

    Authors: Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino

    Abstract: Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal referen… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  4. arXiv:2403.17545  [pdf, other

    cs.CL cs.CV

    A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions

    Authors: Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

    Abstract: Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention wit… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  5. Whats New? Identifying the Unfolding of New Events in Narratives

    Authors: Seyed Mahed Mousavi, Shohei Tanaka, Gabriel Roccabruna, Koichiro Yoshino, Satoshi Nakamura, Giuseppe Riccardi

    Abstract: Narratives include a rich source of events unfolding over time and context. Automatic understanding of these events provides a summarised comprehension of the narrative for further computation (such as reasoning). In this paper, we study the Information Status (IS) of the events and propose a novel challenging task: the automatic identification of new events in a narrative. We define an event as a… ▽ More

    Submitted 8 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

  6. arXiv:2212.08475  [pdf, other

    cs.CL cs.AI cs.IR cs.LG cs.SI

    Best-Answer Prediction in Q&A Sites Using User Information

    Authors: Rafik Hadfi, Ahmed Moustafa, Kai Yoshino, Takayuki Ito

    Abstract: Community Question Answering (CQA) sites have spread and multiplied significantly in recent years. Sites like Reddit, Quora, and Stack Exchange are becoming popular amongst people interested in finding answers to diverse questions. One practical way of finding such answers is automatically predicting the best candidate given existing answers and comments. Many studies were conducted on answer pred… ▽ More

    Submitted 14 December, 2022; originally announced December 2022.

    Comments: 22 pages, 3 figures, 4 tables

    ACM Class: I.2.7; I.2.1

  7. arXiv:2210.02735  [pdf, ps, other

    cs.RO

    What Should the System Do Next?: Operative Action Captioning for Estimating System Actions

    Authors: Taiki Nakamura, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

    Abstract: Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative acti… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Under review in ICRA2023

  8. arXiv:2106.07999  [pdf, other

    cs.CL

    ARTA: Collection and Classification of Ambiguous Requests and Thoughtful Actions

    Authors: Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: Human-assisting systems such as dialogue systems must take thoughtful, appropriate actions not only for clear and unambiguous user requests, but also for ambiguous user requests, even if the users themselves are not aware of their potential requirements. To construct such a dialogue agent, we collected a corpus and developed a model that classifies ambiguous user requests into corresponding system… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

    Comments: Accepted by The 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL2021)

  9. arXiv:2007.02598  [pdf, other

    cs.CL

    Reflection-based Word Attribute Transfer

    Authors: Yoichi Ishibashi, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura

    Abstract: Word embeddings, which often represent such analogic relations as king - man + woman = queen, can be used to change a word's attribute, including its gender. For transferring king into queen in this analogy-based manner, we subtract a difference vector man - woman based on the knowledge that king is male. However, developing such knowledge is very costly for words and attributes. In this work, we… ▽ More

    Submitted 7 July, 2020; v1 submitted 6 July, 2020; originally announced July 2020.

    Comments: Accepted at ACL 2020 Student Research Workshop (SRW)

  10. arXiv:2003.10066  [pdf, other

    cs.CL cs.RO

    Caption Generation of Robot Behaviors based on Unsupervised Learning of Action Segments

    Authors: Koichiro Yoshino, Kohei Wakimoto, Yuta Nishimura, Satoshi Nakamura

    Abstract: Bridging robot action sequences and their natural language captions is an important task to increase explainability of human assisting robots in their recently evolving field. In this paper, we propose a system for generating natural language captions that describe behaviors of human assisting robots. The system describes robot actions by using robot observations; histories from actuator systems a… ▽ More

    Submitted 22 March, 2020; originally announced March 2020.

    Comments: Will appear in IWSDS2020

  11. arXiv:1906.09795  [pdf, other

    cs.CL

    Conversational Response Re-ranking Based on Event Causality and Role Factored Tensor Event Embedding

    Authors: Shohei Tanaka, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: We propose a novel method for selecting coherent and diverse responses for a given dialogue context. The proposed method re-ranks response candidates generated from conversational models by using event causality relations between events in a dialogue history and response candidates (e.g., ``be stressed out'' precedes ``relieve stress''). We use distributed event representation based on the Role Fa… ▽ More

    Submitted 24 June, 2019; originally announced June 2019.

    Comments: Accepted by 1st Workshop NLP for Conversational AI, ACL 2019 Workshop (ConvAI)

  12. arXiv:1905.11806  [pdf, other

    cs.CL

    An Incremental Turn-Taking Model For Task-Oriented Dialog Systems

    Authors: Andrei C. Coman, Koichiro Yoshino, Yukitoshi Murase, Satoshi Nakamura, Giuseppe Riccardi

    Abstract: In a human-machine dialog scenario, deciding the appropriate time for the machine to take the turn is an open research problem. In contrast, humans engaged in conversations are able to timely decide when to interrupt the speaker for competitive or non-competitive reasons. In state-of-the-art turn-by-turn dialog systems the decision on the next dialog action is taken at the end of the utterance. In… ▽ More

    Submitted 11 July, 2019; v1 submitted 28 May, 2019; originally announced May 2019.

    Comments: Accepted to INTERSPEECH 2019

  13. arXiv:1901.03461  [pdf, ps, other

    cs.CL

    Dialog System Technology Challenge 7

    Authors: Koichiro Yoshino, Chiori Hori, Julien Perez, Luis Fernando D'Haro, Lazaros Polymenakos, Chulaka Gunasekara, Walter S. Lasecki, Jonathan K. Kummerfeld, Michel Galley, Chris Brockett, Jianfeng Gao, Bill Dolan, Xiang Gao, Huda Alamari, Tim K. Marks, Devi Parikh, Dhruv Batra

    Abstract: This paper introduces the Seventh Dialog System Technology Challenges (DSTC), which use shared datasets to explore the problem of building dialog systems. Recently, end-to-end dialog modeling approaches have been applied to various dialog tasks. The seventh DSTC (DSTC7) focuses on developing technologies related to end-to-end dialog systems for (1) sentence selection, (2) sentence generation and (… ▽ More

    Submitted 10 January, 2019; originally announced January 2019.

    Comments: This paper is presented at NIPS2018 2nd Conversational AI workshop

  14. arXiv:1811.10728  [pdf, other

    cs.AI

    Optimization of Information-Seeking Dialogue Strategy for Argumentation-Based Dialogue System

    Authors: Hisao Katsumi, Takuya Hiraoka, Koichiro Yoshino, Kazeto Yamamoto, Shota Motoura, Kunihiko Sadamasa, Satoshi Nakamura

    Abstract: Argumentation-based dialogue systems, which can handle and exchange arguments through dialogue, have been widely researched. It is required that these systems have sufficient supporting information to argue their claims rationally; however, the systems often do not have enough of such information in realistic situations. One way to fill in the gap is acquiring such missing information from dialogu… ▽ More

    Submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted by AAAI2019 DEEP-DIAL 2019 workshop

  15. arXiv:1811.08100  [pdf, other

    cs.CL

    Another Diversity-Promoting Objective Function for Neural Dialogue Generation

    Authors: Ryo Nakamura, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura

    Abstract: Although generation-based dialogue systems have been widely researched, the response generations by most existing systems have very low diversities. The most likely reason for this problem is Maximum Likelihood Estimation (MLE) with Softmax Cross-Entropy (SCE) loss. MLE trains models to generate the most frequent responses from enormous generation candidates, although in actual dialogues there are… ▽ More

    Submitted 20 November, 2018; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: AAAI 2019 Workshop on Reasoning and Learning for Human-Machine Dialogues (DEEP-DIAL 2019)

  16. arXiv:1802.08645  [pdf, other

    cs.CV

    Interactive Image Manipulation with Natural Language Instruction Commands

    Authors: Seitaro Shinagawa, Koichiro Yoshino, Sakriani Sakti, Yu Suzuki, Satoshi Nakamura

    Abstract: We propose an interactive image-manipulation system with natural language instruction, which can generate a target image from a source image and an instruction that describes the difference between the source and the target image. The system makes it possible to modify a generated image interactively and make natural language conditioned image generation more controllable. We construct a neural ne… ▽ More

    Submitted 23 February, 2018; originally announced February 2018.

    Comments: accepted at NIPS 2017 ViGIL workshop (https://nips2017vigil.github.io/)

  17. arXiv:1706.05765  [pdf, other

    cs.CL

    An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

    Authors: Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes. During the mini-batched training process, it is necessary to pad shorter sentences in a mini-batch to be equal in length to the longest sentence therein for efficient computation. Previous work has noted that sorting the corpus based on the sentence length before making mini-batches reduces the a… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

    Comments: 8 pages, accepted to the First Workshop on Neural Machine Translation

  18. arXiv:1705.10962  [pdf, other

    cs.CL

    Analysis of the Effect of Dependency Information on Predicate-Argument Structure Analysis and Zero Anaphora Resolution

    Authors: Koichiro Yoshino, Shinsuke Mori, Satoshi Nakamura

    Abstract: This paper investigates and analyzes the effect of dependency information on predicate-argument structure analysis (PASA) and zero anaphora resolution (ZAR) for Japanese, and shows that a straightforward approach of PASA and ZAR works effectively even if dependency information was not available. We constructed an analyzer that directly predicts relationships of predicates and arguments with their… ▽ More

    Submitted 31 May, 2017; originally announced May 2017.

  19. arXiv:1704.06918  [pdf, ps, other

    cs.CL

    Neural Machine Translation via Binary Code Prediction

    Authors: Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura

    Abstract: In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output layer to be logarithmic in vocabulary size in the best case. In addition, we also introduce two advanced approaches to improve the robustness of the proposed mod… ▽ More

    Submitted 23 April, 2017; originally announced April 2017.

    Comments: Accepted as a long paper at ACL2017