Skip to main content

Showing 1–12 of 12 results for author: Oda, Y

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2406.17185  [pdf, other

    cs.CL

    Vaporetto: Efficient Japanese Tokenization Based on Improved Pointwise Linear Classification

    Authors: Koichi Akabe, Shunsuke Kanda, Yusuke Oda, Shinsuke Mori

    Abstract: This paper proposes an approach to improve the runtime efficiency of Japanese tokenization based on the pointwise linear classification (PLC) framework, which formulates the whole tokenization process as a sequence of linear classification problems. Our approach optimizes tokenization by leveraging the characteristics of the PLC framework and the task definition. Our approach involves (1) composin… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  3. arXiv:2311.11690  [pdf, other

    cs.PL cs.AI cs.CL cs.SE

    Refactoring Programs Using Large Language Models with Few-Shot Examples

    Authors: Atsushi Shirafuji, Yusuke Oda, Jun Suzuki, Makoto Morishita, Yutaka Watanobe

    Abstract: A less complex and more straightforward program is a crucial factor that enhances its maintainability and makes writing secure and bug-free programs easier. However, due to its heavy workload and the risks of breaking the working programs, programmers are reluctant to do code refactoring, and thus, it also causes the loss of potential learning experiences. To mitigate this, we demonstrate the appl… ▽ More

    Submitted 20 November, 2023; originally announced November 2023.

    Comments: 10 pages, 10 figures, accepted to the 30th Asia-Pacific Software Engineering Conference (APSEC 2023)

  4. arXiv:2306.14583  [pdf, ps, other

    cs.CL cs.AI cs.SE

    Exploring the Robustness of Large Language Models for Solving Programming Problems

    Authors: Atsushi Shirafuji, Yutaka Watanobe, Takumi Ito, Makoto Morishita, Yuki Nakamura, Yusuke Oda, Jun Suzuki

    Abstract: Using large language models (LLMs) for source code has recently gained attention. LLMs, such as Transformer-based models like Codex and ChatGPT, have been shown to be highly capable of solving a wide range of programming problems. However, the extent to which LLMs understand problem descriptions and generate programs accordingly or just retrieve source code from the most relevant problem in traini… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

  5. Engineering faster double-array Aho-Corasick automata

    Authors: Shunsuke Kanda, Koichi Akabe, Yusuke Oda

    Abstract: Multiple pattern matching in strings is a fundamental problem in text processing applications such as regular expressions or tokenization. This paper studies efficient implementations of double-array Aho-Corasick automata (DAACs), data structures for quickly performing the multiple pattern matching. The practical performance of DAACs is improved by carefully designing the data structure, and many… ▽ More

    Submitted 23 June, 2024; v1 submitted 27 July, 2022; originally announced July 2022.

    Comments: Accepted by Software: Practice and Experience (Accepted version)

    Journal ref: Software: Practice and Experience (SPE), 53(6): 1332-1361, 2023

  6. arXiv:2205.09295  [pdf, other

    cs.CL

    Are Prompt-based Models Clueless?

    Authors: Pride Kavumba, Ryo Takahashi, Yusuke Oda

    Abstract: Finetuning large pre-trained language models with a task-specific head has advanced the state-of-the-art on many natural language understanding benchmarks. However, models with a task-specific head require a lot of training data, making them susceptible to learning and exploiting dataset-specific superficial cues that do not generalize to other datasets. Prompting has reduced the data requirement… ▽ More

    Submitted 19 May, 2022; v1 submitted 18 May, 2022; originally announced May 2022.

  7. The failure of cut-elimination in cyclic proof for first-order logic with inductive definitions

    Authors: Yukihiro Oda, James Brotherston, Makoto Tatsuta

    Abstract: A cyclic proof system is a proof system whose proof figure is a tree with cycles. The cut-elimination in a proof system is fundamental. It is conjectured that the cut-elimination in the cyclic proof system for first-order logic with inductive definitions does not hold. This paper shows that the conjecture is correct by giving a sequent not provable without the cut rule but provable in the cyclic p… ▽ More

    Submitted 14 February, 2024; v1 submitted 22 June, 2021; originally announced June 2021.

    Comments: 18 pages

    Journal ref: Journal of Logic and Computation, 2023;, exad068

  8. arXiv:1910.13299  [pdf, other

    cs.CL

    Findings of the Third Workshop on Neural Generation and Translation

    Authors: Hiroaki Hayashi, Yusuke Oda, Alexandra Birch, Ioannis Konstas, Andrew Finch, Minh-Thang Luong, Graham Neubig, Katsuhito Sudoh

    Abstract: This document describes the findings of the Third Workshop on Neural Generation and Translation, held in concert with the annual conference of the Empirical Methods in Natural Language Processing (EMNLP 2019). First, we summarize the research trends of papers presented in the proceedings. Second, we describe the results of the two shared tasks 1) efficient neural machine translation (NMT) where pa… ▽ More

    Submitted 29 October, 2019; v1 submitted 29 October, 2019; originally announced October 2019.

    Comments: Fixed the metadata (author list)

  9. arXiv:1806.02940  [pdf, other

    cs.CL

    Findings of the Second Workshop on Neural Machine Translation and Generation

    Authors: Alexandra Birch, Andrew Finch, Minh-Thang Luong, Graham Neubig, Yusuke Oda

    Abstract: This document describes the findings of the Second Workshop on Neural Machine Translation and Generation, held in concert with the annual conference of the Association for Computational Linguistics (ACL 2018). First, we summarize the research trends of papers presented in the proceedings, and note that there is particular interest in linguistic structure, domain adaptation, data augmentation, hand… ▽ More

    Submitted 18 June, 2018; v1 submitted 7 June, 2018; originally announced June 2018.

    Comments: WNMT 2018

  10. arXiv:1706.05765  [pdf, other

    cs.CL

    An Empirical Study of Mini-Batch Creation Strategies for Neural Machine Translation

    Authors: Makoto Morishita, Yusuke Oda, Graham Neubig, Koichiro Yoshino, Katsuhito Sudoh, Satoshi Nakamura

    Abstract: Training of neural machine translation (NMT) models usually uses mini-batches for efficiency purposes. During the mini-batched training process, it is necessary to pad shorter sentences in a mini-batch to be equal in length to the longest sentence therein for efficient computation. Previous work has noted that sorting the corpus based on the sentence length before making mini-batches reduces the a… ▽ More

    Submitted 18 June, 2017; originally announced June 2017.

    Comments: 8 pages, accepted to the First Workshop on Neural Machine Translation

  11. arXiv:1704.06918  [pdf, ps, other

    cs.CL

    Neural Machine Translation via Binary Code Prediction

    Authors: Yusuke Oda, Philip Arthur, Graham Neubig, Koichiro Yoshino, Satoshi Nakamura

    Abstract: In this paper, we propose a new method for calculating the output layer in neural machine translation systems. The method is based on predicting a binary code for each word and can reduce computation time/memory requirements of the output layer to be logarithmic in vocabulary size in the best case. In addition, we also introduce two advanced approaches to improve the robustness of the proposed mod… ▽ More

    Submitted 23 April, 2017; originally announced April 2017.

    Comments: Accepted as a long paper at ACL2017

  12. arXiv:1701.03980  [pdf, other

    stat.ML cs.CL cs.MS

    DyNet: The Dynamic Neural Network Toolkit

    Authors: Graham Neubig, Chris Dyer, Yoav Goldberg, Austin Matthews, Waleed Ammar, Antonios Anastasopoulos, Miguel Ballesteros, David Chiang, Daniel Clothiaux, Trevor Cohn, Kevin Duh, Manaal Faruqui, Cynthia Gan, Dan Garrette, Yangfeng Ji, Lingpeng Kong, Adhiguna Kuncoro, Gaurav Kumar, Chaitanya Malaviya, Paul Michel, Yusuke Oda, Matthew Richardson, Naomi Saphra, Swabha Swayamdipta, Pengcheng Yin

    Abstract: We describe DyNet, a toolkit for implementing neural network models based on dynamic declaration of network structure. In the static declaration strategy that is used in toolkits like Theano, CNTK, and TensorFlow, the user first defines a computation graph (a symbolic representation of the computation), and then examples are fed into an engine that executes this computation and computes its deriva… ▽ More

    Submitted 14 January, 2017; originally announced January 2017.

    Comments: 33 pages