Search | arXiv e-print repository

LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its activities, and technical reports on the LLMs developed by LLM-jp. For the latest activities, visit https://llm-jp.nii.ac.jp/en/. △ Less

Submitted 4 July, 2024; originally announced July 2024.

arXiv:2305.06576 [pdf, other]

Clustering of Time-Varying Graphs Based on Temporal Label Smoothness

Authors: Katsuki Fukumoto, Koki Yamada, Yuichi Tanaka, Hoi-To Wai

Abstract: We propose a node clustering method for time-varying graphs based on the assumption that the cluster labels are changed smoothly over time. Clustering is one of the fundamental tasks in many science and engineering fields including signal processing, machine learning, and data mining. Although most existing studies focus on the clustering of nodes in static graphs, we often encounter time-varying… ▽ More We propose a node clustering method for time-varying graphs based on the assumption that the cluster labels are changed smoothly over time. Clustering is one of the fundamental tasks in many science and engineering fields including signal processing, machine learning, and data mining. Although most existing studies focus on the clustering of nodes in static graphs, we often encounter time-varying graphs for time-series data, e.g., social networks, brain functional connectivity, and point clouds. In this paper, we formulate a node clustering of time-varying graphs as an optimization problem based on spectral clustering, with a smoothness constraint of the node labels. We solve the problem with a primal-dual splitting algorithm. Experiments on synthetic and real-world time-varying graphs are performed to validate the effectiveness of the proposed approach. △ Less

Submitted 11 May, 2023; originally announced May 2023.

arXiv:2212.04298 [pdf, other]

Real-time Sampling-based Model Predictive Control based on Reverse Kullback-Leibler Divergence and Its Adaptive Acceleration

Authors: Taisuke Kobayashi, Kota Fukumoto

Abstract: Sampling-based model predictive control (MPC) can be applied to versatile robotic systems. However, the real-time control with it is a big challenge due to its unstable updates and poor convergence. This paper tackles this challenge with a novel derivation from reverse Kullback-Leibler divergence, which has a mode-seeking behavior and is likely to find one of the sub-optimal solutions early. With… ▽ More Sampling-based model predictive control (MPC) can be applied to versatile robotic systems. However, the real-time control with it is a big challenge due to its unstable updates and poor convergence. This paper tackles this challenge with a novel derivation from reverse Kullback-Leibler divergence, which has a mode-seeking behavior and is likely to find one of the sub-optimal solutions early. With this derivation, a weighted maximum likelihood estimation with positive/negative weights is obtained, solving by mirror descent (MD) algorithm. While the negative weights eliminate unnecessary actions, that requires to develop a practical implementation that avoids the interference with positive/negative updates based on rejection sampling. In addition, although the convergence of MD can be accelerated with Nesterov's acceleration method, it is modified for the proposed MPC with a heuristic of a step size adaptive to the noise estimated in update amounts. In the real-time simulations, the proposed method can solve more tasks statistically than the conventional method and accomplish more complex tasks only with a CPU due to the improved acceleration. In addition, its applicability is also demonstrated in a variable impedance control of a force-driven mobile robot. https://youtu.be/D8bFMzct1XM △ Less

Submitted 8 December, 2022; originally announced December 2022.

Comments: 12 pages, 12 figures

arXiv:2103.02777 [pdf, ps, other]

doi 10.1541/ieejeiss.141.155

Application of Reversible Data Hiding for Printing with Special Color Inks to Preserve Compatibility with Normal Printing

Authors: Kotoko Hiraoka, Kensuke Fukumoto, Takashi Yamazoe, Norimichi Tsumura, Satoshi Kaneko, Wataru Arai, Shoko Imaizumi

Abstract: We propose an efficient framework with compatibility between normal printing and printing with special color inks in this paper. Special color inks can be used for printing to represent some particular colors and specific optical properties, which are difficult to express using only CMYK inks. Special color layers are required in addition to the general color layer for printing with special color… ▽ More We propose an efficient framework with compatibility between normal printing and printing with special color inks in this paper. Special color inks can be used for printing to represent some particular colors and specific optical properties, which are difficult to express using only CMYK inks. Special color layers are required in addition to the general color layer for printing with special color inks. We introduce a reversible data hiding (RDH) method to embed the special color layers into the general color layer without visible artifacts. The proposed method can realize both normal printing and printing with special color inks by using a single layer. Our experimental results show that the quality of the marked image is virtually identical to that of the original image, i.e., the general color layer. △ Less

Submitted 3 March, 2021; originally announced March 2021.

Comments: 8 pages

Journal ref: IEEJ Trans. Electr. Inf. & Syst., vol.141, no.2, pp.155-162, February 2021

Showing 1–4 of 4 results for author: Fukumoto, K