Search | arXiv e-print repository

JaFIn: Japanese Financial Instruction Dataset

Authors: Kota Tanabe, Masahiro Suzuki, Hiroki Sakaji, Itsuki Noda

Abstract: We construct an instruction dataset for the large language model (LLM) in the Japanese finance domain. Domain adaptation of language models, including LLMs, is receiving more attention as language models become more popular. This study demonstrates the effectiveness of domain adaptation through instruction tuning. To achieve this, we propose an instruction tuning data in Japanese called JaFIn, the… ▽ More We construct an instruction dataset for the large language model (LLM) in the Japanese finance domain. Domain adaptation of language models, including LLMs, is receiving more attention as language models become more popular. This study demonstrates the effectiveness of domain adaptation through instruction tuning. To achieve this, we propose an instruction tuning data in Japanese called JaFIn, the Japanese Financial Instruction Dataset. JaFIn is manually constructed based on multiple data sources, including Japanese government websites, which provide extensive financial knowledge. We then utilize JaFIn to apply instruction tuning for several LLMs, demonstrating that our models specialized in finance have better domain adaptability than the original models. The financial-specialized LLMs created were evaluated using a quantitative Japanese financial benchmark and qualitative response comparisons, showing improved performance over the originals. △ Less

Submitted 19 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

Comments: 10 pages, 1 figure. The paper is a camera-ready version for the 2024 IEEE Symposium on Computational Intelligence for Financial Engineering and Economics (CIFEr)

arXiv:2309.10430 [pdf, other]

Predicate Classification Using Optimal Transport Loss in Scene Graph Generation

Authors: Sorachi Kurita, Satoshi Oyama, Itsuki Noda

Abstract: In scene graph generation (SGG), learning with cross-entropy loss yields biased predictions owing to the severe imbalance in the distribution of the relationship labels in the dataset. Thus, this study proposes a method to generate scene graphs using optimal transport as a measure for comparing two probability distributions. We apply learning with the optimal transport loss, which reflects the sim… ▽ More In scene graph generation (SGG), learning with cross-entropy loss yields biased predictions owing to the severe imbalance in the distribution of the relationship labels in the dataset. Thus, this study proposes a method to generate scene graphs using optimal transport as a measure for comparing two probability distributions. We apply learning with the optimal transport loss, which reflects the similarity between the labels in terms of transportation cost, for predicate classification in SGG. In the proposed approach, the transportation cost of the optimal transport is defined using the similarity of words obtained from the pre-trained model. The experimental evaluation of the effectiveness demonstrates that the proposed method outperforms existing methods in terms of mean Recall@50 and 100. Furthermore, it improves the recall of the relationship labels scarcely available in the dataset. △ Less

Submitted 19 September, 2023; originally announced September 2023.

arXiv:1811.08801 [pdf, other]

doi 10.1007/978-3-030-20937-7_9

CARAVAN: a framework for comprehensive simulations on massive parallel machines

Authors: Yohsuke Murase, Hiroyasu Matsushima, Itsuki Noda, Tomio Kamada

Abstract: We present a software framework called CARAVAN, which was developed for comprehensive simulations on massive parallel computers. The framework runs user-developed simulators with various input parameters in parallel without requiring the knowledge of parallel programming. The framework is useful for exploring high-dimensional parameter spaces, for which sampling points must be dynamically determin… ▽ More We present a software framework called CARAVAN, which was developed for comprehensive simulations on massive parallel computers. The framework runs user-developed simulators with various input parameters in parallel without requiring the knowledge of parallel programming. The framework is useful for exploring high-dimensional parameter spaces, for which sampling points must be dynamically determined based on the previous results. Possible use cases include optimization, data assimilation, and Markov-chain Monte Carlo sampling in parameter spaces. As a demonstration, we applied CARAVAN to an evacuation planning problem in an urban area. We formulated the problem as a multi-objective optimization problem, and searched for solutions using multi-agent simulations and a multi-objective evolutionary algorithm, which were developed as modules of the framework. △ Less

Submitted 21 November, 2018; v1 submitted 7 November, 2018; originally announced November 2018.

Comments: 14 pages, 5 figures, to appear in a Springer LNCS/LNAI proceedings series

Journal ref: In: Lin D., Ishida T., Zambonelli F., Noda I. (eds) Massively Multi-Agent Systems II. MMAS 2018. Lecture Notes in Computer Science, vol 11422. Springer, Cham

Showing 1–3 of 3 results for author: Noda, I