Skip to main content

Showing 1–13 of 13 results for author: Kawano, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2406.09839  [pdf, other

    cs.CL cs.HC

    Rapport-Driven Virtual Agent: Rapport Building Dialogue Strategy for Improving User Experience at First Meeting

    Authors: Muhammad Yeza Baihaqi, Angel García Contreras, Seiya Kawano, Koichiro Yoshino

    Abstract: Rapport is known as a conversational aspect focusing on relationship building, which influences outcomes in collaborative tasks. This study aims to establish human-agent rapport through small talk by using a rapport-building strategy. We implemented this strategy for the virtual agents based on dialogue strategies by prompting a large language model (LLM). In particular, we utilized two dialogue s… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: will be presented at INTERSPEECH 2024

  3. arXiv:2404.03250  [pdf, ps, other

    stat.ME cs.LG stat.ML

    Multi-task learning via robust regularized clustering with non-convex group penalties

    Authors: Akira Okazaki, Shuichi Kawano

    Abstract: Multi-task learning (MTL) aims to improve estimation and prediction performance by sharing common information among related tasks. One natural assumption in MTL is that tasks are classified into clusters based on their characteristics. However, existing MTL methods based on this assumption often ignore outlier tasks that have large task-specific components or no relation to other tasks. To address… ▽ More

    Submitted 27 May, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: 32 pages

  4. arXiv:2403.19259  [pdf, other

    cs.CL

    J-CRe3: A Japanese Conversation Dataset for Real-world Reference Resolution

    Authors: Nobuhiro Ueda, Hideko Habe, Yoko Matsui, Akishige Yuguchi, Seiya Kawano, Yasutomo Kawanishi, Sadao Kurohashi, Koichiro Yoshino

    Abstract: Understanding expressions that refer to the physical world is crucial for such human-assisting systems in the real world, as robots that must perform actions that are expected by users. In real-world reference resolution, a system must ground the verbal information that appears in user interactions to the visual information observed in egocentric views. To this end, we propose a multimodal referen… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  5. arXiv:2403.17545  [pdf, other

    cs.CL cs.CV

    A Gaze-grounded Visual Question Answering Dataset for Clarifying Ambiguous Japanese Questions

    Authors: Shun Inadumi, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

    Abstract: Situated conversations, which refer to visual information as visual question answering (VQA), often contain ambiguities caused by reliance on directive information. This problem is exacerbated because some languages, such as Japanese, often omit subjective or objective terms. Such ambiguities in questions are often clarified by the contexts in conversational situations, such as joint attention wit… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: LREC-COLING 2024

  6. ROSE: Rotation-based Squeezing Robotic Gripper toward Universal Handling of Objects

    Authors: Son Tien Bui, Shinya Kawano, Van Anh Ho

    Abstract: Robotics hand/grippers nowadays are not limited to manufacturing lines; instead, they are widely utilized in cluttered environments, such as restaurants, farms, and warehouses. In such scenarios, they need to deal with high uncertainty of the grasped objects' shapes, postures, surfaces, and material properties, which requires complex integration of sensing and decision-making process. On the other… ▽ More

    Submitted 10 February, 2024; originally announced February 2024.

    Comments: 9 pages, 9 figures, RSS2023 conference

    Journal ref: Robotics: Science and System 2023

  7. arXiv:2210.02735  [pdf, ps, other

    cs.RO

    What Should the System Do Next?: Operative Action Captioning for Estimating System Actions

    Authors: Taiki Nakamura, Seiya Kawano, Akishige Yuguchi, Yasutomo Kawanishi, Koichiro Yoshino

    Abstract: Such human-assisting systems as robots need to correctly understand the surrounding situation based on observations and output the required support actions for humans. Language is one of the important channels to communicate with humans, and the robots are required to have the ability to express their understanding and action planning results. In this study, we propose a new task of operative acti… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

    Comments: Under review in ICRA2023

  8. arXiv:2110.09040  [pdf, ps, other

    stat.ME cs.LG stat.ML

    A Bayesian approach to multi-task learning with network lasso

    Authors: Kaito Shimamura, Shuichi Kawano

    Abstract: Network lasso is a method for solving a multi-task learning problem through the regularized maximum likelihood method. A characteristic of network lasso is setting a different model for each sample. The relationships among the models are represented by relational coefficients. A crucial issue in network lasso is to provide appropriate values for these relational coefficients. In this paper, we pro… ▽ More

    Submitted 18 October, 2021; originally announced October 2021.

  9. arXiv:2009.02695  [pdf, other

    stat.ML cs.LG stat.ME

    Multilinear Common Component Analysis via Kronecker Product Representation

    Authors: Kohei Yoshikawa, Shuichi Kawano

    Abstract: We consider the problem of extracting a common structure from multiple tensor datasets. For this purpose, we propose multilinear common component analysis (MCCA) based on Kronecker products of mode-wise covariance matrices. MCCA constructs a common basis represented by linear combinations of the original variables which loses as little information of the multiple tensor datasets. We also develop a… ▽ More

    Submitted 20 November, 2020; v1 submitted 6 September, 2020; originally announced September 2020.

    Comments: 35 pages, 7 figures

  10. arXiv:2005.03419  [pdf, other

    stat.ML cs.LG

    Relevance Vector Machine with Weakly Informative Hyperprior and Extended Predictive Information Criterion

    Authors: Kazuaki. Murayama, Shuichi. Kawano

    Abstract: In the variational relevance vector machine, the gamma distribution is representative as a hyperprior over the noise precision of automatic relevance determination prior. Instead of the gamma hyperprior, we propose to use the inverse gamma hyperprior with a shape parameter close to zero and a scale parameter not necessary close to zero. This hyperprior is associated with the concept of a weakly in… ▽ More

    Submitted 7 May, 2020; originally announced May 2020.

    Comments: 29 pages, 12 captioned figures, 23 files of non-captioned figures

  11. arXiv:2002.09188  [pdf, ps, other

    stat.ML cs.LG stat.ME

    Sparse principal component regression via singular value decomposition approach

    Authors: Shuichi Kawano

    Abstract: Principal component regression (PCR) is a two-stage procedure: the first stage performs principal component analysis (PCA) and the second stage constructs a regression model whose explanatory variables are replaced by principal components obtained by the first stage. Since PCA is performed by using only explanatory variables, the principal components have no information about the response variable… ▽ More

    Submitted 21 February, 2020; originally announced February 2020.

    Comments: 30 pages

    Journal ref: Advances in Data Analysis and Classification 15 (2021) 795-823

  12. arXiv:1911.08703  [pdf, other

    stat.ML cs.LG stat.ME

    Bayesian sparse convex clustering via global-local shrinkage priors

    Authors: Kaito Shimamura, Shuichi Kawano

    Abstract: Sparse convex clustering is to cluster observations and conduct variable selection simultaneously in the framework of convex clustering. Although a weighted $L_1$ norm is usually employed for the regularization term in sparse convex clustering, its use increases the dependence on the data and reduces the estimation accuracy if the sample size is not sufficient. To tackle these problems, this paper… ▽ More

    Submitted 26 May, 2020; v1 submitted 19 November, 2019; originally announced November 2019.

  13. arXiv:1910.05083  [pdf, other

    stat.ML cs.LG stat.ME

    Sparse Reduced-Rank Regression for Simultaneous Rank and Variable Selection via Manifold Optimization

    Authors: Kohei Yoshikawa, Shuichi Kawano

    Abstract: We consider the problem of constructing a reduced-rank regression model whose coefficient parameter is represented as a singular value decomposition with sparse singular vectors. The traditional estimation procedure for the coefficient parameter often fails when the true rank of the parameter is high. To overcome this issue, we develop an estimation algorithm with rank and variable selection via s… ▽ More

    Submitted 1 November, 2019; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: 28 pages