Skip to main content

Showing 1–15 of 15 results for author: Nakayama, K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2401.08140  [pdf, other

    cs.CV

    ProvNeRF: Modeling per Point Provenance in NeRFs as a Stochastic Process

    Authors: Kiyohiro Nakayama, Mikaela Angelina Uy, Yang You, Ke Li, Leonidas Guibas

    Abstract: Neural radiance fields (NeRFs) have gained popularity across various applications. However, they face challenges in the sparse view setting, lacking sufficient constraints from volume rendering. Reconstructing and understanding a 3D scene from sparse and unconstrained cameras is a long-standing problem in classical computer vision with diverse applications. While recent works have explored NeRFs i… ▽ More

    Submitted 18 January, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

  3. arXiv:2312.09609  [pdf, other

    cs.CV

    Semantic-Aware Transformation-Invariant RoI Align

    Authors: Guo-Ye Yang, George Kiyohiro Nakayama, Zi-Kai Xiao, Tai-Jiang Mu, Xiaolei Huang, Shi-Min Hu

    Abstract: Great progress has been made in learning-based object detection methods in the last decade. Two-stage detectors often have higher detection accuracy than one-stage detectors, due to the use of region of interest (RoI) feature extractors which extract transformation-invariant RoI features for different RoI proposals, making refinement of bounding boxes and prediction of object categories more robus… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  4. arXiv:2310.20685  [pdf, other

    cs.CV

    NeRF Revisited: Fixing Quadrature Instability in Volume Rendering

    Authors: Mikaela Angelina Uy, Kiyohiro Nakayama, Guandao Yang, Rahul Krishna Thomas, Leonidas Guibas, Ke Li

    Abstract: Neural radiance fields (NeRF) rely on volume rendering to synthesize novel views. Volume rendering requires evaluating an integral along each ray, which is numerically approximated with a finite sum that corresponds to the exact integral along the ray under piecewise constant volume density. As a consequence, the rendered result is unstable w.r.t. the choice of samples along the ray, a phenomenon… ▽ More

    Submitted 19 January, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

    Comments: Neurips 2023

  5. arXiv:2310.07376  [pdf, other

    cs.CV cs.AI cs.MM

    Point Cloud Denoising and Outlier Detection with Local Geometric Structure by Dynamic Graph CNN

    Authors: Kosuke Nakayama, Hiroto Fukuta, Hiroshi Watanabe

    Abstract: The digitalization of society is rapidly developing toward the realization of the digital twin and metaverse. In particular, point clouds are attracting attention as a media format for 3D space. Point cloud data is contaminated with noise and outliers due to measurement errors. Therefore, denoising and outlier detection are necessary for point cloud processing. Among them, PointCleanNet is an effe… ▽ More

    Submitted 21 October, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

    Comments: 2023 IEEE 12th Global Conference on Consumer Electronics (GCCE 2023)

  6. arXiv:2305.01921  [pdf, other

    cs.CV

    DiffFacto: Controllable Part-Based 3D Point Cloud Generation with Cross Diffusion

    Authors: Kiyohiro Nakayama, Mikaela Angelina Uy, Jiahui Huang, Shi-Min Hu, Ke Li, Leonidas J Guibas

    Abstract: While the community of 3D point cloud generation has witnessed a big growth in recent years, there still lacks an effective way to enable intuitive user control in the generation process, hence limiting the general utility of such methods. Since an intuitive way of decomposing a shape is through its parts, we propose to tackle the task of controllable part-based point cloud generation. We introduc… ▽ More

    Submitted 20 August, 2023; v1 submitted 3 May, 2023; originally announced May 2023.

  7. arXiv:2203.04557  [pdf, other

    math.OC cs.DS

    Neighborhood persistency of the linear optimization relaxation of integer linear optimization

    Authors: Kei Kimura, Kotaro Nakayama

    Abstract: For an integer linear optimization (ILO) problem, persistency of its linear optimization (LO) relaxation is a property that for every optimal solution of the relaxation that assigns integer values to some variables, there exists an optimal solution of the ILO problem in which these variables retain the same values. Although persistency has been used to develop heuristic, approximation, and fixed-p… ▽ More

    Submitted 9 March, 2022; originally announced March 2022.

    Comments: 17 pages

  8. arXiv:2004.13846  [pdf, ps, other

    cs.CL cs.CV cs.LG

    Character-level Japanese Text Generation with Attention Mechanism for Chest Radiography Diagnosis

    Authors: Kenya Sakka, Kotaro Nakayama, Nisei Kimura, Taiki Inoue, Yusuke Iwasawa, Ryohei Yamaguchi, Yosimasa Kawazoe, Kazuhiko Ohe, Yutaka Matsuo

    Abstract: Chest radiography is a general method for diagnosing a patient's condition and identifying important information; therefore, radiography is used extensively in routine medical practice in various situations, such as emergency medical care and medical checkup. However, a high level of expertise is required to interpret chest radiographs. Thus, medical specialists spend considerable time in diagnosi… ▽ More

    Submitted 8 June, 2020; v1 submitted 6 April, 2020; originally announced April 2020.

    Comments: 8 pages, 3 figures, 2 tables

  9. arXiv:1909.07452  [pdf, other

    cs.LG cs.CR cs.DC stat.ML

    BAFFLE : Blockchain Based Aggregator Free Federated Learning

    Authors: Paritosh Ramanan, Kiyoshi Nakayama

    Abstract: A key aspect of Federated Learning (FL) is the requirement of a centralized aggregator to maintain and update the global model. However, in many cases orchestrating a centralized aggregator might be infeasible due to numerous operational constraints. In this paper, we introduce BAFFLE, an aggregator free, blockchain driven, FL environment that is inherently decentralized. BAFFLE leverages Smart Co… ▽ More

    Submitted 18 October, 2020; v1 submitted 16 September, 2019; originally announced September 2019.

  10. arXiv:1805.08962  [pdf, ps, other

    cs.RO

    Tool Exchangeable Grasp/Assembly Planner

    Authors: Kensuke Harada, Kento Nakayama, Weiwei Wan, Kazuyuki Nagata, Natsuki Yamanobe, Ixchel G. Ramirez-Alpizar

    Abstract: This paper proposes a novel assembly planner for a manipulator which can simultaneously plan assembly sequence, robot motion, grasping configuration, and exchange of grippers. Our assembly planner assumes multiple grippers and can automatically selects a feasible one to assemble a part. For a given AND/OR graph of an assembly task, we consider generating the assembly graph from which assembly moti… ▽ More

    Submitted 23 May, 2018; originally announced May 2018.

    Comments: This is to appear Int. Conf. on Intelligent Autonomous Systems

    Journal ref: Int. Conf. on Intelligent Autonomous Systems, 2018

  11. arXiv:1801.08702  [pdf, ps, other

    stat.ML cs.LG

    Improving Bi-directional Generation between Different Modalities with Variational Autoencoders

    Authors: Masahiro Suzuki, Kotaro Nakayama, Yutaka Matsuo

    Abstract: We investigate deep generative models that can exchange multiple modalities bi-directionally, e.g., generating images from corresponding texts and vice versa. A major approach to achieve this objective is to train a model that integrates all the information of different modalities into a joint representation and then to generate one modality from the corresponding other modality via this joint rep… ▽ More

    Submitted 26 January, 2018; originally announced January 2018.

    Comments: Updated version of arXiv:1611.01891

  12. arXiv:1711.06406  [pdf, other

    cs.CV

    Predicting Driver Attention in Critical Situations

    Authors: Ye Xia, Danqing Zhang, Jinkyu Kim, Ken Nakayama, Karl Zipser, David Whitney

    Abstract: Robust driver attention prediction for critical situations is a challenging computer vision problem, yet essential for autonomous driving. Because critical driving moments are so rare, collecting enough data for these situations is difficult with the conventional in-car data collection protocol---tracking eye movements during driving. Here, we first propose a new in-lab driver attention collection… ▽ More

    Submitted 5 December, 2018; v1 submitted 16 November, 2017; originally announced November 2017.

    Comments: ACCV 2018

  13. arXiv:1706.03038  [pdf, other

    cs.CV

    Okutama-Action: An Aerial View Video Dataset for Concurrent Human Action Detection

    Authors: Mohammadamin Barekatain, Miquel Martí, Hsueh-Fu Shih, Samuel Murray, Kotaro Nakayama, Yutaka Matsuo, Helmut Prendinger

    Abstract: Despite significant progress in the development of human action detection datasets and algorithms, no current dataset is representative of real-world aerial view scenarios. We present Okutama-Action, a new video dataset for aerial view concurrent human action detection. It consists of 43 minute-long fully-annotated sequences with 12 action classes. Okutama-Action features many challenges missing i… ▽ More

    Submitted 15 June, 2017; v1 submitted 9 June, 2017; originally announced June 2017.

    Comments: Computer Vision and Pattern Recognition Workshops (CVPRW), Hawaii, USA, 2017

  14. arXiv:1611.08459  [pdf, other

    cs.CL

    Neural Machine Translation with Latent Semantic of Image and Text

    Authors: Joji Toyama, Masanori Misono, Masahiro Suzuki, Kotaro Nakayama, Yutaka Matsuo

    Abstract: Although attention-based Neural Machine Translation have achieved great success, attention-mechanism cannot capture the entire meaning of the source sentence because the attention mechanism generates a target word depending heavily on the relevant parts of the source sentence. The report of earlier studies has introduced a latent variable to capture the entire meaning of sentence and achieved impr… ▽ More

    Submitted 25 November, 2016; originally announced November 2016.

  15. arXiv:1611.01891  [pdf, ps, other

    stat.ML cs.LG

    Joint Multimodal Learning with Deep Generative Models

    Authors: Masahiro Suzuki, Kotaro Nakayama, Yutaka Matsuo

    Abstract: We investigate deep generative models that can exchange multiple modalities bi-directionally, e.g., generating images from corresponding texts and vice versa. Recently, some studies handle multiple modalities on deep generative models, such as variational autoencoders (VAEs). However, these models typically assume that modalities are forced to have a conditioned relation, i.e., we can only generat… ▽ More

    Submitted 6 November, 2016; originally announced November 2016.