Skip to main content

Showing 1–24 of 24 results for author: Nakamura, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03963  [pdf, other

    cs.CL cs.AI

    LLM-jp: A Cross-organizational Project for the Research and Development of Fully Open Japanese LLMs

    Authors: LLM-jp, :, Akiko Aizawa, Eiji Aramaki, Bowen Chen, Fei Cheng, Hiroyuki Deguchi, Rintaro Enomoto, Kazuki Fujii, Kensuke Fukumoto, Takuya Fukushima, Namgi Han, Yuto Harada, Chikara Hashimoto, Tatsuya Hiraoka, Shohei Hisada, Sosuke Hosokawa, Lu Jie, Keisuke Kamata, Teruhito Kanazawa, Hiroki Kanezashi, Hiroshi Kataoka, Satoru Katsumata, Daisuke Kawahara, Seiya Kawano , et al. (57 additional authors not shown)

    Abstract: This paper introduces LLM-jp, a cross-organizational project for the research and development of Japanese large language models (LLMs). LLM-jp aims to develop open-source and strong Japanese LLMs, and as of this writing, more than 1,500 participants from academia and industry are working together for this purpose. This paper presents the background of the establishment of LLM-jp, summaries of its… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  2. arXiv:2401.03665  [pdf, other

    cs.CV

    Primitive Geometry Segment Pre-training for 3D Medical Image Segmentation

    Authors: Ryu Tadokoro, Ryosuke Yamada, Kodai Nakashima, Ryo Nakamura, Hirokatsu Kataoka

    Abstract: The construction of 3D medical image datasets presents several issues, including requiring significant financial costs in data collection and specialized expertise for annotation, as well as strict privacy concerns for patient confidentiality compared to natural image datasets. Therefore, it has become a pressing issue in 3D medical image segmentation to enable data-efficient learning with limited… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: Accepted to BMVC2023 (Oral)

    Report number: 152

    Journal ref: Proceedings of the British Machine Vision Conference (BMVC), 2023

  3. arXiv:2312.10737  [pdf, other

    cs.CV cs.RO

    Traffic Incident Database with Multiple Labels Including Various Perspective Environmental Information

    Authors: Shota Nishiyama, Takuma Saito, Ryo Nakamura, Go Ohtani, Hirokatsu Kataoka, Kensho Hara

    Abstract: A large dataset of annotated traffic accidents is necessary to improve the accuracy of traffic accident recognition using deep learning models. Conventional traffic accident datasets provide annotations on traffic accidents and other teacher labels, improving traffic accident recognition performance. However, the labels annotated in conventional datasets need to be more comprehensive to describe t… ▽ More

    Submitted 19 December, 2023; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Conference paper accepted to IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2023 Reason for revision: Corrected due to a missing space between sentences in the preview's abstract, which led to an unintended URL interpretation

  4. arXiv:2308.14332  [pdf, other

    cs.CV cs.RO

    Attention-Guided Lidar Segmentation and Odometry Using Image-to-Point Cloud Saliency Transfer

    Authors: Guanqun Ding, Nevrez Imamoglu, Ali Caglayan, Masahiro Murakawa, Ryosuke Nakamura

    Abstract: LiDAR odometry estimation and 3D semantic segmentation are crucial for autonomous driving, which has achieved remarkable advances recently. However, these tasks are challenging due to the imbalance of points in different semantic categories for 3D semantic segmentation and the influence of dynamic objects for LiDAR odometry estimation, which increases the importance of using representative/salient… ▽ More

    Submitted 16 June, 2024; v1 submitted 28 August, 2023; originally announced August 2023.

    Comments: 33 pages, 12 Figures, 6 Tables, accepted to appear in Multimedia Systems journal (2024)

  5. arXiv:2307.14710  [pdf, other

    cs.CV

    Pre-training Vision Transformers with Very Limited Synthesized Images

    Authors: Ryo Nakamura, Hirokatsu Kataoka, Sora Takashima, Edgar Josafat Martinez Noriega, Rio Yokota, Nakamasa Inoue

    Abstract: Formula-driven supervised learning (FDSL) is a pre-training method that relies on synthetic images generated from mathematical formulae such as fractals. Prior work on FDSL has shown that pre-training vision transformers on such synthetic datasets can yield competitive accuracy on a wide range of downstream tasks. These synthetic images are categorized according to the parameters in the mathematic… ▽ More

    Submitted 30 July, 2023; v1 submitted 27 July, 2023; originally announced July 2023.

    Comments: Accepted to ICCV 2023

  6. arXiv:2212.00567  [pdf, other

    cs.CV cs.RO

    P2Net: A Post-Processing Network for Refining Semantic Segmentation of LiDAR Point Cloud based on Consistency of Consecutive Frames

    Authors: Yutaka Momma, Weimin Wang, Edgar Simo-Serra, Satoshi Iizuka, Ryosuke Nakamura, Hiroshi Ishikawa

    Abstract: We present a lightweight post-processing method to refine the semantic segmentation results of point cloud sequences. Most existing methods usually segment frame by frame and encounter the inherent ambiguity of the problem: based on a measurement in a single frame, labels are sometimes difficult to predict even for humans. To remedy this problem, we propose to explicitly train a network to refine… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Comments: 2020 IEEE International Conference on Systems, Man, and Cybernetics (SMC)

  7. arXiv:2208.02611  [pdf, other

    cs.CV

    Surgical Skill Assessment via Video Semantic Aggregation

    Authors: Zhenqiang Li, Lin Gu, Weimin Wang, Ryosuke Nakamura, Yoichi Sato

    Abstract: Automated video-based assessment of surgical skills is a promising task in assisting young surgical trainees, especially in poor-resource areas. Existing works often resort to a CNN-LSTM joint framework that models long-term relationships by LSTMs on spatially pooled short-term CNN features. However, this practice would inevitably neglect the difference among semantic concepts such as tools, tissu… ▽ More

    Submitted 4 August, 2022; originally announced August 2022.

    Comments: To appear in MICCAI 2022

  8. arXiv:2203.14188  [pdf, ps, other

    cs.LG cs.CY cs.DC

    mdx: A Cloud Platform for Supporting Data Science and Cross-Disciplinary Research Collaborations

    Authors: Toyotaro Suzumura, Akiyoshi Sugiki, Hiroyuki Takizawa, Akira Imakura, Hiroshi Nakamura, Kenjiro Taura, Tomohiro Kudoh, Toshihiro Hanawa, Yuji Sekiya, Hiroki Kobayashi, Shin Matsushima, Yohei Kuga, Ryo Nakamura, Renhe Jiang, Junya Kawase, Masatoshi Hanai, Hiroshi Miyazaki, Tsutomu Ishizaki, Daisuke Shimotoku, Daisuke Miyamoto, Kento Aida, Atsuko Takefusa, Takashi Kurimoto, Koji Sasayama, Naoya Kitagawa , et al. (8 additional authors not shown)

    Abstract: The growing amount of data and advances in data science have created a need for a new kind of cloud platform that provides users with flexibility, strong security, and the ability to couple with supercomputers and edge devices through high-performance networks. We have built such a nation-wide cloud platform, called "mdx" to meet this need. The mdx platform's virtualization service, jointly operat… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

  9. arXiv:2203.03951  [pdf

    eess.IV cs.CV

    Efficient and Accurate Hyperspectral Pansharpening Using 3D VolumeNet and 2.5D Texture Transfer

    Authors: Yinao Li, Yutaro Iwamoto, Ryousuke Nakamura, Lanfen Lin, Ruofeng Tong, Yen-Wei Chen

    Abstract: Recently, convolutional neural networks (CNN) have obtained promising results in single-image SR for hyperspectral pansharpening. However, enhancing CNNs' representation ability with fewer parameters and a shorter prediction time is a challenging and critical task. In this paper, we propose a novel multi-spectral image fusion method using a combination of the previously proposed 3D CNN model Volum… ▽ More

    Submitted 8 March, 2022; originally announced March 2022.

  10. arXiv:2201.12047   

    cs.CV

    Exploring Object-Aware Attention Guided Frame Association for RGB-D SLAM

    Authors: Ali Caglayan, Nevrez Imamoglu, Oguzhan Guclu, Ali Osman Serhatoglu, Weimin Wang, Ahmet Burak Can, Ryosuke Nakamura

    Abstract: Deep learning models as an emerging topic have shown great progress in various fields. Especially, visualization tools such as class activation mapping methods provided visual explanation on the reasoning of convolutional neural networks (CNNs). By using the gradients of the network layers, it is possible to demonstrate where the networks pay attention during a specific image recognition task. Mor… ▽ More

    Submitted 13 March, 2023; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: This article has been removed by arXiv administrators because the submitter did not have the authority to grant the license at the time of submission

  11. arXiv:2112.03731  [pdf, other

    cs.CV

    SalFBNet: Learning Pseudo-Saliency Distribution via Feedback Convolutional Networks

    Authors: Guanqun Ding, Nevrez Imamoglu, Ali Caglayan, Masahiro Murakawa, Ryosuke Nakamura

    Abstract: Feed-forward only convolutional neural networks (CNNs) may ignore intrinsic relationships and potential benefits of feedback connections in vision tasks such as saliency detection, despite their significant representation capabilities. In this work, we propose a feedback-recursive convolutional framework (SalFBNet) for saliency detection. The proposed feedback model can learn abundant contextual r… ▽ More

    Submitted 10 January, 2022; v1 submitted 7 December, 2021; originally announced December 2021.

  12. Classifying DNS Servers based on Response Message Matrix using Machine Learning

    Authors: Keiichi Shima, Ryo Nakamura, Kazuya Okada, Tomohiro Ishihara, Daisuke Miyamoto, Yuji Sekiya

    Abstract: Improperly configured domain name system (DNS) servers are sometimes used as packet reflectors as part of a DoS or DDoS attack. Detecting packets created as a result of this activity is logically possible by monitoring the DNS request and response traffic. Any response that does not have a corresponding request can be considered a reflected message; checking and tracking every DNS packet, however,… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

  13. arXiv:2004.12349  [pdf, other

    cs.CV

    When CNNs Meet Random RNNs: Towards Multi-Level Analysis for RGB-D Object and Scene Recognition

    Authors: Ali Caglayan, Nevrez Imamoglu, Ahmet Burak Can, Ryosuke Nakamura

    Abstract: Recognizing objects and scenes are two challenging but essential tasks in image understanding. In particular, the use of RGB-D sensors in handling these tasks has emerged as an important area of focus for better visual understanding. Meanwhile, deep neural networks, specifically convolutional neural networks (CNNs), have become widespread and have been applied to many visual tasks by replacing han… ▽ More

    Submitted 11 January, 2022; v1 submitted 26 April, 2020; originally announced April 2020.

    Comments: 15 pages, 9 figures, 5 tables (To appear in Computer Vision and Image Understanding, Elsevier)

  14. arXiv:2003.04260  [pdf, other

    cs.CV cs.RO eess.IV

    SOIC: Semantic Online Initialization and Calibration for LiDAR and Camera

    Authors: Weimin Wang, Shohei Nobuhara, Ryosuke Nakamura, Ken Sakurada

    Abstract: This paper presents a novel semantic-based online extrinsic calibration approach, SOIC (so, I see), for Light Detection and Ranging (LiDAR) and camera sensors. Previous online calibration methods usually need prior knowledge of rough initial values for optimization. The proposed approach removes this limitation by converting the initialization problem to a Perspective-n-Point (PnP) problem with th… ▽ More

    Submitted 9 March, 2020; originally announced March 2020.

  15. arXiv:1902.10993  [pdf, other

    cs.CV

    Salient object detection on hyperspectral images using features learned from unsupervised segmentation task

    Authors: Nevrez Imamoglu, Guanqun Ding, Yuming Fang, Asako Kanezaki, Toru Kouyama, Ryosuke Nakamura

    Abstract: Various saliency detection algorithms from color images have been proposed to mimic eye fixation or attentive object detection response of human observers for the same scenes. However, developments on hyperspectral imaging systems enable us to obtain redundant spectral information of the observed scenes from the reflected light source from objects. A few studies using low-level features on hypersp… ▽ More

    Submitted 28 February, 2019; originally announced February 2019.

    Comments: 5 pages, 3 figures, accepted to appear in IEEE ICASSP 2019 (accepted version)

  16. arXiv:1812.01285  [pdf, other

    cs.CV

    Rare Event Detection using Disentangled Representation Learning

    Authors: Ryuhei Hamaguchi, Ken Sakurada, Ryosuke Nakamura

    Abstract: This paper presents a novel method for rare event detection from an image pair with class-imbalanced datasets. A straightforward approach for event detection tasks is to train a detection network from a large-scale dataset in an end-to-end manner. However, in many applications such as building change detection on satellite images, few positive samples are available for the training. Moreover, scen… ▽ More

    Submitted 4 December, 2018; originally announced December 2018.

  17. arXiv:1811.08100  [pdf, other

    cs.CL

    Another Diversity-Promoting Objective Function for Neural Dialogue Generation

    Authors: Ryo Nakamura, Katsuhito Sudoh, Koichiro Yoshino, Satoshi Nakamura

    Abstract: Although generation-based dialogue systems have been widely researched, the response generations by most existing systems have very low diversities. The most likely reason for this problem is Maximum Likelihood Estimation (MLE) with Softmax Cross-Entropy (SCE) loss. MLE trains models to generate the most frequent responses from enormous generation candidates, although in actual dialogues there are… ▽ More

    Submitted 20 November, 2018; v1 submitted 20 November, 2018; originally announced November 2018.

    Comments: AAAI 2019 Workshop on Reasoning and Learning for Human-Machine Dialogues (DEEP-DIAL 2019)

  18. arXiv:1810.11856  [pdf, other

    cs.CV

    Scale Estimation of Monocular SfM for a Multi-modal Stereo Camera

    Authors: Shinya Sumikura, Ken Sakurada, Nobuo Kawaguchi, Ryosuke Nakamura

    Abstract: This paper proposes a novel method of estimating the absolute scale of monocular SfM for a multi-modal stereo camera. In the fields of computer vision and robotics, scale estimation for monocular SfM has been widely investigated in order to simplify systems. This paper addresses the scale estimation problem for a stereo camera system in which two cameras capture different spectral images (e.g., RG… ▽ More

    Submitted 28 October, 2018; originally announced October 2018.

    Comments: Accepted to ACCV 2018, please see the additional results here: http://youtu.be/xOLtvMZJseU

  19. arXiv:1808.02996  [pdf, other

    cs.CV

    Object Detection in Satellite Imagery using 2-Step Convolutional Neural Networks

    Authors: Hiroki Miyamoto, Kazuki Uehara, Masahiro Murakawa, Hidenori Sakanashi, Hirokazu Nosato, Toru Kouyama, Ryosuke Nakamura

    Abstract: This paper presents an efficient object detection method from satellite imagery. Among a number of machine learning algorithms, we proposed a combination of two convolutional neural networks (CNN) aimed at high precision and high recall, respectively. We validated our models using golf courses as target objects. The proposed deep learning method demonstrated higher accuracy than previous object id… ▽ More

    Submitted 8 August, 2018; originally announced August 2018.

    Comments: 4 pages,5 figures

  20. arXiv:1806.11314  [pdf, other

    cs.CV

    Hyperspectral Image Dataset for Benchmarking on Salient Object Detection

    Authors: Nevrez Imamoglu, Yu Oishi, Xiaoqiang Zhang, Guanqun Ding, Yuming Fang, Toru Kouyama, Ryosuke Nakamura

    Abstract: Many works have been done on salient object detection using supervised or unsupervised approaches on colour images. Recently, a few studies demonstrated that efficient salient object detection can also be implemented by using spectral features in visible spectrum of hyperspectral images from natural scenes. However, these models on hyperspectral salient object detection were tested with a very few… ▽ More

    Submitted 1 July, 2018; v1 submitted 29 June, 2018; originally announced June 2018.

    Comments: 3 pages, 3 figures. 2 tables, appeared in the Proceedings of the 10th International Conference on Quality of Multimedia Experience (QoMEX 2018)

  21. arXiv:1712.02941  [pdf, other

    cs.CV

    Dense Optical Flow based Change Detection Network Robust to Difference of Camera Viewpoints

    Authors: Ken Sakurada, Weimin Wang, Nobuo Kawaguchi, Ryosuke Nakamura

    Abstract: This paper presents a novel method for detecting scene changes from a pair of images with a difference of camera viewpoints using a dense optical flow based change detection network. In the case that camera poses of input images are fixed or known, such as with surveillance and satellite cameras, the pixel correspondence between the images captured at different times can be known. Hence, it is pos… ▽ More

    Submitted 8 December, 2017; originally announced December 2017.

  22. Filmy Cloud Removal on Satellite Imagery with Multispectral Conditional Generative Adversarial Nets

    Authors: Kenji Enomoto, Ken Sakurada, Weimin Wang, Hiroshi Fukui, Masashi Matsuoka, Ryosuke Nakamura, Nobuo Kawaguchi

    Abstract: In this paper, we propose a method for cloud removal from visible light RGB satellite images by extending the conditional Generative Adversarial Networks (cGANs) from RGB images to multispectral images. Satellite images have been widely utilized for various purposes, such as natural environment monitoring (pollution, forest or rivers), transportation improvement and prompt emergency response to di… ▽ More

    Submitted 13 October, 2017; originally announced October 2017.

  23. arXiv:1707.09099  [pdf, other

    cs.CV

    Object Detection of Satellite Images Using Multi-Channel Higher-order Local Autocorrelation

    Authors: Kazuki Uehara, Hidenori Sakanashi, Hirokazu Nosato, Masahiro Murakawa, Hiroki Miyamoto, Ryosuke Nakamura

    Abstract: The Earth observation satellites have been monitoring the earth's surface for a long time, and the images taken by the satellites contain large amounts of valuable data. However, it is extremely hard work to manually analyze such huge data. Thus, a method of automatic object detection is needed for satellite images to facilitate efficient data analyses. This paper describes a new image feature ext… ▽ More

    Submitted 27 July, 2017; originally announced July 2017.

    Comments: 6 pages, 2 column, 7 figures, Accepted by IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2017

  24. arXiv:1704.06410  [pdf, other

    cs.CV

    Solar Power Plant Detection on Multi-Spectral Satellite Imagery using Weakly-Supervised CNN with Feedback Features and m-PCNN Fusion

    Authors: Nevrez Imamoglu, Motoki Kimura, Hiroki Miyamoto, Aito Fujita, Ryosuke Nakamura

    Abstract: Most of the traditional convolutional neural networks (CNNs) implements bottom-up approach (feed-forward) for image classifications. However, many scientific studies demonstrate that visual perception in primates rely on both bottom-up and top-down connections. Therefore, in this work, we propose a CNN network with feedback structure for Solar power plant detection on middle-resolution satellite i… ▽ More

    Submitted 21 June, 2017; v1 submitted 21 April, 2017; originally announced April 2017.

    Comments: 9 pages, 9 figures, 4 tables

    Journal ref: British Machine Vision Conference (BMVC) 2017