Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Hataya, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16819  [pdf, other

    cs.LG stat.ML

    Automatic Domain Adaptation by Transformers in In-Context Learning

    Authors: Ryuichiro Hataya, Kota Matsui, Masaaki Imaizumi

    Abstract: Selecting or designing an appropriate domain adaptation algorithm for a given problem remains challenging. This paper presents a Transformer model that can provably approximate and opt for domain adaptation methods for a given dataset in the in-context learning framework, where a foundation model performs new tasks without updating its parameters at test time. Specifically, we prove that Transform… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  2. arXiv:2404.06218  [pdf, other

    cs.LG math.OA quant-ph

    Quantum Circuit $C^*$-algebra Net

    Authors: Yuka Hashimoto, Ryuichiro Hataya

    Abstract: This paper introduces quantum circuit $C^*$-algebra net, which provides a connection between $C^*$-algebra nets proposed in classical machine learning and quantum circuits. Using $C^*$-algebra, a generalization of the space of complex numbers, we can represent quantum gates as weight parameters of a neural network. By introducing additional parameters, we can induce interaction among multiple circ… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  3. arXiv:2402.02741  [pdf, other

    cs.LG stat.ML

    Glocal Hypergradient Estimation with Koopman Operator

    Authors: Ryuichiro Hataya, Yoshinobu Kawahara

    Abstract: Gradient-based hyperparameter optimization methods update hyperparameters using hypergradients, gradients of a meta criterion with respect to hyperparameters. Previous research used two distinct update strategies: optimizing hyperparameters using global hypergradients obtained after completing model training or local hypergradients derived after every few model updates. While global hypergradients… ▽ More

    Submitted 26 May, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

  4. arXiv:2402.02098  [pdf, other

    stat.ML cs.LG

    Self-attention Networks Localize When QK-eigenspectrum Concentrates

    Authors: Han Bao, Ryuichiro Hataya, Ryo Karakida

    Abstract: The self-attention mechanism prevails in modern machine learning. It has an interesting functionality of adaptively selecting tokens from an input sequence by modulating the degree of attention localization, which many researchers speculate is the basis of the powerful model performance but complicates the underlying mechanism of the learning dynamics. In recent years, mainly two arguments have co… ▽ More

    Submitted 3 February, 2024; originally announced February 2024.

  5. arXiv:2307.08187  [pdf, other

    cs.LG cs.AI

    An Empirical Study of Pre-trained Model Selection for Out-of-Distribution Generalization and Calibration

    Authors: Hiroki Naganuma, Ryuichiro Hataya, Ioannis Mitliagkas

    Abstract: In out-of-distribution (OOD) generalization tasks, fine-tuning pre-trained models has become a prevalent strategy. Different from most prior work that has focused on advancing learning algorithms, we systematically examined how pre-trained model size, pre-training dataset size, and training strategies impact generalization and uncertainty calibration on downstream tasks. We evaluated 100 models ac… ▽ More

    Submitted 30 May, 2024; v1 submitted 16 July, 2023; originally announced July 2023.

  6. arXiv:2306.16627  [pdf, other

    quant-ph cs.LG

    MNISQ: A Large-Scale Quantum Circuit Dataset for Machine Learning on/for Quantum Computers in the NISQ era

    Authors: Leonardo Placidi, Ryuichiro Hataya, Toshio Mori, Koki Aoyama, Hayata Morisaki, Kosuke Mitarai, Keisuke Fujii

    Abstract: We introduce the first large-scale dataset, MNISQ, for both the Quantum and the Classical Machine Learning community during the Noisy Intermediate-Scale Quantum era. MNISQ consists of 4,950,000 data points organized in 9 subdatasets. Building our dataset from the quantum encoding of classical information (e.g., MNIST dataset), we deliver a dataset in a dual form: in quantum form, as circuits, and… ▽ More

    Submitted 28 June, 2023; originally announced June 2023.

    Comments: Preprint. Under review

  7. arXiv:2303.03633  [pdf, other

    cs.CV

    Sketch-based Medical Image Retrieval

    Authors: Kazuma Kobayashi, Lin Gu, Ryuichiro Hataya, Takaaki Mizuno, Mototaka Miyake, Hirokazu Watanabe, Masamichi Takahashi, Yasuyuki Takamizawa, Yukihiro Yoshida, Satoshi Nakamura, Nobuji Kouno, Amina Bolatkan, Yusuke Kurose, Tatsuya Harada, Ryuji Hamamoto

    Abstract: The amount of medical images stored in hospitals is increasing faster than ever; however, utilizing the accumulated medical images has been limited. This is because existing content-based medical image retrieval (CBMIR) systems usually require example images to construct query vectors; nevertheless, example images cannot always be prepared. Besides, there can be images with rare characteristics th… ▽ More

    Submitted 6 March, 2023; originally announced March 2023.

  8. arXiv:2302.09726  [pdf, other

    cs.LG

    Nystrom Method for Accurate and Scalable Implicit Differentiation

    Authors: Ryuichiro Hataya, Makoto Yamada

    Abstract: The essential difficulty of gradient-based bilevel optimization using implicit differentiation is to estimate the inverse Hessian vector product with respect to neural network parameters. This paper proposes to tackle this problem by the Nystrom method and the Woodbury matrix identity, exploiting the low-rankness of the Hessian. Compared to existing methods using iterative approximation, such as c… ▽ More

    Submitted 19 February, 2023; originally announced February 2023.

    Comments: AISTATS 2023

  9. arXiv:2302.01191  [pdf, other

    math.OA cs.LG math.FA

    Noncommutative $C^*$-algebra Net: Learning Neural Networks with Powerful Product Structure in $C^*$-algebra

    Authors: Ryuichiro Hataya, Yuka Hashimoto

    Abstract: We propose a new generalization of neural network parameter spaces with noncommutative $C^*$-algebra, which possesses a rich noncommutative structure of products. We show that this noncommutative structure induces powerful effects in learning neural networks. Our framework has a wide range of applications, such as learning multiple related neural networks simultaneously with interactions and learn… ▽ More

    Submitted 6 July, 2024; v1 submitted 26 January, 2023; originally announced February 2023.

  10. arXiv:2211.08095  [pdf, other

    cs.CV

    Will Large-scale Generative Models Corrupt Future Datasets?

    Authors: Ryuichiro Hataya, Han Bao, Hiromi Arai

    Abstract: Recently proposed large-scale text-to-image generative models such as DALL$\cdot$E 2, Midjourney, and StableDiffusion can generate high-quality and realistic images from users' prompts. Not limited to the research community, ordinary Internet users enjoy these generative models, and consequently, a tremendous amount of generated images have been shared on the Internet. Meanwhile, today's success o… ▽ More

    Submitted 9 August, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

    Comments: ICCV 2023

  11. arXiv:2103.12328  [pdf, other

    cs.CV

    Decomposing Normal and Abnormal Features of Medical Images into Discrete Latent Codes for Content-Based Image Retrieval

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Mototaka Miyake, Masamichi Takahashi, Akiko Nakagawa, Tatsuya Harada, Ryuji Hamamoto

    Abstract: In medical imaging, the characteristics purely derived from a disease should reflect the extent to which abnormal findings deviate from the normal features. Indeed, physicians often need corresponding images without abnormal findings of interest or, conversely, images that contain similar abnormal findings regardless of normal anatomical context. This is called comparative diagnostic reading of me… ▽ More

    Submitted 23 March, 2021; originally announced March 2021.

  12. arXiv:2102.04600  [pdf, other

    physics.chem-ph cs.LG

    Graph Energy-based Model for Substructure Preserving Molecular Design

    Authors: Ryuichiro Hataya, Hideki Nakayama, Kazuki Yoshizoe

    Abstract: It is common practice for chemists to search chemical databases based on substructures of compounds for finding molecules with desired properties. The purpose of de novo molecular generation is to generate instead of search. Existing machine learning based molecular design methods have no or limited ability in generating novel molecules that preserves a target substructure. Our Graph Energy-based… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

    Comments: preprint

  13. arXiv:2011.06224  [pdf, other

    eess.IV cs.CV

    Decomposing Normal and Abnormal Features of Medical Images for Content-based Image Retrieval

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Tatsuya Harada, Ryuji Hamamoto

    Abstract: Medical images can be decomposed into normal and abnormal features, which is considered as the compositionality. Based on this idea, we propose an encoder-decoder network to decompose a medical image into two discrete latent codes: a normal anatomy code and an abnormal anatomy code. Using these latent codes, we demonstrate a similarity retrieval by focusing on either normal or abnormal features of… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

    Comments: Machine Learning for Health (ML4H) at NeurIPS 2020 - Extended Abstract

  14. arXiv:2006.07965  [pdf, other

    cs.CV

    Meta Approach to Data Augmentation Optimization

    Authors: Ryuichiro Hataya, Jan Zdenek, Kazuki Yoshizoe, Hideki Nakayama

    Abstract: Data augmentation policies drastically improve the performance of image recognition tasks, especially when the policies are optimized for the target data and tasks. In this paper, we propose to optimize image recognition models and data augmentation policies simultaneously to improve the performance using gradient descent. Unlike prior methods, our approach avoids using proxy tasks or reducing sea… ▽ More

    Submitted 14 June, 2020; originally announced June 2020.

  15. arXiv:2005.12573  [pdf, other

    eess.IV cs.CV

    Learning Global and Local Features of Normal Brain Anatomy for Unsupervised Abnormality Detection

    Authors: Kazuma Kobayashi, Ryuichiro Hataya, Yusuke Kurose, Amina Bolatkan, Mototaka Miyake, Hirokazu Watanabe, Masamichi Takahashi, Jun Itami, Tatsuya Harada, Ryuji Hamamoto

    Abstract: In real-world clinical practice, overlooking unanticipated findings can result in serious consequences. However, supervised learning, which is the foundation for the current success of deep learning, only encourages models to identify abnormalities that are defined in datasets in advance. Therefore, abnormality detection must be implemented in medical images that are not limited to a specific dise… ▽ More

    Submitted 8 May, 2021; v1 submitted 26 May, 2020; originally announced May 2020.

    Comments: This work has been submitted to the IEEE for possible publication. Copyright may be transferred without notice, after which this version may no longer be accessible

  16. arXiv:1911.06987  [pdf, other

    cs.CV

    Faster AutoAugment: Learning Augmentation Strategies using Backpropagation

    Authors: Ryuichiro Hataya, Jan Zdenek, Kazuki Yoshizoe, Hideki Nakayama

    Abstract: Data augmentation methods are indispensable heuristics to boost the performance of deep neural networks, especially in image recognition tasks. Recently, several studies have shown that augmentation strategies found by search algorithms outperform hand-made strategies. Such methods employ black-box search algorithms over image transformations with continuous or discrete parameters and require a lo… ▽ More

    Submitted 16 November, 2019; originally announced November 2019.

  17. arXiv:1904.08254  [pdf, other

    cs.CV cs.LG

    USE-Net: incorporating Squeeze-and-Excitation blocks into U-Net for prostate zonal segmentation of multi-institutional MRI datasets

    Authors: Leonardo Rundo, Changhee Han, Yudai Nagano, Jin Zhang, Ryuichiro Hataya, Carmelo Militello, Andrea Tangherloni, Marco S. Nobile, Claudio Ferretti, Daniela Besozzi, Maria Carla Gilardi, Salvatore Vitabile, Giancarlo Mauri, Hideki Nakayama, Paolo Cazzaniga

    Abstract: Prostate cancer is the most common malignant tumors in men but prostate Magnetic Resonance Imaging (MRI) analysis remains challenging. Besides whole prostate gland segmentation, the capability to differentiate between the blurry boundary of the Central Gland (CG) and Peripheral Zone (PZ) can lead to differential diagnosis, since tumor's frequency and severity differ in these regions. To tackle the… ▽ More

    Submitted 17 July, 2019; v1 submitted 17 April, 2019; originally announced April 2019.

    Comments: 44 pages, 6 figures, Accepted to Neurocomputing, Co-first authors: Leonardo Rundo and Changhee Han

  18. arXiv:1903.12571  [pdf, other

    cs.CV cs.AI

    CNN-based Prostate Zonal Segmentation on T2-weighted MR Images: A Cross-dataset Study

    Authors: Leonardo Rundo, Changhee Han, Jin Zhang, Ryuichiro Hataya, Yudai Nagano, Carmelo Militello, Claudio Ferretti, Marco S. Nobile, Andrea Tangherloni, Maria Carla Gilardi, Salvatore Vitabile, Hideki Nakayama, Giancarlo Mauri

    Abstract: Prostate cancer is the most common cancer among US men. However, prostate imaging is still challenging despite the advances in multi-parametric Magnetic Resonance Imaging (MRI), which provides both morphologic and functional information pertaining to the pathological regions. Along with whole prostate gland segmentation, distinguishing between the Central Gland (CG) and Peripheral Zone (PZ) can gu… ▽ More

    Submitted 29 March, 2019; originally announced March 2019.

    Comments: 12 pages, 3 figures, Accepted to Neural Approaches to Dynamics of Signal Exchanges as a Springer book chapter