Zum Hauptinhalt springen

Showing 1–25 of 25 results for author: Koyama, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20879  [pdf, other

    cs.LG

    Flow matching achieves minimax optimal convergence

    Authors: Kenji Fukumizu, Taiji Suzuki, Noboru Isobe, Kazusato Oko, Masanori Koyama

    Abstract: Flow matching (FM) has gained significant attention as a simulation-free generative model. Unlike diffusion models, which are based on stochastic differential equations, FM employs a simpler approach by solving an ordinary differential equation with an initial condition from a normal distribution, thus streamlining the sample generation process. This paper discusses the convergence properties of F… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  2. arXiv:2405.09860  [pdf, other

    quant-ph cs.NI

    Optimal Switching Networks for Paired-Egress Bell State Analyzer Pools

    Authors: Marii Koyama, Claire Yun, Amin Taherkhani, Naphan Benchasattabuse, Bernard Ousmane Sane, Michal Hajdušek, Shota Nagayama, Rodney Van Meter

    Abstract: To scale quantum computers to useful levels, we must build networks of quantum computational nodes that can share entanglement for use in distributed forms of quantum algorithms. In one proposed architecture, node-to-node entanglement is created when nodes emit photons entangled with stationary memories, with the photons routed through a switched interconnect to a shared pool of Bell state analyze… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 11 pages, 8 figures, 1 table

  3. arXiv:2402.18839  [pdf, other

    cs.LG math.AP math.FA math.OC math.PR

    Extended Flow Matching: a Method of Conditional Generation with Generalized Continuity Equation

    Authors: Noboru Isobe, Masanori Koyama, Jinzhe Zhang, Kohei Hayashi, Kenji Fukumizu

    Abstract: The task of conditional generation is one of the most important applications of generative models, and numerous methods have been developed to date based on the celebrated flow-based models. However, many flow-based models in use today are not built to allow one to introduce an explicit inductive bias to how the conditional distribution to be generated changes with respect to conditions. This can… ▽ More

    Submitted 5 July, 2024; v1 submitted 28 February, 2024; originally announced February 2024.

    Comments: 27 pages, 10 figures, We have corrected an error in our experiment on COT-FM

    MSC Class: 68T07 (Primary); 49Q22 (Secondary)

  4. arXiv:2305.18484  [pdf, other

    stat.ML cs.LG

    Neural Fourier Transform: A General Approach to Equivariant Representation Learning

    Authors: Masanori Koyama, Kenji Fukumizu, Kohei Hayashi, Takeru Miyato

    Abstract: Symmetry learning has proven to be an effective approach for extracting the hidden structure of data, with the concept of equivariance relation playing the central role. However, most of the current studies are built on architectural theory and corresponding assumptions on the form of data. We propose Neural Fourier Transform (NFT), a general framework of learning the latent linear action of the g… ▽ More

    Submitted 14 February, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

  5. arXiv:2210.07413  [pdf, other

    stat.ML cs.LG

    Invariance-adapted decomposition and Lasso-type contrastive learning

    Authors: Masanori Koyama, Takeru Miyato, Kenji Fukumizu

    Abstract: Recent years have witnessed the effectiveness of contrastive learning in obtaining the representation of dataset that is useful in interpretation and downstream tasks. However, the mechanism that describes this effectiveness have not been thoroughly analyzed, and many studies have been conducted to investigate the data structures captured by contrastive learning. In particular, the recent study of… ▽ More

    Submitted 13 October, 2022; originally announced October 2022.

    Journal ref: 2022 ICML workshop of Topology, Algebra and Geometry in Machine Learning (spotlight)

  6. arXiv:2210.05972  [pdf, other

    cs.LG stat.ML

    Unsupervised Learning of Equivariant Structure from Sequences

    Authors: Takeru Miyato, Masanori Koyama, Kenji Fukumizu

    Abstract: In this study, we present meta-sequential prediction (MSP), an unsupervised framework to learn the symmetry from the time sequence of length at least three. Our method leverages the stationary property (e.g. constant velocity, constant acceleration) of the time sequence to learn the underlying equivariant structure of the dataset by simply training the encoder-decoder model to be able to predict t… ▽ More

    Submitted 12 October, 2022; originally announced October 2022.

    Comments: Accepted to NeurIPS 2022

  7. arXiv:2111.07679  [pdf, other

    stat.ML cs.LG

    Contrastive Representation Learning with Trainable Augmentation Channel

    Authors: Masanori Koyama, Kentaro Minami, Takeru Miyato, Yarin Gal

    Abstract: In contrastive representation learning, data representation is trained so that it can classify the image instances even when the images are altered by augmentations. However, depending on the datasets, some augmentations can damage the information of the images beyond recognition, and such augmentations can result in collapsed representations. We present a partial solution to this problem by forma… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

  8. arXiv:2008.01883  [pdf, other

    stat.ML cs.LG

    When is invariance useful in an Out-of-Distribution Generalization problem ?

    Authors: Masanori Koyama, Shoichiro Yamaguchi

    Abstract: The goal of Out-of-Distribution (OOD) generalization problem is to train a predictor that generalizes on all environments. Popular approaches in this field use the hypothesis that such a predictor shall be an \textit{invariant predictor} that captures the mechanism that remains constant across environments. While these approaches have been experimentally successful in various case studies, there i… ▽ More

    Submitted 25 November, 2021; v1 submitted 4 August, 2020; originally announced August 2020.

  9. arXiv:2007.10623  [pdf, other

    cs.LG stat.ML

    Learning Structured Latent Factors from Dependent Data:A Generative Model Framework from Information-Theoretic Perspective

    Authors: Ruixiang Zhang, Masanori Koyama, Katsuhiko Ishiguro

    Abstract: Learning controllable and generalizable representation of multivariate data with desired structural properties remains a fundamental problem in machine learning. In this paper, we present a novel framework for learning generative models with various underlying structures in the latent space. We represent the inductive bias in the form of mask variables to model the dependency structure in the grap… ▽ More

    Submitted 2 October, 2020; v1 submitted 21 July, 2020; originally announced July 2020.

    Comments: ICML2020 accepted paper. Author name fixed

  10. arXiv:2006.01488  [pdf, other

    stat.ML cs.LG

    Meta Learning as Bayes Risk Minimization

    Authors: Shin-ichi Maeda, Toshiki Nakanishi, Masanori Koyama

    Abstract: Meta-Learning is a family of methods that use a set of interrelated tasks to learn a model that can quickly learn a new query task from a possibly small contextual dataset. In this study, we use a probabilistic framework to formalize what it means for two tasks to be related and reframe the meta-learning problem into the problem of Bayesian risk minimization (BRM). In our formulation, the BRM opti… ▽ More

    Submitted 2 June, 2020; originally announced June 2020.

  11. arXiv:1909.09540  [pdf, other

    cs.LG stat.ML

    Reconnaissance and Planning algorithm for constrained MDP

    Authors: Shin-ichi Maeda, Hayato Watahiki, Shintarou Okada, Masanori Koyama

    Abstract: Practical reinforcement learning problems are often formulated as constrained Markov decision process (CMDP) problems, in which the agent has to maximize the expected return while satisfying a set of prescribed safety constraints. In this study, we propose a novel simulator-based method to approximately solve a CMDP problem without making any compromise on the safety constraints. We achieve this b… ▽ More

    Submitted 20 September, 2019; originally announced September 2019.

  12. arXiv:1907.10902  [pdf, other

    cs.LG stat.ML

    Optuna: A Next-generation Hyperparameter Optimization Framework

    Authors: Takuya Akiba, Shotaro Sano, Toshihiko Yanase, Takeru Ohta, Masanori Koyama

    Abstract: The purpose of this study is to introduce new design-criteria for next-generation hyperparameter optimization software. The criteria we propose include (1) define-by-run API that allows users to construct the parameter search space dynamically, (2) efficient implementation of both searching and pruning strategies, and (3) easy-to-setup, versatile architecture that can be deployed for various purpo… ▽ More

    Submitted 25 July, 2019; originally announced July 2019.

    Comments: 10 pages, Accepted at KDD 2019 Applied Data Science track

  13. arXiv:1905.13021  [pdf, other

    stat.ML cs.IT cs.LG

    Robustness to Adversarial Perturbations in Learning from Incomplete Data

    Authors: Amir Najafi, Shin-ichi Maeda, Masanori Koyama, Takeru Miyato

    Abstract: What is the role of unlabeled data in an inference problem, when the presumed underlying distribution is adversarially perturbed? To provide a concrete answer to this question, this paper unifies two major learning frameworks: Semi-Supervised Learning (SSL) and Distributionally Robust Learning (DRL). We develop a generalization theory for our framework based on a number of novel complexity measure… ▽ More

    Submitted 24 May, 2019; originally announced May 2019.

    Comments: 41 pages, 9 figures

  14. arXiv:1905.11722  [pdf, ps, other

    cs.LG stat.ML

    A Graph Theoretic Framework of Recomputation Algorithms for Memory-Efficient Backpropagation

    Authors: Mitsuru Kusumoto, Takuya Inoue, Gentaro Watanabe, Takuya Akiba, Masanori Koyama

    Abstract: Recomputation algorithms collectively refer to a family of methods that aims to reduce the memory consumption of the backpropagation by selectively discarding the intermediate results of the forward propagation and recomputing the discarded results as needed. In this paper, we will propose a novel and efficient recomputation method that can be applied to a wider range of neural nets than previous… ▽ More

    Submitted 28 May, 2019; originally announced May 2019.

  15. arXiv:1902.02992  [pdf, other

    stat.ML cs.LG

    A Wrapped Normal Distribution on Hyperbolic Space for Gradient-Based Learning

    Authors: Yoshihiro Nagano, Shoichiro Yamaguchi, Yasuhiro Fujita, Masanori Koyama

    Abstract: Hyperbolic space is a geometry that is known to be well-suited for representation learning of data with an underlying hierarchical structure. In this paper, we present a novel hyperbolic distribution called \textit{pseudo-hyperbolic Gaussian}, a Gaussian-like distribution on hyperbolic space whose density can be evaluated analytically and differentiated with respect to the parameters. Our distribu… ▽ More

    Submitted 9 May, 2019; v1 submitted 8 February, 2019; originally announced February 2019.

    Comments: 20 pages, 12 figures

  16. arXiv:1902.01020  [pdf, other

    cs.LG stat.ML

    Graph Warp Module: an Auxiliary Module for Boosting the Power of Graph Neural Networks in Molecular Graph Analysis

    Authors: Katsuhiko Ishiguro, Shin-ichi Maeda, Masanori Koyama

    Abstract: Graph Neural Network (GNN) is a popular architecture for the analysis of chemical molecules, and it has numerous applications in material and medicinal science. Current lines of GNNs developed for molecular analysis, however, do not fit well on the training set, and their performance does not scale well with the complexity of the network. In this paper, we propose an auxiliary module to be attache… ▽ More

    Submitted 24 May, 2019; v1 submitted 3 February, 2019; originally announced February 2019.

    Comments: Augmented experiments, title slightly modified

  17. arXiv:1811.10153  [pdf, other

    cs.CV cs.LG

    Spatially Controllable Image Synthesis with Internal Representation Collaging

    Authors: Ryohei Suzuki, Masanori Koyama, Takeru Miyato, Taizan Yonetsuji, Huachun Zhu

    Abstract: We present a novel CNN-based image editing strategy that allows the user to change the semantic information of an image over an arbitrary region by manipulating the feature-space representation of the image in a trained GAN model. We will present two variants of our strategy: (1) spatial conditional batch normalization (sCBN), a type of conditional batch normalization with user-specifiable spatial… ▽ More

    Submitted 9 April, 2019; v1 submitted 25 November, 2018; originally announced November 2018.

  18. Train Sparsely, Generate Densely: Memory-efficient Unsupervised Training of High-resolution Temporal GAN

    Authors: Masaki Saito, Shunta Saito, Masanori Koyama, Sosuke Kobayashi

    Abstract: Training of Generative Adversarial Network (GAN) on a video dataset is a challenge because of the sheer size of the dataset and the complexity of each observation. In general, the computational cost of training GAN scales exponentially with the resolution. In this study, we present a novel memory efficient method of unsupervised learning of high-resolution video dataset whose computational cost sc… ▽ More

    Submitted 1 June, 2020; v1 submitted 22 November, 2018; originally announced November 2018.

    Comments: Accepted at International Journal of Computer Vision. The source code is available at https://github.com/pfnet-research/tgan2

  19. arXiv:1802.05957  [pdf, other

    cs.LG cs.CV stat.ML

    Spectral Normalization for Generative Adversarial Networks

    Authors: Takeru Miyato, Toshiki Kataoka, Masanori Koyama, Yuichi Yoshida

    Abstract: One of the challenges in the study of generative adversarial networks is the instability of its training. In this paper, we propose a novel weight normalization technique called spectral normalization to stabilize the training of the discriminator. Our new normalization technique is computationally light and easy to incorporate into existing implementations. We tested the efficacy of spectral norm… ▽ More

    Submitted 16 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  20. arXiv:1802.05637  [pdf, other

    cs.LG cs.CV stat.ML

    cGANs with Projection Discriminator

    Authors: Takeru Miyato, Masanori Koyama

    Abstract: We propose a novel, projection based way to incorporate the conditional information into the discriminator of GANs that respects the role of the conditional information in the underlining probabilistic model. This approach is in contrast with most frameworks of conditional GANs used in application today, which use the conditional information by concatenating the (embedded) conditional vector to th… ▽ More

    Submitted 14 August, 2018; v1 submitted 15 February, 2018; originally announced February 2018.

    Comments: Published as a conference paper at ICLR 2018

  21. arXiv:1710.07375  [pdf

    physics.app-ph cond-mat.mes-hall cs.ET

    SPICE Simulation of tunnel FET aiming at 32 kHz crystal-oscillator operation

    Authors: Tetsufumi Tanamoto, Chika Tanaka, Satoshi Takaya, Masato Koyama

    Abstract: We numerically investigate the possibility of using Tunnel field-effect transistor (TFET) in a 32 kHz crystal oscillator circuit to reduce power consumption. A simulation using SPICE (Simulation Program with Integrated Circuit Emphasis) is carried out based on a conventional CMOS transistor model. It is shown that the power consumption of TFET is one-tenth that of conventional low-power CMOS.

    Submitted 19 October, 2017; originally announced October 2017.

    Comments: 7 pages, 2 tables

  22. arXiv:1704.03976  [pdf, other

    stat.ML cs.LG

    Virtual Adversarial Training: A Regularization Method for Supervised and Semi-Supervised Learning

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Shin Ishii

    Abstract: We propose a new regularization method based on virtual adversarial loss: a new measure of local smoothness of the conditional label distribution given input. Virtual adversarial loss is defined as the robustness of the conditional label distribution around each input data point against local perturbation. Unlike adversarial training, our method defines the adversarial direction without label info… ▽ More

    Submitted 27 June, 2018; v1 submitted 12 April, 2017; originally announced April 2017.

    Comments: To be appeared in IEEE Transactions on Pattern Analysis and Machine Intelligence

  23. arXiv:1507.00677  [pdf, other

    stat.ML cs.LG

    Distributional Smoothing with Virtual Adversarial Training

    Authors: Takeru Miyato, Shin-ichi Maeda, Masanori Koyama, Ken Nakae, Shin Ishii

    Abstract: We propose local distributional smoothness (LDS), a new notion of smoothness for statistical model that can be used as a regularization term to promote the smoothness of the model distribution. We named the LDS based regularization as virtual adversarial training (VAT). The LDS of a model at an input datapoint is defined as the KL-divergence based robustness of the model distribution against local… ▽ More

    Submitted 11 June, 2016; v1 submitted 2 July, 2015; originally announced July 2015.

    Comments: Under review as a conference paper at ICLR 2016

  24. arXiv:1502.00093  [pdf, other

    stat.ML cs.LG q-bio.NC

    Deep learning of fMRI big data: a novel approach to subject-transfer decoding

    Authors: Sotetsu Koyamada, Yumi Shikauchi, Ken Nakae, Masanori Koyama, Shin Ishii

    Abstract: As a technology to read brain states from measurable brain activities, brain decoding are widely applied in industries and medical sciences. In spite of high demands in these applications for a universal decoder that can be applied to all individuals simultaneously, large variation in brain activities across individuals has limited the scope of many studies to the development of individual-specifi… ▽ More

    Submitted 31 January, 2015; originally announced February 2015.

  25. Principal Sensitivity Analysis

    Authors: Sotetsu Koyamada, Masanori Koyama, Ken Nakae, Shin Ishii

    Abstract: We present a novel algorithm (Principal Sensitivity Analysis; PSA) to analyze the knowledge of the classifier obtained from supervised machine learning techniques. In particular, we define principal sensitivity map (PSM) as the direction on the input space to which the trained classifier is most sensitive, and use analogously defined k-th PSM to define a basis for the input space. We train neural… ▽ More

    Submitted 11 March, 2015; v1 submitted 21 December, 2014; originally announced December 2014.