Zum Hauptinhalt springen

Showing 1–31 of 31 results for author: Kimura, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.18806  [pdf, other

    stat.ML cs.LG

    Density Ratio Estimation via Sampling along Generalized Geodesics on Statistical Manifolds

    Authors: Masanari Kimura, Howard Bondell

    Abstract: The density ratio of two probability distributions is one of the fundamental tools in mathematical and computational statistics and machine learning, and it has a variety of known applications. Therefore, density ratio estimation from finite samples is a very important task, but it is known to be unstable when the distributions are distant from each other. One approach to address this problem is d… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  2. arXiv:2405.14522  [pdf, other

    cs.LG cs.AI cs.CL cs.CV stat.ML

    Explaining Black-box Model Predictions via Two-level Nested Feature Attributions with Consistency Property

    Authors: Yuya Yoshikawa, Masanari Kimura, Ryotaro Shimizu, Yuki Saito

    Abstract: Techniques that explain the predictions of black-box machine learning models are crucial to make the models transparent, thereby increasing trust in AI systems. The input features to the models often have a nested structure that consists of high- and low-level features, and each high-level feature is decomposed into multiple low-level features. For such inputs, both high-level feature attributions… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  3. arXiv:2405.00442  [pdf, other

    stat.ML cs.AI cs.LG

    Geometric Insights into Focal Loss: Reducing Curvature for Enhanced Model Calibration

    Authors: Masanari Kimura, Hiroki Naganuma

    Abstract: The key factor in implementing machine learning algorithms in decision-making situations is not only the accuracy of the model but also its confidence level. The confidence level of a model in a classification problem is often given by the output vector of a softmax function for convenience. However, these values are known to deviate significantly from the actual expected model confidence. This pr… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: This paper is under consideration at Pattern Recognition Letters

  4. arXiv:2403.17410  [pdf, other

    cs.LG cs.AI stat.ML

    On permutation-invariant neural networks

    Authors: Masanari Kimura, Ryotaro Shimizu, Yuki Hirakawa, Ryosuke Goto, Yuki Saito

    Abstract: Conventional machine learning algorithms have traditionally been designed under the assumption that input data follows a vector-based format, with an emphasis on vector-centric paradigms. However, as the demand for tasks involving set-based inputs has grown, there has been a paradigm shift in the research community towards addressing these challenges. In recent years, the emergence of neural netwo… ▽ More

    Submitted 28 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2403.10175  [pdf, other

    cs.LG cs.AI stat.ML

    A Short Survey on Importance Weighting for Machine Learning

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: Importance weighting is a fundamental procedure in statistics and machine learning that weights the objective function or probability distribution based on the importance of the instance in some sense. The simplicity and usefulness of the idea has led to many applications of importance weighting. For example, it is known that supervised learning under an assumption about the difference between the… ▽ More

    Submitted 14 May, 2024; v1 submitted 15 March, 2024; originally announced March 2024.

  6. arXiv:2402.06892  [pdf, other

    cs.LG

    Understanding Test-Time Augmentation

    Authors: Masanari Kimura

    Abstract: Test-Time Augmentation (TTA) is a very powerful heuristic that takes advantage of data augmentation during testing to produce averaged output. Despite the experimental effectiveness of TTA, there is insufficient discussion of its theoretical aspects. In this paper, we aim to give theoretical guarantees for TTA and clarify its behavior.

    Submitted 10 February, 2024; originally announced February 2024.

  7. Information Geometrically Generalized Covariate Shift Adaptation

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: Many machine learning methods assume that the training and test data follow the same distribution. However, in the real world, this assumption is very often violated. In particular, the phenomenon that the marginal distribution of the data changes is called covariate shift, one of the most important research topics in machine learning. We show that the well-known family of covariate shift adaptati… ▽ More

    Submitted 18 April, 2023; originally announced April 2023.

  8. arXiv:2302.12991  [pdf, other

    stat.ML cs.LG

    Generalization Bounds for Set-to-Set Matching with Negative Sampling

    Authors: Masanari Kimura

    Abstract: The problem of matching two sets of multiple elements, namely set-to-set matching, has received a great deal of attention in recent years. In particular, it has been reported that good experimental results can be obtained by preparing a neural network as a matching function, especially in complex cases where, for example, each element of the set is an image. However, theoretical analysis of set-to… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: This paper is accepted at the International Conference on Neural Information Processing (ICONIP2022)

  9. arXiv:2210.17417  [pdf, other

    cs.CV cs.LG

    Fashion-Specific Attributes Interpretation via Dual Gaussian Visual-Semantic Embedding

    Authors: Ryotaro Shimizu, Masanari Kimura, Masayuki Goto

    Abstract: Several techniques to map various types of components, such as words, attributes, and images, into the embedded space have been studied. Most of them estimate the embedded representation of target entity as a point in the projective space. Some models, such as Word2Gauss, assume a probability distribution behind the embedded representation, which enables the spread or variance of the meaning of em… ▽ More

    Submitted 7 November, 2022; v1 submitted 27 October, 2022; originally announced October 2022.

  10. arXiv:2206.10936  [pdf, other

    stat.ML cs.IT cs.LG

    Information Geometry of Dropout Training

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: Dropout is one of the most popular regularization techniques in neural network training. Because of its power and simplicity of idea, dropout has been analyzed extensively and many variants have been proposed. In this paper, several properties of dropout are discussed in a unified manner from the viewpoint of information geometry. We showed that dropout flattens the model manifold and that their r… ▽ More

    Submitted 22 June, 2022; originally announced June 2022.

  11. arXiv:2203.00955  [pdf, other

    cs.CV

    GRASP EARTH: Intuitive Software for Discovering Changes on the Planet

    Authors: Waku Hatakeyama, Shirou Kawakita, Ryohei Izawa, Masanari Kimura

    Abstract: Detecting changes on the Earth, such as urban development, deforestation, or natural disaster, is one of the research fields that is attracting a great deal of attention. One promising tool to solve these problems is satellite imagery. However, satellite images require huge amount of storage, therefore users are required to set Area of Interests first, which was not suitable for detecting potentia… ▽ More

    Submitted 2 March, 2022; originally announced March 2022.

  12. arXiv:2108.13699  [pdf, other

    cs.CV

    End-to-End Monocular Vanishing Point Detection Exploiting Lane Annotations

    Authors: Hiroto Honda, Motoki Kimura, Takumi Karasawa, Yusuke Uchida

    Abstract: Vanishing points (VPs) play a vital role in various computer vision tasks, especially for recognizing the 3D scenes from an image. In the real-world scenario of automobile applications, it is costly to manually obtain the external camera parameters when the camera is attached to the vehicle or the attachment is accidentally perturbed. In this paper we introduce a simple but effective end-to-end va… ▽ More

    Submitted 31 August, 2021; originally announced August 2021.

  13. arXiv:2108.12992  [pdf, other

    cs.LG cs.CV

    SHIFT15M: Fashion-specific dataset for set-to-set matching with several distribution shifts

    Authors: Masanari Kimura, Takuma Nakamura, Yuki Saito

    Abstract: This paper addresses the problem of set-to-set matching, which involves matching two different sets of items based on some criteria, especially in the case of high-dimensional items like images. Although neural networks have been applied to solve this problem, most machine learning-based approaches assume that the training and test data follow the same distribution, which is not always true in rea… ▽ More

    Submitted 8 March, 2023; v1 submitted 30 August, 2021; originally announced August 2021.

  14. arXiv:2108.12216  [pdf, other

    cs.CL

    Exploring the Capacity of a Large-scale Masked Language Model to Recognize Grammatical Errors

    Authors: Ryo Nagata, Manabu Kimura, Kazuaki Hanawa

    Abstract: In this paper, we explore the capacity of a language model-based method for grammatical error detection in detail. We first show that 5 to 10% of training data are enough for a BERT-based error detection method to achieve performance equivalent to a non-language model-based method can achieve with the full training data; recall improves much faster with respect to training data size in the BERT-ba… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

  15. arXiv:2103.17060  [pdf, other

    cs.IT math.ST stat.CO stat.ML

    $α$-Geodesical Skew Divergence

    Authors: Masanari Kimura, Hideitsu Hino

    Abstract: The asymmetric skew divergence smooths one of the distributions by mixing it, to a degree determined by the parameter $λ$, with the other distribution. Such divergence is an approximation of the KL divergence that does not require the target distribution to be absolutely continuous with respect to the source distribution. In this paper, an information geometric generalization of the skew divergenc… ▽ More

    Submitted 25 April, 2021; v1 submitted 31 March, 2021; originally announced March 2021.

    Journal ref: Entropy. 2021; 23(5):528

  16. arXiv:2101.10229  [pdf, other

    cs.LG cs.AI math.CA math.NA stat.ML

    Universal Approximation Properties for an ODENet and a ResNet: Mathematical Analysis and Numerical Experiments

    Authors: Yuto Aizawa, Masato Kimura, Kazunori Matsui

    Abstract: We prove a universal approximation property (UAP) for a class of ODENet and a class of ResNet, which are simplified mathematical models for deep learning systems with skip connections. The UAP can be stated as follows. Let $n$ and $m$ be the dimension of input and output data, and assume $m\leq n$. Then we show that ODENet of width $n+m$ with any non-polynomial continuous activation function can a… ▽ More

    Submitted 17 May, 2023; v1 submitted 22 December, 2020; originally announced January 2021.

  17. arXiv:2007.03899  [pdf, other

    cs.LG stat.ML

    Density Fixing: Simple yet Effective Regularization Method based on the Class Prior

    Authors: Masanari Kimura, Ryohei Izawa

    Abstract: Machine learning models suffer from overfitting, which is caused by a lack of labeled data. To tackle this problem, we proposed a framework of regularization methods, called density-fixing, that can be used commonly for supervised and semi-supervised learning. Our proposed regularization method improves the generalization performance by forcing the model to approximate the class's prior distributi… ▽ More

    Submitted 6 September, 2020; v1 submitted 8 July, 2020; originally announced July 2020.

  18. arXiv:2006.06231  [pdf, other

    stat.ML cs.LG

    Why Mixup Improves the Model Performance

    Authors: Masanari Kimura

    Abstract: Machine learning techniques are used in a wide range of domains. However, machine learning models often suffer from the problem of over-fitting. Many data augmentation methods have been proposed to tackle such a problem, and one of them is called mixup. Mixup is a recently proposed regularization procedure, which linearly interpolates a random pair of training examples. This regularization method… ▽ More

    Submitted 17 June, 2021; v1 submitted 11 June, 2020; originally announced June 2020.

  19. arXiv:1912.02945  [pdf

    cs.LG cs.MA cs.RO stat.ML

    A pedestrian path-planning model in accordance with obstacle's danger with reinforcement learning

    Authors: Thanh-Trung Trinh, Dinh-Minh Vu, Masaomi Kimura

    Abstract: Most microscopic pedestrian navigation models use the concept of "forces" applied to the pedestrian agents to replicate the navigation environment. While the approach could provide believable results in regular situations, it does not always resemble natural pedestrian navigation behaviour in many typical settings. In our research, we proposed a novel approach using reinforcement learning for simu… ▽ More

    Submitted 5 December, 2019; originally announced December 2019.

  20. arXiv:1910.07129  [pdf, other

    cs.CV eess.IV

    Large-Scale Landslides Detection from Satellite Images with Incomplete Labels

    Authors: Masanari Kimura

    Abstract: Earthquakes and tropical cyclones cause the suffering of millions of people around the world every year. The resulting landslides exacerbate the effects of these disasters. Landslide detection is, therefore, a critical task for the protection of human life and livelihood in mountainous areas. To tackle this problem, we propose a combination of satellite technology and Deep Neural Networks (DNNs).… ▽ More

    Submitted 15 October, 2019; originally announced October 2019.

  21. arXiv:1909.07156  [pdf, other

    cs.LG cs.AI stat.ML

    New Perspective of Interpretability of Deep Neural Networks

    Authors: Masanari Kimura, Masayuki Tanaka

    Abstract: Deep neural networks (DNNs) are known as black-box models. In other words, it is difficult to interpret the internal state of the model. Improving the interpretability of DNNs is one of the hot research topics. However, at present, the definition of interpretability for DNNs is vague, and the question of what is a highly explanatory model is still controversial. To address this issue, we provide t… ▽ More

    Submitted 12 September, 2019; originally announced September 2019.

  22. arXiv:1907.02786  [pdf

    cs.LG cs.SI

    Sequence to Sequence with Attention for Influenza Prevalence Prediction using Google Trends

    Authors: Kenjiro Kondo, Akihiko Ishikawa, Masashi Kimura

    Abstract: Early prediction of the prevalence of influenza reduces its impact. Various studies have been conducted to predict the number of influenza-infected people. However, these studies are not highly accurate especially in the distant future such as over one month. To deal with this problem, we investigate the sequence to sequence (Seq2Seq) with attention model using Google Trends data to assess and pre… ▽ More

    Submitted 3 July, 2019; originally announced July 2019.

    Comments: 7 pages, ICCBB2019

  23. arXiv:1906.10822  [pdf, other

    cs.LG stat.ML

    Gradient Noise Convolution (GNC): Smoothing Loss Function for Distributed Large-Batch SGD

    Authors: Kosuke Haruki, Taiji Suzuki, Yohei Hamakawa, Takeshi Toda, Ryuji Sakai, Masahiro Ozawa, Mitsuhiro Kimura

    Abstract: Large-batch stochastic gradient descent (SGD) is widely used for training in distributed deep learning because of its training-time efficiency, however, extremely large-batch SGD leads to poor generalization and easily converges to sharp minima, which prevents naive large-scale data-parallel SGD (DP-SGD) from converging to good minima. To overcome this difficulty, we propose gradient noise convolu… ▽ More

    Submitted 25 June, 2019; originally announced June 2019.

    Comments: 19 pages, 11 figures, 7 tables

  24. arXiv:1905.10939  [pdf, other

    cs.CV eess.IV

    PNUNet: Anomaly Detection using Positive-and-Negative Noise based on Self-Training Procedure

    Authors: Masanari Kimura

    Abstract: We propose the novel framework for anomaly detection in images. Our new framework, PNUNet, is based on many normal data and few anomalous data. We assume that some noises are added to the input images and learn to remove the noise. In addition, the proposed method achieves significant performance improvement by updating the noise assumed in the inputs using a self-training framework. The experimen… ▽ More

    Submitted 26 May, 2019; originally announced May 2019.

  25. arXiv:1905.02719  [pdf, other

    cs.CV cs.AI

    Intentional Attention Mask Transformation for Robust CNN Classification

    Authors: Masanari Kimura, Masayuki Tanaka

    Abstract: Convolutional Neural Networks have achieved impressive results in various tasks, but interpreting the internal mechanism is a challenging problem. To tackle this problem, we exploit a multi-channel attention mechanism in feature space. Our network architecture allows us to obtain an attention mask for each feature while existing CNN visualization methods provide only a common attention mask for al… ▽ More

    Submitted 20 May, 2019; v1 submitted 7 May, 2019; originally announced May 2019.

    Comments: arXiv admin note: substantial text overlap with arXiv:1904.13078

  26. arXiv:1904.13078  [pdf, other

    cs.CV cs.AI

    Interpretation of Feature Space using Multi-Channel Attentional Sub-Networks

    Authors: Masanari Kimura, Masayuki Tanaka

    Abstract: Convolutional Neural Networks have achieved impressive results in various tasks, but interpreting the internal mechanism is a challenging problem. To tackle this problem, we exploit a multi-channel attention mechanism in feature space. Our network architecture allows us to obtain an attention mask for each feature while existing CNN visualization methods provide only a common attention mask for al… ▽ More

    Submitted 30 April, 2019; originally announced April 2019.

    Comments: CVPR2019 Workshop on Explainable AI

  27. arXiv:1807.01136  [pdf, other

    cs.CV

    Anomaly Detection Using GANs for Visual Inspection in Noisy Training Data

    Authors: Masanari Kimura, Takashi Yanagihara

    Abstract: The detection and the quantification of anomalies in image data are critical tasks in industrial scenes such as detecting micro scratches on product. In recent years, due to the difficulty of defining anomalies and the limit of correcting their labels, research on unsupervised anomaly detection using generative models has attracted attention. Generally, in those studies, only normal images are use… ▽ More

    Submitted 7 November, 2018; v1 submitted 3 July, 2018; originally announced July 2018.

  28. arXiv:1802.06368  [pdf, other

    cs.LG cs.SI stat.ML

    Node Centralities and Classification Performance for Characterizing Node Embedding Algorithms

    Authors: Kento Nozawa, Masanari Kimura, Atsunori Kanemura

    Abstract: Embedding graph nodes into a vector space can allow the use of machine learning to e.g. predict node classes, but the study of node embedding algorithms is immature compared to the natural language processing field because of a diverse nature of graphs. We examine the performance of node embedding algorithms with respect to graph centrality measures that characterize diverse graphs, through system… ▽ More

    Submitted 18 February, 2018; originally announced February 2018.

    Comments: Under review at ICLR 2018 workshop track

  29. arXiv:1704.06410  [pdf, other

    cs.CV

    Solar Power Plant Detection on Multi-Spectral Satellite Imagery using Weakly-Supervised CNN with Feedback Features and m-PCNN Fusion

    Authors: Nevrez Imamoglu, Motoki Kimura, Hiroki Miyamoto, Aito Fujita, Ryosuke Nakamura

    Abstract: Most of the traditional convolutional neural networks (CNNs) implements bottom-up approach (feed-forward) for image classifications. However, many scientific studies demonstrate that visual perception in primates rely on both bottom-up and top-down connections. Therefore, in this work, we propose a CNN network with feedback structure for Solar power plant detection on middle-resolution satellite i… ▽ More

    Submitted 21 June, 2017; v1 submitted 21 April, 2017; originally announced April 2017.

    Comments: 9 pages, 9 figures, 4 tables

    Journal ref: British Machine Vision Conference (BMVC) 2017

  30. arXiv:1204.4528  [pdf, ps, other

    cs.SI physics.soc-ph

    Learning Asynchronous-Time Information Diffusion Models and its Application to Behavioral Data Analysis over Social Networks

    Authors: Kazumi Saito, Masahiro Kimura, Kouzou Ohara, Hiroshi Motoda

    Abstract: One of the interesting and important problems of information diffusion over a large social network is to identify an appropriate model from a limited amount of diffusion information. There are two contrasting approaches to model information diffusion: a push type model known as Independent Cascade (IC) model and a pull type model known as Linear Threshold (LT) model. We extend these two models (ca… ▽ More

    Submitted 20 April, 2012; originally announced April 2012.

    Comments: 39 pages, 55 figures

  31. arXiv:1110.2659  [pdf, ps, other

    cs.SI physics.soc-ph

    Efficient Detection of Hot Span in Information Diffusion from Observation

    Authors: Kouzou Ohara, Kazumi Saito, Masahiro Kimura, Hiroshi Motoda

    Abstract: We addressed the problem of detecting the change in behavior of information diffusion from a small amount of observation data, where the behavior changes were assumed to be effectively reflected in changes in the diffusion parameter value. The problem is to detect where in time and how long this change persisted and how big this change is. We solved this problem by searching the change pattern tha… ▽ More

    Submitted 12 October, 2011; originally announced October 2011.

    Comments: 7 pages, 11 figures; IJCAI11 Workshop on Link Analysis in Heterogeneous Information Networks