Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Imakura, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.14164  [pdf, other

    cs.LG cs.DC

    New Solutions Based on the Generalized Eigenvalue Problem for the Data Collaboration Analysis

    Authors: Yuta Kawakami, Yuichi Takano, Akira Imakura

    Abstract: In recent years, the accumulation of data across various institutions has garnered attention for the technology of confidential data analysis, which improves analytical accuracy by sharing data between multiple institutions while protecting sensitive information. Among these methods, Data Collaboration Analysis (DCA) is noted for its efficiency in terms of computational cost and communication load… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 16 pages, 9 figures, preprint

    MSC Class: 15A18 ACM Class: C.2.4

  2. arXiv:2402.02672  [pdf, other

    stat.ME cs.CR cs.LG

    Estimation of conditional average treatment effects on distributed data: A privacy-preserving approach

    Authors: Yuji Kawamata, Ryoki Motai, Yukihiko Okada, Akira Imakura, Tetsuya Sakurai

    Abstract: Estimation of conditional average treatment effects (CATEs) is an important topic in sciences. CATEs can be estimated with high accuracy if distributed data across multiple parties can be centralized. However, it is difficult to aggregate such data owing to privacy concerns. To address this issue, we proposed data collaboration double machine learning, a method that can estimate CATE models with p… ▽ More

    Submitted 25 May, 2024; v1 submitted 4 February, 2024; originally announced February 2024.

    Comments: 51 pages, 11 figures

  3. arXiv:2308.00280  [pdf, other

    cs.LG

    Data Collaboration Analysis applied to Compound Datasets and the Introduction of Projection data to Non-IID settings

    Authors: Akihiro Mizoguchi, Anna Bogdanova, Akira Imakura, Tetsuya Sakurai

    Abstract: Given the time and expense associated with bringing a drug to market, numerous studies have been conducted to predict the properties of compounds based on their structure using machine learning. Federated learning has been applied to compound datasets to increase their prediction accuracy while safeguarding potentially proprietary information. However, federated learning is encumbered by low accur… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  4. arXiv:2212.03373  [pdf, other

    cs.LG cs.AI

    Achieving Transparency in Distributed Machine Learning with Explainable Data Collaboration

    Authors: Anna Bogdanova, Akira Imakura, Tetsuya Sakurai, Tomoya Fujii, Teppei Sakamoto, Hiroyuki Abe

    Abstract: Transparency of Machine Learning models used for decision support in various industries becomes essential for ensuring their ethical use. To that end, feature attribution methods such as SHAP (SHapley Additive exPlanations) are widely used to explain the predictions of black-box machine learning models to customers and developers. However, a parallel trend has been to train machine learning models… ▽ More

    Submitted 6 December, 2022; originally announced December 2022.

    Comments: Presented at PKAW 2022 (arXiv:2211.03888) Report-no: PKAW/2022/03

    Report number: Report-no: PKAW/2022/03

  5. arXiv:2208.14611  [pdf, other

    cs.LG cs.CR

    Non-readily identifiable data collaboration analysis for multiple datasets including personal information

    Authors: Akira Imakura, Tetsuya Sakurai, Yukihiko Okada, Tomoya Fujii, Teppei Sakamoto, Hiroyuki Abe

    Abstract: Multi-source data fusion, in which multiple data sources are jointly analyzed to obtain improved information, has considerable research attention. For the datasets of multiple medical institutions, data confidentiality and cross-institutional communication are critical. In such cases, data collaboration (DC) analysis by sharing dimensionality-reduced intermediate representations without iterative… ▽ More

    Submitted 30 August, 2022; originally announced August 2022.

    Comments: 19 pages, 3 figures, 7 tables

  6. arXiv:2208.12458  [pdf, other

    cs.LG

    Another Use of SMOTE for Interpretable Data Collaboration Analysis

    Authors: Akira Imakura, Masateru Kihira, Yukihiko Okada, Tetsuya Sakurai

    Abstract: Recently, data collaboration (DC) analysis has been developed for privacy-preserving integrated analysis across multiple institutions. DC analysis centralizes individually constructed dimensionality-reduced intermediate representations and realizes integrated analysis via collaboration representations without sharing the original data. To construct the collaboration representations, each instituti… ▽ More

    Submitted 26 August, 2022; originally announced August 2022.

    Comments: 19 pages, 3 figures, 7 tables

  7. Collaborative causal inference on distributed data

    Authors: Yuji Kawamata, Ryoki Motai, Yukihiko Okada, Akira Imakura, Tetsuya Sakurai

    Abstract: In recent years, the development of technologies for causal inference with privacy preservation of distributed data has gained considerable attention. Many existing methods for distributed data focus on resolving the lack of subjects (samples) and can only reduce random errors in estimating treatment effects. In this study, we propose a data collaboration quasi-experiment (DC-QE) that resolves the… ▽ More

    Submitted 11 January, 2024; v1 submitted 16 August, 2022; originally announced August 2022.

    Comments: 16 pages, 4 figures

    Journal ref: Expert Systems with Applications, 123024 (2023)

  8. arXiv:2203.14188  [pdf, ps, other

    cs.LG cs.CY cs.DC

    mdx: A Cloud Platform for Supporting Data Science and Cross-Disciplinary Research Collaborations

    Authors: Toyotaro Suzumura, Akiyoshi Sugiki, Hiroyuki Takizawa, Akira Imakura, Hiroshi Nakamura, Kenjiro Taura, Tomohiro Kudoh, Toshihiro Hanawa, Yuji Sekiya, Hiroki Kobayashi, Shin Matsushima, Yohei Kuga, Ryo Nakamura, Renhe Jiang, Junya Kawase, Masatoshi Hanai, Hiroshi Miyazaki, Tsutomu Ishizaki, Daisuke Shimotoku, Daisuke Miyamoto, Kento Aida, Atsuko Takefusa, Takashi Kurimoto, Koji Sasayama, Naoya Kitagawa , et al. (8 additional authors not shown)

    Abstract: The growing amount of data and advances in data science have created a need for a new kind of cloud platform that provides users with flexibility, strong security, and the ability to couple with supercomputers and edge devices through high-performance networks. We have built such a nation-wide cloud platform, called "mdx" to meet this need. The mdx platform's virtualization service, jointly operat… ▽ More

    Submitted 26 March, 2022; originally announced March 2022.

  9. arXiv:2106.09852  [pdf, other

    cs.LG

    LSEC: Large-scale spectral ensemble clustering

    Authors: Hongmin Li, Xiucai Ye, Akira Imakura, Tetsuya Sakurai

    Abstract: Ensemble clustering is a fundamental problem in the machine learning field, combining multiple base clusterings into a better clustering result. However, most of the existing methods are unsuitable for large-scale ensemble clustering tasks due to the efficiency bottleneck. In this paper, we propose a large-scale spectral ensemble clustering (LSEC) method to strike a good balance between efficiency… ▽ More

    Submitted 17 June, 2021; originally announced June 2021.

    Comments: 22 pages

  10. Divide-and-conquer based Large-Scale Spectral Clustering

    Authors: Hongmin Li, Xiucai Ye, Akira Imakura, Tetsuya Sakurai

    Abstract: Spectral clustering is one of the most popular clustering methods. However, how to balance the efficiency and effectiveness of the large-scale spectral clustering with limited computing resources has not been properly solved for a long time. In this paper, we propose a divide-and-conquer based large-scale spectral clustering method to strike a good balance between efficiency and effectiveness. In… ▽ More

    Submitted 22 April, 2022; v1 submitted 30 April, 2021; originally announced April 2021.

    Comments: 14 pages, 6 figures, 10 tables

    Journal ref: Neurocomputing Volume 501, 28 August 2022, Pages 664-678

  11. arXiv:2101.11144  [pdf, other

    cs.LG cs.CR

    Accuracy and Privacy Evaluations of Collaborative Data Analysis

    Authors: Akira Imakura, Anna Bogdanova, Takaya Yamazoe, Kazumasa Omote, Tetsuya Sakurai

    Abstract: Distributed data analysis without revealing the individual data has recently attracted significant attention in several applications. A collaborative data analysis through sharing dimensionality reduced representations of data has been proposed as a non-model sharing-type federated learning. This paper analyzes the accuracy and privacy evaluations of this novel framework. In the accuracy analysis,… ▽ More

    Submitted 26 January, 2021; originally announced January 2021.

    Comments: 16 pages; 2 figures; 1 table

    Journal ref: To be presented at The Second AAAI Workshop on Privacy-Preserving Artificial Intelligence (PPAI-21) (2021)

  12. arXiv:2011.06803  [pdf, other

    cs.LG

    Federated Learning System without Model Sharing through Integration of Dimensional Reduced Data Representations

    Authors: Anna Bogdanova, Akie Nakai, Yukihiko Okada, Akira Imakura, Tetsuya Sakurai

    Abstract: Dimensionality Reduction is a commonly used element in a machine learning pipeline that helps to extract important features from high-dimensional data. In this work, we explore an alternative federated learning system that enables integration of dimensionality reduced representations of distributed data prior to a supervised learning task, thus avoiding model sharing among the parties. We compare… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: 6 pages with 4 figures. To be presented at the Workshop on Federated Learning for Data Privacy and Confidentiality in Conjunction with IJCAI 2020 (FL-IJCAI'20)

  13. arXiv:2011.04437  [pdf, other

    cs.LG

    Interpretable collaborative data analysis on distributed data

    Authors: Akira Imakura, Hiroaki Inaba, Yukihiko Okada, Tetsuya Sakurai

    Abstract: This paper proposes an interpretable non-model sharing collaborative data analysis method as one of the federated learning systems, which is an emerging technology to analyze distributed data. Analyzing distributed data is essential in many applications such as medical, financial, and manufacturing data analyses due to privacy, and confidentiality concerns. In addition, interpretability of the obt… ▽ More

    Submitted 9 November, 2020; originally announced November 2020.

    Comments: 16 pages, 3 figures, 3 tables

  14. arXiv:1910.07174  [pdf, other

    cs.LG stat.ML

    Multiclass spectral feature scaling method for dimensionality reduction

    Authors: Momo Matsuda, Keiichi Morikuni, Akira Imakura, Xiucai Ye, Tetsuya Sakurai

    Abstract: Irregular features disrupt the desired classification. In this paper, we consider aggressively modifying scales of features in the original space according to the label information to form well-separated clusters in low-dimensional space. The proposed method exploits spectral clustering to derive scaling factors that are used to modify the features. Specifically, we reformulate the Laplacian eigen… ▽ More

    Submitted 16 October, 2019; originally announced October 2019.

  15. arXiv:1902.07535  [pdf, ps, other

    cs.LG stat.ML

    Data collaboration analysis for distributed datasets

    Authors: Akira Imakura, Tetsuya Sakurai

    Abstract: In this paper, we propose a data collaboration analysis method for distributed datasets. The proposed method is a centralized machine learning while training datasets and models remain distributed over some institutions. Recently, data became large and distributed with decreasing costs of data collection. If we can centralize these distributed datasets and analyse them as one dataset, we expect to… ▽ More

    Submitted 20 February, 2019; originally announced February 2019.

    Comments: 7 pages

  16. arXiv:1605.04639  [pdf, ps, other

    cs.LG cs.NE stat.ML

    Alternating optimization method based on nonnegative matrix factorizations for deep neural networks

    Authors: Tetsuya Sakurai, Akira Imakura, Yuto Inoue, Yasunori Futamura

    Abstract: The backpropagation algorithm for calculating gradients has been widely used in computation of weights for deep neural networks (DNNs). This method requires derivatives of objective functions and has some difficulties finding appropriate parameters such as learning rate. In this paper, we propose a novel approach for computing weight matrices of fully-connected DNNs by using two types of semi-nonn… ▽ More

    Submitted 15 May, 2016; originally announced May 2016.

    Comments: 9 pages, 2 figures