Zum Hauptinhalt springen

Showing 1–48 of 48 results for author: Sato, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.15010  [pdf, other

    cs.LG math.OC

    Polyak Meets Parameter-free Clipped Gradient Descent

    Authors: Yuki Takezawa, Han Bao, Ryoma Sato, Kenta Niwa, Makoto Yamada

    Abstract: Gradient descent and its variants are de facto standard algorithms for training machine learning models. As gradient descent is sensitive to its hyperparameters, we need to tune the hyperparameters carefully using a grid search, but it is time-consuming, especially when multiple hyperparameters exist. Recently, parameter-free methods that adjust the hyperparameters on the fly have been studied. Ho… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  2. arXiv:2404.19288  [pdf, other

    cs.LG cs.AI stat.ML

    Training-free Graph Neural Networks and the Power of Labels as Features

    Authors: Ryoma Sato

    Abstract: We propose training-free graph neural networks (TFGNNs), which can be used without training and can also be improved with optional training, for transductive node classification. We first advocate labels as features (LaF), which is an admissible but not explored technique. We show that LaF provably enhances the expressive power of graph neural networks. We design TFGNNs based on this analysis. In… ▽ More

    Submitted 15 August, 2024; v1 submitted 30 April, 2024; originally announced April 2024.

    Comments: TMLR 2024

  3. arXiv:2404.11049  [pdf, other

    cs.LG cs.AI cs.CL

    Stepwise Alignment for Constrained Language Model Policy Optimization

    Authors: Akifumi Wachi, Thien Q. Tran, Rei Sato, Takumi Tanabe, Youhei Akimoto

    Abstract: Safety and trustworthiness are indispensable requirements for real-world applications of AI systems using large language models (LLMs). This paper formulates human value alignment as an optimization problem of the language model policy to maximize reward under a safety constraint, and then proposes an algorithm, Stepwise Alignment for Constrained Policy Optimization (SACPO). One key idea behind SA… ▽ More

    Submitted 22 May, 2024; v1 submitted 16 April, 2024; originally announced April 2024.

  4. arXiv:2403.15757  [pdf, other

    cs.IR cs.AI cs.CL cs.CR cs.LG

    User-Side Realization

    Authors: Ryoma Sato

    Abstract: Users are dissatisfied with services. Since the service is not tailor-made for a user, it is natural for dissatisfaction to arise. The problem is, that even if users are dissatisfied, they often do not have the means to resolve their dissatisfaction. The user cannot alter the source code of the service, nor can they force the service provider to change. The user has no choice but to remain dissati… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

    Comments: Doctoral Thesis

  5. arXiv:2312.04068  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Making Translators Privacy-aware on the User's Side

    Authors: Ryoma Sato

    Abstract: We propose PRISM to enable users of machine translation systems to preserve the privacy of data on their own initiative. There is a growing demand to apply machine translation systems to data that require privacy protection. While several machine translation engines claim to prioritize privacy, the extent and specifics of such protection are largely ambiguous. First, there is often a lack of clari… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

  6. arXiv:2312.00044  [pdf

    cs.CY cs.AI econ.GN

    Advancing AI Audits for Enhanced AI Governance

    Authors: Arisa Ema, Ryo Sato, Tomoharu Hase, Masafumi Nakano, Shinji Kamimura, Hiromu Kitamura

    Abstract: As artificial intelligence (AI) is integrated into various services and systems in society, many companies and organizations have proposed AI principles, policies, and made the related commitments. Conversely, some have proposed the need for independent audits, arguing that the voluntary principles adopted by the developers and providers of AI services and systems insufficiently address risk. This… ▽ More

    Submitted 26 November, 2023; originally announced December 2023.

  7. arXiv:2310.20430  [pdf, ps, other

    cs.PL

    Borrowable Fractional Ownership Types for Verification

    Authors: Takashi Nakayama, Yusuke Matsushita, Ken Sakayori, Ryosuke Sato, Naoki Kobayashi

    Abstract: Automated verification of functional correctness of imperative programs with references (a.k.a. pointers) is challenging because of reference aliasing. Ownership types have recently been applied to address this issue, but the existing approaches were limited in that they are effective only for a class of programs whose reference usage follows a certain style. To relax the limitation, we combine th… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: An extended version of the paper to appear in Proceedings of VMCAI 2024

  8. arXiv:2310.08920  [pdf, other

    cs.LG cs.AI cs.CR

    Embarrassingly Simple Text Watermarks

    Authors: Ryoma Sato, Yuki Takezawa, Han Bao, Kenta Niwa, Makoto Yamada

    Abstract: We propose Easymark, a family of embarrassingly simple yet effective watermarks. Text watermarking is becoming increasingly important with the advent of Large Language Models (LLM). LLMs can generate texts that cannot be distinguished from human-written texts. This is a serious problem for the credibility of the text. Easymark is a simple yet effective solution to this problem. Easymark can inject… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  9. arXiv:2310.00833  [pdf, other

    cs.CL cs.LG

    Necessary and Sufficient Watermark for Large Language Models

    Authors: Yuki Takezawa, Ryoma Sato, Han Bao, Kenta Niwa, Makoto Yamada

    Abstract: In recent years, large language models (LLMs) have achieved remarkable performances in various NLP tasks. They can generate texts that are indistinguishable from those written by humans. Such remarkable performance of LLMs increases their risk of being used for malicious purposes, such as generating fake news articles. Therefore, it is necessary to develop methods for distinguishing texts written… ▽ More

    Submitted 1 October, 2023; originally announced October 2023.

  10. arXiv:2305.11420  [pdf, other

    cs.LG cs.DC stat.ML

    Beyond Exponential Graph: Communication-Efficient Topologies for Decentralized Learning via Finite-time Convergence

    Authors: Yuki Takezawa, Ryoma Sato, Han Bao, Kenta Niwa, Makoto Yamada

    Abstract: Decentralized learning has recently been attracting increasing attention for its applications in parallel computation and privacy preservation. Many recent studies stated that the underlying network topology with a faster consensus rate (a.k.a. spectral gap) leads to a better convergence rate and accuracy for decentralized learning. However, a topology with a fast consensus rate, e.g., the exponen… ▽ More

    Submitted 15 October, 2023; v1 submitted 19 May, 2023; originally announced May 2023.

    Comments: NeurIPS 2023

  11. arXiv:2303.00231  [pdf, ps, other

    cs.GT

    Polyhedral Clinching Auctions for Indivisible Goods

    Authors: Hiroshi Hirai, Ryosuke Sato

    Abstract: In this study, we propose the polyhedral clinching auction for indivisible goods, which has so far been studied for divisible goods. As in the divisible setting by Goel et al. (2015), our mechanism enjoys incentive compatibility, individual rationality, and Pareto optimality, and works with polymatroidal environments. A notable feature for the indivisible setting is that the whole procedure can be… ▽ More

    Submitted 18 October, 2023; v1 submitted 28 February, 2023; originally announced March 2023.

    MSC Class: 91B26

  12. arXiv:2302.03458  [pdf, ps, other

    cs.GT

    Polyhedral Clinching Auctions with a Single Sample

    Authors: Ryosuke Sato

    Abstract: In this study, we investigate auctions in two-sided markets with budget constraints on buyers. Our goal is to propose an efficient mechanism that satisfies dominant strategy incentive compatibility (DSIC), individual rationality (IR), and weak budget balance (WBB). To avoid several known impossibility theorems for each of two-sided markets and budget constraints, we assume prior information on sel… ▽ More

    Submitted 7 February, 2023; originally announced February 2023.

    MSC Class: 91B26

  13. Few-Shot Image-to-Semantics Translation for Policy Transfer in Reinforcement Learning

    Authors: Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

    Abstract: We investigate policy transfer using image-to-semantics translation to mitigate learning difficulties in vision-based robotics control agents. This problem assumes two environments: a simulator environment with semantics, that is, low-dimensional and essential information, as the state space, and a real-world environment with images as the state space. By learning mapping from images to semantics,… ▽ More

    Submitted 30 January, 2023; originally announced January 2023.

    Comments: The 2022 International Joint Conference on Neural Networks (IJCNN2022)

  14. arXiv:2301.10956  [pdf, other

    cs.LG cs.SI

    Graph Neural Networks can Recover the Hidden Features Solely from the Graph Structure

    Authors: Ryoma Sato

    Abstract: Graph Neural Networks (GNNs) are popular models for graph learning problems. GNNs show strong empirical performance in many practical tasks. However, the theoretical properties have not been completely elucidated. In this paper, we investigate whether GNNs can exploit the graph structure from the perspective of the expressive power of GNNs. In our analysis, we consider graph generation processes t… ▽ More

    Submitted 23 March, 2024; v1 submitted 26 January, 2023; originally announced January 2023.

    Comments: ICML 2023

  15. arXiv:2212.04984  [pdf, other

    cs.LG cs.AI

    Transformer-based normative modelling for anomaly detection of early schizophrenia

    Authors: Pedro F Da Costa, Jessica Dafflon, Sergio Leonardo Mendes, João Ricardo Sato, M. Jorge Cardoso, Robert Leech, Emily JH Jones, Walter H. L. Pinaya

    Abstract: Despite the impact of psychiatric disorders on clinical health, early-stage diagnosis remains a challenge. Machine learning studies have shown that classifiers tend to be overly narrow in the diagnosis prediction task. The overlap between conditions leads to high heterogeneity among participants that is not adequately captured by classification models. To address this issue, normative approaches h… ▽ More

    Submitted 8 December, 2022; originally announced December 2022.

    Comments: 10 pages, 2 figures, 2 tables, presented at NeurIPS22@PAI4MH

  16. arXiv:2211.03413  [pdf, other

    cs.LG cs.AI

    Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification

    Authors: Takumi Tanabe, Rei Sato, Kazuto Fukuchi, Jun Sakuma, Youhei Akimoto

    Abstract: In the field of reinforcement learning, because of the high cost and risk of policy training in the real world, policies are trained in a simulation environment and transferred to the corresponding real-world environment. However, the simulation environment does not perfectly mimic the real-world environment, lead to model misspecification. Multiple studies report significant deterioration of poli… ▽ More

    Submitted 11 January, 2023; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: Neural Information Processing Systems 2022 (NeurIPS '22)

    ACM Class: I.2.6

  17. arXiv:2210.08205  [pdf, other

    cs.LG

    Active Learning from the Web

    Authors: Ryoma Sato

    Abstract: Labeling data is one of the most costly processes in machine learning pipelines. Active learning is a standard approach to alleviating this problem. Pool-based active learning first builds a pool of unlabelled data and iteratively selects data to be labeled so that the total number of required labels is minimized, keeping the model performance high. Many effective criteria for choosing data from t… ▽ More

    Submitted 10 February, 2023; v1 submitted 15 October, 2022; originally announced October 2022.

    Comments: WWW 2023

  18. arXiv:2209.15505  [pdf, other

    cs.LG

    Momentum Tracking: Momentum Acceleration for Decentralized Deep Learning on Heterogeneous Data

    Authors: Yuki Takezawa, Han Bao, Kenta Niwa, Ryoma Sato, Makoto Yamada

    Abstract: SGD with momentum is one of the key components for improving the performance of neural networks. For decentralized learning, a straightforward approach using momentum is Distributed SGD (DSGD) with momentum (DSGDm). However, DSGDm performs worse than DSGD when the data distributions are statistically heterogeneous. Recently, several studies have addressed this issue and proposed methods with momen… ▽ More

    Submitted 24 September, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Transactions on Machine Learning Research 2023

  19. arXiv:2208.09864  [pdf, other

    cs.IR

    Towards Principled User-side Recommender Systems

    Authors: Ryoma Sato

    Abstract: Traditionally, recommendation algorithms have been designed for service developers. However, recently, a new paradigm called user-side recommender systems has been proposed and they enable web service users to construct their own recommender systems without access to trade-secret data. This approach opens the door to user-defined fair systems even if the official recommender system of the service… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: CIKM 2022

  20. arXiv:2208.09862  [pdf, other

    cs.DL cs.LG

    Twin Papers: A Simple Framework of Causal Inference for Citations via Coupling

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: The research process includes many decisions, e.g., how to entitle and where to publish the paper. In this paper, we introduce a general framework for investigating the effects of such decisions. The main difficulty in investigating the effects is that we need to know counterfactual results, which are not available in reality. The key insight of our framework is inspired by the existing counterfac… ▽ More

    Submitted 21 August, 2022; originally announced August 2022.

    Comments: CIKM 2022 short paper

  21. arXiv:2206.12116  [pdf, other

    stat.ML cs.AI cs.LG

    Approximating 1-Wasserstein Distance with Trees

    Authors: Makoto Yamada, Yuki Takezawa, Ryoma Sato, Han Bao, Zornitsa Kozareva, Sujith Ravi

    Abstract: Wasserstein distance, which measures the discrepancy between distributions, shows efficacy in various types of natural language processing (NLP) and computer vision (CV) applications. One of the challenges in estimating Wasserstein distance is that it is computationally expensive and does not scale well for many distribution comparison tasks. In this paper, we aim to approximate the 1-Wasserstein… ▽ More

    Submitted 24 June, 2022; originally announced June 2022.

  22. arXiv:2206.08521  [pdf, other

    cs.IR

    CLEAR: A Fully User-side Image Search System

    Authors: Ryoma Sato

    Abstract: We use many search engines on the Internet in our daily lives. However, they are not perfect. Their scoring function may not model our intent or they may accept only text queries even though we want to carry out a similar image search. In such cases, we need to make a compromise: We continue to use the unsatisfactory service or leave the service. Recently, a new solution, user-side search systems,… ▽ More

    Submitted 16 June, 2022; originally announced June 2022.

  23. arXiv:2205.01954  [pdf, other

    cs.CL cs.AI cs.LG

    Word Tour: One-dimensional Word Embeddings via the Traveling Salesman Problem

    Authors: Ryoma Sato

    Abstract: Word embeddings are one of the most fundamental technologies used in natural language processing. Existing word embeddings are high-dimensional and consume considerable computational resources. In this study, we propose WordTour, unsupervised one-dimensional word embeddings. To achieve the challenging goal, we propose a decomposition of the desiderata of word embeddings into two parts, completenes… ▽ More

    Submitted 4 May, 2022; originally announced May 2022.

    Comments: NAACL 2022

  24. arXiv:2203.08402  [pdf, ps, other

    cs.PL

    Gradual Tensor Shape Checking

    Authors: Momoko Hattori, Naoki Kobayashi, Ryosuke Sato

    Abstract: Tensor shape mismatch is a common source of bugs in deep learning programs. We propose a new type-based approach to detect tensor shape mismatches. One of the main features of our approach is the best-effort shape inference. As the tensor shape inference problem is undecidable in general, we allow static type/shape inference to be performed only in a best-effort manner. If the static inference can… ▽ More

    Submitted 25 March, 2023; v1 submitted 16 March, 2022; originally announced March 2022.

    Comments: 48 pages

    ACM Class: D.2.4

  25. arXiv:2203.07601  [pdf, ps, other

    cs.PL

    Automatic HFL(Z) Validity Checking for Program Verification

    Authors: Naoki Kobayashi, Kento Tanahashi, Ryosuke Sato, Takeshi Tsukada

    Abstract: We propose an automated method for checking the validity of a formula of HFL(Z), a higher-order logic with fixpoint operators and integers. Combined with Kobayashi et al.'s reduction from higher-order program verification to HFL(Z) validity checking, our method yields a fully automated, uniform verification method for arbitrary temporal properties of higher-order functional programs expressible in… ▽ More

    Submitted 8 December, 2022; v1 submitted 14 March, 2022; originally announced March 2022.

    Comments: A long version of the paper published in Proceedings of POPL 2023

  26. arXiv:2112.14921  [pdf, other

    cs.IR cs.AI cs.CV cs.LG

    Retrieving Black-box Optimal Images from External Databases

    Authors: Ryoma Sato

    Abstract: Suppose we have a black-box function (e.g., deep neural network) that takes an image as input and outputs a value that indicates preference. How can we retrieve optimal images with respect to this function from an external database on the Internet? Standard retrieval problems in the literature (e.g., item recommendations) assume that an algorithm has full access to the set of items. In other words… ▽ More

    Submitted 29 December, 2021; originally announced December 2021.

    Comments: WSDM 2022

  27. arXiv:2109.03431  [pdf, other

    cs.AI cs.LG

    Fixed Support Tree-Sliced Wasserstein Barycenter

    Authors: Yuki Takezawa, Ryoma Sato, Zornitsa Kozareva, Sujith Ravi, Makoto Yamada

    Abstract: The Wasserstein barycenter has been widely studied in various fields, including natural language processing, and computer vision. However, it requires a high computational cost to solve the Wasserstein barycenter problem because the computation of the Wasserstein distance requires a quadratic time with respect to the number of supports. By contrast, the Wasserstein distance on a tree, called the t… ▽ More

    Submitted 11 February, 2022; v1 submitted 8 September, 2021; originally announced September 2021.

    Comments: AISTATS 2022

  28. arXiv:2109.00311  [pdf, ps, other

    cs.PL

    Termination Analysis for the $π$-Calculus by Reduction to Sequential Program Termination

    Authors: Tsubasa Shoshi, Takuma Ishikawa, Naoki Kobayashi, Ken Sakayori, Ryosuke Sato, Takeshi Tsukada

    Abstract: We propose an automated method for proving termination of $π$-calculus processes, based on a reduction to termination of sequential programs: we translate a $π$-calculus process to a sequential program, so that the termination of the latter implies that of the former. We can then use an off-the-shelf termination verification tool to check termination of the sequential program. Our approach has bee… ▽ More

    Submitted 1 September, 2021; originally announced September 2021.

    Comments: A shorter version will appear in Proceedings of APLAS 2021

  29. arXiv:2108.07642  [pdf, ps, other

    cs.LO cs.PL

    Symbolic Automatic Relations and Their Applications to SMT and CHC Solving

    Authors: Takumi Shimoda, Naoki Kobayashi, Ken Sakayori, Ryosuke Sato

    Abstract: Despite the recent advance of automated program verification, reasoning about recursive data structures remains as a challenge for verification tools and their backends such as SMT and CHC solvers. To address the challenge, we introduce the notion of symbolic automatic relations (SARs), which combines symbolic automata and automatic relations, and inherits their good properties such as the closure… ▽ More

    Submitted 17 August, 2021; originally announced August 2021.

    Comments: A shorter version will appear in Proceedings of SAS 2021

  30. arXiv:2105.14423  [pdf, other

    cs.IR cs.DS

    Enumerating Fair Packages for Group Recommendations

    Authors: Ryoma Sato

    Abstract: Package-to-group recommender systems recommend a set of unified items to a group of people. Different from conventional settings, it is not easy to measure the utility of group recommendations because it involves more than one user. In particular, fairness is crucial in group recommendations. Even if some members in a group are substantially satisfied with a recommendation, it is undesirable if ot… ▽ More

    Submitted 27 December, 2021; v1 submitted 30 May, 2021; originally announced May 2021.

    Comments: WSDM 2022

  31. arXiv:2105.14403  [pdf, other

    cs.LG cs.CL cs.IR

    Re-evaluating Word Mover's Distance

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: The word mover's distance (WMD) is a fundamental technique for measuring the similarity of two documents. As the crux of WMD, it can take advantage of the underlying geometry of the word space by employing an optimal transport formulation. The original study on WMD reported that WMD outperforms classical baselines such as bag-of-words (BOW) and TF-IDF by significant margins in various datasets. In… ▽ More

    Submitted 15 June, 2022; v1 submitted 29 May, 2021; originally announced May 2021.

    Comments: ICML 2022

  32. arXiv:2105.13954  [pdf, other

    math.OC cs.LG

    A Gradient Method for Multilevel Optimization

    Authors: Ryo Sato, Mirai Tanaka, Akiko Takeda

    Abstract: Although application examples of multilevel optimization have already been discussed since the 1990s, the development of solution methods was almost limited to bilevel cases due to the difficulty of the problem. In recent years, in machine learning, Franceschi et al. have proposed a method for solving bilevel optimization problems by replacing their lower-level problems with the $T$ steepest desce… ▽ More

    Submitted 26 October, 2021; v1 submitted 28 May, 2021; originally announced May 2021.

    Comments: NeurIPS 2021 camera-ready, 27 pages

  33. arXiv:2105.12353  [pdf, other

    cs.IR

    Private Recommender Systems: How Can Users Build Their Own Fair Recommender Systems without Log Data?

    Authors: Ryoma Sato

    Abstract: Fairness is a crucial property in recommender systems. Although some online services have adopted fairness aware systems recently, many other services have not adopted them yet. In this work, we propose methods to enable the users to build their own fair recommender systems. Our methods can generate fair recommendations even when the service does not (or cannot) provide fair recommender systems. T… ▽ More

    Submitted 19 January, 2022; v1 submitted 26 May, 2021; originally announced May 2021.

    Comments: SDM 2022

  34. arXiv:2101.11520  [pdf, other

    cs.LG stat.ML

    Supervised Tree-Wasserstein Distance

    Authors: Yuki Takezawa, Ryoma Sato, Makoto Yamada

    Abstract: To measure the similarity of documents, the Wasserstein distance is a powerful tool, but it requires a high computational cost. Recently, for fast computation of the Wasserstein distance, methods for approximating the Wasserstein distance using a tree metric have been proposed. These tree-based methods allow fast comparisons of a large number of documents; however, they are unsupervised and do not… ▽ More

    Submitted 23 July, 2021; v1 submitted 27 January, 2021; originally announced January 2021.

  35. arXiv:2012.08053  [pdf, other

    cs.SE

    A Quantitative Study of Security Bug Fixes of GitHub Repositories

    Authors: Daito Nakano, Mingyang Yin, Ryosuke Sato, Abram Hindle, Yasutaka Kamei, Naoyasu Ubayashi

    Abstract: Software is prone to bugs and failures. Security bugs are those that expose or share privileged information and access in violation of the software's requirements. Given the seriousness of security bugs, there are centralized mechanisms for supporting and tracking these bugs across multiple products, one such mechanism is the Common Vulnerabilities and Exposures (CVE) ID description. When a bug ge… ▽ More

    Submitted 14 December, 2020; originally announced December 2020.

  36. arXiv:2012.06138  [pdf, other

    cs.LG cs.AI cs.NE

    AdvantageNAS: Efficient Neural Architecture Search with Credit Assignment

    Authors: Rei Sato, Jun Sakuma, Youhei Akimoto

    Abstract: Neural architecture search (NAS) is an approach for automatically designing a neural network architecture without human effort or expert knowledge. However, the high computational cost of NAS limits its use in commercial applications. Two recent NAS paradigms, namely one-shot and sparse propagation, which reduce the time and space complexities, respectively, provide clues for solving this problem.… ▽ More

    Submitted 9 March, 2021; v1 submitted 11 December, 2020; originally announced December 2020.

    Comments: The Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI-21)

  37. arXiv:2010.09157  [pdf, other

    cs.DL cs.IR cs.LG stat.ML

    Poincare: Recommending Publication Venues via Treatment Effect Estimation

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: Choosing a publication venue for an academic paper is a crucial step in the research process. However, in many cases, decisions are based solely on the experience of researchers, which often leads to suboptimal results. Although there exist venue recommender systems for academic papers, they recommend venues where the paper is expected to be published. In this study, we aim to recommend publicatio… ▽ More

    Submitted 2 September, 2022; v1 submitted 18 October, 2020; originally announced October 2020.

    Comments: Journal of Informetrics

  38. arXiv:2006.02703  [pdf, ps, other

    cs.LG stat.ML

    Fast Unbalanced Optimal Transport on a Tree

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: This study examines the time complexities of the unbalanced optimal transport problems from an algorithmic perspective for the first time. We reveal which problems in unbalanced optimal transport can/cannot be solved efficiently. Specifically, we prove that the Kantorovich Rubinstein distance and optimal partial transport in the Euclidean metric cannot be computed in strongly subquadratic time und… ▽ More

    Submitted 7 January, 2021; v1 submitted 4 June, 2020; originally announced June 2020.

    Comments: Accepted to NeurIPS 2020

  39. arXiv:2005.12123  [pdf, other

    stat.ML cs.LG

    Feature Robust Optimal Transport for High-dimensional Data

    Authors: Mathis Petrovich, Chao Liang, Ryoma Sato, Yanbin Liu, Yao-Hung Hubert Tsai, Linchao Zhu, Yi Yang, Ruslan Salakhutdinov, Makoto Yamada

    Abstract: Optimal transport is a machine learning problem with applications including distribution comparison, feature selection, and generative adversarial networks. In this paper, we propose feature-robust optimal transport (FROT) for high-dimensional data, which solves high-dimensional OT problems using feature selection to avoid the curse of dimensionality. Specifically, we find a transport plan with di… ▽ More

    Submitted 29 September, 2020; v1 submitted 25 May, 2020; originally announced May 2020.

  40. arXiv:2003.04078  [pdf, ps, other

    cs.LG stat.ML

    A Survey on The Expressive Power of Graph Neural Networks

    Authors: Ryoma Sato

    Abstract: Graph neural networks (GNNs) are effective machine learning models for various graph learning problems. Despite their empirical successes, the theoretical limitations of GNNs have been revealed recently. Consequently, many GNN models have been proposed to overcome these limitations. In this survey, we provide a comprehensive overview of the expressive power of GNNs and provably powerful variants o… ▽ More

    Submitted 16 October, 2020; v1 submitted 9 March, 2020; originally announced March 2020.

    Comments: 42 pages

  41. arXiv:2002.03155  [pdf, ps, other

    cs.LG stat.ML

    Random Features Strengthen Graph Neural Networks

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: Graph neural networks (GNNs) are powerful machine learning models for various graph learning tasks. Recently, the limitations of the expressive power of various GNN models have been revealed. For example, GNNs cannot distinguish some non-isomorphic graphs and they cannot learn efficient graph algorithms. In this paper, we demonstrate that GNNs become powerful just by adding a random feature to eac… ▽ More

    Submitted 18 January, 2021; v1 submitted 8 February, 2020; originally announced February 2020.

    Comments: Accepted to SDM 2021

  42. arXiv:2002.01615  [pdf, ps, other

    stat.ML cs.LG

    Fast and Robust Comparison of Probability Measures in Heterogeneous Spaces

    Authors: Ryoma Sato, Marco Cuturi, Makoto Yamada, Hisashi Kashima

    Abstract: Comparing two probability measures supported on heterogeneous spaces is an increasingly important problem in machine learning. Such problems arise when comparing for instance two populations of biological cells, each described with its own set of features, or when looking at families of word embeddings trained across different corpora/languages. For such settings, the Gromov Wasserstein (GW) dista… ▽ More

    Submitted 10 February, 2021; v1 submitted 4 February, 2020; originally announced February 2020.

  43. arXiv:1910.06155  [pdf

    cs.OH stat.AP

    GeoSES -- um Índice Socioeconômico para Estudos de Saúde no Brasil

    Authors: Ligia Vizeu Barrozo, Michel Fornaciali, Carmen Diva Saldiva de André, Guilherme Augusto Zimeo Morais, Giselle Mansur, William Cabral-Miranda, João Ricardo Sato, Edson Amaro Júnior

    Abstract: Objective: to define an index that summarizes the main dimensions of the socioeconomic context for research purposes, evaluation and monitoring health inequalities. Methods: the index was created from the 2010 Brazilian Demographic Census, whose variables selection was guided by theoretical references for health studies, including seven socioeconomic dimensions: education, mobility, poverty, wealt… ▽ More

    Submitted 9 October, 2019; originally announced October 2019.

    Comments: in Portuguese

  44. arXiv:1905.10261  [pdf, ps, other

    cs.LG stat.ML

    Approximation Ratios of Graph Neural Networks for Combinatorial Problems

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: In this paper, from a theoretical perspective, we study how powerful graph neural networks (GNNs) can be for learning approximation algorithms for combinatorial problems. To this end, we first establish a new class of GNNs that can solve a strictly wider variety of problems than existing GNNs. Then, we bridge the gap between GNN theory and the theory of distributed local algorithms. We theoretical… ▽ More

    Submitted 8 November, 2019; v1 submitted 24 May, 2019; originally announced May 2019.

    Comments: Accepted to NeurIPS 2019

  45. arXiv:1902.09700  [pdf, ps, other

    cs.LG stat.ML

    Learning to Sample Hard Instances for Graph Algorithms

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: Hard instances, which require a long time for a specific algorithm to solve, help (1) analyze the algorithm for accelerating it and (2) build a good benchmark for evaluating the performance of algorithms. There exist several efforts for automatic generation of hard instances. For example, evolutionary algorithms have been utilized to generate hard instances. However, they generate only finite numb… ▽ More

    Submitted 3 October, 2019; v1 submitted 25 February, 2019; originally announced February 2019.

    Comments: 16 pages, 4 figures, accepted by ACML 2019

  46. arXiv:1901.07868  [pdf, ps, other

    cs.LG stat.ML

    Constant Time Graph Neural Networks

    Authors: Ryoma Sato, Makoto Yamada, Hisashi Kashima

    Abstract: The recent advancements in graph neural networks (GNNs) have led to state-of-the-art performances in various applications, including chemo-informatics, question-answering systems, and recommender systems. However, scaling up these methods to huge graphs, such as social networks and Web graphs, remains a challenge. In particular, the existing methods for accelerating GNNs either are not theoretical… ▽ More

    Submitted 29 March, 2022; v1 submitted 23 January, 2019; originally announced January 2019.

    Comments: TKDD 2022

    Journal ref: ACM Trans. Knowl. Discov. Data. 16, 5, Article 92 (March 2022)

  47. arXiv:1708.04881  [pdf, other

    cs.GT math.OC

    Polyhedral Clinching Auctions for Two-sided Markets

    Authors: Hiroshi Hirai, Ryosuke Sato

    Abstract: In this paper, we present a new model and two mechanisms for auctions in two-sided markets of buyers and sellers, where budget constraints are imposed on buyers. Our model incorporates polymatroidal environments, and is applicable to a wide variety of models that include multiunit auctions, matching markets and reservation exchange markets. Our mechanisms are build on polymatroidal network flow mo… ▽ More

    Submitted 13 September, 2018; v1 submitted 14 August, 2017; originally announced August 2017.

    MSC Class: 91B26; 91-08

  48. arXiv:1208.2976  [pdf, other

    stat.ME cs.SI physics.soc-ph q-bio.QM

    Discriminating different classes of biological networks by analyzing the graphs spectra distribution

    Authors: Daniel Yasumasa Takahashi, João Ricardo Sato, Carlos Eduardo Ferreira, André Fujita

    Abstract: The brain's structural and functional systems, protein-protein interaction, and gene networks are examples of biological systems that share some features of complex networks, such as highly connected nodes, modularity, and small-world topology. Recent studies indicate that some pathologies present topological network alterations relative to norms seen in the general population. Therefore, methods… ▽ More

    Submitted 14 August, 2012; originally announced August 2012.