Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Kool, W

Searching in archive cs. Search in all archives.
.
  1. PyVRP: a high-performance VRP solver package

    Authors: Niels A. Wouda, Leon Lan, Wouter Kool

    Abstract: We introduce PyVRP, a Python package that implements hybrid genetic search in a state-of-the-art vehicle routing problem (VRP) solver. The package is designed for the VRP with time windows (VRPTW), but can be easily extended to support other VRP variants. PyVRP combines the flexibility of Python with the performance of C++, by implementing (only) performance critical parts of the algorithm in C++,… ▽ More

    Submitted 21 March, 2024; v1 submitted 22 November, 2023; originally announced March 2024.

    Comments: Pre-print of accepted paper in INFORMS Journal on Computing. 24 pages, 1 figure, 2 listings

  2. arXiv:2306.17100  [pdf, other

    cs.LG cs.AI

    RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark

    Authors: Federico Berto, Chuanbo Hua, Junyoung Park, Laurin Luttmann, Yining Ma, Fanchen Bu, Jiarui Wang, Haoran Ye, Minsu Kim, Sanghyeok Choi, Nayeli Gast Zepeda, André Hottung, Jianan Zhou, Jieyi Bi, Yu Hu, Fei Liu, Hyeonah Kim, Jiwoo Son, Haeyeon Kim, Davide Angioni, Wouter Kool, Zhiguang Cao, Qingfu Zhang, Joungho Kim, Jie Zhang , et al. (8 additional authors not shown)

    Abstract: Deep reinforcement learning (RL) has recently shown significant benefits in solving combinatorial optimization (CO) problems, reducing reliance on domain expertise, and improving computational efficiency. However, the field lacks a unified benchmark for easy development and standardized comparison of algorithms across diverse CO problems. To fill this gap, we introduce RL4CO, a unified and extensi… ▽ More

    Submitted 21 June, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Comments: A previous version was presented as a workshop paper at the NeurIPS 2023 GLFrontiers Workshop (Oral)

  3. arXiv:2110.01515  [pdf, other

    cs.LG stat.ML

    A Review of the Gumbel-max Trick and its Extensions for Discrete Stochasticity in Machine Learning

    Authors: Iris A. M. Huijben, Wouter Kool, Max B. Paulus, Ruud J. G. van Sloun

    Abstract: The Gumbel-max trick is a method to draw a sample from a categorical distribution, given by its unnormalized (log-)probabilities. Over the past years, the machine learning community has proposed several extensions of this trick to facilitate, e.g., drawing multiple samples, sampling from structured domains, or gradient estimation for error backpropagation in neural network optimization. The goal o… ▽ More

    Submitted 8 March, 2022; v1 submitted 4 October, 2021; originally announced October 2021.

    Comments: Accepted as a survey article in IEEE TPAMI

  4. arXiv:2109.11817  [pdf, other

    cs.LG stat.ML

    Unbiased Gradient Estimation with Balanced Assignments for Mixtures of Experts

    Authors: Wouter Kool, Chris J. Maddison, Andriy Mnih

    Abstract: Training large-scale mixture of experts models efficiently on modern hardware requires assigning datapoints in a batch to different experts, each with a limited capacity. Recently proposed assignment procedures lack a probabilistic interpretation and use biased estimators for training. As an alternative, we propose two unbiased estimators based on principled stochastic assignment procedures: one t… ▽ More

    Submitted 8 December, 2021; v1 submitted 24 September, 2021; originally announced September 2021.

    Comments: I (Still) Can't Believe It's Not Better Workshop at NeurIPS 2021

  5. arXiv:2102.11756  [pdf, other

    cs.LG stat.ML

    Deep Policy Dynamic Programming for Vehicle Routing Problems

    Authors: Wouter Kool, Herke van Hoof, Joaquim Gromicho, Max Welling

    Abstract: Routing problems are a class of combinatorial problems with many practical applications. Recently, end-to-end deep learning methods have been proposed to learn approximate solution heuristics for such problems. In contrast, classical dynamic programming (DP) algorithms guarantee optimal solutions, but scale badly with the problem size. We propose Deep Policy Dynamic Programming (DPDP), which aims… ▽ More

    Submitted 2 December, 2021; v1 submitted 23 February, 2021; originally announced February 2021.

    Comments: 21 pages

  6. arXiv:2002.06043  [pdf, other

    cs.LG stat.ML

    Estimating Gradients for Discrete Random Variables by Sampling without Replacement

    Authors: Wouter Kool, Herke van Hoof, Max Welling

    Abstract: We derive an unbiased estimator for expectations over discrete random variables based on sampling without replacement, which reduces variance as it avoids duplicate samples. We show that our estimator can be derived as the Rao-Blackwellization of three different estimators. Combining our estimator with REINFORCE, we obtain a policy gradient estimator and we reduce its variance using a built-in con… ▽ More

    Submitted 14 February, 2020; originally announced February 2020.

    Comments: ICLR 2020

  7. arXiv:1903.06059  [pdf, other

    cs.LG stat.ML

    Stochastic Beams and Where to Find Them: The Gumbel-Top-k Trick for Sampling Sequences Without Replacement

    Authors: Wouter Kool, Herke van Hoof, Max Welling

    Abstract: The well-known Gumbel-Max trick for sampling from a categorical distribution can be extended to sample $k$ elements without replacement. We show how to implicitly apply this 'Gumbel-Top-$k$' trick on a factorized distribution over sequences, allowing to draw exact samples without replacement using a Stochastic Beam Search. Even for exponentially large domains, the number of model evaluations grows… ▽ More

    Submitted 29 May, 2019; v1 submitted 14 March, 2019; originally announced March 2019.

    Comments: ICML 2019 ; 13 pages, 4 figures

  8. arXiv:1803.08475  [pdf, other

    stat.ML cs.LG

    Attention, Learn to Solve Routing Problems!

    Authors: Wouter Kool, Herke van Hoof, Max Welling

    Abstract: The recently presented idea to learn heuristics for combinatorial optimization problems is promising as it can save costly development. However, to push this idea towards practical implementation, we need better models and better ways of training. We contribute in both directions: we propose a model based on attention layers with benefits over the Pointer Network and we show how to train this mode… ▽ More

    Submitted 7 February, 2019; v1 submitted 22 March, 2018; originally announced March 2018.

    Comments: Accepted at ICLR 2019. 25 pages, 7 figures