REGAL: Transfer Learning For Fast Optimization of Computation Graphs

Paliwal, Aditya; Gimeno, Felix; Nair, Vinod; Li, Yujia; Lubin, Miles; Kohli, Pushmeet; Vinyals, Oriol

Computer Science > Machine Learning

arXiv:1905.02494v2 (cs)

[Submitted on 7 May 2019 (v1), revised 30 May 2019 (this version, v2), latest version 10 Feb 2020 (v4)]

Title:REGAL: Transfer Learning For Fast Optimization of Computation Graphs

Authors:Aditya Paliwal, Felix Gimeno, Vinod Nair, Yujia Li, Miles Lubin, Pushmeet Kohli, Oriol Vinyals

View PDF

Abstract:We present a deep reinforcement learning approach to optimizing the execution cost of computation graphs in a static compiler. The key idea is to combine a neural network policy with a genetic algorithm, the Biased Random-Key Genetic Algorithm (BRKGA). The policy is trained to predict, given an input graph to be optimized, the node-level probability distributions for sampling mutations and crossovers in BRKGA. Our approach, "REINFORCE-based Genetic Algorithm Learning" (REGAL), uses the policy's ability to transfer to new graphs to significantly improve the solution quality of the genetic algorithm for the same objective evaluation budget. As a concrete application, we show results for minimizing peak memory in TensorFlow graphs by jointly optimizing device placement and scheduling. REGAL achieves on average 3.56% lower peak memory than BRKGA on previously unseen graphs, outperforming all the algorithms we compare to, and giving 4.4x bigger improvement than the next best algorithm. We also evaluate REGAL on a production compiler team's performance benchmark of XLA graphs and achieve on average 3.74% lower peak memory than BRKGA, again outperforming all others. Our approach and analysis is made possible by collecting a dataset of 372 unique real-world TensorFlow graphs, more than an order of magnitude more data than previous work.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1905.02494 [cs.LG]
	(or arXiv:1905.02494v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1905.02494

Submission history

From: Miles Lubin [view email]
[v1] Tue, 7 May 2019 12:15:06 UTC (452 KB)
[v2] Thu, 30 May 2019 12:07:27 UTC (446 KB)
[v3] Thu, 26 Sep 2019 17:52:11 UTC (478 KB)
[v4] Mon, 10 Feb 2020 11:57:18 UTC (459 KB)

Computer Science > Machine Learning

Title:REGAL: Transfer Learning For Fast Optimization of Computation Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:REGAL: Transfer Learning For Fast Optimization of Computation Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators