GDP: Generalized Device Placement for Dataflow Graphs

Zhou, Yanqi; Roy, Sudip; Abdolrashidi, Amirali; Wong, Daniel; Ma, Peter C.; Xu, Qiumin; Zhong, Ming; Liu, Hanxiao; Goldie, Anna; Mirhoseini, Azalia; Laudon, James

Computer Science > Machine Learning

arXiv:1910.01578 (cs)

[Submitted on 28 Sep 2019]

Title:GDP: Generalized Device Placement for Dataflow Graphs

Authors:Yanqi Zhou, Sudip Roy, Amirali Abdolrashidi, Daniel Wong, Peter C. Ma, Qiumin Xu, Ming Zhong, Hanxiao Liu, Anna Goldie, Azalia Mirhoseini, James Laudon

View PDF

Abstract:Runtime and scalability of large neural networks can be significantly affected by the placement of operations in their dataflow graphs on suitable devices. With increasingly complex neural network architectures and heterogeneous device characteristics, finding a reasonable placement is extremely challenging even for domain experts. Most existing automated device placement approaches are impractical due to the significant amount of compute required and their inability to generalize to new, previously held-out graphs. To address both limitations, we propose an efficient end-to-end method based on a scalable sequential attention mechanism over a graph neural network that is transferable to new graphs. On a diverse set of representative deep learning models, including Inception-v3, AmoebaNet, Transformer-XL, and WaveNet, our method on average achieves 16% improvement over human experts and 9.2% improvement over the prior art with 15 times faster convergence. To further reduce the computation cost, we pre-train the policy network on a set of dataflow graphs and use a superposition network to fine-tune it on each individual graph, achieving state-of-the-art performance on large hold-out graphs with over 50k nodes, such as an 8-layer GNMT.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:1910.01578 [cs.LG]
	(or arXiv:1910.01578v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.1910.01578

Submission history

From: Yanqi Zhou [view email]
[v1] Sat, 28 Sep 2019 04:13:57 UTC (139 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.LG

< prev | next >

new | recent | 2019-10

Change to browse by:

cs
stat
stat.ML

References & Citations

DBLP - CS Bibliography

listing | bibtex

Yanqi Zhou
Sudip Roy
AmirAli Abdolrashidi
Ming Zhong
Hanxiao Liu

…

export BibTeX citation

Computer Science > Machine Learning

Title:GDP: Generalized Device Placement for Dataflow Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:GDP: Generalized Device Placement for Dataflow Graphs

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators