Zum Hauptinhalt springen

Showing 1–1 of 1 results for author: Woolley, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:1410.0759  [pdf, other

    cs.NE cs.LG cs.MS

    cuDNN: Efficient Primitives for Deep Learning

    Authors: Sharan Chetlur, Cliff Woolley, Philippe Vandermersch, Jonathan Cohen, John Tran, Bryan Catanzaro, Evan Shelhamer

    Abstract: We present a library of efficient implementations of deep learning primitives. Deep learning workloads are computationally intensive, and optimizing their kernels is difficult and time-consuming. As parallel architectures evolve, kernels must be reoptimized, which makes maintaining codebases difficult over time. Similar issues have long been addressed in the HPC community by libraries such as the… ▽ More

    Submitted 17 December, 2014; v1 submitted 3 October, 2014; originally announced October 2014.