Zum Hauptinhalt springen

Showing 1–11 of 11 results for author: Moritz, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2107.06008  [pdf, other

    stat.ML cs.LG

    Wasserstein GAN: Deep Generation applied on Bitcoins financial time series

    Authors: Rikli Samuel, Bigler Daniel Nico, Pfenninger Moritz, Osterrieder Joerg

    Abstract: Modeling financial time series is challenging due to their high volatility and unexpected happenings on the market. Most financial models and algorithms trying to fill the lack of historical financial time series struggle to perform and are highly vulnerable to overfitting. As an alternative, we introduce in this paper a deep neural network called the WGAN-GP, a data-driven model that focuses on s… ▽ More

    Submitted 13 July, 2021; originally announced July 2021.

  2. arXiv:2002.05814  [pdf, other

    cs.DC cs.LG cs.NI

    Hoplite: Efficient and Fault-Tolerant Collective Communication for Task-Based Distributed Systems

    Authors: Siyuan Zhuang, Zhuohan Li, Danyang Zhuo, Stephanie Wang, Eric Liang, Robert Nishihara, Philipp Moritz, Ion Stoica

    Abstract: Task-based distributed frameworks (e.g., Ray, Dask, Hydro) have become increasingly popular for distributed applications that contain asynchronous and dynamic workloads, including asynchronous gradient descent, reinforcement learning, and model serving. As more data-intensive applications move to run on top of task-based systems, collective communication efficiency has become an important problem.… ▽ More

    Submitted 28 September, 2021; v1 submitted 13 February, 2020; originally announced February 2020.

    Comments: SIGCOMM 2021

  3. arXiv:1904.03646  [pdf, other

    cs.LG stat.ML

    Policy Gradient Search: Online Planning and Expert Iteration without Search Trees

    Authors: Thomas Anthony, Robert Nishihara, Philipp Moritz, Tim Salimans, John Schulman

    Abstract: Monte Carlo Tree Search (MCTS) algorithms perform simulation-based search to improve policies online. During search, the simulation policy is adapted to explore the most promising lines of play. MCTS has been used by state-of-the-art programs for many problems, however a disadvantage to MCTS is that it estimates the values of states with Monte Carlo averages, stored in a search tree; this does not… ▽ More

    Submitted 7 April, 2019; originally announced April 2019.

  4. arXiv:1807.05118  [pdf, other

    cs.LG cs.DC stat.ML

    Tune: A Research Platform for Distributed Model Selection and Training

    Authors: Richard Liaw, Eric Liang, Robert Nishihara, Philipp Moritz, Joseph E. Gonzalez, Ion Stoica

    Abstract: Modern machine learning algorithms are increasingly computationally demanding, requiring specialized hardware and distributed computation to achieve high performance in a reasonable time frame. Many hyperparameter search algorithms have been proposed for improving the efficiency of model selection, however their adaptation to the distributed compute environment is often ad-hoc. We propose Tune, a… ▽ More

    Submitted 13 July, 2018; originally announced July 2018.

    Comments: 8 Pages, Presented at the 2018 ICML AutoML workshop

  5. arXiv:1712.09381  [pdf, other

    cs.AI cs.DC cs.LG

    RLlib: Abstractions for Distributed Reinforcement Learning

    Authors: Eric Liang, Richard Liaw, Philipp Moritz, Robert Nishihara, Roy Fox, Ken Goldberg, Joseph E. Gonzalez, Michael I. Jordan, Ion Stoica

    Abstract: Reinforcement learning (RL) algorithms involve the deep nesting of highly irregular computation patterns, each of which typically exhibits opportunities for distributed computation. We argue for distributing RL components in a composable way by adapting algorithms for top-down hierarchical control, thereby encapsulating parallelism and resource requirements within short-running compute tasks. We d… ▽ More

    Submitted 28 June, 2018; v1 submitted 26 December, 2017; originally announced December 2017.

    Comments: Published in the International Conference on Machine Learning (ICML 2018), 10 pages

  6. arXiv:1712.05889  [pdf, other

    cs.DC cs.AI cs.LG stat.ML

    Ray: A Distributed Framework for Emerging AI Applications

    Authors: Philipp Moritz, Robert Nishihara, Stephanie Wang, Alexey Tumanov, Richard Liaw, Eric Liang, Melih Elibol, Zongheng Yang, William Paul, Michael I. Jordan, Ion Stoica

    Abstract: The next generation of AI applications will continuously interact with the environment and learn from these interactions. These applications impose new and demanding systems requirements, both in terms of performance and flexibility. In this paper, we consider these requirements and present Ray---a distributed system to address them. Ray implements a unified interface that can express both task-pa… ▽ More

    Submitted 29 September, 2018; v1 submitted 15 December, 2017; originally announced December 2017.

    Comments: 17 pages, 14 figures, 13th USENIX Symposium on Operating Systems Design and Implementation, 2018

  7. arXiv:1703.03924  [pdf, other

    cs.DC cs.AI cs.LG

    Real-Time Machine Learning: The Missing Pieces

    Authors: Robert Nishihara, Philipp Moritz, Stephanie Wang, Alexey Tumanov, William Paul, Johann Schleier-Smith, Richard Liaw, Mehrdad Niknami, Michael I. Jordan, Ion Stoica

    Abstract: Machine learning applications are increasingly deployed not only to serve predictions using static models, but also as tightly-integrated components of feedback loops involving dynamic, real-time decision making. These applications pose a new set of requirements, none of which are difficult to achieve in isolation, but the combination of which creates a challenge for existing distributed execution… ▽ More

    Submitted 19 May, 2017; v1 submitted 11 March, 2017; originally announced March 2017.

    Comments: 6 pages, 3 figures

  8. arXiv:1511.06051  [pdf, other

    stat.ML cs.DC cs.LG cs.NE math.OC

    SparkNet: Training Deep Networks in Spark

    Authors: Philipp Moritz, Robert Nishihara, Ion Stoica, Michael I. Jordan

    Abstract: Training deep networks is a time-consuming process, with networks for object recognition often requiring multiple days to train. For this reason, leveraging the resources of a cluster to speed up training is an important area of work. However, widely-popular batch-processing computational frameworks like MapReduce and Spark were not designed to support the asynchronous and communication-intensive… ▽ More

    Submitted 28 February, 2016; v1 submitted 18 November, 2015; originally announced November 2015.

    Comments: 12 pages, 7 figures

  9. arXiv:1508.02087  [pdf, other

    math.OC cs.LG math.NA stat.CO stat.ML

    A Linearly-Convergent Stochastic L-BFGS Algorithm

    Authors: Philipp Moritz, Robert Nishihara, Michael I. Jordan

    Abstract: We propose a new stochastic L-BFGS algorithm and prove a linear convergence rate for strongly convex and smooth functions. Our algorithm draws heavily from a recent stochastic variant of L-BFGS proposed in Byrd et al. (2014) as well as a recent approach to variance reduction for stochastic gradient descent from Johnson and Zhang (2013). We demonstrate experimentally that our algorithm performs wel… ▽ More

    Submitted 13 April, 2016; v1 submitted 9 August, 2015; originally announced August 2015.

    Comments: 10 pages, 3 figures in International Conference on Artificial Intelligence and Statistics, 2016

  10. arXiv:1506.02438  [pdf, other

    cs.LG cs.RO eess.SY

    High-Dimensional Continuous Control Using Generalized Advantage Estimation

    Authors: John Schulman, Philipp Moritz, Sergey Levine, Michael Jordan, Pieter Abbeel

    Abstract: Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks. The two main challenges are the large number of samples typically required, and the difficulty of obtaining stable and steady improvement despite the nonstationarity of the i… ▽ More

    Submitted 20 October, 2018; v1 submitted 8 June, 2015; originally announced June 2015.

  11. arXiv:1502.05477  [pdf, other

    cs.LG

    Trust Region Policy Optimization

    Authors: John Schulman, Sergey Levine, Philipp Moritz, Michael I. Jordan, Pieter Abbeel

    Abstract: We describe an iterative procedure for optimizing policies, with guaranteed monotonic improvement. By making several approximations to the theoretically-justified procedure, we develop a practical algorithm, called Trust Region Policy Optimization (TRPO). This algorithm is similar to natural policy gradient methods and is effective for optimizing large nonlinear policies such as neural networks. O… ▽ More

    Submitted 20 April, 2017; v1 submitted 19 February, 2015; originally announced February 2015.

    Comments: 16 pages, ICML 2015