Zum Hauptinhalt springen

Showing 1–4 of 4 results for author: Enos, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:1907.10203  [pdf, other

    cs.DC cs.LG

    Live Forensics for Distributed Storage Systems

    Authors: Saurabh Jha, Shengkun Cui, Tianyin Xu, Jeremy Enos, Mike Showerman, Mark Dalton, Zbigniew T. Kalbarczyk, William T. Kramer, Ravishankar K. Iyer

    Abstract: We present Kaleidoscope an innovative system that supports live forensics for application performance problems caused by either individual component failures or resource contention issues in large-scale distributed storage systems. The design of Kaleidoscope is driven by our study of I/O failures observed in a peta-scale storage system anonymized as PetaStore. Kaleidoscope is built on three key fe… ▽ More

    Submitted 23 July, 2019; originally announced July 2019.

  2. arXiv:1907.01019  [pdf, other

    cs.DC

    Understanding Fault Scenarios and Impacts through Fault Injection Experiments in Cielo

    Authors: Valerio Formicola, Saurabh Jha, Daniel Chen, Fei Deng, Amanda Bonnie, Mike Mason, Jim Brandt, Ann Gentile, Larry Kaplan, Jason Repik, Jeremy Enos, Mike Showerman, Annette Greiner, Zbigniew Kalbarczyk, Ravishankar K. Iyer, Bill Krammer

    Abstract: We present a set of fault injection experiments performed on the ACES (LANL/SNL) Cray XE supercomputer Cielo. We use this experimental campaign to improve the understanding of failure causes and propagation that we observed in the field failure data analysis of NCSA's Blue Waters. We use the data collected from the logs and from network performance counter data 1) to characterize the fault-error-f… ▽ More

    Submitted 1 July, 2019; originally announced July 2019.

    Comments: Presented at Cray User Group 2017

  3. BOSS-LDG: A Novel Computational Framework that Brings Together Blue Waters, Open Science Grid, Shifter and the LIGO Data Grid to Accelerate Gravitational Wave Discovery

    Authors: E. A. Huerta, Roland Haas, Edgar Fajardo, Daniel S. Katz, Stuart Anderson, Peter Couvares, Josh Willis, Timothy Bouvet, Jeremy Enos, William T. C. Kramer, Hon Wai Leong, David Wheeler

    Abstract: We present a novel computational framework that connects Blue Waters, the NSF-supported, leadership-class supercomputer operated by NCSA, to the Laser Interferometer Gravitational-Wave Observatory (LIGO) Data Grid via Open Science Grid technology. To enable this computational infrastructure, we configured, for the first time, a LIGO Data Grid Tier-1 Center that can submit heterogeneous LIGO workfl… ▽ More

    Submitted 25 September, 2017; originally announced September 2017.

    Comments: 10 pages, 10 figures. Accepted as a Full Research Paper to the 13th IEEE International Conference on eScience

    ACM Class: C.2.4; C.5.1; D.1.3; J.2

    Journal ref: 2017 IEEE 13th International Conference on e-Science

  4. arXiv:1703.00924  [pdf

    cs.DC

    Workload Analysis of Blue Waters

    Authors: Matthew D. Jones, Joseph P. White, Martins Innus, Robert L. DeLeon, Nikolay Simakov, Jeffrey T. Palmer, Steven M. Gallo, Thomas R. Furlani, Michael Showerman, Robert Brunner, Andry Kot, Gregory Bauer, Brett Bode, Jeremy Enos, William Kramer

    Abstract: Blue Waters is a Petascale-level supercomputer whose mission is to enable the national scientific and research community to solve "grand challenge" problems that are orders of magnitude more complex than can be carried out on other high performance computing systems. Given the important and unique role that Blue Waters plays in the U.S. research portfolio, it is important to have a detailed unders… ▽ More

    Submitted 2 March, 2017; originally announced March 2017.

    Comments: 107 pages, >100 figures (figure sizes reduced to save space, contact authors for version with full resolution)

    MSC Class: 68M14; 68M20; 68U20 ACM Class: I.6.3; J.2; J.3; J.4; J.5; K.6.4