Zum Hauptinhalt springen

Showing 1–13 of 13 results for author: Weitzel, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2308.07999  [pdf

    physics.comp-ph astro-ph.IM cs.PF

    IceCube experience using XRootD-based Origins with GPU workflows in PNRP

    Authors: David Schultz, Igor Sfiligoi, Benedikt Riedel, Fabio Andrijauskas, Derek Weitzel, Frank Würthwein

    Abstract: The IceCube Neutrino Observatory is a cubic kilometer neutrino telescope located at the geographic South Pole. Understanding detector systematic effects is a continuous process. This requires the Monte Carlo simulation to be updated periodically to quantify potential changes and improvements in science results with more detailed modeling of the systematic effects. IceCube's largest systematic effe… ▽ More

    Submitted 15 August, 2023; originally announced August 2023.

    Comments: 7 pages, 3 figures, 1 table, To be published in Proceedings of CHEP23

  2. Analyzing Transatlantic Network Traffic over Scientific Data Caches

    Authors: Z. Deng, A. Sim, K. Wu, C. Guok, D. Hazen, I. Monga, F. Andrijauskas, F. Wuerthwein, D. Weitzel

    Abstract: Large scientific collaborations often share huge volumes of data around the world. Consequently a significant amount of network bandwidth is needed for data replication and data access. Users in the same region may possibly share resources as well as data, especially when they are working on related topics with similar datasets. In this work, we study the network traffic patterns and resource util… ▽ More

    Submitted 17 July, 2023; v1 submitted 1 May, 2023; originally announced May 2023.

  3. arXiv:2112.03074  [pdf, other

    cs.NI hep-ex

    The Service Analysis and Network Diagnosis DataPipeline

    Authors: Derek Weitzel, Shawn McKee, Brian Paul Bockelman, John Thiltges, Marian Babik, Ilija Vukotic

    Abstract: Modern network performance monitoring toolkits, such as perfSONAR, take a remarkable number of measurements about the local network environment. To gain a complete picture of network performance, however, one needs to aggregate data across a large number of endpoints. The Service Analysis and Network Diagnosis (SAND) data pipeline collects data from diverse sources and ingests these measurements i… ▽ More

    Submitted 6 December, 2021; originally announced December 2021.

    Comments: 10 pages, to be published in 2021 IEEE Workshop on Innovating the Network for Data-Intensive Science

  4. Creating a content delivery network for general science on the internet backbone using XCaches

    Authors: Edgar Fajardo, Marian Zvada, Derek Weitzel, Mats Rynge, John Hicks, Mat Selmeci, Brian Lin, Pascal Paschos, Brian Bockelman, Igor Sfiligoi, Andrew Hanushevsky, Frank Würthwein

    Abstract: A general problem faced by computing on the grid for opportunistic users is that delivering cycles is simpler than delivering data to those cycles. In this project we show how we integrated XRootD caches placed on the internet backbone to implement a content delivery network for general science workflows. We will show that for some workflows on different science domains like high energy physics, g… ▽ More

    Submitted 28 September, 2020; v1 submitted 2 July, 2020; originally announced July 2020.

  5. WLCG Networks: Update on Monitoring and Analytics

    Authors: Marian Babik, Shawn McKee, Pedro Andrade, Brian Paul Bockelman, Robert Gardner, Edgar Mauricio Fajardo Hernandez, Edoardo Martelli, Ilija Vukotic, Derek Weitzel, Marian Zvada

    Abstract: WLCG relies on the network as a critical part of its infrastructure and therefore needs to guarantee effective network usage and prompt detection and resolution of any network issues including connection failures, congestion and traffic routing. The OSG Networking Area, in partnership with WLCG, is focused on being the primary source of networking information for its partners and constituents. It… ▽ More

    Submitted 1 July, 2020; originally announced July 2020.

    Comments: Accepted for publication in CHEP 2019 proceedings

  6. arXiv:2004.05729  [pdf, other

    cs.DC

    Exploring Erasure Coding Techniques for High Availability of Intermediate Data

    Authors: Zhe Zhang, Brian Bockelman, Derek Weitzel, David Swanson

    Abstract: Scientific computing workflows generate enormous distributed data that is short-lived, yet critical for job completion time. This class of data is called intermediate data. A common way to achieve high data availability is to replicate data. However, an increasing scale of intermediate data generated in modern scientific applications demands new storage techniques to improve storage efficiency. Er… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  7. arXiv:2004.05723  [pdf, other

    cs.DC

    Trua: Efficient Task Replication for Flexible User-defined Availability in Scientific Grids

    Authors: Zhe Zhang, Brian Bockelman, Derek Weitzel, Xinkai Zhang, Hamid Vakilzadian, David Swanson

    Abstract: Failure is inevitable in scientific computing. As scientific applications and facilities increase their scales over the last decades, finding the root cause of a failure can be very complex or at times nearly impossible. Different scientific computing customers have varying availability demands as well as a diverse willingness to pay for availability. In contrast to existing solutions that try to… ▽ More

    Submitted 12 April, 2020; originally announced April 2020.

  8. arXiv:1907.03688  [pdf, other

    cs.DC

    Enabling Microsoft OneDrive Integration with HTCondor

    Authors: Derek Weitzel

    Abstract: Accessing data from distributed computing is essential in many workflows, but can be complicated for users of cyberinfrastructure. They must perform multiple steps to make data available to distributed computing using unfamiliar tools. Further, most research on data distribution has focused on the efficiency of providing data to computing resources rather than considering the ease of use for distr… ▽ More

    Submitted 8 July, 2019; originally announced July 2019.

    Comments: Humans in the Loop: Enabling and Facilitating Research on Cloud Computing Workshop at PEARC 19

  9. SciTokens: Demonstrating Capability-Based Access to Remote Scientific Data using HTCondor

    Authors: Alex Withers, Brian Bockelman, Derek Weitzel, Duncan Brown, Jason Patton, Jeff Gaynor, Jim Basney, Todd Tannenbaum, You Alex Gao, Zach Miller

    Abstract: The management of security credentials (e.g., passwords, secret keys) for computational science workflows is a burden for scientists and information security officers. Problems with credentials (e.g., expiration, privilege mismatch) cause workflows to fail to fetch needed input data or store valuable scientific results, distracting scientists from their research by requiring them to diagnose the p… ▽ More

    Submitted 22 May, 2019; originally announced May 2019.

    Comments: 8 pages, 3 figures, PEARC '19: Practice and Experience in Advanced Research Computing, July 28-August 1, 2019, Chicago, IL, USA. arXiv admin note: substantial text overlap with arXiv:1807.04728

  10. StashCache: A Distributed Caching Federation for the Open Science Grid

    Authors: Derek Weitzel, Marian Zvada, Ilija Vukotic, Rob Gardner, Brian Bockelman, Mats Rynge, Edgar Fajardo Hernandez, Brian Lin, Matyas Selmeci

    Abstract: Data distribution for opportunistic users is challenging as they neither own the computing resources they are using or any nearby storage. Users are motivated to use opportunistic computing to expand their data processing capacity, but they require storage and fast networking to distribute data to that processing. Since it requires significant management overhead, it is rare for resource providers… ▽ More

    Submitted 16 May, 2019; originally announced May 2019.

    Comments: In Practice and Experience in Advanced Research Computing (PEARC 19), July 28-August 1, 2019, Chicago, IL, USA. ACM, New York, NY, USA, 7 pages

  11. Discovering Job Preemptions in the Open Science Grid

    Authors: Zhe Zhang, Brian Bockelman, Derek Weitzel, David Swanson

    Abstract: The Open Science Grid(OSG) is a world-wide computing system which facilitates distributed computing for scientific research. It can distribute a computationally intensive job to geo-distributed clusters and process job's tasks in parallel. For compute clusters on the OSG, physical resources may be shared between OSG and cluster's local user-submitted jobs, with local jobs preempting OSG-based ones… ▽ More

    Submitted 17 July, 2018; originally announced July 2018.

    Comments: 8 pages

  12. SciTokens: Capability-Based Secure Access to Remote Scientific Data

    Authors: Alex Withers, Brian Bockelman, Derek Weitzel, Duncan Brown, Jeff Gaynor, Jim Basney, Todd Tannenbaum, Zach Miller

    Abstract: The management of security credentials (e.g., passwords, secret keys) for computational science workflows is a burden for scientists and information security officers. Problems with credentials (e.g., expiration, privilege mismatch) cause workflows to fail to fetch needed input data or store valuable scientific results, distracting scientists from their research by requiring them to diagnose the p… ▽ More

    Submitted 12 July, 2018; originally announced July 2018.

    Comments: 8 pages, 6 figures, PEARC '18: Practice and Experience in Advanced Research Computing, July 22--26, 2018, Pittsburgh, PA, USA

  13. arXiv:1705.06202  [pdf, other

    cs.DC astro-ph.IM

    Data Access for LIGO on the OSG

    Authors: Derek Weitzel, Brian Bockelman, Duncan A. Brown, Peter Couvares, Frank Würthwein, Edgar Fajardo Hernandez

    Abstract: During 2015 and 2016, the Laser Interferometer Gravitational-Wave Observatory (LIGO) conducted a three-month observing campaign. These observations delivered the first direct detection of gravitational waves from binary black hole mergers. To search for these signals, the LIGO Scientific Collaboration uses the PyCBC search pipeline. To deliver science results in a timely manner, LIGO collaborated… ▽ More

    Submitted 17 May, 2017; originally announced May 2017.

    Comments: 6 pages, 3 figures, submitted to PEARC17