Zum Hauptinhalt springen

Showing 1–29 of 29 results for author: Merzky, A

.
  1. arXiv:2407.16646  [pdf, other

    cs.SE cs.DC

    ExaWorks Software Development Kit: A Robust and Scalable Collection of Interoperable Workflow Technologies

    Authors: Matteo Turilli, Mihael Hategan-Marandiuc, Mikhail Titov, Ketan Maheshwari, Aymen Alsaadi, Andre Merzky, Ramon Arambula, Mikhail Zakharchanka, Matt Cowan, Justin M. Wozniak, Andreas Wilke, Ozgur Ozan Kilic, Kyle Chard, Rafael Ferreira da Silva, Shantenu Jha, Daniel Laney

    Abstract: Scientific discovery increasingly requires executing heterogeneous scientific workflows on high-performance computing (HPC) platforms. Heterogeneous workflows contain different types of tasks (e.g., simulation, analysis, and learning) that need to be mapped, scheduled, and launched on different computing. That requires a software stack that enables users to code their workflows and automate resour… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  2. arXiv:2403.18073  [pdf, other

    cs.DC

    Workflow Mini-Apps: Portable, Scalable, Tunable & Faithful Representations of Scientific Workflows

    Authors: Ozgur Ozan Kilic, Tianle Wang, Matteo Turilli, Mikhail Titov, Andre Merzky, Line Pouchard, Shantenu Jha

    Abstract: Workflows are critical for scientific discovery. However, the sophistication, heterogeneity, and scale of workflows make building, testing, and optimizing them increasingly challenging. Furthermore, their complexity and heterogeneity make performance reproducibility hard. In this paper, we propose workflow mini-apps as a tool to address the challenges in building and testing workflows while contro… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  3. arXiv:2403.15721  [pdf, other

    cs.DC

    Design and Implementation of an Analysis Pipeline for Heterogeneous Data

    Authors: Arup Kumar Sarker, Aymen Alsaadi, Niranda Perera, Mills Staylor, Gregor von Laszewski, Matteo Turilli, Ozgur Ozan Kilic, Mikhail Titov, Andre Merzky, Shantenu Jha, Geoffrey Fox

    Abstract: Managing and preparing complex data for deep learning, a prevalent approach in large-scale data science can be challenging. Data transfer for model training also presents difficulties, impacting scientific fields like genomics, climate modeling, and astronomy. A large-scale solution like Google Pathways with a distributed execution environment for deep learning models exists but is proprietary. In… ▽ More

    Submitted 7 April, 2024; v1 submitted 23 March, 2024; originally announced March 2024.

    Comments: 14 pages, 16 figures, 2 tables

    ACM Class: H.2.4; D.2.7; D.2.2

  4. PSI/J: A Portable Interface for Submitting, Monitoring, and Managing Jobs

    Authors: Mihael Hategan-Marandiuc, Andre Merzky, Nicholson Collier, Ketan Maheshwari, Jonathan Ozik, Matteo Turilli, Andreas Wilke, Justin M. Wozniak, Kyle Chard, Ian Foster, Rafael Ferreira da Silva, Shantenu Jha, Daniel Laney

    Abstract: It is generally desirable for high-performance computing (HPC) applications to be portable between HPC systems, for example to make use of more performant hardware, make effective use of allocations, and to co-locate compute jobs with large datasets. Unfortunately, moving scientific applications between HPC systems is challenging for various reasons, most notably that HPC systems have different HP… ▽ More

    Submitted 20 September, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

  5. RAPTOR: Ravenous Throughput Computing

    Authors: Andre Merzky, Matteo Turilli, Shantenu Jha

    Abstract: We describe the design, implementation and performance of the RADICAL-Pilot task overlay (RAPTOR). RAPTOR enables the execution of heterogeneous tasks -- i.e., functions and executables with arbitrary duration -- on HPC platforms, providing high throughput and high resource utilization. RAPTOR supports the high throughput virtual screening requirements of DOE's National Virtual Biotechnology Labor… ▽ More

    Submitted 31 August, 2022; originally announced September 2022.

    Comments: 10 pages, 9 figures. 22nd International Symposium on Cluster, Cloud and Internet Computing (CCGrid 2022)

  6. arXiv:2208.00056  [pdf

    physics.comp-ph

    Pipeline for Automating Compliance-based Elimination and Extension (PACE2): A Systematic Framework for High-throughput Biomolecular Material Simulation Workflows

    Authors: Srinivas C. Mushnoori, Ethan Zang, Akash Banerjee, Mason Hooten, Andre Merzky, Matteo Turilli, Shantenu Jha, Meenakshi Dutt

    Abstract: The formation of biomolecular materials via dynamical interfacial processes such as self-assembly and fusion, for diverse compositions and external conditions, can be efficiently probed using ensemble Molecular Dynamics. However, this approach requires a large number of simulations when investigating a large composition phase space. In addition, there is difficulty in predicting whether each simul… ▽ More

    Submitted 29 July, 2022; originally announced August 2022.

    Comments: 25 pages, 9 figures, 4 tables

  7. arXiv:2201.06962  [pdf, other

    cs.CE cs.DC physics.ao-ph

    A Scalable Solution for Running Ensemble Simulations for Photovoltaic Energy

    Authors: Weiming Hu, Guido Cervone, Matteo Turilli, Andre Merzky, Shantenu Jha

    Abstract: This chapter proposes and provides an in-depth discussion of a scalable solution for running ensemble simulation for solar energy production. Generating a forecast ensemble is computationally expensive. But with the help of Analog Ensemble, forecast ensembles can be generated with a single deterministic run of a weather forecast model. Weather ensembles are then used to simulate 11 10 KW photovolt… ▽ More

    Submitted 10 January, 2022; originally announced January 2022.

  8. arXiv:2108.13521  [pdf, other

    cs.DC

    ExaWorks: Workflows for Exascale

    Authors: Aymen Al-Saadi, Dong H. Ahn, Yadu Babuji, Kyle Chard, James Corbett, Mihael Hategan, Stephen Herbein, Shantenu Jha, Daniel Laney, Andre Merzky, Todd Munson, Michael Salim, Mikhail Titov, Matteo Turilli, Justin M. Wozniak

    Abstract: Exascale computers will offer transformative capabilities to combine data-driven and learning-based approaches with traditional simulation applications to accelerate scientific discovery and insight. These software combinations and integrations, however, are difficult to achieve due to challenges of coordination and deployment of heterogeneous software components on diverse and massive platforms.… ▽ More

    Submitted 30 August, 2021; originally announced August 2021.

  9. arXiv:2106.07036  [pdf, other

    q-bio.BM cs.LG

    Protein-Ligand Docking Surrogate Models: A SARS-CoV-2 Benchmark for Deep Learning Accelerated Virtual Screening

    Authors: Austin Clyde, Thomas Brettin, Alexander Partin, Hyunseung Yoo, Yadu Babuji, Ben Blaiszik, Andre Merzky, Matteo Turilli, Shantenu Jha, Arvind Ramanathan, Rick Stevens

    Abstract: We propose a benchmark to study surrogate model accuracy for protein-ligand docking. We share a dataset consisting of 200 million 3D complex structures and 2D structure scores across a consistent set of 13 million "in-stock" molecules over 15 receptors, or binding sites, across the SARS-CoV-2 proteome. Our work shows surrogate docking models have six orders of magnitude more throughput than standa… ▽ More

    Submitted 30 June, 2021; v1 submitted 13 June, 2021; originally announced June 2021.

  10. Workflows Community Summit: Advancing the State-of-the-art of Scientific Workflows Management Systems Research and Development

    Authors: Rafael Ferreira da Silva, Henri Casanova, Kyle Chard, Tainã Coleman, Dan Laney, Dong Ahn, Shantenu Jha, Dorran Howell, Stian Soiland-Reys, Ilkay Altintas, Douglas Thain, Rosa Filgueira, Yadu Babuji, Rosa M. Badia, Bartosz Balis, Silvina Caino-Lores, Scott Callaghan, Frederik Coppens, Michael R. Crusoe, Kaushik De, Frank Di Natale, Tu M. A. Do, Bjoern Enders, Thomas Fahringer, Anne Fouilloux , et al. (33 additional authors not shown)

    Abstract: Scientific workflows are a cornerstone of modern scientific computing, and they have underpinned some of the most significant discoveries of the last decade. Many of these workflows have high computational, storage, and/or communication demands, and thus must execute on a wide range of large-scale platforms, from large clouds to upcoming exascale HPC platforms. Workflows will play a crucial role i… ▽ More

    Submitted 9 June, 2021; originally announced June 2021.

  11. arXiv:2105.13185  [pdf, other

    cs.DC

    RADICAL-Pilot and Parsl: Executing Heterogeneous Workflows on HPC Platforms

    Authors: Aymen Alsaadi, Logan Ward, Andre Merzky, Kyle Chard, Ian Foster, Shantenu Jha, Matteo Turilli

    Abstract: Workflows applications are becoming increasingly important to support scientific discovery. That is leading to a proliferation of workflow management systems and, thus, to a fragmented software ecosystem. Integration among existing workflow tools can improve development efficiency and, ultimately, increase the sustainability of scientific workflow software. We describe our experience with integrat… ▽ More

    Submitted 30 August, 2022; v1 submitted 27 May, 2021; originally announced May 2021.

  12. arXiv:2103.02843  [pdf

    cs.DC cs.CE cs.LG physics.bio-ph q-bio.QM

    Pandemic Drugs at Pandemic Speed: Infrastructure for Accelerating COVID-19 Drug Discovery with Hybrid Machine Learning- and Physics-based Simulations on High Performance Computers

    Authors: Agastya P. Bhati, Shunzhou Wan, Dario Alfè, Austin R. Clyde, Mathis Bode, Li Tan, Mikhail Titov, Andre Merzky, Matteo Turilli, Shantenu Jha, Roger R. Highfield, Walter Rocchia, Nicola Scafuri, Sauro Succi, Dieter Kranzlmüller, Gerald Mathias, David Wifling, Yann Donon, Alberto Di Meglio, Sofia Vallecorsa, Heng Ma, Anda Trifan, Arvind Ramanathan, Tom Brettin, Alexander Partin , et al. (4 additional authors not shown)

    Abstract: The race to meet the challenges of the global pandemic has served as a reminder that the existing drug discovery process is expensive, inefficient and slow. There is a major bottleneck screening the vast number of potential small molecules to shortlist lead compounds for antiviral drug development. New opportunities to accelerate drug discovery lie at the interface between machine learning methods… ▽ More

    Submitted 4 September, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Journal ref: Interface Focus. 2021. 11 (6): 20210018

  13. arXiv:2103.00091  [pdf, other

    cs.DC

    Design and Performance Characterization of RADICAL-Pilot on Leadership-class Platforms

    Authors: Andre Merzky, Matteo Turilli, Mikhail Titov, Aymen Al-Saadi, Shantenu Jha

    Abstract: Many extreme scale scientific applications have workloads comprised of a large number of individual high-performance tasks. The Pilot abstraction decouples workload specification, resource management, and task execution via job placeholders and late-binding. As such, suitable implementations of the Pilot abstraction can support the collective execution of large number of tasks on supercomputers. W… ▽ More

    Submitted 2 November, 2021; v1 submitted 26 February, 2021; originally announced March 2021.

    Comments: arXiv admin note: text overlap with arXiv:1801.01843

  14. arXiv:2010.10517  [pdf, other

    cs.DC cs.CE

    Scalable HPC and AI Infrastructure for COVID-19 Therapeutics

    Authors: Hyungro Lee, Andre Merzky, Li Tan, Mikhail Titov, Matteo Turilli, Dario Alfe, Agastya Bhati, Alex Brace, Austin Clyde, Peter Coveney, Heng Ma, Arvind Ramanathan, Rick Stevens, Anda Trifan, Hubertus Van Dam, Shunzhou Wan, Sean Wilkinson, Shantenu Jha

    Abstract: COVID-19 has claimed more 1 million lives and resulted in over 40 million infections. There is an urgent need to identify drugs that can inhibit SARS-CoV-2. In response, the DOE recently established the Medical Therapeutics project as part of the National Virtual Biotechnology Laboratory, and tasked it with creating the computational infrastructure and methods necessary to advance therapeutics dev… ▽ More

    Submitted 20 October, 2020; originally announced October 2020.

  15. arXiv:2010.06574  [pdf, other

    cs.DC cs.CE q-bio.QM

    IMPECCABLE: Integrated Modeling PipelinE for COVID Cure by Assessing Better LEads

    Authors: Aymen Al Saadi, Dario Alfe, Yadu Babuji, Agastya Bhati, Ben Blaiszik, Thomas Brettin, Kyle Chard, Ryan Chard, Peter Coveney, Anda Trifan, Alex Brace, Austin Clyde, Ian Foster, Tom Gibbs, Shantenu Jha, Kristopher Keipert, Thorsten Kurth, Dieter Kranzlmüller, Hyungro Lee, Zhuozhao Li, Heng Ma, Andre Merzky, Gerald Mathias, Alexander Partin, Junqi Yin , et al. (11 additional authors not shown)

    Abstract: The drug discovery process currently employed in the pharmaceutical industry typically requires about 10 years and $2-3 billion to deliver one new drug. This is both too expensive and too slow, especially in emergencies like the COVID-19 pandemic. In silicomethodologies need to be improved to better select lead compounds that can proceed to later stages of the drug discovery protocol accelerating… ▽ More

    Submitted 13 October, 2020; originally announced October 2020.

  16. arXiv:1909.03057  [pdf, other

    cs.DC

    Characterizing the Performance of Executing Many-tasks on Summit

    Authors: Matteo Turilli, Andre Merzky, Thomas Naughton, Wael Elwasif, Shantenu Jha

    Abstract: Many scientific workloads are comprised of many tasks, where each task is an independent simulation or analysis of data. The execution of millions of tasks on heterogeneous HPC platforms requires scalable dynamic resource management and multi-level scheduling. RADICAL-Pilot (RP) -- an implementation of the Pilot abstraction, addresses these challenges and serves as an effective runtime system to e… ▽ More

    Submitted 8 September, 2019; originally announced September 2019.

  17. arXiv:1904.03085  [pdf, other

    cs.SE

    RADICAL-Cybertools: Middleware Building Blocks for Scalable Science

    Authors: Vivek Balasubramanian, Shantenu Jha, Andre Merzky, Matteo Turilli

    Abstract: RADICAL-Cybertools (RCT) are a set of software systems that serve as middleware to develop efficient and effective tools for scientific computing. Specifically, RCT enable executing many-task applications at extreme scale and on a variety of computing infrastructures. RCT are building blocks, designed to work as stand-alone systems, integrated among themselves or integrated with third-party system… ▽ More

    Submitted 5 April, 2019; originally announced April 2019.

  18. Middleware Building Blocks for Workflow Systems

    Authors: Matteo Turilli, Vivek Balasubramanian, Andre Merzky, Ioannis Paraskevakos, Shantenu Jha

    Abstract: This paper describes a building blocks approach to the design of scientific workflow systems. We discuss RADICAL-Cybertools as one implementation of the building blocks concept, showing how they are designed and developed in accordance with this approach. This paper offers three main contributions: (i) showing the relevance of the design principles underlying the building blocks approach to suppor… ▽ More

    Submitted 27 June, 2019; v1 submitted 24 March, 2019; originally announced March 2019.

  19. Synapse: Synthetic Application Profiler and Emulator

    Authors: Andre Merzky, Ming Tai Ha, Matteo Turilli, Shantenu Jha

    Abstract: Motivated by the need to emulate workload execution characteristics on high-performance and distributed heterogeneous resources, we introduce Synapse. Synapse is used as a proxy application (or "representative application") for real workloads, with the advantage that it can be tuned in different ways and dimensions, and also at levels of granularity that are not possible with real applications. Sy… ▽ More

    Submitted 2 August, 2018; originally announced August 2018.

    Comments: Large portions of this work originally appeared as arXiv:1506.00272, which was subsequently published as a workshop paper. This is an extended version published in the "Journal of Computational Science"

    Report number: 01

    Journal ref: Journal of Computational Science, 27C (2018) pp. 329-344

  20. arXiv:1801.02651  [pdf, other

    cs.DC

    Towards General Distributed Resource Selection

    Authors: Ming Tai Ha, Matteo Turilli, Andre Merzky, Shantenu Jha

    Abstract: The advantages of distributing workloads and utilizing multiple distributed resources are now well established. The type and degree of heterogeneity of distributed resources is increasing, and thus determining how to distribute the workloads becomes increasingly difficult, in particular with respect to the selection of suitable resources. We formulate and investigate the resource selection problem… ▽ More

    Submitted 8 January, 2018; originally announced January 2018.

  21. arXiv:1801.01843  [pdf, other

    cs.DC

    Design and Performance Characterization of RADICAL-Pilot on Titan

    Authors: Andre Merzky, Matteo Turilli, Manuel Maldonado, Shantenu Jha

    Abstract: Many extreme scale scientific applications have workloads comprised of a large number of individual high-performance tasks. The Pilot abstraction decouples workload specification, resource management, and task execution via job placeholders and late-binding. As such, suitable implementations of the Pilot abstraction can support the collective execution of large number of tasks on supercomputers. W… ▽ More

    Submitted 5 January, 2018; originally announced January 2018.

  22. arXiv:1609.03484  [pdf, other

    cs.SE

    Designing Workflow Systems Using Building Blocks

    Authors: Matteo Turilli, Andre Merzky, Vivek Balasubramanian, Manuel Maldonado, Shantenu Jha

    Abstract: We suggest there is a need for a fresh perspective on the design and development of workflow systems and argue for a building blocks approach. We outline a description of this approach and define the properties of software building blocks. We discuss RADICAL-Cybertools as one implementation of the building blocks concept, showing how they have been designed and developed in accordance with this ap… ▽ More

    Submitted 8 April, 2019; v1 submitted 12 September, 2016; originally announced September 2016.

  23. Evaluating Distributed Execution of Workloads

    Authors: Matteo Turilli, Yadu Nand Babuji, Andre Merzky, Ming Tai Ha, Michael Wilde, Daniel S. Katz, Shantenu Jha

    Abstract: Resource selection and task placement for distributed execution poses conceptual and implementation difficulties. Although resource selection and task placement are at the core of many tools and workflow systems, the methods are ad hoc rather than being based on models. Consequently, partial and non-interoperable implementations proliferate. We address both the conceptual and implementation diffic… ▽ More

    Submitted 2 November, 2021; v1 submitted 31 May, 2016; originally announced May 2016.

  24. arXiv:1601.05439  [pdf, other

    cs.DC

    RepEx: A Flexible Framework for Scalable Replica Exchange Molecular Dynamics Simulations

    Authors: Antons Treikalis, Andre Merzky, Haoyuan Chen, Tai-Sung Lee, Darrin M. York, Shantenu Jha

    Abstract: Replica Exchange (RE) simulations have emerged as an important algorithmic tool for the molecular sciences. RE simulations involve the concurrent execution of independent simulations which infrequently interact and exchange information. The next set of simulation parameters are based upon the outcome of the exchanges. Typically RE functionality is integrated into the molecular simulation softwar… ▽ More

    Submitted 20 January, 2016; originally announced January 2016.

    Comments: 12 pages, 13 figures

  25. arXiv:1512.08194  [pdf, other

    cs.DC

    Using Pilot Systems to Execute Many Task Workloads on Supercomputers

    Authors: Andre Merzky, Matteo Turilli, Manuel Maldonado, Mark Santcroos, Shantenu Jha

    Abstract: High performance computing systems have historically been designed to support applications comprised of mostly monolithic, single-job workloads. Pilot systems decouple workload specification, resource selection, and task execution via job placeholders and late-binding. Pilot systems help to satisfy the resource requirements of workloads comprised of multiple tasks. RADICAL-Pilot (RP) is a modular… ▽ More

    Submitted 30 July, 2018; v1 submitted 27 December, 2015; originally announced December 2015.

  26. arXiv:1506.00272  [pdf, other

    cs.DC

    Synapse: Synthetic Application Profiler and Emulator

    Authors: Andre Merzky, Shantenu Jha

    Abstract: We introduce Synapse motivated by the needs to estimate and emulate workload execution characteristics on high-performance and distributed heterogeneous resources. Synapse has a platform independent application profiler, and the ability to emulate profiled workloads on a variety of heterogeneous resources. Synapse is used as a proxy application (or "representative application") for real workloads,… ▽ More

    Submitted 15 February, 2016; v1 submitted 31 May, 2015; originally announced June 2015.

    Journal ref: 2016 IEEE International Parallel and Distributed Processing Symposium Workshops, Chicago, IL, USA, May 23-27, 2016

  27. arXiv:1504.04720  [pdf, other

    cs.DC

    Integrating Abstractions to Enhance the Execution of Distributed Applications

    Authors: Matteo Turilli, Feng Liu, Zhao Zhang, Andre Merzky, Michael Wilde, Jon Weissman, Daniel S. Katz, Shantenu Jha

    Abstract: One of the factors that limits the scale, performance, and sophistication of distributed applications is the difficulty of concurrently executing them on multiple distributed computing resources. In part, this is due to a poor understanding of the general properties and performance of the coupling between applications and dynamic resources. This paper addresses this issue by integrating abstractio… ▽ More

    Submitted 18 February, 2016; v1 submitted 18 April, 2015; originally announced April 2015.

  28. arXiv:1210.3271  [pdf

    cs.DC cs.CY

    Grid Computing: The Next Decade -- Report and Summary

    Authors: Jarek Nabrzyski, Krzysztof Kurowski, Daniel S. Katz, Andre Merzky

    Abstract: The evolution of the global scientific cyberinfrastructure (CI) has, over the last 10+ years, led to a large diversity of CI instances. While specialized, competing and alternative CI building blocks are inherent to a healthy ecosystem, it also becomes apparent that the increasing degree of fragmentation is hindering interoperation, and thus limiting collaboration, which is essential for modern sc… ▽ More

    Submitted 11 October, 2012; originally announced October 2012.

    Comments: 17 pages, 1 figure

  29. arXiv:1207.6644  [pdf, other

    cs.DC

    P*: A Model of Pilot-Abstractions

    Authors: Andre Luckow, Mark Santcroos, Ole Weidner, Andre Merzky, Pradeep Mantha, Shantenu Jha

    Abstract: Pilot-Jobs support effective distributed resource utilization, and are arguably one of the most widely-used distributed computing abstractions - as measured by the number and types of applications that use them, as well as the number of production distributed cyberinfrastructures that support them. In spite of broad uptake, there does not exist a well-defined, unifying conceptual model of Pilot-Jo… ▽ More

    Submitted 27 July, 2012; originally announced July 2012.

    Comments: 10 pages