Zum Hauptinhalt springen

Showing 1–7 of 7 results for author: Nichols, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2404.18864  [pdf, other

    cs.DC cs.AI cs.SE

    Performance-Aligned LLMs for Generating Fast Code

    Authors: Daniel Nichols, Pranav Polasam, Harshitha Menon, Aniruddha Marathe, Todd Gamblin, Abhinav Bhatele

    Abstract: Optimizing scientific software is a difficult task because codebases are often large and complex, and performance can depend upon several factors including the algorithm, its implementation, and hardware among others. Causes of poor performance can originate from disparate sources and be difficult to diagnose. Recent years have seen a multitude of work that use large language models (LLMs) to assi… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  2. arXiv:2401.13150  [pdf, other

    cs.DC cs.PF

    Automated Programmatic Performance Analysis of Parallel Programs

    Authors: Onur Cankur, Aditya Tomar, Daniel Nichols, Connor Scully-Allison, Katherine E. Isaacs, Abhinav Bhatele

    Abstract: Developing efficient parallel applications is critical to advancing scientific development but requires significant performance analysis and optimization. Performance analysis tools help developers manage the increasing complexity and scale of performance data, but often rely on the user to manually explore low-level data and are rigid in how the data can be manipulated. We propose a Python-based… ▽ More

    Submitted 23 January, 2024; originally announced January 2024.

  3. Can Large Language Models Write Parallel Code?

    Authors: Daniel Nichols, Joshua H. Davis, Zhaojun Xie, Arjun Rajaram, Abhinav Bhatele

    Abstract: Large language models are increasingly becoming a popular tool for software development. Their ability to model and generate source code has been demonstrated in a variety of contexts, including code completion, summarization, translation, and lookup. However, they often struggle to generate code for complex programs. In this paper, we study the capabilities of state-of-the-art language models to… ▽ More

    Submitted 14 May, 2024; v1 submitted 23 January, 2024; originally announced January 2024.

    Journal ref: The 33rd International Symposium on High-Performance Parallel and Distributed Computing (HPDC '24), June 3-7, 2024, Pisa, Italy. ACM, New York, NY, USA, 14 pages

  4. HPC-Coder: Modeling Parallel Programs using Large Language Models

    Authors: Daniel Nichols, Aniruddha Marathe, Harshitha Menon, Todd Gamblin, Abhinav Bhatele

    Abstract: Parallel programs in high performance computing (HPC) continue to grow in complexity and scale in the exascale era. The diversity in hardware and parallel programming models make developing, optimizing, and maintaining parallel software even more burdensome for developers. One way to alleviate some of these burdens is with automated development and analysis tools. Such tools can perform complex an… ▽ More

    Submitted 14 May, 2024; v1 submitted 29 June, 2023; originally announced June 2023.

    Journal ref: ISC High Performance 2024 Research Paper Proceedings (39th International Conference), Hamburg, Germany, 2024, pp. 1-12

  5. arXiv:2111.04949  [pdf, other

    cs.LG cs.AI cs.DC

    A Survey and Empirical Evaluation of Parallel Deep Learning Frameworks

    Authors: Daniel Nichols, Siddharth Singh, Shu-Huai Lin, Abhinav Bhatele

    Abstract: The field of deep learning has witnessed a remarkable shift towards extremely compute- and memory-intensive neural networks. These newer larger models have enabled researchers to advance state-of-the-art tools across a variety of fields. This phenomenon has spurred the development of algorithms for distributed training of neural networks over a larger number of hardware accelerators. In this paper… ▽ More

    Submitted 30 June, 2022; v1 submitted 8 November, 2021; originally announced November 2021.

  6. arXiv:2011.11188  [pdf, other

    cs.LG

    Integrating Deep Learning in Domain Sciences at Exascale

    Authors: Rick Archibald, Edmond Chow, Eduardo D'Azevedo, Jack Dongarra, Markus Eisenbach, Rocco Febbo, Florent Lopez, Daniel Nichols, Stanimire Tomov, Kwai Wong, Junqi Yin

    Abstract: This paper presents some of the current challenges in designing deep learning artificial intelligence (AI) and integrating it with traditional high-performance computing (HPC) simulations. We evaluate existing packages for their ability to run deep learning models and applications on large-scale HPC systems efficiently, identify challenges, and propose new asynchronous parallelization and optimiza… ▽ More

    Submitted 22 November, 2020; originally announced November 2020.

  7. arXiv:0711.3419  [pdf

    cs.AI

    Translating OWL and Semantic Web Rules into Prolog: Moving Toward Description Logic Programs

    Authors: Ken Samuel, Leo Obrst, Suzette Stoutenberg, Karen Fox, Paul Franklin, Adrian Johnson, Ken Laskey, Deborah Nichols, Steve Lopez, Jason Peterson

    Abstract: To appear in Theory and Practice of Logic Programming (TPLP), 2008. We are researching the interaction between the rule and the ontology layers of the Semantic Web, by comparing two options: 1) using OWL and its rule extension SWRL to develop an integrated ontology/rule language, and 2) layering rules on top of an ontology with RuleML and OWL. Toward this end, we are developing the SWORIER sys… ▽ More

    Submitted 21 November, 2007; originally announced November 2007.

    Comments: 21 pages, 5 figures, 19 tables. To appear in Theory and Practice of Logic Programming (TPLP), 2008