Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Luszczek, P

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.11474  [pdf, other

    math.NA cs.MS math.OC

    Randomized Numerical Linear Algebra : A Perspective on the Field With an Eye to Software

    Authors: Riley Murray, James Demmel, Michael W. Mahoney, N. Benjamin Erichson, Maksim Melnichenko, Osman Asif Malik, Laura Grigori, Piotr Luszczek, Michał Dereziński, Miles E. Lopes, Tianyu Liang, Hengrui Luo, Jack Dongarra

    Abstract: Randomized numerical linear algebra - RandNLA, for short - concerns the use of randomization as a resource to develop improved algorithms for large-scale linear algebra computations. The origins of contemporary RandNLA lay in theoretical computer science, where it blossomed from a simple idea: randomization provides an avenue for computing approximate solutions to linear algebra problems more ef… ▽ More

    Submitted 12 April, 2023; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: v1: this is the first arXiv release of LAPACK Working Note 299. v2: complete rewrite of the subsection on trace estimation, among other changes. See frontmatter page ii (pdf page 5) for revision history

  2. arXiv:2207.09281  [pdf, other

    cs.MS

    Proposed Consistent Exception Handling for the BLAS and LAPACK

    Authors: James Demmel, Jack Dongarra, Mark Gates, Greg Henry, Julien Langou, Xiaoye Li, Piotr Luszczek, Weslley Pereira, Jason Riedy, Cindy Rubio-González

    Abstract: Numerical exceptions, which may be caused by overflow, operations like division by 0 or sqrt(-1), or convergence failures, are unavoidable in many cases, in particular when software is used on unforeseen and difficult inputs. As more aspects of society become automated, e.g., self-driving cars, health monitors, and cyber-physical systems more generally, it is becoming increasingly important to des… ▽ More

    Submitted 19 July, 2022; originally announced July 2022.

  3. arXiv:2101.05808  [pdf, other

    cond-mat.mtrl-sci cs.LG stat.AP

    Materials Fingerprinting Classification

    Authors: Adam Spannaus, Kody J. H. Law, Piotr Luszczek, Farzana Nasrin, Cassie Putman Micucci, Peter K. Liaw, Louis J. Santodonato, David J. Keffer, Vasileios Maroulas

    Abstract: Significant progress in many classes of materials could be made with the availability of experimentally-derived large datasets composed of atomic identities and three-dimensional coordinates. Methods for visualizing the local atomic structure, such as atom probe tomography (APT), which routinely generate datasets comprised of millions of atoms, are an important step in realizing this goal. However… ▽ More

    Submitted 14 January, 2021; originally announced January 2021.

  4. arXiv:2011.03196  [pdf, other

    cs.DC

    Task-Graph Scheduling Extensions for Efficient Synchronization and Communication

    Authors: Seonmyeong Bak, Oscar Hernandez, Mark Gates, Piotr Luszczek, Vivek Sarkar

    Abstract: Task graphs have been studied for decades as a foundation for scheduling irregular parallel applications and incorporated in programming models such as OpenMP. While many high-performance parallel libraries are based on task graphs, they also have additional scheduling requirements, such as synchronization from inner levels of data parallelism and internal blocking communications. In this paper, w… ▽ More

    Submitted 6 November, 2020; originally announced November 2020.

  5. arXiv:2011.01850  [pdf, ps, other

    math.NA cs.MS

    Improving the Performance of the GMRES Method using Mixed-Precision Techniques

    Authors: Neil Lindquist, Piotr Luszczek, Jack Dongarra

    Abstract: The GMRES method is used to solve sparse, non-symmetric systems of linear equations arising from many scientific applications. The solver performance within a single node is memory bound, due to the low arithmetic intensity of its computational kernels. To reduce the amount of data movement, and thus, to improve performance, we investigated the effect of using a mix of single and double precision… ▽ More

    Submitted 3 November, 2020; originally announced November 2020.

    Comments: 16 pages. In the 17th Smoky Mountains Computational Sciences and Engineering Conference

  6. arXiv:2007.06674  [pdf, other

    cs.MS math.NA

    A Survey of Numerical Methods Utilizing Mixed Precision Arithmetic

    Authors: Ahmad Abdelfattah, Hartwig Anzt, Erik G. Boman, Erin Carson, Terry Cojean, Jack Dongarra, Mark Gates, Thomas Grützmacher, Nicholas J. Higham, Sherry Li, Neil Lindquist, Yang Liu, Jennifer Loe, Piotr Luszczek, Pratik Nayak, Sri Pranesh, Siva Rajamanickam, Tobias Ribizel, Barry Smith, Kasia Swirydowicz, Stephen Thomas, Stanimire Tomov, Yaohung M. Tsai, Ichitaro Yamazaki, Urike Meier Yang

    Abstract: Within the past years, hardware vendors have started designing low precision special function units in response to the demand of the Machine Learning community and their demand for high compute power in low precision formats. Also the server-line products are increasingly featuring low-precision special function units, such as the NVIDIA tensor cores in ORNL's Summit supercomputer providing more t… ▽ More

    Submitted 13 July, 2020; originally announced July 2020.

    Comments: Technical report as a part of the Exascale computing project (ECP)

    ACM Class: G.1.3; G.4

  7. arXiv:1706.07519  [pdf, other

    cs.DC

    Interoperable Convergence of Storage, Networking, and Computation

    Authors: Micah Beck, Terry Moore, Piotr Luszczek, Anthony Danalis

    Abstract: In every form of digital store-and-forward communication, intermediate forwarding nodes are computers, with attendant memory and processing resources. This has inevitably stimulated efforts to create a wide-area infrastructure that goes beyond simple store-and-forward to create a platform that makes more general and varied use of the potential of this collection of increasingly powerful nodes. His… ▽ More

    Submitted 17 November, 2018; v1 submitted 22 June, 2017; originally announced June 2017.

    Comments: 15 pages, 5 figures

  8. Accelerating Scientific Computations with Mixed Precision Algorithms

    Authors: Marc Baboulin, Alfredo Buttari, Jack Dongarra, Jakub Kurzak, Julie Langou, Julien Langou, Piotr Luszczek, Stanimire Tomov

    Abstract: On modern architectures, the performance of 32-bit operations is often at least twice as fast as the performance of 64-bit operations. By using a combination of 32-bit and 64-bit floating point arithmetic, the performance of many dense and sparse linear algebra algorithms can be significantly enhanced while maintaining the 64-bit accuracy of the resulting solution. The approach presented here ca… ▽ More

    Submitted 20 August, 2008; originally announced August 2008.