Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Valeev, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.01834  [pdf, other

    physics.comp-ph cond-mat.mtrl-sci cs.DC physics.chem-ph

    3-center and 4-center 2-particle Gaussian AO integrals on modern accelerated processors

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: We report an implementation of the McMurchie-Davidson (MD) algorithm for 3-center and 4-center 2-particle integrals over Gaussian atomic orbitals (AOs) with low and high angular momenta $l$ and varying degrees of contraction for graphical processing units (GPUs). This work builds upon our recent implementation of a matrix form of the MD algorithm that is efficient for GPU evaluation of 4-center 2-… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

  2. arXiv:2401.04836  [pdf, other

    cs.PL cs.DC cs.PF

    CoNST: Code Generator for Sparse Tensor Networks

    Authors: Saurabh Raje, Yufan Xu, Atanas Rountev, Edward F. Valeev, Saday Sadayappan

    Abstract: Sparse tensor networks are commonly used to represent contractions over sparse tensors. Tensor contractions are higher-order analogs of matrix multiplication. Tensor networks arise commonly in many domains of scientific computing and data science. After a transformation into a tree of binary contractions, the network is implemented as a sequence of individual contractions. Several critical aspects… ▽ More

    Submitted 9 January, 2024; originally announced January 2024.

  3. arXiv:2307.03452  [pdf, ps, other

    physics.comp-ph cs.CE physics.chem-ph

    High-performance evaluation of high angular momentum 4-center Gaussian integrals on modern accelerated processors

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: We present a high-performance evaluation method for 4-center 2-particle integrals over Gaussian atomic orbitals with high angular momenta ($l\geq4$) and arbitrary contraction degrees on graphical processing units (GPUs) and other accelerators. The implementation uses the matrix form of McMurchie-Davidson recurrences. Evaluation of the 4-center integrals over four $l=6$ ($i$) Gaussian AOs in the do… ▽ More

    Submitted 19 December, 2023; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: 23 pages

  4. arXiv:2210.03192  [pdf, other

    physics.comp-ph cs.MS physics.chem-ph

    Memory-Efficient Recursive Evaluation of 3-Center Gaussian Integrals

    Authors: Andrey Asadchev, Edward F. Valeev

    Abstract: To improve the efficiency of Gaussian integral evaluation on modern accelerated architectures FLOP-efficient Obara-Saika-based recursive evaluation schemes are optimized for the memory footprint. For the 3-center 2-particle integrals that are key for the evaluation of Coulomb and other 2-particle interactions in the density-fitting approximation the use of multi-quantal recurrences (in which multi… ▽ More

    Submitted 16 January, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: 37 pages, 2 figures, 6 tables

  5. arXiv:2010.15584  [pdf, ps, other

    cs.CY

    Future Directions of the Cyberinfrastructure for Sustained Scientific Innovation (CSSI) Program

    Authors: Ritu Arora, Xiaosong Li, Bonnie Hurwitz, Daniel Fay, Dhabaleswar K. Panda, Edward Valeev, Shaowen Wang, Shirley Moore, Sunita Chandrasekaran, Ting Cao, Holly Bik, Matthew Curry, Tanzima Islam

    Abstract: The CSSI 2019 workshop was held on October 28-29, 2019, in Austin, Texas. The main objectives of this workshop were to (1) understand the impact of the CSSI program on the community over the last 9 years, (2) engage workshop participants in identifying gaps and opportunities in the current CSSI landscape, (3) gather ideas on the cyberinfrastructure needs and expectations of the community with resp… ▽ More

    Submitted 15 October, 2020; originally announced October 2020.

    Comments: This report was submitted in April 2020 to the National Science Foundation (NSF)

  6. Scalable Task-Based Algorithm for Multiplication of Block-Rank-Sparse Matrices

    Authors: Justus A. Calvin, Cannada A. Lewis, Edward F. Valeev

    Abstract: A task-based formulation of Scalable Universal Matrix Multiplication Algorithm (SUMMA), a popular algorithm for matrix multiplication (MM), is applied to the multiplication of hierarchy-free, rank-structured matrices that appear in the domain of quantum chemistry (QC). The novel features of our formulation are: (1) concurrent scheduling of multiple SUMMA iterations, and (2) fine-grained task-based… ▽ More

    Submitted 9 October, 2015; v1 submitted 1 September, 2015; originally announced September 2015.

    Comments: 8 pages, 6 figures, accepted to IA3 2015. arXiv admin note: text overlap with arXiv:1504.05046

  7. arXiv:1507.01888  [pdf, ps, other

    cs.MS cs.CE math.NA

    MADNESS: A Multiresolution, Adaptive Numerical Environment for Scientific Simulation

    Authors: Robert J. Harrison, Gregory Beylkin, Florian A. Bischoff, Justus A. Calvin, George I. Fann, Jacob Fosso-Tande, Diego Galindo, Jeff R. Hammond, Rebecca Hartman-Baker, Judith C. Hill, Jun Jia, Jakob S. Kottmann, M-J. Yvonne Ou, Laura E. Ratcliff, Matthew G. Reuter, Adam C. Richie-Halford, Nichols A. Romero, Hideo Sekino, William A. Shelton, Bryan E. Sundahl, W. Scott Thornton, Edward F. Valeev, Álvaro Vázquez-Mayagoitia, Nicholas Vence, Yukina Yokoi

    Abstract: MADNESS (multiresolution adaptive numerical environment for scientific simulation) is a high-level software environment for solving integral and differential equations in many dimensions that uses adaptive and fast harmonic analysis methods with guaranteed precision based on multiresolution analysis and separated representations. Underpinning the numerical capabilities is a powerful petascale para… ▽ More

    Submitted 5 July, 2015; originally announced July 2015.

    Journal ref: SIAM SISC 38, S123-S142 (2016)

  8. arXiv:1504.05046  [pdf, other

    cs.DC

    Task-Based Algorithm for Matrix Multiplication: A Step Towards Block-Sparse Tensor Computing

    Authors: Justus A. Calvin, Edward F. Valeev

    Abstract: Distributed-memory matrix multiplication (MM) is a key element of algorithms in many domains (machine learning, quantum physics). Conventional algorithms for dense MM rely on regular/uniform data decomposition to ensure load balance. These traits conflict with the irregular structure (block-sparse or rank-sparse within blocks) that is increasingly relevant for fast methods in quantum physics. To d… ▽ More

    Submitted 20 April, 2015; originally announced April 2015.

    Comments: submitted to SC15 (9 pages, 8 figures)