-
Separable Physics-Informed Neural Networks for the solution of elasticity problems
Authors:
Vasiliy A. Es'kin,
Danil V. Davydov,
Julia V. Gur'eva,
Alexey O. Malkhanov,
Mikhail E. Smorkalov
Abstract:
A method for solving elasticity problems based on separable physics-informed neural networks (SPINN) in conjunction with the deep energy method (DEM) is presented. Numerical experiments have been carried out for a number of problems showing that this method has a significantly higher convergence rate and accuracy than the vanilla physics-informed neural networks (PINN) and even SPINN based on a sy…
▽ More
A method for solving elasticity problems based on separable physics-informed neural networks (SPINN) in conjunction with the deep energy method (DEM) is presented. Numerical experiments have been carried out for a number of problems showing that this method has a significantly higher convergence rate and accuracy than the vanilla physics-informed neural networks (PINN) and even SPINN based on a system of partial differential equations (PDEs). In addition, using the SPINN in the framework of DEM approach it is possible to solve problems of the linear theory of elasticity on complex geometries, which is unachievable with the help of PINNs in frames of partial differential equations. Considered problems are very close to the industrial problems in terms of geometry, loading, and material parameters.
△ Less
Submitted 24 January, 2024;
originally announced January 2024.
-
About optimal loss function for training physics-informed neural networks under respecting causality
Authors:
Vasiliy A. Es'kin,
Danil V. Davydov,
Ekaterina D. Egorova,
Alexey O. Malkhanov,
Mikhail A. Akhukov,
Mikhail E. Smorkalov
Abstract:
A method is presented that allows to reduce a problem described by differential equations with initial and boundary conditions to the problem described only by differential equations. The advantage of using the modified problem for physics-informed neural networks (PINNs) methodology is that it becomes possible to represent the loss function in the form of a single term associated with differentia…
▽ More
A method is presented that allows to reduce a problem described by differential equations with initial and boundary conditions to the problem described only by differential equations. The advantage of using the modified problem for physics-informed neural networks (PINNs) methodology is that it becomes possible to represent the loss function in the form of a single term associated with differential equations, thus eliminating the need to tune the scaling coefficients for the terms related to boundary and initial conditions. The weighted loss functions respecting causality were modified and new weighted loss functions based on generalized functions are derived. Numerical experiments have been carried out for a number of problems, demonstrating the accuracy of the proposed methods.
△ Less
Submitted 5 April, 2023;
originally announced April 2023.
-
On Scale-out Deep Learning Training for Cloud and HPC
Authors:
Srinivas Sridharan,
Karthikeyan Vaidyanathan,
Dhiraj Kalamkar,
Dipankar Das,
Mikhail E. Smorkalov,
Mikhail Shiryaev,
Dheevatsa Mudigere,
Naveen Mellempudi,
Sasikanth Avancha,
Bharat Kaul,
Pradeep Dubey
Abstract:
The exponential growth in use of large deep neural networks has accelerated the need for training these deep neural networks in hours or even minutes. This can only be achieved through scalable and efficient distributed training, since a single node/card cannot satisfy the compute, memory, and I/O requirements of today's state-of-the-art deep neural networks. However, scaling synchronous Stochasti…
▽ More
The exponential growth in use of large deep neural networks has accelerated the need for training these deep neural networks in hours or even minutes. This can only be achieved through scalable and efficient distributed training, since a single node/card cannot satisfy the compute, memory, and I/O requirements of today's state-of-the-art deep neural networks. However, scaling synchronous Stochastic Gradient Descent (SGD) is still a challenging problem and requires continued research/development. This entails innovations spanning algorithms, frameworks, communication libraries, and system design. In this paper, we describe the philosophy, design, and implementation of Intel Machine Learning Scalability Library (MLSL) and present proof-points demonstrating scaling DL training on 100s to 1000s of nodes across Cloud and HPC systems.
△ Less
Submitted 24 January, 2018;
originally announced January 2018.
-
Deep Learning at 15PF: Supervised and Semi-Supervised Classification for Scientific Data
Authors:
Thorsten Kurth,
Jian Zhang,
Nadathur Satish,
Ioannis Mitliagkas,
Evan Racah,
Mostofa Ali Patwary,
Tareq Malas,
Narayanan Sundaram,
Wahid Bhimji,
Mikhail Smorkalov,
Jack Deslippe,
Mikhail Shiryaev,
Srinivas Sridharan,
Prabhat,
Pradeep Dubey
Abstract:
This paper presents the first, 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems on contemporary HPC architectures. We develop supervised convolutional architectures for discriminating signals in high-energy physics data as well as semi-supervised architectures for localizing and classifying extreme weather in climate data. Our Intelcaffe-based implementation…
▽ More
This paper presents the first, 15-PetaFLOP Deep Learning system for solving scientific pattern classification problems on contemporary HPC architectures. We develop supervised convolutional architectures for discriminating signals in high-energy physics data as well as semi-supervised architectures for localizing and classifying extreme weather in climate data. Our Intelcaffe-based implementation obtains $\sim$2TFLOP/s on a single Cori Phase-II Xeon-Phi node. We use a hybrid strategy employing synchronous node-groups, while using asynchronous communication across groups. We use this strategy to scale training of a single model to $\sim$9600 Xeon-Phi nodes; obtaining peak performance of 11.73-15.07 PFLOP/s and sustained performance of 11.41-13.27 PFLOP/s. At scale, our HEP architecture produces state-of-the-art classification accuracy on a dataset with 10M images, exceeding that achieved by selections on high-level physics-motivated features. Our semi-supervised architecture successfully extracts weather patterns in a 15TB climate dataset. Our results demonstrate that Deep Learning can be optimized and scaled effectively on many-core, HPC systems.
△ Less
Submitted 17 August, 2017;
originally announced August 2017.