-
HubbardNet: Efficient Predictions of the Bose-Hubbard Model Spectrum with Deep Neural Networks
Authors:
Ziyan Zhu,
Marios Mattheakis,
Weiwei Pan,
Efthimios Kaxiras
Abstract:
We present a deep neural network (DNN)-based model (HubbardNet) to variationally find the ground state and excited state wavefunctions of the one-dimensional and two-dimensional Bose-Hubbard model. Using this model for a square lattice with $M$ sites, we obtain the energy spectrum as an analytical function of the on-site Coulomb repulsion, $U$, and the total number of particles, $N$, from a single…
▽ More
We present a deep neural network (DNN)-based model (HubbardNet) to variationally find the ground state and excited state wavefunctions of the one-dimensional and two-dimensional Bose-Hubbard model. Using this model for a square lattice with $M$ sites, we obtain the energy spectrum as an analytical function of the on-site Coulomb repulsion, $U$, and the total number of particles, $N$, from a single training. This approach bypasses the need to solve a new hamiltonian for each different set of values $(U,N)$. Using \texttt{HubbardNet}, we identify the two ground state phases of the Bose-Hubbard model (Mott insulator and superfluid). We show that the DNN-parametrized solutions are in excellent agreement with results from the exact diagonalization of the hamiltonian, and it outperforms exact diagonalization in terms of computational scaling. These advantages suggest that our model is promising for efficient and accurate computation of exact phase diagrams of many-body lattice hamiltonians.
△ Less
Submitted 25 March, 2023; v1 submitted 27 December, 2022;
originally announced December 2022.
-
First principles physics-informed neural network for quantum wavefunctions and eigenvalue surfaces
Authors:
Marios Mattheakis,
Gabriel R. Schleder,
Daniel T. Larson,
Efthimios Kaxiras
Abstract:
Physics-informed neural networks have been widely applied to learn general parametric solutions of differential equations. Here, we propose a neural network to discover parametric eigenvalue and eigenfunction surfaces of quantum systems. We apply our method to solve the hydrogen molecular ion. This is an ab-initio deep learning method that solves the Schrodinger equation with the Coulomb potential…
▽ More
Physics-informed neural networks have been widely applied to learn general parametric solutions of differential equations. Here, we propose a neural network to discover parametric eigenvalue and eigenfunction surfaces of quantum systems. We apply our method to solve the hydrogen molecular ion. This is an ab-initio deep learning method that solves the Schrodinger equation with the Coulomb potential yielding realistic wavefunctions that include a cusp at the ion positions. The neural solutions are continuous and differentiable functions of the interatomic distance and their derivatives are analytically calculated by applying automatic differentiation. Such a parametric and analytical form of the solutions is useful for further calculations such as the determination of force fields.
△ Less
Submitted 19 November, 2022; v1 submitted 8 November, 2022;
originally announced November 2022.
-
Transfer Learning with Physics-Informed Neural Networks for Efficient Simulation of Branched Flows
Authors:
Raphaël Pellegrin,
Blake Bullwinkel,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
Physics-Informed Neural Networks (PINNs) offer a promising approach to solving differential equations and, more generally, to applying deep learning to problems in the physical sciences. We adopt a recently developed transfer learning approach for PINNs and introduce a multi-head model to efficiently obtain accurate solutions to nonlinear systems of ordinary differential equations with random pote…
▽ More
Physics-Informed Neural Networks (PINNs) offer a promising approach to solving differential equations and, more generally, to applying deep learning to problems in the physical sciences. We adopt a recently developed transfer learning approach for PINNs and introduce a multi-head model to efficiently obtain accurate solutions to nonlinear systems of ordinary differential equations with random potentials. In particular, we apply the method to simulate stochastic branched flows, a universal phenomenon in random wave dynamics. Finally, we compare the results achieved by feed forward and GAN-based PINNs on two physically relevant transfer learning tasks and show that our methods provide significant computational speedups in comparison to standard PINNs trained from scratch.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
RcTorch: a PyTorch Reservoir Computing Package with Automated Hyper-Parameter Optimization
Authors:
Hayden Joy,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
Reservoir computers (RCs) are among the fastest to train of all neural networks, especially when they are compared to other recurrent neural networks. RC has this advantage while still handling sequential data exceptionally well. However, RC adoption has lagged other neural network models because of the model's sensitivity to its hyper-parameters (HPs). A modern unified software package that autom…
▽ More
Reservoir computers (RCs) are among the fastest to train of all neural networks, especially when they are compared to other recurrent neural networks. RC has this advantage while still handling sequential data exceptionally well. However, RC adoption has lagged other neural network models because of the model's sensitivity to its hyper-parameters (HPs). A modern unified software package that automatically tunes these parameters is missing from the literature. Manually tuning these numbers is very difficult, and the cost of traditional grid search methods grows exponentially with the number of HPs considered, discouraging the use of the RC and limiting the complexity of the RC models which can be devised. We address these problems by introducing RcTorch, a PyTorch based RC neural network package with automated HP tuning. Herein, we demonstrate the utility of RcTorch by using it to predict the complex dynamics of a driven pendulum being acted upon by varying forces. This work includes coding examples. Example Python Jupyter notebooks can be found on our GitHub repository https://github.com/blindedjoy/RcTorch and documentation can be found at https://rctorch.readthedocs.io/.
△ Less
Submitted 12 July, 2022;
originally announced July 2022.
-
Physics-informed neural networks for quantum control
Authors:
Ariel Norambuena,
Marios Mattheakis,
Francisco J. González,
Raúl Coto
Abstract:
Quantum control is a ubiquitous research field that has enabled physicists to delve into the dynamics and features of quantum systems, delivering powerful applications for various atomic, optical, mechanical, and solid-state systems. In recent years, traditional control techniques based on optimization processes have been translated into efficient artificial intelligence algorithms. Here, we intro…
▽ More
Quantum control is a ubiquitous research field that has enabled physicists to delve into the dynamics and features of quantum systems, delivering powerful applications for various atomic, optical, mechanical, and solid-state systems. In recent years, traditional control techniques based on optimization processes have been translated into efficient artificial intelligence algorithms. Here, we introduce a computational method for optimal quantum control problems via physics-informed neural networks (PINNs). We apply our methodology to open quantum systems by efficiently solving the state-to-state transfer problem with high probabilities, short-time evolution, and using low-energy consumption controls. Furthermore, we illustrate the flexibility of PINNs to solve the same problem under changes in physical parameters and initial conditions, showing advantages in comparison with standard control techniques.
△ Less
Submitted 7 December, 2023; v1 submitted 13 June, 2022;
originally announced June 2022.
-
Physics-Informed Neural Networks for Quantum Eigenvalue Problems
Authors:
Henry Jin,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
Eigenvalue problems are critical to several fields of science and engineering. We expand on the method of using unsupervised neural networks for discovering eigenfunctions and eigenvalues for differential eigenvalue problems. The obtained solutions are given in an analytical and differentiable form that identically satisfies the desired boundary conditions. The network optimization is data-free an…
▽ More
Eigenvalue problems are critical to several fields of science and engineering. We expand on the method of using unsupervised neural networks for discovering eigenfunctions and eigenvalues for differential eigenvalue problems. The obtained solutions are given in an analytical and differentiable form that identically satisfies the desired boundary conditions. The network optimization is data-free and depends solely on the predictions of the neural network. We introduce two physics-informed loss functions. The first, called ortho-loss, motivates the network to discover pair-wise orthogonal eigenfunctions. The second loss term, called norm-loss, requests the discovery of normalized eigenfunctions and is used to avoid trivial solutions. We find that embedding even or odd symmetries to the neural network architecture further improves the convergence for relevant problems. Lastly, a patience condition can be used to automatically recognize eigenfunction solutions. This proposed unsupervised learning method is used to solve the finite well, multiple finite wells, and hydrogen atom eigenvalue quantum problems.
△ Less
Submitted 24 February, 2022;
originally announced March 2022.
-
One-Shot Transfer Learning of Physics-Informed Neural Networks
Authors:
Shaan Desai,
Marios Mattheakis,
Hayden Joy,
Pavlos Protopapas,
Stephen Roberts
Abstract:
Solving differential equations efficiently and accurately sits at the heart of progress in many areas of scientific research, from classical dynamical systems to quantum mechanics. There is a surge of interest in using Physics-Informed Neural Networks (PINNs) to tackle such problems as they provide numerous benefits over traditional numerical approaches. Despite their potential benefits for solvin…
▽ More
Solving differential equations efficiently and accurately sits at the heart of progress in many areas of scientific research, from classical dynamical systems to quantum mechanics. There is a surge of interest in using Physics-Informed Neural Networks (PINNs) to tackle such problems as they provide numerous benefits over traditional numerical approaches. Despite their potential benefits for solving differential equations, transfer learning has been under explored. In this study, we present a general framework for transfer learning PINNs that results in one-shot inference for linear systems of both ordinary and partial differential equations. This means that highly accurate solutions to many unknown differential equations can be obtained instantaneously without retraining an entire network. We demonstrate the efficacy of the proposed deep learning approach by solving several real-world problems, such as first- and second-order linear ordinary equations, the Poisson equation, and the time-dependent Schrodinger complex-value partial differential equation.
△ Less
Submitted 5 July, 2022; v1 submitted 21 October, 2021;
originally announced October 2021.
-
Modeling the effect of the vaccination campaign on the Covid-19 pandemic
Authors:
Mattia Angeli,
Georgios Neofotistos,
Marios Mattheakis,
Efthimios Kaxiras
Abstract:
Population-wide vaccination is critical for containing the SARS-CoV-2 (Covid-19) pandemic when combined with restrictive and prevention measures. In this study, we introduce SAIVR, a mathematical model able to forecast the Covid-19 epidemic evolution during the vaccination campaign. SAIVR extends the widely used Susceptible-Infectious-Removed (SIR) model by considering the Asymptomatic (A) and Vac…
▽ More
Population-wide vaccination is critical for containing the SARS-CoV-2 (Covid-19) pandemic when combined with restrictive and prevention measures. In this study, we introduce SAIVR, a mathematical model able to forecast the Covid-19 epidemic evolution during the vaccination campaign. SAIVR extends the widely used Susceptible-Infectious-Removed (SIR) model by considering the Asymptomatic (A) and Vaccinated (V) compartments. The model contains several parameters and initial conditions that are estimated by employing a semi-supervised machine learning procedure. After training an unsupervised neural network to solve the SAIVR differential equations, a supervised framework then estimates the optimal conditions and parameters that best fit recent infectious curves of 27 countries. Instructed by these results, we performed an extensive study on the temporal evolution of the pandemic under varying values of roll-out daily rates, vaccine efficacy, and a broad range of societal vaccine hesitancy/denial levels. The concept of herd immunity is questioned by studying future scenarios which involve different vaccination efforts and more infectious Covid-19 variants.
△ Less
Submitted 27 August, 2021;
originally announced August 2021.
-
Unsupervised Reservoir Computing for Solving Ordinary Differential Equations
Authors:
Marios Mattheakis,
Hayden Joy,
Pavlos Protopapas
Abstract:
There is a wave of interest in using unsupervised neural networks for solving differential equations. The existing methods are based on feed-forward networks, {while} recurrent neural network differential equation solvers have not yet been reported. We introduce an unsupervised reservoir computing (RC), an echo-state recurrent neural network capable of discovering approximate solutions that satisf…
▽ More
There is a wave of interest in using unsupervised neural networks for solving differential equations. The existing methods are based on feed-forward networks, {while} recurrent neural network differential equation solvers have not yet been reported. We introduce an unsupervised reservoir computing (RC), an echo-state recurrent neural network capable of discovering approximate solutions that satisfy ordinary differential equations (ODEs). We suggest an approach to calculate time derivatives of recurrent neural network outputs without using backpropagation. The internal weights of an RC are fixed, while only a linear output layer is trained, yielding efficient training. However, RC performance strongly depends on finding the optimal hyper-parameters, which is a computationally expensive process. We use Bayesian optimization to efficiently discover optimal sets in a high-dimensional hyper-parameter space and numerically show that one set is robust and can be used to solve an ODE for different initial conditions and time ranges. A closed-form formula for the optimal output weights is derived to solve first order linear equations in a backpropagation-free learning process. We extend the RC approach by solving nonlinear system of ODEs using a hybrid optimization method consisting of gradient descent and Bayesian optimization. Evaluation of linear and nonlinear systems of equations demonstrates the efficiency of the RC ODE solver.
△ Less
Submitted 25 August, 2021;
originally announced August 2021.
-
Port-Hamiltonian Neural Networks for Learning Explicit Time-Dependent Dynamical Systems
Authors:
Shaan Desai,
Marios Mattheakis,
David Sondak,
Pavlos Protopapas,
Stephen Roberts
Abstract:
Accurately learning the temporal behavior of dynamical systems requires models with well-chosen learning biases. Recent innovations embed the Hamiltonian and Lagrangian formalisms into neural networks and demonstrate a significant improvement over other approaches in predicting trajectories of physical systems. These methods generally tackle autonomous systems that depend implicitly on time or sys…
▽ More
Accurately learning the temporal behavior of dynamical systems requires models with well-chosen learning biases. Recent innovations embed the Hamiltonian and Lagrangian formalisms into neural networks and demonstrate a significant improvement over other approaches in predicting trajectories of physical systems. These methods generally tackle autonomous systems that depend implicitly on time or systems for which a control signal is known apriori. Despite this success, many real world dynamical systems are non-autonomous, driven by time-dependent forces and experience energy dissipation. In this study, we address the challenge of learning from such non-autonomous systems by embedding the port-Hamiltonian formalism into neural networks, a versatile framework that can capture energy dissipation and time-dependent control forces. We show that the proposed \emph{port-Hamiltonian neural network} can efficiently learn the dynamics of nonlinear physical systems of practical interest and accurately recover the underlying stationary Hamiltonian, time-dependent force, and dissipative coefficient. A promising outcome of our network is its ability to learn and predict chaotic systems such as the Duffing equation, for which the trajectories are typically hard to learn.
△ Less
Submitted 16 July, 2021;
originally announced July 2021.
-
Encoding Involutory Invariances in Neural Networks
Authors:
Anwesh Bhattacharya,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
In certain situations, neural networks are trained upon data that obey underlying symmetries. However, the predictions do not respect the symmetries exactly unless embedded in the network structure. In this work, we introduce architectures that embed a special kind of symmetry namely, invariance with respect to involutory linear/affine transformations up to parity $p=\pm 1$. We provide rigorous th…
▽ More
In certain situations, neural networks are trained upon data that obey underlying symmetries. However, the predictions do not respect the symmetries exactly unless embedded in the network structure. In this work, we introduce architectures that embed a special kind of symmetry namely, invariance with respect to involutory linear/affine transformations up to parity $p=\pm 1$. We provide rigorous theorems to show that the proposed network ensures such an invariance and present qualitative arguments for a special universal approximation theorem. An adaption of our techniques to CNN tasks for datasets with inherent horizontal/vertical reflection symmetry is demonstrated. Extensive experiments indicate that the proposed model outperforms baseline feed-forward and physics-informed neural networks while identically respecting the underlying symmetry.
△ Less
Submitted 26 April, 2022; v1 submitted 7 June, 2021;
originally announced June 2021.
-
A New Artificial Neuron Proposal with Trainable Simultaneous Local and Global Activation Function
Authors:
Tiago A. E. Ferreira,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
The activation function plays a fundamental role in the artificial neural network learning process. However, there is no obvious choice or procedure to determine the best activation function, which depends on the problem. This study proposes a new artificial neuron, named global-local neuron, with a trainable activation function composed of two components, a global and a local. The global componen…
▽ More
The activation function plays a fundamental role in the artificial neural network learning process. However, there is no obvious choice or procedure to determine the best activation function, which depends on the problem. This study proposes a new artificial neuron, named global-local neuron, with a trainable activation function composed of two components, a global and a local. The global component term used here is relative to a mathematical function to describe a general feature present in all problem domain. The local component is a function that can represent a localized behavior, like a transient or a perturbation. This new neuron can define the importance of each activation function component in the learning phase. Depending on the problem, it results in a purely global, or purely local, or a mixed global and local activation function after the training phase. Here, the trigonometric sine function was employed for the global component and the hyperbolic tangent for the local component. The proposed neuron was tested for problems where the target was a purely global function, or purely local function, or a composition of two global and local functions. Two classes of test problems were investigated, regression problems and differential equations solving. The experimental tests demonstrated the Global-Local Neuron network's superior performance, compared with simple neural networks with sine or hyperbolic tangent activation function, and with a hybrid network that combines these two simple neural networks.
△ Less
Submitted 15 January, 2021;
originally announced January 2021.
-
Unsupervised Neural Networks for Quantum Eigenvalue Problems
Authors:
Henry Jin,
Marios Mattheakis,
Pavlos Protopapas
Abstract:
Eigenvalue problems are critical to several fields of science and engineering. We present a novel unsupervised neural network for discovering eigenfunctions and eigenvalues for differential eigenvalue problems with solutions that identically satisfy the boundary conditions. A scanning mechanism is embedded allowing the method to find an arbitrary number of solutions. The network optimization is da…
▽ More
Eigenvalue problems are critical to several fields of science and engineering. We present a novel unsupervised neural network for discovering eigenfunctions and eigenvalues for differential eigenvalue problems with solutions that identically satisfy the boundary conditions. A scanning mechanism is embedded allowing the method to find an arbitrary number of solutions. The network optimization is data-free and depends solely on the predictions. The unsupervised method is used to solve the quantum infinite well and quantum oscillator eigenvalue problems.
△ Less
Submitted 10 October, 2020;
originally announced October 2020.
-
Semi-supervised Neural Networks solve an inverse problem for modeling Covid-19 spread
Authors:
Alessandro Paticchio,
Tommaso Scarlatti,
Marios Mattheakis,
Pavlos Protopapas,
Marco Brambilla
Abstract:
Studying the dynamics of COVID-19 is of paramount importance to understanding the efficiency of restrictive measures and develop strategies to defend against upcoming contagion waves. In this work, we study the spread of COVID-19 using a semi-supervised neural network and assuming a passive part of the population remains isolated from the virus dynamics. We start with an unsupervised neural networ…
▽ More
Studying the dynamics of COVID-19 is of paramount importance to understanding the efficiency of restrictive measures and develop strategies to defend against upcoming contagion waves. In this work, we study the spread of COVID-19 using a semi-supervised neural network and assuming a passive part of the population remains isolated from the virus dynamics. We start with an unsupervised neural network that learns solutions of differential equations for different modeling parameters and initial conditions. A supervised method then solves the inverse problem by estimating the optimal conditions that generate functions to fit the data for those infected by, recovered from, and deceased due to COVID-19. This semi-supervised approach incorporates real data to determine the evolution of the spread, the passive population, and the basic reproduction number for different countries.
△ Less
Submitted 10 October, 2020;
originally announced October 2020.
-
Variational Integrator Graph Networks for Learning Energy Conserving Dynamical Systems
Authors:
Shaan Desai,
Marios Mattheakis,
Stephen Roberts
Abstract:
Recent advances show that neural networks embedded with physics-informed priors significantly outperform vanilla neural networks in learning and predicting the long term dynamics of complex physical systems from noisy data. Despite this success, there has only been a limited study on how to optimally combine physics priors to improve predictive performance. To tackle this problem we unpack and gen…
▽ More
Recent advances show that neural networks embedded with physics-informed priors significantly outperform vanilla neural networks in learning and predicting the long term dynamics of complex physical systems from noisy data. Despite this success, there has only been a limited study on how to optimally combine physics priors to improve predictive performance. To tackle this problem we unpack and generalize recent innovations into individual inductive bias segments. As such, we are able to systematically investigate all possible combinations of inductive biases of which existing methods are a natural subset. Using this framework we introduce Variational Integrator Graph Networks - a novel method that unifies the strengths of existing approaches by combining an energy constraint, high-order symplectic variational integrators, and graph neural networks. We demonstrate, across an extensive ablation, that the proposed unifying framework outperforms existing methods, for data-efficient learning and in predictive accuracy, across both single and many-body problems studied in recent literature. We empirically show that the improvements arise because high order variational integrators combined with a potential energy constraint induce coupled learning of generalized position and momentum updates which can be formalized via the Partitioned Runge-Kutta method.
△ Less
Submitted 16 July, 2021; v1 submitted 28 April, 2020;
originally announced April 2020.
-
Electronic structure calculations of twisted multi-layer graphene superlattices
Authors:
Georgios A. Tritsaris,
Stephen Carr,
Ziyan Zhu,
Yiqi Xie,
Steven B. Torrisi,
Jing Tang,
Marios Mattheakis,
Daniel Larson,
Efthimios Kaxiras
Abstract:
Quantum confinement endows two-dimensional (2D) layered materials with exceptional physics and novel properties compared to their bulk counterparts. Although certain two- and few-layer configurations of graphene have been realized and studied, a systematic investigation of the properties of arbitrarily layered graphene assemblies is still lacking. We introduce theoretical concepts and methods for…
▽ More
Quantum confinement endows two-dimensional (2D) layered materials with exceptional physics and novel properties compared to their bulk counterparts. Although certain two- and few-layer configurations of graphene have been realized and studied, a systematic investigation of the properties of arbitrarily layered graphene assemblies is still lacking. We introduce theoretical concepts and methods for the processing of materials information, and as a case study, apply them to investigate the electronic structure of multi-layer graphene-based assemblies in a high-throughput fashion. We provide a critical discussion of patterns and trends in tight binding band structures and we identify specific layered assemblies using low-dispersion electronic bands as indicators of potentially interesting physics like strongly correlated behavior. A combination of data-driven models for visualization and prediction is used to intelligently explore the materials space. This work more generally aims to increase confidence in the combined use of physics-based and data-driven modeling for the systematic refinement of knowledge about 2D layered materials, with implications for the development of novel quantum devices.
△ Less
Submitted 30 January, 2020;
originally announced January 2020.
-
Hamiltonian neural networks for solving equations of motion
Authors:
Marios Mattheakis,
David Sondak,
Akshunna S. Dogra,
Pavlos Protopapas
Abstract:
There has been a wave of interest in applying machine learning to study dynamical systems. We present a Hamiltonian neural network that solves the differential equations that govern dynamical systems. This is an equation-driven machine learning method where the optimization process of the network depends solely on the predicted functions without using any ground truth data. The model learns soluti…
▽ More
There has been a wave of interest in applying machine learning to study dynamical systems. We present a Hamiltonian neural network that solves the differential equations that govern dynamical systems. This is an equation-driven machine learning method where the optimization process of the network depends solely on the predicted functions without using any ground truth data. The model learns solutions that satisfy, up to an arbitrarily small error, Hamilton's equations and, therefore, conserve the Hamiltonian invariants. The choice of an appropriate activation function drastically improves the predictability of the network. Moreover, an error analysis is derived and states that the numerical errors depend on the overall network performance. The Hamiltonian network is then employed to solve the equations for the nonlinear oscillator and the chaotic Henon-Heiles dynamical system. In both systems, a symplectic Euler integrator requires two orders more evaluation points than the Hamiltonian network in order to achieve the same order of the numerical error in the predicted phase space trajectories.
△ Less
Submitted 26 April, 2022; v1 submitted 29 January, 2020;
originally announced January 2020.
-
LAN -- A materials notation for 2D layered assemblies
Authors:
Georgios A. Tritsaris,
Yiqi Xie,
Alexander M. Rush,
Stephen Carr,
Marios Mattheakis,
Efthimios Kaxiras
Abstract:
Two-dimensional (2D) layered materials offer intriguing possibilities for novel physics and applications. Before any attempt at exploring the materials space in a systematic fashion, or combining insights from theory, computation and experiment, a formal description of information about an assembly of arbitrary composition is required. Here, we introduce a domain-generic notation that is used to d…
▽ More
Two-dimensional (2D) layered materials offer intriguing possibilities for novel physics and applications. Before any attempt at exploring the materials space in a systematic fashion, or combining insights from theory, computation and experiment, a formal description of information about an assembly of arbitrary composition is required. Here, we introduce a domain-generic notation that is used to describe the space of 2D layered materials from monolayers to twisted assemblies of arbitrary composition, existent or not-yet-fabricated. The notation corresponds to a theoretical materials concept of stepwise assembly of layered structures using a sequence of rotation, vertical stacking, and other operations on individual 2D layers. Its scope is demonstrated with a number of example structures using common single-layer materials as building blocks. This work overall aims to contribute to the systematic codification, capture and transfer of materials knowledge in the area of 2D layered materials.
△ Less
Submitted 30 January, 2020; v1 submitted 8 October, 2019;
originally announced October 2019.
-
Graphene epsilon-near-zero plasmonic crystals
Authors:
Marios Mattheakis,
Matthias Maier,
Wei Xi Boo,
Efthimios Kaxiras
Abstract:
Plasmonic crystals are a class of optical metamaterials that consist of engineered structures at the sub-wavelength scale. They exhibit optical properties that are not found under normal circumstances in nature, such as negative-refractive-index and epsilon-near-zero (ENZ) behavior. Graphene-based plasmonic crystals present linear, elliptical, or hyperbolic dispersion relations that exhibit ENZ be…
▽ More
Plasmonic crystals are a class of optical metamaterials that consist of engineered structures at the sub-wavelength scale. They exhibit optical properties that are not found under normal circumstances in nature, such as negative-refractive-index and epsilon-near-zero (ENZ) behavior. Graphene-based plasmonic crystals present linear, elliptical, or hyperbolic dispersion relations that exhibit ENZ behavior, normal or negative-index diffraction. The optical properties can be dynamically tuned by controlling the operating frequency and the doping level of graphene. We propose a construction approach to expand the frequency range of the ENZ behavior. We demonstrate how the combination of a host material with an optical Lorentzian response in combination with a graphene conductivity that follows a Drude model leads to an ENZ condition spanning a large frequency range.
△ Less
Submitted 31 May, 2019;
originally announced June 2019.
-
Physical Symmetries Embedded in Neural Networks
Authors:
M. Mattheakis,
P. Protopapas,
D. Sondak,
M. Di Giovanni,
E. Kaxiras
Abstract:
Neural networks are a central technique in machine learning. Recent years have seen a wave of interest in applying neural networks to physical systems for which the governing dynamics are known and expressed through differential equations. Two fundamental challenges facing the development of neural networks in physics applications is their lack of interpretability and their physics-agnostic design…
▽ More
Neural networks are a central technique in machine learning. Recent years have seen a wave of interest in applying neural networks to physical systems for which the governing dynamics are known and expressed through differential equations. Two fundamental challenges facing the development of neural networks in physics applications is their lack of interpretability and their physics-agnostic design. The focus of the present work is to embed physical constraints into the structure of the neural network to address the second fundamental challenge. By constraining tunable parameters (such as weights and biases) and adding special layers to the network, the desired constraints are guaranteed to be satisfied without the need for explicit regularization terms. This is demonstrated on upervised and unsupervised networks for two basic symmetries: even/odd symmetry of a function and energy conservation. In the supervised case, the network with embedded constraints is shown to perform well on regression problems while simultaneously obeying the desired constraints whereas a traditional network fits the data but violates the underlying constraints. Finally, a new unsupervised neural network is proposed that guarantees energy conservation through an embedded symplectic structure. The symplectic neural network is used to solve a system of energy-conserving differential equations and out-performs an unsupervised, non-symplectic neural network.
△ Less
Submitted 29 January, 2020; v1 submitted 18 April, 2019;
originally announced April 2019.
-
Homogenization of plasmonic crystals: Seeking the epsilon-near-zero effect
Authors:
Matthias Maier,
Marios Mattheakis,
Efthimios Kaxiras,
Mitchell Luskin,
Dionisios Margetis
Abstract:
By using an asymptotic analysis and numerical simulations, we derive and investigate a system of homogenized Maxwell's equations for conducting material sheets that are periodically arranged and embedded in a heterogeneous and anisotropic dielectric host. This structure is motivated by the need to design plasmonic crystals that enable the propagation of electromagnetic waves with no phase delay (e…
▽ More
By using an asymptotic analysis and numerical simulations, we derive and investigate a system of homogenized Maxwell's equations for conducting material sheets that are periodically arranged and embedded in a heterogeneous and anisotropic dielectric host. This structure is motivated by the need to design plasmonic crystals that enable the propagation of electromagnetic waves with no phase delay (epsilon-near-zero effect). Our microscopic model incorporates the surface conductivity of the two-dimensional (2D) material of each sheet and a corresponding line charge density through a line conductivity along possible edges of the sheets. Our analysis generalizes averaging principles inherent in previous Bloch-wave approaches. We investigate physical implications of our findings. In particular, we emphasize the role of the vector-valued corrector field, which expresses microscopic modes of surface waves on the 2D material. We demonstrate how our homogenization procedure may set the foundation for computational investigations of: effective optical responses of reasonably general geometries, and complicated design problems in the plasmonics of 2D materials.
△ Less
Submitted 19 April, 2019; v1 submitted 21 September, 2018;
originally announced September 2018.
-
Machine learning with observers predicts complex spatiotemporal behavior
Authors:
G. Neofotistos,
M. Mattheakis,
G. D. Barmparis,
J. Hizanidis,
G. P. Tsironis,
E. Kaxiras
Abstract:
Chimeras and branching are two archetypical complex phenomena that appear in many physical systems; because of their different intrinsic dynamics, they delineate opposite non-trivial limits in the complexity of wave motion and present severe challenges in predicting chaotic and singular behavior in extended physical systems. We report on the long-term forecasting capability of Long Short-Term Memo…
▽ More
Chimeras and branching are two archetypical complex phenomena that appear in many physical systems; because of their different intrinsic dynamics, they delineate opposite non-trivial limits in the complexity of wave motion and present severe challenges in predicting chaotic and singular behavior in extended physical systems. We report on the long-term forecasting capability of Long Short-Term Memory (LSTM) and reservoir computing (RC) recurrent neural networks, when they are applied to the spatiotemporal evolution of turbulent chimeras in simulated arrays of coupled superconducting quantum interference devices (SQUIDs) or lasers, and branching in the electronic flow of two-dimensional graphene with random potential. We propose a new method in which we assign one LSTM network to each system node except for "observer" nodes which provide continual "ground truth" measurements as input; we refer to this method as "Observer LSTM" (OLSTM). We demonstrate that even a small number of observers greatly improves the data-driven (model-free) long-term forecasting capability of the LSTM networks and provide the framework for a consistent comparison between the RC and LSTM methods. We find that RC requires smaller training datasets than OLSTMs, but the latter requires fewer observers. Both methods are benchmarked against Feed-Forward neural networks (FNNs), also trained to make predictions with observers (OFNNs).
△ Less
Submitted 27 July, 2018;
originally announced July 2018.
-
Emergence and dynamical properties of stochastic branching in the electronic flows of disordered Dirac solids
Authors:
Marios Mattheakis,
G. P. Tsironis,
Efthimios Kaxiras
Abstract:
Graphene as well as more generally Dirac solids constitute two dimensional materials where the electronic flow is ultra relativistic. When a Dirac solid is deposited on a different substrate surface with roughness, a local random potential develops through an inhomogeneous charge impurity distribution. This external potential affects profoundly the charge flow and induces a chaotic pattern of curr…
▽ More
Graphene as well as more generally Dirac solids constitute two dimensional materials where the electronic flow is ultra relativistic. When a Dirac solid is deposited on a different substrate surface with roughness, a local random potential develops through an inhomogeneous charge impurity distribution. This external potential affects profoundly the charge flow and induces a chaotic pattern of current branches that develops through focusing and defocusing effects produced by the randomness of the surface. An additional bias voltage may be used to tune the branching pattern of the charge carrier currents. We employ analytical and numerical techniques in order to investigate the onset and the statistical properties of carrier branches in Dirac solids. We find a specific scaling-type relationship that connects the physical scale for the occurrence of branches with the characteristic medium properties, such as disorder and bias field. We use numerics to test and verify the theoretical prediction as well as a perturbative approach that gives a clear indication of the regime of validity of the approach. This work is relevant to device applications and may be tested experimentally.
△ Less
Submitted 24 March, 2018; v1 submitted 24 January, 2018;
originally announced January 2018.
-
Universal behavior of dispersive Dirac cone in gradient-index plasmonic metamaterials
Authors:
Matthias Maier,
Marios Mattheakis,
Efthimios Kaxiras,
Mitchell Luskin,
Dionisios Margetis
Abstract:
We demonstrate analytically and numerically that the dispersive Dirac cone emulating an epsilon-near-zero (ENZ) behavior is a universal property within a family of plasmonic crystals consisting of two-dimensional (2D) metals. Our starting point is a periodic array of 2D metallic sheets embedded in an inhomogeneous and anisotropic dielectric host that allows for propagation of transverse-magnetic (…
▽ More
We demonstrate analytically and numerically that the dispersive Dirac cone emulating an epsilon-near-zero (ENZ) behavior is a universal property within a family of plasmonic crystals consisting of two-dimensional (2D) metals. Our starting point is a periodic array of 2D metallic sheets embedded in an inhomogeneous and anisotropic dielectric host that allows for propagation of transverse-magnetic (TM) polarized waves. By invoking a systematic bifurcation argument for arbitrary dielectric profiles in one spatial dimension, we show how TM Bloch waves experience an effective dielectric function that averages out microscopic details of the host medium. The corresponding effective dispersion relation reduces to a Dirac cone when the conductivity of the metallic sheet and the period of the array satisfy a critical condition for ENZ behavior. Our analytical findings are in excellent agreement with numerical simulations.
△ Less
Submitted 12 January, 2018; v1 submitted 6 November, 2017;
originally announced November 2017.
-
Extreme waves and branching flows in optical media
Authors:
M. Mattheakis,
G. P. Tsironis
Abstract:
We address light propagation properties in complex media consisting of random distributions of lenses that have specific focusing properties. We present both analytical and numerical techniques that can be used to study emergent properties of light organization in these media. As light propagates, it experiences multiple scattering leading to the formation of light bundles in the form of branches;…
▽ More
We address light propagation properties in complex media consisting of random distributions of lenses that have specific focusing properties. We present both analytical and numerical techniques that can be used to study emergent properties of light organization in these media. As light propagates, it experiences multiple scattering leading to the formation of light bundles in the form of branches; these are random yet occur systematically in the the medium, particularly in the weak scattering limit. On the other hand, in the strong scattering limit we find that coalescence of branches may lead to the formation of extreme waves of the "rogue wave" type. These waves appear at specific locations and arise in the linear as well as in the nonlinear regimes. We present both the weak and strong scattering limit and show that these complex phenomena can be studied numerically and analytically through simple models.
△ Less
Submitted 6 April, 2017;
originally announced April 2017.
-
Graphene and active metamaterials: theoretical methods and physical properties
Authors:
Marios Mattheakis,
Giorgos P. Tsironis,
Efthimios Kaxiras
Abstract:
The interaction of light with matter has triggered the interest of scientists for long time. The area of plasmonics emerges in this context through the interaction of light with valence electrons in metals. The random phase approximation in the long wavelength limit is used for analytical investigation of plasmons in three-dimensional metals, in a two-dimensional electron gas and finally in the mo…
▽ More
The interaction of light with matter has triggered the interest of scientists for long time. The area of plasmonics emerges in this context through the interaction of light with valence electrons in metals. The random phase approximation in the long wavelength limit is used for analytical investigation of plasmons in three-dimensional metals, in a two-dimensional electron gas and finally in the most famous two-dimensional semi-metal, namely graphene. We show that plasmons in bulk metals as well as in a two-dimensional electron gas originate from classical laws, whereas, quantum effects appear as non-local corrections. On the other hand, graphene plasmons are purely quantum modes and, thus, they would not exist in a "classical world". Furthermore, under certain circumstances, light is able to couple with plasmons on metallic surfaces, forming a surface plasmon polariton, which is very important in nanoplasmonics due to its subwavelength nature. In addition, we outline two applications that complete our theoretical investigation. Firstly, we examine how the presence of gain (active) dielectrics affects surface plasmon polariton properties and we find that there is a gain value for which the metallic losses are completely eliminated resulting to lossless plasmon propagation. Secondly, we combine monolayers of graphene in a periodic order and construct a plasmonic metamaterial that provides tunable wave propagation properties, such as epsilon-near-zero behavior, normal and negative refraction.
△ Less
Submitted 6 April, 2017;
originally announced April 2017.
-
Manipulating polarized light with a planar slab of Black Phosphorus
Authors:
Constantinos A. Valagiannopoulos,
Marios Mattheakis,
Sharmila N. Shirodkar,
Efthimios Kaxiras
Abstract:
Wave polarization contains valuable information for electromagnetic signal processing and the ability to manipulate it can be extremely useful in photonic devices. In this work, we propose designs comprised of one of the emerging and interesting two-dimensional media: Black Phosphorus. Due to substantial in-plane anisotropy, a single slab of Black Phosphorus can be very efficient for manipulating…
▽ More
Wave polarization contains valuable information for electromagnetic signal processing and the ability to manipulate it can be extremely useful in photonic devices. In this work, we propose designs comprised of one of the emerging and interesting two-dimensional media: Black Phosphorus. Due to substantial in-plane anisotropy, a single slab of Black Phosphorus can be very efficient for manipulating the polarization state of electromagnetic waves. We investigate Black Phosphorus slabs that filter the fields along one direction, or polarization axis rotation, or convert linear polarization to circular. These slabs can be employed as components in numerous mid-IR integrated devices.
△ Less
Submitted 17 September, 2017; v1 submitted 18 March, 2017;
originally announced March 2017.
-
Quantum plasmons with optical-range frequencies in doped few-layer graphene
Authors:
Sharmila N. Shirodkar,
Marios Mattheakis,
Paul Cazeaux,
Prineha Narang,
Marin Soljačić,
Efthimios Kaxiras
Abstract:
Although plasmon modes exist in doped graphene, the limited range of doping achieved by gating restricts the plasmon frequencies to a range that does not include visible and infrared. Here we show, through the use of first-principles calculations, that the high levels of doping achieved by lithium intercalation in bilayer and trilayer graphene shift the plasmon frequencies into the visible range.…
▽ More
Although plasmon modes exist in doped graphene, the limited range of doping achieved by gating restricts the plasmon frequencies to a range that does not include visible and infrared. Here we show, through the use of first-principles calculations, that the high levels of doping achieved by lithium intercalation in bilayer and trilayer graphene shift the plasmon frequencies into the visible range. To obtain physically meaningful results, we introduce a correction of the effect of plasmon interaction across the vacuum separating periodic images of the doped graphene layers, consisting of transparent boundary conditions in the direction perpendicular to the layers; this represents a significant improvement over the Exact Coulomb cutoff technique employed in earlier works. The resulting plasmon modes are due to local field efffects and the non-local response of the material to external electromagnetic fields, requiring a fully quantum mechanical treatment. We describe the features of these quantum plasmons, including the dispersion relation, losses and field localization. Our findings point to a strategy for fine-tuning the plasmon frequencies in graphene and other two dimensional materials.
△ Less
Submitted 2 May, 2018; v1 submitted 4 March, 2017;
originally announced March 2017.
-
Epsilon-Near-Zero behavior from plasmonic Dirac point: theory and realization using two-dimensional materials
Authors:
Marios Mattheakis,
Constantinos A. Valagiannopoulos,
Efthimios Kaxiras
Abstract:
The electromagnetic response of a two-dimensional metal embedded in a periodic array of a dielectric host can give rise to a plasmonic Dirac point that emulates Epsilon-Near-Zero (ENZ) behavior. This theoretical result is extremely sensitive to tructural features like periodicity of the dielectric medium and thickness imperfections. We propose that such a device can actually be realized by using g…
▽ More
The electromagnetic response of a two-dimensional metal embedded in a periodic array of a dielectric host can give rise to a plasmonic Dirac point that emulates Epsilon-Near-Zero (ENZ) behavior. This theoretical result is extremely sensitive to tructural features like periodicity of the dielectric medium and thickness imperfections. We propose that such a device can actually be realized by using graphene as the 2D metal and materials like the layered semiconducting transition-metal dichalcogenides or hexagonal boron nitride as the dielectric host. We propose a systematic approach, in terms of design characteristics, for constructing metamaterials with linear, elliptical and hyperbolic dispersion relations which produce ENZ behavior, normal or negative diffraction.
△ Less
Submitted 11 October, 2016;
originally announced October 2016.
-
Phase transition in PT symmetric active plasmonic systems
Authors:
M. Mattheakis,
T. Oikonomou,
M. I. Molina,
G. P. Tsironis
Abstract:
Surface plasmon polaritons (SPPs) are coherent electromagnetic surface waves trapped on an insulator-conductor interface. The SPPs decay exponentially along the propagation due to conductor losses, restricting the SPPs propagation length to few microns. Gain materials can be used to counterbalance the aforementioned losses. We provide an exact expression for the gain, in terms of the optical prope…
▽ More
Surface plasmon polaritons (SPPs) are coherent electromagnetic surface waves trapped on an insulator-conductor interface. The SPPs decay exponentially along the propagation due to conductor losses, restricting the SPPs propagation length to few microns. Gain materials can be used to counterbalance the aforementioned losses. We provide an exact expression for the gain, in terms of the optical properties of the interface, for which the losses are eliminated. In addition, we show that systems characterized by lossless SPP propagation are related to PT symmetric systems. Furthermore, we derive an analytical critical value of the gain describing a phase transition between lossless and prohibited SPPs propagation. The regime of the aforementioned propagation can be directed by the optical properties of the system under scrutiny. Finally, we perform COMSOL simulations verifying the theoretical findings.
△ Less
Submitted 12 October, 2015; v1 submitted 29 July, 2015;
originally announced July 2015.
-
Rogue events in complex linear and nonlinear photonic media
Authors:
M. Mattheakis,
I. J. Pitsios,
G. P. Tsironis,
S. Tzortzakis
Abstract:
Ocean rogue waves (RW) -huge solitary waves- have for long triggered the interest of scientists. RWs emerge in a complex environment and it is still dubious the importance of linear versus nonlinear processes. Recent works have demonstrated that RWs appear in various other physical systems such as microwaves, nonlinear crystals, cold atoms, etc. In this work we investigate optical wave propagation…
▽ More
Ocean rogue waves (RW) -huge solitary waves- have for long triggered the interest of scientists. RWs emerge in a complex environment and it is still dubious the importance of linear versus nonlinear processes. Recent works have demonstrated that RWs appear in various other physical systems such as microwaves, nonlinear crystals, cold atoms, etc. In this work we investigate optical wave propagation in strongly scattering random lattices embedded in the bulk of transparent glasses. In the linear regime we observe the appearance of RWs that depend solely on the scattering properties of the medium. Interestingly, the addition of nonlinearity does not modify the RW statistics, while as the nonlinearities are increased multiple-filamentation and intensity clamping destroy the RW statistics. Numerical simulations agree nicely with the experimental findings and altogether prove that optical rogue waves are generated through the linear strong scattering in such complex environments.
△ Less
Submitted 2 September, 2015; v1 submitted 16 July, 2015;
originally announced July 2015.
-
Small-world networks of optical fiber lattices
Authors:
F. Perakis,
M. Mattheakis,
G. P. Tsironis
Abstract:
We use a simple dynamical model and explore coherent dynamics of wavepackets in complex networks of optical fibers. We start from a symmetric lattice and through the application of a Monte-Carlo criterion we introduce structural disorder and deform the lattice into a small-world network regime. We investigate in the latter both structural (correlation length) as well as dynamical (diffusion expone…
▽ More
We use a simple dynamical model and explore coherent dynamics of wavepackets in complex networks of optical fibers. We start from a symmetric lattice and through the application of a Monte-Carlo criterion we introduce structural disorder and deform the lattice into a small-world network regime. We investigate in the latter both structural (correlation length) as well as dynamical (diffusion exponent) properties and find that both exhibit a rapid crossover from the ordered to the fully random regime. For a critical value of the structural disorder parameter $ρ\approx 0.25$ transport changes from ballistic to sub-diffusive due to the creation strongly connected local clusters and channels of preferential transport in the small world regime.
△ Less
Submitted 8 August, 2014; v1 submitted 10 January, 2014;
originally announced January 2014.
-
Enhanced surface plasmon polariton propagation induced by active dielectrics
Authors:
C. Athanasopoulos,
M. Mattheakis,
G. P. Tsironis
Abstract:
We present numerical simulations for the propagation of surface plasmon polaritons in a dielectric-metal-dielectric waveguide using COMSOL multiphysics software. We show that the use of an active dielectric with gain that compensates metal absorption losses enhances substantially plasmon propagation. Furthermore, the introduction of the active material induces, for a specific gain value, a root in…
▽ More
We present numerical simulations for the propagation of surface plasmon polaritons in a dielectric-metal-dielectric waveguide using COMSOL multiphysics software. We show that the use of an active dielectric with gain that compensates metal absorption losses enhances substantially plasmon propagation. Furthermore, the introduction of the active material induces, for a specific gain value, a root in the imaginary part of the propagation constant leading to infinite propagation of the surface plasmon. The computational approaches analyzed in this work can be used to define and tune the optimal conditions for surface plasmon polariton amplification and propagation.
△ Less
Submitted 8 August, 2014; v1 submitted 22 November, 2013;
originally announced November 2013.
-
Luneburg lens waveguide networks
Authors:
Marios Mattheakis,
George Tsironis,
Vassilios Kovanis
Abstract:
We investigate certain configurations of Luneburg lenses that form light propagating and guiding networks. We study single Luneburg lens dynamics and apply the single lens ray tracing solution to various arrangements of multiple lenses. The wave propagating features of the Luneburg lens networks are also verified through direct numerical solutions of Maxwell's equations. We find that Luneburg lens…
▽ More
We investigate certain configurations of Luneburg lenses that form light propagating and guiding networks. We study single Luneburg lens dynamics and apply the single lens ray tracing solution to various arrangements of multiple lenses. The wave propagating features of the Luneburg lens networks are also verified through direct numerical solutions of Maxwell's equations. We find that Luneburg lenses may form efficient waveguides for light propagation and guiding. The additional presence of nonlinearity improves the focusing characteristics of the networks.
△ Less
Submitted 18 July, 2012;
originally announced July 2012.