-
Fleet Size and Spill for UAM Operation under Uncertain Demand
Authors:
Shangqing Cao,
Xuan Jiang,
Emin Burak Onat,
Bo Zou,
Mark Hansen,
Raja Sengupta,
Anjan Chakrabarty
Abstract:
Variation and imbalance in demand poses significant challenges to Urban Air Mobility (UAM) operations, affecting strategic decisions such as fleet sizing. To study the implications of demand variation on UAM fleet operations, we propose a stochastic passenger arrival time generation model that uses real-world data to infer demand distributions, and two integer programs that compute the zero-spill…
▽ More
Variation and imbalance in demand poses significant challenges to Urban Air Mobility (UAM) operations, affecting strategic decisions such as fleet sizing. To study the implications of demand variation on UAM fleet operations, we propose a stochastic passenger arrival time generation model that uses real-world data to infer demand distributions, and two integer programs that compute the zero-spill fleet size and the spill-minimizing flight schedules and charging policies, respectively. Our numerical experiment on a two-vertiport network shows that spill in relatively inelastic to fleet size and that the driving factor behind spill is the imbalance in demand.
△ Less
Submitted 1 July, 2024;
originally announced July 2024.
-
BEACON: A Bayesian Optimization Strategy for Novelty Search in Expensive Black-Box Systems
Authors:
Wei-Ting Tang,
Ankush Chakrabarty,
Joel A. Paulson
Abstract:
Novelty search (NS) refers to a class of exploration algorithms that automatically uncover diverse system behaviors through simulations or experiments. Systematically obtaining diverse outcomes is a key component in many real-world design problems such as material and drug discovery, neural architecture search, reinforcement learning, and robot navigation. Since the relationship between the inputs…
▽ More
Novelty search (NS) refers to a class of exploration algorithms that automatically uncover diverse system behaviors through simulations or experiments. Systematically obtaining diverse outcomes is a key component in many real-world design problems such as material and drug discovery, neural architecture search, reinforcement learning, and robot navigation. Since the relationship between the inputs and outputs (i.e., behaviors) of these complex systems is typically not available in closed form, NS requires a black-box perspective. Consequently, popular NS algorithms rely on evolutionary optimization and other meta-heuristics that require intensive sampling of the input space, which is impractical when the system is expensive to evaluate. We propose a Bayesian optimization inspired algorithm for sample-efficient NS that is specifically designed for such expensive black-box systems. Our approach models the input-to-behavior mapping with multi-output Gaussian processes (MOGP) and selects the next point to evaluate by maximizing a novelty metric that depends on a posterior sample drawn from the MOGP that promotes both exploration and exploitation. By leveraging advances in efficient posterior sampling and high-dimensional Gaussian process modeling, we discuss how our approach can be made scalable with respect to both amount of data and number of inputs. We test our approach on ten synthetic benchmark problems and eight real-world problems (with up to 2133 inputs) including new applications such as discovery of diverse metal organic frameworks for use in clean energy technology. We show that our approach greatly outperforms existing NS algorithms by finding substantially larger sets of diverse behaviors under limited sample budgets.
△ Less
Submitted 5 June, 2024;
originally announced June 2024.
-
A Simulation-Optimization Framework for Developing Wind-Resilient AAM Networks
Authors:
Emin Burak Onat,
Shangqing Cao,
Raiyan Rizwan,
Xuan Jiang,
Mark Hansen,
Raja Sengupta,
Anjan Chakrabarty
Abstract:
Environmental factors pose a significant challenge to the operational efficiency and safety of advanced air mobility (AAM) networks. This paper presents a simulation-optimization framework that dynamically integrates wind variability into AAM operations. We employ a nonlinear charging model within a multi-vertiport environment to optimize fleet size and scheduling. Our framework assesses the impac…
▽ More
Environmental factors pose a significant challenge to the operational efficiency and safety of advanced air mobility (AAM) networks. This paper presents a simulation-optimization framework that dynamically integrates wind variability into AAM operations. We employ a nonlinear charging model within a multi-vertiport environment to optimize fleet size and scheduling. Our framework assesses the impact of wind on operational parameters, providing strategies to enhance the resilience of AAM ecosystems. The results demonstrate that wind conditions exert significant influence on fleet size even for short-distance flights, their impact on fleet size and energy requirements becomes more pronounced over longer distances. Efficient management of fleet size and charging policies, particularly for long-distance networks, is needed to accommodate the variability of wind conditions effectively.
△ Less
Submitted 17 May, 2024;
originally announced May 2024.
-
Extended unitarity and absence of skin effect in periodically driven systems
Authors:
Aditi Chakrabarty,
Sanjoy Datta
Abstract:
One of the most striking features of non-Hermitian quasiperiodic systems with arbitrarily small asymmetry in the hopping amplitudes and open boundaries is the accumulation of all the bulk eigenstates at one of the edges of the system, termed in literature as the skin effect, below a critical strength of the potential. In this Letter, we uncover that a time-periodic drive in such systems can elimin…
▽ More
One of the most striking features of non-Hermitian quasiperiodic systems with arbitrarily small asymmetry in the hopping amplitudes and open boundaries is the accumulation of all the bulk eigenstates at one of the edges of the system, termed in literature as the skin effect, below a critical strength of the potential. In this Letter, we uncover that a time-periodic drive in such systems can eliminate the SE up to a finite strength of this asymmetry. Remarkably, the critical value for the onset of SE is independent of the driving frequency and approaches to the static behavior in the thermodynamic limit. We find that the absence of SE is intricately linked to the emergence of extended unitarity in the delocalized phase, providing dynamical stability to the system. Interestingly, under periodic boundary condition, our non-Hermitian system can be mapped to a Hermitian analogue in the large driving frequency limit that leads to the extended unitarity irrespective of the hopping asymmetry and the strength of the quasiperiodic potential, in stark contrast to the static limit. Additionally, we numerically verify that this behavior persists Based on our findings, we propose a possible experimental realization of our driven system, which could be used as a switch to control the light funneling mechanism.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models
Authors:
Jiaqi Yan,
Ankush Chakrabarty,
Alisa Rupenyan,
John Lygeros
Abstract:
In this paper, we consider the problem of reference tracking in uncertain nonlinear systems. A neural State-Space Model (NSSM) is used to approximate the nonlinear system, where a deep encoder network learns the nonlinearity from data, and a state-space component captures the temporal relationship. This transforms the nonlinear system into a linear system in a latent space, enabling the applicatio…
▽ More
In this paper, we consider the problem of reference tracking in uncertain nonlinear systems. A neural State-Space Model (NSSM) is used to approximate the nonlinear system, where a deep encoder network learns the nonlinearity from data, and a state-space component captures the temporal relationship. This transforms the nonlinear system into a linear system in a latent space, enabling the application of model predictive control (MPC) to determine effective control actions. Our objective is to design the optimal controller using limited data from the \textit{target system} (the system of interest). To this end, we employ an implicit model-agnostic meta-learning (iMAML) framework that leverages information from \textit{source systems} (systems that share similarities with the target system) to expedite training in the target system and enhance its control performance. The framework consists of two phases: the (offine) meta-training phase learns a aggregated NSSM using data from source systems, and the (online) meta-inference phase quickly adapts this aggregated model to the target system using only a few data points and few online training iterations, based on local loss function gradients. The iMAML algorithm exploits the implicit function theorem to exactly compute the gradient during training, without relying on the entire optimization path. By focusing solely on the optimal solution, rather than the path, we can meta-train with less storage complexity and fewer approximations than other contemporary meta-learning algorithms. We demonstrate through numerical examples that our proposed method can yield accurate predictive models by adaptation, resulting in a downstream MPC that outperforms several baselines.
△ Less
Submitted 18 April, 2024;
originally announced April 2024.
-
Fortifying Fully Convolutional Generative Adversarial Networks for Image Super-Resolution Using Divergence Measures
Authors:
Arkaprabha Basu,
Kushal Bose,
Sankha Subhra Mullick,
Anish Chakrabarty,
Swagatam Das
Abstract:
Super-Resolution (SR) is a time-hallowed image processing problem that aims to improve the quality of a Low-Resolution (LR) sample up to the standard of its High-Resolution (HR) counterpart. We aim to address this by introducing Super-Resolution Generator (SuRGe), a fully-convolutional Generative Adversarial Network (GAN)-based architecture for SR. We show that distinct convolutional features obta…
▽ More
Super-Resolution (SR) is a time-hallowed image processing problem that aims to improve the quality of a Low-Resolution (LR) sample up to the standard of its High-Resolution (HR) counterpart. We aim to address this by introducing Super-Resolution Generator (SuRGe), a fully-convolutional Generative Adversarial Network (GAN)-based architecture for SR. We show that distinct convolutional features obtained at increasing depths of a GAN generator can be optimally combined by a set of learnable convex weights to improve the quality of generated SR samples. In the process, we employ the Jensen-Shannon and the Gromov-Wasserstein losses respectively between the SR-HR and LR-SR pairs of distributions to further aid the generator of SuRGe to better exploit the available information in an attempt to improve SR. Moreover, we train the discriminator of SuRGe with the Wasserstein loss with gradient penalty, to primarily prevent mode collapse. The proposed SuRGe, as an end-to-end GAN workflow tailor-made for super-resolution, offers improved performance while maintaining low inference time. The efficacy of SuRGe is substantiated by its superior performance compared to 18 state-of-the-art contenders on 10 benchmark datasets.
△ Less
Submitted 9 April, 2024;
originally announced April 2024.
-
Concurrent Density Estimation with Wasserstein Autoencoders: Some Statistical Insights
Authors:
Anish Chakrabarty,
Arkaprabha Basu,
Swagatam Das
Abstract:
Variational Autoencoders (VAEs) have been a pioneering force in the realm of deep generative models. Amongst its legions of progenies, Wasserstein Autoencoders (WAEs) stand out in particular due to the dual offering of heightened generative quality and a strong theoretical backbone. WAEs consist of an encoding and a decoding network forming a bottleneck with the prime objective of generating new s…
▽ More
Variational Autoencoders (VAEs) have been a pioneering force in the realm of deep generative models. Amongst its legions of progenies, Wasserstein Autoencoders (WAEs) stand out in particular due to the dual offering of heightened generative quality and a strong theoretical backbone. WAEs consist of an encoding and a decoding network forming a bottleneck with the prime objective of generating new samples resembling the ones it was catered to. In the process, they aim to achieve a target latent representation of the encoded data. Our work is an attempt to offer a theoretical understanding of the machinery behind WAEs. From a statistical viewpoint, we pose the problem as concurrent density estimation tasks based on neural network-induced transformations. This allows us to establish deterministic upper bounds on the realized errors WAEs commit. We also analyze the propagation of these stochastic errors in the presence of adversaries. As a result, both the large sample properties of the reconstructed distribution and the resilience of WAE models are explored.
△ Less
Submitted 11 December, 2023;
originally announced December 2023.
-
Evaluating eVTOL Network Performance and Fleet Dynamics through Simulation-Based Analysis
Authors:
Emin Burak Onat,
Vishwanath Bulusu,
Anjan Chakrabarty,
Mark Hansen,
Raja Sengupta,
Banavar Sridar
Abstract:
Urban Air Mobility (UAM) represents a promising solution for future transportation. In this study, we introduce VertiSim, an advanced event-driven simulator developed to evaluate e-VTOL transportation networks. Uniquely, VertiSim simultaneously models passenger, aircraft, and energy flows, reflecting the interrelated complexities of UAM systems. We utilized VertiSim to assess 19 operational scenar…
▽ More
Urban Air Mobility (UAM) represents a promising solution for future transportation. In this study, we introduce VertiSim, an advanced event-driven simulator developed to evaluate e-VTOL transportation networks. Uniquely, VertiSim simultaneously models passenger, aircraft, and energy flows, reflecting the interrelated complexities of UAM systems. We utilized VertiSim to assess 19 operational scenarios serving a daily demand for 2,834 passengers with varying fleet sizes and vertiport distances. The study aims to support stakeholders in making informed decisions about fleet size, network design, and infrastructure development by understanding tradeoffs in passenger delay time, operational costs, and fleet utilization. Our simulations, guided by a heuristic dispatch and charge policy, indicate that fleet size significantly influences passenger delay and energy consumption within UAM networks. We find that increasing the fleet size can reduce average passenger delays, but this comes at the cost of higher operational expenses due to an increase in the number of repositioning flights. Additionally, our analysis highlights how vertiport distances impact fleet utilization: longer distances result in reduced total idle time and increased cruise and charge times, leading to more efficient fleet utilization but also longer passenger delays. These findings are important for UAM network planning, especially in balancing fleet size with vertiport capacity and operational costs. Simulator demo is available at: https://tinyurl.com/vertisim-vis
△ Less
Submitted 5 December, 2023;
originally announced December 2023.
-
Safe multi-agent motion planning under uncertainty for drones using filtered reinforcement learning
Authors:
Sleiman Safaoui,
Abraham P. Vinod,
Ankush Chakrabarty,
Rien Quirynen,
Nobuyuki Yoshikawa,
Stefano Di Cairano
Abstract:
We consider the problem of safe multi-agent motion planning for drones in uncertain, cluttered workspaces. For this problem, we present a tractable motion planner that builds upon the strengths of reinforcement learning and constrained-control-based trajectory planning. First, we use single-agent reinforcement learning to learn motion plans from data that reach the target but may not be collision-…
▽ More
We consider the problem of safe multi-agent motion planning for drones in uncertain, cluttered workspaces. For this problem, we present a tractable motion planner that builds upon the strengths of reinforcement learning and constrained-control-based trajectory planning. First, we use single-agent reinforcement learning to learn motion plans from data that reach the target but may not be collision-free. Next, we use a convex optimization, chance constraints, and set-based methods for constrained control to ensure safety, despite the uncertainty in the workspace, agent motion, and sensing. The proposed approach can handle state and control constraints on the agents, and enforce collision avoidance among themselves and with static obstacles in the workspace with high probability. The proposed approach yields a safe, real-time implementable, multi-agent motion planner that is simpler to train than methods based solely on learning. Numerical simulations and experiments show the efficacy of the approach.
△ Less
Submitted 31 October, 2023;
originally announced November 2023.
-
Where are the Water Worlds? Identifying the Exo-water-worlds Using Models of Planet Formation and Atmospheric Evolution
Authors:
Aritra Chakrabarty,
Gijs D. Mulders
Abstract:
Planet formation models suggest that the small exoplanets that migrate from beyond the snowline of the protoplanetary disk likely contain water-ice-rich cores ($\sim 50\%$ by mass), also known as the water worlds. While the observed radius valley of the Kepler planets is well explained with the atmospheric dichotomy of the rocky planets, precise measurements of mass and radius of the transiting pl…
▽ More
Planet formation models suggest that the small exoplanets that migrate from beyond the snowline of the protoplanetary disk likely contain water-ice-rich cores ($\sim 50\%$ by mass), also known as the water worlds. While the observed radius valley of the Kepler planets is well explained with the atmospheric dichotomy of the rocky planets, precise measurements of mass and radius of the transiting planets hint at the existence of these water worlds. However, observations cannot confirm the core compositions of those planets owing to the degeneracy between the density of a bare water-ice-rich planet and the bulk density of a rocky planet with a thin atmosphere. We combine different formation models from the Genesis library with atmospheric escape models, such as photo-evaporation and impact stripping, to simulate planetary systems consistent with the observed radius valley. We then explore the possibility of water worlds being present in the currently observed sample by comparing them with the simulated planets in the mass-radius-orbital period space. We find that the migration models suggest $\gtrsim 10\%$ and $\gtrsim 20\%$ of the bare planets, i.e. planets without primordial H/He atmospheres, to be water-ice-rich around G- and M-type host stars respectively, consistent with the mass-radius distributions of the observed planets. However, most of the water worlds are predicted to be outside a period of 10 days. A unique identification of water worlds through radial velocity and transmission spectroscopy is likely to be more successful when targeting such planets with longer orbital periods.
△ Less
Submitted 7 February, 2024; v1 submitted 5 October, 2023;
originally announced October 2023.
-
Physics-Informed Machine Learning for Modeling and Control of Dynamical Systems
Authors:
Truong X. Nghiem,
Ján Drgoňa,
Colin Jones,
Zoltan Nagy,
Roland Schwan,
Biswadip Dey,
Ankush Chakrabarty,
Stefano Di Cairano,
Joel A. Paulson,
Andrea Carron,
Melanie N. Zeilinger,
Wenceslao Shaw Cortez,
Draguna L. Vrabie
Abstract:
Physics-informed machine learning (PIML) is a set of methods and tools that systematically integrate machine learning (ML) algorithms with physical constraints and abstract mathematical models developed in scientific and engineering domains. As opposed to purely data-driven methods, PIML models can be trained from additional information obtained by enforcing physical laws such as energy and mass c…
▽ More
Physics-informed machine learning (PIML) is a set of methods and tools that systematically integrate machine learning (ML) algorithms with physical constraints and abstract mathematical models developed in scientific and engineering domains. As opposed to purely data-driven methods, PIML models can be trained from additional information obtained by enforcing physical laws such as energy and mass conservation. More broadly, PIML models can include abstract properties and conditions such as stability, convexity, or invariance. The basic premise of PIML is that the integration of ML and physics can yield more effective, physically consistent, and data-efficient models. This paper aims to provide a tutorial-like overview of the recent advances in PIML for dynamical system modeling and control. Specifically, the paper covers an overview of the theory, fundamental concepts and methods, tools, and applications on topics of: 1) physics-informed learning for system identification; 2) physics-informed learning for control; 3) analysis and verification of PIML models; and 4) physics-informed digital twins. The paper is concluded with a perspective on open challenges and future research opportunities.
△ Less
Submitted 24 June, 2023;
originally announced June 2023.
-
Violation-Aware Contextual Bayesian Optimization for Controller Performance Optimization with Unmodeled Constraints
Authors:
Wenjie Xu,
Colin N Jones,
Bratislav Svetozarevic,
Christopher R. Laughman,
Ankush Chakrabarty
Abstract:
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated to be effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints and time-var…
▽ More
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated to be effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints and time-varying ambient conditions. In this paper, we propose a violation-aware contextual BO algorithm (VACBO) that optimizes closed-loop performance while simultaneously learning constraint-feasible solutions under time-varying ambient conditions. Unlike classical constrained BO methods which allow unlimited constraint violations, or 'safe' BO algorithms that are conservative and try to operate with near-zero violations, we allow budgeted constraint violations to improve constraint learning and accelerate optimization. We demonstrate the effectiveness of our proposed VACBO method for energy minimization of industrial vapor compression systems under time-varying ambient temperature and humidity.
△ Less
Submitted 28 January, 2023;
originally announced January 2023.
-
Effect of multiple scattering on the Transmission spectra and the Polarization phase curves for Earth-like Exoplanets
Authors:
Manika Singla,
Aritra Chakrabarty,
Sujan Sengupta
Abstract:
It is the most appropriate time to characterize the Earth-like exoplanets in order to detect biosignature beyond the Earth because such exoplanets will be the prime targets of big-budget missions like JWST, Roman Space Telescope, HabEx, LUVOIR, TMT, ELT, etc. We provide models for the transmission spectra of the Earth-like exoplanets by incorporating effects of multiple scattering. For this purpos…
▽ More
It is the most appropriate time to characterize the Earth-like exoplanets in order to detect biosignature beyond the Earth because such exoplanets will be the prime targets of big-budget missions like JWST, Roman Space Telescope, HabEx, LUVOIR, TMT, ELT, etc. We provide models for the transmission spectra of the Earth-like exoplanets by incorporating effects of multiple scattering. For this purpose we numerically solve the full multiple-scattering radiative transfer equations instead of using Beer-Bouguer-Lambert's law that doesn't include the diffuse radiation due to scattering. Our models demonstrate that the effect of this diffuse transmission radiation can be observationally significant, especially in the presence of clouds. We also calculate the reflection spectra and polarization phase curves of Earth-like exoplanets by considering both cloud-free and cloudy atmospheres. We solve the 3D vector radiative transfer equations numerically and calculate the phase curves of albedo and disk-integrated polarization by using appropriate scattering phase matrices and integrating the local Stokes vectors over the illuminated part of the disks along the line of sight. We present the effects of the globally averaged surface albedo on the reflection spectra and phase curves as the surface features of such planets are known to significantly dictate the nature of these observational quantities. Synergic observations of the spectra and phase curves will certainly prove to be useful in extracting more information and reducing the degeneracy among the estimated parameters of terrestrial exoplanets. Thus, our models will play a pivotal role in driving future observations.
△ Less
Submitted 20 January, 2023;
originally announced January 2023.
-
Clustering of large deviations in moving average processes: the long memory regime
Authors:
Arijit Chakrabarty,
Gennady Samorodnitsky
Abstract:
We investigate how large deviations events cluster in the framework of an infinite moving average process with light-tailed noise and long memory. The long memory makes clusters larger, and the asymptotic behaviour of the size of the cluster turns out to be described by the first hitting time of a randomly shifted fractional Brownian motion with drift.
We investigate how large deviations events cluster in the framework of an infinite moving average process with light-tailed noise and long memory. The long memory makes clusters larger, and the asymptotic behaviour of the size of the cluster turns out to be described by the first hitting time of a randomly shifted fractional Brownian motion with drift.
△ Less
Submitted 5 January, 2023;
originally announced January 2023.
-
Meta-Learning of Neural State-Space Models Using Data From Similar Systems
Authors:
Ankush Chakrabarty,
Gordon Wichern,
Christopher R. Laughman
Abstract:
Deep neural state-space models (SSMs) provide a powerful tool for modeling dynamical systems solely using operational data. Typically, neural SSMs are trained using data collected from the actual system under consideration, despite the likely existence of operational data from similar systems which have previously been deployed in the field. In this paper, we propose the use of model-agnostic meta…
▽ More
Deep neural state-space models (SSMs) provide a powerful tool for modeling dynamical systems solely using operational data. Typically, neural SSMs are trained using data collected from the actual system under consideration, despite the likely existence of operational data from similar systems which have previously been deployed in the field. In this paper, we propose the use of model-agnostic meta-learning (MAML) for constructing deep encoder network-based SSMs, by leveraging a combination of archived data from similar systems (used to meta-train offline) and limited data from the actual system (used for rapid online adaptation). We demonstrate using a numerical example that meta-learning can result in more accurate neural SSM models than supervised- or transfer-learning, despite few adaptation steps and limited online data. Additionally, we show that by carefully partitioning and adapting the encoder layers while fixing the state-transition operator, we can achieve comparable performance to MAML while reducing online adaptation complexity.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Two-stream Multi-dimensional Convolutional Network for Real-time Violence Detection
Authors:
Dipon Kumar Ghosh,
Amitabha Chakrabarty
Abstract:
The increasing number of surveillance cameras and security concerns have made automatic violent activity detection from surveillance footage an active area for research. Modern deep learning methods have achieved good accuracy in violence detection and proved to be successful because of their applicability in intelligent surveillance systems. However, the models are computationally expensive and l…
▽ More
The increasing number of surveillance cameras and security concerns have made automatic violent activity detection from surveillance footage an active area for research. Modern deep learning methods have achieved good accuracy in violence detection and proved to be successful because of their applicability in intelligent surveillance systems. However, the models are computationally expensive and large in size because of their inefficient methods for feature extraction. This work presents a novel architecture for violence detection called Two-stream Multi-dimensional Convolutional Network (2s-MDCN), which uses RGB frames and optical flow to detect violence. Our proposed method extracts temporal and spatial information independently by 1D, 2D, and 3D convolutions. Despite combining multi-dimensional convolutional networks, our models are lightweight and efficient due to reduced channel capacity, yet they learn to extract meaningful spatial and temporal information. Additionally, combining RGB frames and optical flow yields 2.2% more accuracy than a single RGB stream. Regardless of having less complexity, our models obtained state-of-the-art accuracy of 89.7% on the largest violence detection benchmark dataset.
△ Less
Submitted 8 November, 2022;
originally announced November 2022.
-
Optimizing Closed-Loop Performance with Data from Similar Systems: A Bayesian Meta-Learning Approach
Authors:
Ankush Chakrabarty
Abstract:
Bayesian optimization (BO) has demonstrated potential for optimizing control performance in data-limited settings, especially for systems with unknown dynamics or unmodeled performance objectives. The BO algorithm efficiently trades-off exploration and exploitation by leveraging uncertainty estimates using surrogate models. These surrogates are usually learned using data collected from the target…
▽ More
Bayesian optimization (BO) has demonstrated potential for optimizing control performance in data-limited settings, especially for systems with unknown dynamics or unmodeled performance objectives. The BO algorithm efficiently trades-off exploration and exploitation by leveraging uncertainty estimates using surrogate models. These surrogates are usually learned using data collected from the target dynamical system to be optimized. Intuitively, the convergence rate of BO is better for surrogate models that can accurately predict the target system performance. In classical BO, initial surrogate models are constructed using very limited data points, and therefore rarely yield accurate predictions of system performance. In this paper, we propose the use of meta-learning to generate an initial surrogate model based on data collected from performance optimization tasks performed on a variety of systems that are different to the target system. To this end, we employ deep kernel networks (DKNs) which are simple to train and which comprise encoded Gaussian process models that integrate seamlessly with classical BO. The effectiveness of our proposed DKN-BO approach for speeding up control system performance optimization is demonstrated using a well-studied nonlinear system with unknown dynamics and an unmodeled performance function.
△ Less
Submitted 31 October, 2022;
originally announced November 2022.
-
Skin effect and dynamical delocalization in non-Hermitian quasicrystals with spin-orbit interaction
Authors:
Aditi Chakrabarty,
Sanjoy Datta
Abstract:
The investigations of the spectral and dynamical delocalization-localization (DL) transition have revealed intriguing features in a wide range of non-Hermitian systems. The present study aims at exploring the spectral and dynamical properties in a non-Hermitian quasiperiodic system with asymmetric hopping in the presence of Rashba Spin-Orbit (RSO) interaction. In particular, in such systems, we ha…
▽ More
The investigations of the spectral and dynamical delocalization-localization (DL) transition have revealed intriguing features in a wide range of non-Hermitian systems. The present study aims at exploring the spectral and dynamical properties in a non-Hermitian quasiperiodic system with asymmetric hopping in the presence of Rashba Spin-Orbit (RSO) interaction. In particular, in such systems, we have identified that the DL transition is associated with a concurrent change in the energy spectrum, where the eigenstates always break the time-reversal symmetry for all strenghts of the quasiperiodic potential, contrary to the systems without RSO interaction. Remarkably, we find that the reality of energy spectrum under the open boundary condition that is frequently symbolised as a hallmark of the skin-effect, is a system-size dependent phenomena, and appears even when the associated energies are indeed complex. In addition, it is demonstrated that the spin-flip term in the RSO interaction in fact possesses a tendency to diminish the directionality of the skin-effect. On scrutinizing the dynamical attributes in our non-Hermitian system, we unveil that in spite of the fact that the spectral DL transition accords with the dynamical phase transition, interestingly, the system comes across hyper-diffusive and negative diffusion dynamical regimes depending upon the strength of the RSO interaction, in the spectrally localized regime.
△ Less
Submitted 22 August, 2022;
originally announced August 2022.
-
Clustering of large deviations in moving average processes: the short memory regime
Authors:
Arijit Chakrabarty,
Gennady Samorodnitsky
Abstract:
We describe the cluster of large deviations events that arise when one such large deviations event occurs. We work in the framework of an infinite moving average process with a noise that has finite exponential moments.
We describe the cluster of large deviations events that arise when one such large deviations event occurs. We work in the framework of an infinite moving average process with a noise that has finite exponential moments.
△ Less
Submitted 8 December, 2023; v1 submitted 9 August, 2022;
originally announced August 2022.
-
Data-Driven Identification of Dynamic Quality Models in Drinking Water Networks
Authors:
Shen Wang,
Ankush Chakrabarty,
Ahmad F. Taha
Abstract:
Traditional control and monitoring of water quality in drinking water distribution networks (WDN) rely on mostly model- or toolbox-driven approaches, where the network topology and parameters are assumed to be known. In contrast, system identification (SysID) algorithms for generic dynamic system models seek to approximate such models using only input-output data without relying on network paramet…
▽ More
Traditional control and monitoring of water quality in drinking water distribution networks (WDN) rely on mostly model- or toolbox-driven approaches, where the network topology and parameters are assumed to be known. In contrast, system identification (SysID) algorithms for generic dynamic system models seek to approximate such models using only input-output data without relying on network parameters. The objective of this paper is to investigate SysID algorithms for water quality model approximation. This research problem is challenging due to (i) complex water quality and reaction dynamics and (ii) the mismatch between the requirements of SysID algorithms and the properties of water quality dynamics. In this paper, we present the first attempt to identify water quality models in WDNs using only input-output experimental data and classical SysID methods without knowing any WDN parameters. Properties of water quality models are introduced, the ensuing challenges caused by these properties when identifying water quality models are discussed, and remedial solutions are given. Through case studies, we demonstrate the applicability of SysID algorithms, show the corresponding performance in terms of accuracy and computational time, and explore the possible factors impacting water quality model identification.
△ Less
Submitted 23 January, 2023; v1 submitted 13 July, 2022;
originally announced July 2022.
-
Interval Bound Interpolation for Few-shot Learning with Few Tasks
Authors:
Shounak Datta,
Sankha Subhra Mullick,
Anish Chakrabarty,
Swagatam Das
Abstract:
Few-shot learning aims to transfer the knowledge acquired from training on a diverse set of tasks to unseen tasks from the same task distribution with a limited amount of labeled data. The underlying requirement for effective few-shot generalization is to learn a good representation of the task manifold. This becomes more difficult when only a limited number of tasks are available for training. In…
▽ More
Few-shot learning aims to transfer the knowledge acquired from training on a diverse set of tasks to unseen tasks from the same task distribution with a limited amount of labeled data. The underlying requirement for effective few-shot generalization is to learn a good representation of the task manifold. This becomes more difficult when only a limited number of tasks are available for training. In such a few-task few-shot setting, it is beneficial to explicitly preserve the local neighborhoods from the task manifold and exploit this to generate artificial tasks for training. To this end, we introduce the notion of interval bounds from the provably robust training literature to few-shot learning. The interval bounds are used to characterize neighborhoods around the training tasks. These neighborhoods can then be preserved by minimizing the distance between a task and its respective bounds. We then use a novel strategy to artificially form new tasks for training by interpolating between the available tasks and their respective interval bounds. We apply our framework to both model-agnostic meta-learning as well as prototype-based metric-learning paradigms. The efficacy of our proposed approach is evident from the improved performance on several datasets from diverse domains compared to current methods.
△ Less
Submitted 7 May, 2023; v1 submitted 7 April, 2022;
originally announced April 2022.
-
Polarization of Rotationally Oblate Self-Luminous Exoplanets with Anisotropic Atmospheres
Authors:
Aritra Chakrabarty,
Sujan Sengupta,
Mark S. Marley
Abstract:
Young self-luminous giant exoplanets are expected to be oblate in shape owing to the high rotational speeds observed for some objects. Similar to the case of brown dwarfs, the thermal emission from these planets should be polarized by scatterings of molecules and condensate cloud particles, and the rotation-induced asymmetry of the planet's disk would yield to net non-zero detectable polarization.…
▽ More
Young self-luminous giant exoplanets are expected to be oblate in shape owing to the high rotational speeds observed for some objects. Similar to the case of brown dwarfs, the thermal emission from these planets should be polarized by scatterings of molecules and condensate cloud particles, and the rotation-induced asymmetry of the planet's disk would yield to net non-zero detectable polarization. Considering an anisotropic atmosphere, we present here a three-dimensional approach to estimate the disk-averaged polarization that arises due to the oblateness of the planets. We solve the multiple-scattering vector radiative transfer equations at each location on the planet's disk and calculate the local Stokes vectors and then calculate the disk-integrated flux and linear polarization. For a cloud-free atmosphere, the polarization signal is observable only in the visible wavelength region. However, the presence of clouds in the planetary atmospheres leads to a detectable amount of polarization in the infrared wavelength region where the planetary thermal emission peaks. Considering different broad-band filters of the SPHERE-IRDIS instrument of the Very Large Telescope, we present generic models for the polarization at different wavelength bands as a function of their rotation period. We also present polarization models for the Exoplanets $β$ Pic b and ROXs 42B b as two representative cases which can guide future observations. Our insights on the polarization of young giant planets presented here would be useful for the upcoming polarimetric observations of the directly imaged planets.
△ Less
Submitted 19 January, 2022;
originally announced January 2022.
-
A Deep Learning Approach to Integrate Human-Level Understanding in a Chatbot
Authors:
Afia Fairoose Abedin,
Amirul Islam Al Mamun,
Rownak Jahan Nowrin,
Amitabha Chakrabarty,
Moin Mostakim,
Sudip Kumar Naskar
Abstract:
In recent times, a large number of people have been involved in establishing their own businesses. Unlike humans, chatbots can serve multiple customers at a time, are available 24/7 and reply in less than a fraction of a second. Though chatbots perform well in task-oriented activities, in most cases they fail to understand personalized opinions, statements or even queries which later impact the or…
▽ More
In recent times, a large number of people have been involved in establishing their own businesses. Unlike humans, chatbots can serve multiple customers at a time, are available 24/7 and reply in less than a fraction of a second. Though chatbots perform well in task-oriented activities, in most cases they fail to understand personalized opinions, statements or even queries which later impact the organization for poor service management. Lack of understanding capabilities in bots disinterest humans to continue conversations with them. Usually, chatbots give absurd responses when they are unable to interpret a user's text accurately. Extracting the client reviews from conversations by using chatbots, organizations can reduce the major gap of understanding between the users and the chatbot and improve their quality of products and services.Thus, in our research we incorporated all the key elements that are necessary for a chatbot to analyse and understand an input text precisely and accurately. We performed sentiment analysis, emotion detection, intent classification and named-entity recognition using deep learning to develop chatbots with humanistic understanding and intelligence. The efficiency of our approach can be demonstrated accordingly by the detailed analysis.
△ Less
Submitted 31 December, 2021;
originally announced January 2022.
-
Length of stationary Gaussian excursions
Authors:
Arijit Chakrabarty,
Manish Pandey,
Sukrit Chakraborty
Abstract:
Given that a stationary Gaussian process is above a high threshold, the length of time it spends before going below that threshold is studied. The asymptotic order is determined by the smoothness of the sample paths, which in turn is a function of the tails of the spectral measure. Two disjoint regimes are studied - one in which the second spectral moment is finite and the other in which the tails…
▽ More
Given that a stationary Gaussian process is above a high threshold, the length of time it spends before going below that threshold is studied. The asymptotic order is determined by the smoothness of the sample paths, which in turn is a function of the tails of the spectral measure. Two disjoint regimes are studied - one in which the second spectral moment is finite and the other in which the tails of the spectral measure are regularly varying and the second moment is infinite.
△ Less
Submitted 9 August, 2022; v1 submitted 25 November, 2021;
originally announced November 2021.
-
VABO: Violation-Aware Bayesian Optimization for Closed-Loop Control Performance Optimization with Unmodeled Constraints
Authors:
Wenjie Xu,
Colin N Jones,
Bratislav Svetozarevic,
Christopher R. Laughman,
Ankush Chakrabarty
Abstract:
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints. In this paper, we…
▽ More
We study the problem of performance optimization of closed-loop control systems with unmodeled dynamics. Bayesian optimization (BO) has been demonstrated effective for improving closed-loop performance by automatically tuning controller gains or reference setpoints in a model-free manner. However, BO methods have rarely been tested on dynamical systems with unmodeled constraints. In this paper, we propose a violation-aware BO algorithm (VABO) that optimizes closed-loop performance while simultaneously learning constraint-feasible solutions. Unlike classical constrained BO methods which allow an unlimited constraint violations, or safe BO algorithms that are conservative and try to operate with near-zero violations, we allow budgeted constraint violations to improve constraint learning and accelerate optimization. We demonstrate the effectiveness of our proposed VABO method for energy minimization of industrial vapor compression systems.
△ Less
Submitted 14 October, 2021;
originally announced October 2021.
-
Statistical Regeneration Guarantees of the Wasserstein Autoencoder with Latent Space Consistency
Authors:
Anish Chakrabarty,
Swagatam Das
Abstract:
The introduction of Variational Autoencoders (VAE) has been marked as a breakthrough in the history of representation learning models. Besides having several accolades of its own, VAE has successfully flagged off a series of inventions in the form of its immediate successors. Wasserstein Autoencoder (WAE), being an heir to that realm carries with it all of the goodness and heightened generative pr…
▽ More
The introduction of Variational Autoencoders (VAE) has been marked as a breakthrough in the history of representation learning models. Besides having several accolades of its own, VAE has successfully flagged off a series of inventions in the form of its immediate successors. Wasserstein Autoencoder (WAE), being an heir to that realm carries with it all of the goodness and heightened generative promises, matching even the generative adversarial networks (GANs). Needless to say, recent years have witnessed a remarkable resurgence in statistical analyses of the GANs. Similar examinations for Autoencoders, however, despite their diverse applicability and notable empirical performance, remain largely absent. To close this gap, in this paper, we investigate the statistical properties of WAE. Firstly, we provide statistical guarantees that WAE achieves the target distribution in the latent space, utilizing the Vapnik Chervonenkis (VC) theory. The main result, consequently ensures the regeneration of the input distribution, harnessing the potential offered by Optimal Transport of measures under the Wasserstein metric. This study, in turn, hints at the class of distributions WAE can reconstruct after suffering a compression in the form of a latent law.
△ Less
Submitted 8 October, 2021;
originally announced October 2021.
-
A Deep Neural Network Approach for Crop Selection and Yield Prediction in Bangladesh
Authors:
Tanhim Islam,
Tanjir Alam Chisty,
Amitabha Chakrabarty
Abstract:
Agriculture is the essential ingredients to mankind which is a major source of livelihood. Agriculture work in Bangladesh is mostly done in old ways which directly affects our economy. In addition, institutions of agriculture are working with manual data which cannot provide a proper solution for crop selection and yield prediction. This paper shows the best way of crop selection and yield predict…
▽ More
Agriculture is the essential ingredients to mankind which is a major source of livelihood. Agriculture work in Bangladesh is mostly done in old ways which directly affects our economy. In addition, institutions of agriculture are working with manual data which cannot provide a proper solution for crop selection and yield prediction. This paper shows the best way of crop selection and yield prediction in minimum cost and effort. Artificial Neural Network is considered robust tools for modeling and prediction. This algorithm aims to get better output and prediction, as well as, support vector machine, Logistic Regression, and random forest algorithm is also considered in this study for comparing the accuracy and error rate. Moreover, all of these algorithms used here are just to see how well they performed for a dataset which is over 0.3 million. We have collected 46 parameters such as maximum and minimum temperature, average rainfall, humidity, climate, weather, and types of land, types of chemical fertilizer, types of soil, soil structure, soil composition, soil moisture, soil consistency, soil reaction and soil texture for applying into this prediction process. In this paper, we have suggested using the deep neural network for agricultural crop selection and yield prediction.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Localization, $\mathcal{PT}$-Symmetry Breaking and Topological Transitions in non-Hermitian Quasicrystals
Authors:
Aruna Prasad Acharya,
Aditi Chakrabarty,
Deepak Kumar Sahu,
Sanjoy Datta
Abstract:
According to the topological band theory of a Hermitian system, the different electronic phases are classified in terms of topological invariants, wherein the transition between the two phases characterized by a different topological invariant is the primary signature of a topological phase transition. Recently, it has been argued that the delocalization-localization transition in a quasicrystal,…
▽ More
According to the topological band theory of a Hermitian system, the different electronic phases are classified in terms of topological invariants, wherein the transition between the two phases characterized by a different topological invariant is the primary signature of a topological phase transition. Recently, it has been argued that the delocalization-localization transition in a quasicrystal, described by the non-Hermitian $\mathcal{PT}$-symmetric extension of the Aubry-André-Harper (AAH) Hamiltonian can also be identified as a topological phase transition. Interestingly, the $\mathcal{PT}$-symmetry also breaks down at the same critical point. However, in this article, we have shown that the delocalization-localization transition and the $\mathcal{PT}$-symmetry breaking are not connected to a topological phase transition. To demonstrate this, we have studied the non-Hermitian $\mathcal{PT}$-symmetric AAH Hamiltonian in the presence of Rashba Spin-Orbit (RSO) coupling. We have obtained an analytical expression of the topological transition point and compared it with the numerically obtained critical points. We have found that, except in some special cases, the critical point and the topological transition point are not the same. In fact, the delocalization-localization transition takes place earlier than the topological transition whenever they do not coincide.
△ Less
Submitted 6 August, 2021;
originally announced August 2021.
-
Extremum Seeking Control with an Adaptive Gain Based On Gradient Estimation Error
Authors:
Claus Danielson,
Scott A. Bortoff,
Ankush Chakrabarty
Abstract:
This paper presents an extremum seeking control algorithm with an adaptive step-size that adjusts the aggressiveness of the controller based on the quality of the gradient estimate. The adaptive step-size ensures that the integral-action produced by the gradient descent does not destabilize the closed-loop system. To quantify the quality of the gradient estimate, we present a batch least squares e…
▽ More
This paper presents an extremum seeking control algorithm with an adaptive step-size that adjusts the aggressiveness of the controller based on the quality of the gradient estimate. The adaptive step-size ensures that the integral-action produced by the gradient descent does not destabilize the closed-loop system. To quantify the quality of the gradient estimate, we present a batch least squares estimator with a novel weighting and show that it produces bounded estimation errors, where the uncertainty is due to the curvature of the unknown cost function. The adaptive step-size then maximizes the decrease of the combined plant and controller Lyapunov function for the worst-case estimation error. We prove that our ESC is input-to-state stable with respect to the dither signal. Finally, we demonstrate our proposed ESC through five numerical examples; one illustrative, one practical, and three benchmarks.
△ Less
Submitted 18 December, 2021; v1 submitted 2 July, 2021;
originally announced July 2021.
-
Attentive Neural Processes and Batch Bayesian Optimization for Scalable Calibration of Physics-Informed Digital Twins
Authors:
Ankush Chakrabarty,
Gordon Wichern,
Christopher Laughman
Abstract:
Physics-informed dynamical system models form critical components of digital twins of the built environment. These digital twins enable the design of energy-efficient infrastructure, but must be properly calibrated to accurately reflect system behavior for downstream prediction and analysis. Dynamical system models of modern buildings are typically described by a large number of parameters and inc…
▽ More
Physics-informed dynamical system models form critical components of digital twins of the built environment. These digital twins enable the design of energy-efficient infrastructure, but must be properly calibrated to accurately reflect system behavior for downstream prediction and analysis. Dynamical system models of modern buildings are typically described by a large number of parameters and incur significant computational expenditure during simulations. To handle large-scale calibration of digital twins without exorbitant simulations, we propose ANP-BBO: a scalable and parallelizable batch-wise Bayesian optimization (BBO) methodology that leverages attentive neural processes (ANPs).
△ Less
Submitted 29 June, 2021;
originally announced June 2021.
-
Generic Models for Disk-Resolved and Disk-Integrated Phase Dependent Linear Polarization of Light Reflected from Exoplanets
Authors:
Aritra Chakrabarty,
Sujan Sengupta
Abstract:
Similar to the case of solar system planets, reflected starlight from exoplanets is expected to be polarized due to atmospheric scattering and the net disk integrated polarization should be non-zero owing to the asymmetrical illumination of the planetary disk. The computation of the disk-integrated reflected flux and its state of polarization involves techniques for the calculation of the local re…
▽ More
Similar to the case of solar system planets, reflected starlight from exoplanets is expected to be polarized due to atmospheric scattering and the net disk integrated polarization should be non-zero owing to the asymmetrical illumination of the planetary disk. The computation of the disk-integrated reflected flux and its state of polarization involves techniques for the calculation of the local reflection matrices as well as the numerical recipes for integration over the planetary disks. In this paper, we present a novel approach to calculate the azimuth-dependent reflected intensity vectors at each location on the planetary disk divided into grids. We achieve this by solving the vector radiative transfer equations that describe linear polarization. Our calculations incorporate self-consistent atmospheric models of exoplanets over a wide range of equilibrium temperature, surface gravity, atmospheric composition, and cloud structure. A comparison of the flux and the amount of polarization calculated by considering both single and multiple scattering exhibits the effect of depolarization due to multiple scattering of light depending on the scattering albedo of the atmosphere. We have benchmarked our basic calculations against some of the existing models. We have also presented our models for the hot Jupiter HD 189733 b, indicating the level of precision required by future observations to detect the polarization of this planet in the optical and near-infrared wavelength region. The generic nature and the accuracy offered by our models make them an effective tool for modeling the future observations of the polarized light reflected from exoplanets.
△ Less
Submitted 17 June, 2021;
originally announced June 2021.
-
Multi-band transit follow up observations of five hot-Jupiters with critical noise treatments: Improved physical properties
Authors:
Suman Saha,
Aritra Chakrabarty,
Sujan Sengupta
Abstract:
The most challenging limitation in transit photometry arises from the noises in the photometric signal. In particular, the ground-based telescopes are heavily affected by the noise due to perturbation in the Earth's atmosphere. Use of telescopes with large apertures can improve the photometric signal-to-noise ratio (S/N) to a great extent. However, detecting a transit signal out of a noisy light c…
▽ More
The most challenging limitation in transit photometry arises from the noises in the photometric signal. In particular, the ground-based telescopes are heavily affected by the noise due to perturbation in the Earth's atmosphere. Use of telescopes with large apertures can improve the photometric signal-to-noise ratio (S/N) to a great extent. However, detecting a transit signal out of a noisy light curve of the host star and precisely estimating the transit parameters call for various noise reduction techniques. Here, we present multi-band transit photometric follow-up observations of five hot-Jupiters e.g., HAT-P-30 b, HAT-P-54 b, WASP-43 b, TrES-3 b and XO-2 N b, using the 2m Himalayan Chandra Telescope (HCT) at the Indian Astronomical Observatory, Hanle and the 1.3m J. C. Bhattacharya Telescope (JCBT) at the Vainu Bappu Observatory, Kavalur. Our critical noise treatment approach includes techniques such as Wavelet Denoising and Gaussian Process regression, which effectively reduce both time-correlated and time-uncorrelated noise components from our transit light curves. In addition to these techniques, use of our state-of-the-art model algorithms have allowed us to estimate the physical properties of the target exoplanets with a better accuracy and precision compared to the previous studies.
△ Less
Submitted 20 May, 2021;
originally announced May 2021.
-
Simultaneous Multi-Pivot Neural Machine Translation
Authors:
Raj Dabre,
Aizhan Imankulova,
Masahiro Kaneko,
Abhisek Chakrabarty
Abstract:
Parallel corpora are indispensable for training neural machine translation (NMT) models, and parallel corpora for most language pairs do not exist or are scarce. In such cases, pivot language NMT can be helpful where a pivot language is used such that there exist parallel corpora between the source and pivot and pivot and target languages. Naturally, the quality of pivot language translation is mo…
▽ More
Parallel corpora are indispensable for training neural machine translation (NMT) models, and parallel corpora for most language pairs do not exist or are scarce. In such cases, pivot language NMT can be helpful where a pivot language is used such that there exist parallel corpora between the source and pivot and pivot and target languages. Naturally, the quality of pivot language translation is more inferior to what could be achieved with a direct parallel corpus of a reasonable size for that pair. In a real-time simultaneous translation setting, the quality of pivot language translation deteriorates even further given that the model has to output translations the moment a few source words become available. To solve this issue, we propose multi-pivot translation and apply it to a simultaneous translation setting involving pivot languages. Our approach involves simultaneously translating a source language into multiple pivots, which are then simultaneously translated together into the target language by leveraging multi-source NMT. Our experiments in a low-resource setting using the N-way parallel UN corpus for Arabic to English NMT via French and Spanish as pivots reveals that in a simultaneous pivot NMT setting, using two pivot languages can lead to an improvement of up to 5.8 BLEU.
△ Less
Submitted 15 April, 2021;
originally announced April 2021.
-
Model Order Reduction for Water Quality Dynamics
Authors:
Shen Wang,
Ahmad F. Taha,
Ankush Chakrabarty,
Lina Sela,
Ahmed Abokifa
Abstract:
A state-space representation of water quality dynamics describing disinfectant (e.g., chlorine) transport dynamics in drinking water distribution networks has been recently proposed. Such representation is a byproduct of space- and time-discretization of the PDE modeling transport dynamics. This results in a large state-space dimension even for small networks with tens of nodes. Although such a st…
▽ More
A state-space representation of water quality dynamics describing disinfectant (e.g., chlorine) transport dynamics in drinking water distribution networks has been recently proposed. Such representation is a byproduct of space- and time-discretization of the PDE modeling transport dynamics. This results in a large state-space dimension even for small networks with tens of nodes. Although such a state-space model provides a model-driven approach to predict water quality dynamics, incorporating it into model-based control algorithms or state estimators for large networks is challenging and at times intractable. To that end, this paper investigates model order reduction (MOR) methods for water quality dynamics with the objective of performing post-reduction feedback control. The presented investigation focuses on reducing state-dimension by orders of magnitude, the stability of the MOR methods, and the application of these methods to model predictive control.
△ Less
Submitted 18 February, 2022; v1 submitted 21 February, 2021;
originally announced February 2021.
-
Modeling of Vertical Dipole Above Lossy Dielectric Half-Space: Characteristic Mode Theory
Authors:
Sandip Ghosal,
Arijit De,
Raed M. Shubair,
Ajay Chakrabarty
Abstract:
This work introduces a theoretical extension of the characteristic mode formulation for analysing the vertical electric dipole lying above a lossy dielectric half-space. As the conventional characteristic formulation fails to maintain the orthogonality of the characteristic field modes over the infinite sphere, an alternate modal formulation is proposed here to maintain the orthogonality for both…
▽ More
This work introduces a theoretical extension of the characteristic mode formulation for analysing the vertical electric dipole lying above a lossy dielectric half-space. As the conventional characteristic formulation fails to maintain the orthogonality of the characteristic field modes over the infinite sphere, an alternate modal formulation is proposed here to maintain the orthogonality for both the current and field modes. The modal results are found to match closely with its method of moment counterparts. Later, the modes of an isolated dipole with no ground plane have been used to predict the role of the lossy ground plane through a theory of the linear combination of the eigenvectors. The proposed formulations have been studied with different heights from the ground plane and are compared with the direct modal solutions to validate its accuracy. It helps to provide a thorough understanding of how the isolated modes interact among each other to constitute the perturbed modes in the presence of the lossy half-space. It can find application to include the lossy earth effect in the study of the lightning fields and the path loss modelling of the antennas over the lossy ground.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Large deviation principle for the maximal eigenvalue of inhomogeneous Erdős-Rényi random graphs
Authors:
Arijit Chakrabarty,
Rajat Subhra Hazra,
Frank den Hollander,
Matteo Sfragara
Abstract:
We consider an inhomogeneous Erdős-Rényi random graph $G_N$ with vertex set $[N] = \{1,\dots,N\}$ for which the pair of vertices $i,j \in [N]$, $i\neq j$, is connected by an edge with probability $r(\tfrac{i}{N},\tfrac{j}{N})$, independently of other pairs of vertices. Here, $r\colon\,[0,1]^2 \to (0,1)$ is a symmetric function that plays the role of a reference graphon. Let $λ_N$ be the maximal ei…
▽ More
We consider an inhomogeneous Erdős-Rényi random graph $G_N$ with vertex set $[N] = \{1,\dots,N\}$ for which the pair of vertices $i,j \in [N]$, $i\neq j$, is connected by an edge with probability $r(\tfrac{i}{N},\tfrac{j}{N})$, independently of other pairs of vertices. Here, $r\colon\,[0,1]^2 \to (0,1)$ is a symmetric function that plays the role of a reference graphon. Let $λ_N$ be the maximal eigenvalue of the adjacency matrix of $G_N$. It is known that $λ_N/N$ satisfies a large deviation principle as $N \to \infty$. The associated rate function $ψ_r$ is given by a variational formula that involves the rate function $I_r$ of a large deviation principle on graphon space. We analyse this variational formula in order to identify the properties of $ψ_r$, specially when the reference graphon is of rank 1.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.
-
Effects of Thermal Emission on the Transmission Spectra of Hot Jupiters
Authors:
Aritra Chakrabarty,
Sujan Sengupta
Abstract:
The atmosphere on the dayside of a highly irradiated close-in gas giant (also known as a hot Jupiter) absorbs a significant part of the incident stellar radiation which again gets re-emitted in the infrared wavelengths both from the day and the night sides of the planet. The re-emitted thermal radiation from the night side facing the observers during the transit event of such a planet contributes…
▽ More
The atmosphere on the dayside of a highly irradiated close-in gas giant (also known as a hot Jupiter) absorbs a significant part of the incident stellar radiation which again gets re-emitted in the infrared wavelengths both from the day and the night sides of the planet. The re-emitted thermal radiation from the night side facing the observers during the transit event of such a planet contributes to the transmitted stellar radiation. We demonstrate that the transit spectra at the infrared region get altered significantly when such re-emitted thermal radiation of the planet is included. We assess the effects of the thermal emission of the hot Jupiters on the transit spectra by simulating observational spectroscopic data with corresponding errors from the different channels of the upcoming James Webb Space Telescope. We find that the effect is statistically significant with respect to the noise levels of those simulated data. Hence, we convey the important message that the planetary thermal re-emission must be taken into consideration in the retrieval models of transit spectra for hot Jupiters for a more accurate interpretation of the observed transit spectra.
△ Less
Submitted 7 June, 2020;
originally announced June 2020.
-
Safe Learning-based Observers for Unknown Nonlinear Systems using Bayesian Optimization
Authors:
Ankush Chakrabarty,
Mouhacine Benosman
Abstract:
Data generated from dynamical systems with unknown dynamics enable the learning of state observers that are: robust to modeling error, computationally tractable to design, and capable of operating with guaranteed performance. In this paper, a modular design methodology is formulated, that consists of three design phases: (i) an initial robust observer design that enables one to learn the dynamics…
▽ More
Data generated from dynamical systems with unknown dynamics enable the learning of state observers that are: robust to modeling error, computationally tractable to design, and capable of operating with guaranteed performance. In this paper, a modular design methodology is formulated, that consists of three design phases: (i) an initial robust observer design that enables one to learn the dynamics without allowing the state estimation error to diverge (hence, safe); (ii) a learning phase wherein the unmodeled components are estimated using Bayesian optimization and Gaussian processes; and, (iii) a re-design phase that leverages the learned dynamics to improve convergence rate of the state estimation error. The potential of our proposed learning-based observer is demonstrated on a benchmark nonlinear system. Additionally, certificates of guaranteed estimation performance are provided.
△ Less
Submitted 25 June, 2021; v1 submitted 12 May, 2020;
originally announced May 2020.
-
Optical Transmission Spectra of Hot-Jupiters: Effects of Scattering
Authors:
Sujan Sengupta,
Aritra Chakrabarty,
Giovanna Tinetti
Abstract:
We present new grids of transmission spectra for hot-Jupiters by solving the multiple scattering radiative transfer equations with non-zero scattering albedo instead of using the Beer-Bouguer-Lambert law for the change in the transmitted stellar intensity. The diffused reflection and transmission due to scattering increases the transmitted stellar flux resulting into a decrease in the transmission…
▽ More
We present new grids of transmission spectra for hot-Jupiters by solving the multiple scattering radiative transfer equations with non-zero scattering albedo instead of using the Beer-Bouguer-Lambert law for the change in the transmitted stellar intensity. The diffused reflection and transmission due to scattering increases the transmitted stellar flux resulting into a decrease in the transmission depth. Thus we demonstrate that scattering plays a double role in determining the optical transmission spectra -- increasing the total optical depth of the medium and adding the diffused radiation due to scattering to the transmitted stellar radiation. The resulting effects yield into an increase in the transmitted flux and hence reduction in the transmission depth. For a cloudless planetary atmosphere, Rayleigh scattering albedo alters the transmission depth up to about 0.6 micron but the change in the transmission depth due to forward scattering by cloud or haze is significant throughout the optical and near-infrared regions. However, at wavelength longer than about 1.2 $μ$m, the scattering albedo becomes negligible and hence the transmission spectra match with that calculated without solving the radiative transfer equations. We compare our model spectra with existing theoretical models and find significant difference at wavelength shorter than one micron. We also compare our models with observational data for a few hot-Jupiters which may help constructing better retrieval models in future.
△ Less
Submitted 28 December, 2019;
originally announced December 2019.
-
A statistical test for correspondence of texts to the Zipf-Mandelbrot law
Authors:
Anik Chakrabarty,
Mikhail Chebunin,
Artyom Kovalevskii,
Ilya Pupyshev,
Natalia Zakrevskaya,
Qianqian Zhou
Abstract:
We analyse correspondence of a text to a simple probabilistic model. The model assumes that the words are selected independently from an infinite dictionary. The probability distribution correspond to the Zipf---Mandelbrot law. We count sequentially the numbers of different words in the text and get the process of the numbers of different words. Then we estimate Zipf---Mandelbrot law parameters us…
▽ More
We analyse correspondence of a text to a simple probabilistic model. The model assumes that the words are selected independently from an infinite dictionary. The probability distribution correspond to the Zipf---Mandelbrot law. We count sequentially the numbers of different words in the text and get the process of the numbers of different words. Then we estimate Zipf---Mandelbrot law parameters using the same sequence and construct an estimate of the expectation of the number of different words in the text. Then we subtract the corresponding values of the estimate from the sequence and normalize along the coordinate axes, obtaining a random process on a segment from 0 to 1. We prove that this process (the empirical text bridge) converges weakly in the uniform metric on $C (0,1)$ to a centered Gaussian process with continuous a.s. paths. We develop and implement an algorithm for approximate calculation of eigenvalues of the covariance function of the limit Gaussian process, and then an algorithm for calculating the probability distribution of the integral of the square of this process. We use the algorithm to analyze uniformity of texts in English, French, Russian and Chinese.
△ Less
Submitted 25 December, 2019;
originally announced December 2019.
-
Eigenvalues outside the bulk of inhomogeneous Erdős-Rënyi random graphs
Authors:
Arijit Chakrabarty,
Sukrit Chakraborty,
Rajat Subhra Hazra
Abstract:
The article considers an inhomogeneous Erdős-Rënyi random graph on $\{1,\ldots, N\}$, where an edge is placed between vertices $i$ and $j$ with probability $\varepsilon_N f(i/N,j/N)$, for $i\le j$, the choice being made independent for each pair. The function $f$ is assumed to be non-negative definite, symmetric, bounded and of finite rank $k$. We study the edge of the spectrum of the adjacency ma…
▽ More
The article considers an inhomogeneous Erdős-Rënyi random graph on $\{1,\ldots, N\}$, where an edge is placed between vertices $i$ and $j$ with probability $\varepsilon_N f(i/N,j/N)$, for $i\le j$, the choice being made independent for each pair. The function $f$ is assumed to be non-negative definite, symmetric, bounded and of finite rank $k$. We study the edge of the spectrum of the adjacency matrix of such an inhomogeneous Erdős-Rényi random graph under the assumption that $N\varepsilon_N\to \infty$ sufficiently fast. Although the bulk of the spectrum of the adjacency matrix, scaled by $\sqrt{N\varepsilon_N}$, is compactly supported, the $k$-th largest eigenvalue goes to infinity. It turns out that the largest eigenvalue after appropriate scaling and centering converge to a Gaussian law, if the largest eigenvalue of $f$ has multiplicity $1$. If $f$ has $k$ distinct non-zero eigenvalues, then the joint distribution of the $k$ largest eigenvalues converge jointly to a multivariate Gaussian law. The first order behaviour of the eigenvectors is derived as a by-product of the above results. The results complement the homogeneous case derived by Erdős et al.(2013).
△ Less
Submitted 27 February, 2024; v1 submitted 19 November, 2019;
originally announced November 2019.
-
Near-Field Radiation Exposure Control in Slot-Loaded Microstrip Antenna: A Characteristic Mode Approach
Authors:
Sandip Ghosal,
Arijit De,
Raed M. Shubair,
Ajay Chakrabarty
Abstract:
Microstip antenna topology is commonly loaded with a narrow slot to manipulate the resonance frequency or impedance bandwidth. However, the tuning of the resonance frequency or impedance bandwidth results in the variation of the current and field distributions. In this regard, this work adopts the concept of characteristic modes to gain an initial understanding of the perturbation mechanism of the…
▽ More
Microstip antenna topology is commonly loaded with a narrow slot to manipulate the resonance frequency or impedance bandwidth. However, the tuning of the resonance frequency or impedance bandwidth results in the variation of the current and field distributions. In this regard, this work adopts the concept of characteristic modes to gain an initial understanding of the perturbation mechanism of the rectangular patch when loaded with a slot. The performance of microstrip antennas with finite ground plane is then studied using full-wave simulation. It has been found that the distribution of the induced current density is highly dependent on the orientation of the slot The incorporation of a narrow slot suppresses the nearby orthogonal eigen mode and, as a consequence, the radiation behavior is affected. Specifically, in the presence of biological tissues in the near-field region, both antenna input impedance properties and the realized gain are dependent on the slot orientation. Different examples are included for understanding the impact of slot loading on the energy absorption by biological tissues, by calculating the the specific absorption rate (SAR). The proposed analysis facilitates the design of miniaturized antenna geometries for biomedical applications via systematic loading of narrow slots.
△ Less
Submitted 28 July, 2019;
originally announced July 2019.
-
Safe Approximate Dynamic Programming Via Kernelized Lipschitz Estimation
Authors:
Ankush Chakrabarty,
Devesh K. Jha,
Gregery T. Buzzard,
Yebin Wang,
Kyriakos Vamvoudakis
Abstract:
We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ kernelized Lipschitz estimation and semidefinite programming for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initializat…
▽ More
We develop a method for obtaining safe initial policies for reinforcement learning via approximate dynamic programming (ADP) techniques for uncertain systems evolving with discrete-time dynamics. We employ kernelized Lipschitz estimation and semidefinite programming for computing admissible initial control policies with provably high probability. Such admissible controllers enable safe initialization and constraint enforcement while providing exponential stability of the equilibrium of the closed-loop system.
△ Less
Submitted 3 July, 2019;
originally announced July 2019.
-
Approximate Dynamic Programming For Linear Systems with State and Input Constraints
Authors:
Ankush Chakrabarty,
Rien Quirynen,
Claus Danielson,
Weinan Gao
Abstract:
Enforcing state and input constraints during reinforcement learning (RL) in continuous state spaces is an open but crucial problem which remains a roadblock to using RL in safety-critical applications. This paper leverages invariant sets to update control policies within an approximate dynamic programming (ADP) framework that guarantees constraint satisfaction for all time and converges to the opt…
▽ More
Enforcing state and input constraints during reinforcement learning (RL) in continuous state spaces is an open but crucial problem which remains a roadblock to using RL in safety-critical applications. This paper leverages invariant sets to update control policies within an approximate dynamic programming (ADP) framework that guarantees constraint satisfaction for all time and converges to the optimal policy (in a linear quadratic regulator sense) asymptotically. An algorithm for implementing the proposed constrained ADP approach in a data-driven manner is provided. The potential of this formalism is demonstrated via numerical examples.
△ Less
Submitted 26 June, 2019;
originally announced June 2019.
-
Precise photometric transit follow-up observations of five close-in exoplanets : update on their physical properties
Authors:
Aritra Chakrabarty,
Sujan Sengupta
Abstract:
We report the results of the high precision photometric follow-up observations of five transiting hot jupiters - WASP-33b, WASP-50b, WASP-12b, HATS-18b and HAT-P-36b. The observations are made from the 2m Himalayan Chandra Telescope at Indian Astronomical Observatory, Hanle and the 1.3m J. C. Bhattacharyya Telescope at Vainu Bappu Observatory, Kavalur. This exercise is a part of the capability tes…
▽ More
We report the results of the high precision photometric follow-up observations of five transiting hot jupiters - WASP-33b, WASP-50b, WASP-12b, HATS-18b and HAT-P-36b. The observations are made from the 2m Himalayan Chandra Telescope at Indian Astronomical Observatory, Hanle and the 1.3m J. C. Bhattacharyya Telescope at Vainu Bappu Observatory, Kavalur. This exercise is a part of the capability testing of the two telescopes and their back-end instruments. Leveraging the large aperture of both the telescopes used, the images taken during several nights were used to produce the transit light curves with high photometric S/N ($>200$) by performing differential photometry. In order to reduce the fluctuations in the transit light curves due to various sources such as stellar activity, varying sky transparency etc. we preprocessed them using wavelet denoising and applied Gaussian process correlated noise modeling technique while modeling the transit light curves. To demonstrate the efficiency of the wavelet denoising process we have also included the results without the denoising process. A state-of-the-art algorithm used for modeling the transit light curves provided the physical parameters of the planets with more precise values than reported earlier.
△ Less
Submitted 27 May, 2019;
originally announced May 2019.
-
High minima of non-smooth Gaussian processes
Authors:
Zhixin Wu,
Arijit Chakrabarty,
Gennady Samorodnitsky
Abstract:
In this short note we study the asymptotic behaviour of the minima over compact intervals of Gaussian processes, whose paths are not necessarily smooth. We show that, beyond the logarithmic large deviation Gaussian estimates, this problem is closely related to the classical small-ball problem. Under certain conditions we estimate the term describing the correction to the large deviation behaviour.…
▽ More
In this short note we study the asymptotic behaviour of the minima over compact intervals of Gaussian processes, whose paths are not necessarily smooth. We show that, beyond the logarithmic large deviation Gaussian estimates, this problem is closely related to the classical small-ball problem. Under certain conditions we estimate the term describing the correction to the large deviation behaviour. In addition, the asymptotic distribution of the location of the minimum, conditionally on the minimum exceeding a high threshold, is also studied.
△ Less
Submitted 24 August, 2019; v1 submitted 28 February, 2019;
originally announced February 2019.
-
L2 Observers for a Class of Nonlinear Systems with Unknown Inputs
Authors:
Martin Corless,
Ankush Chakrabarty
Abstract:
We consider the problem of estimating the state and unknown input for a large class of nonlinear systems subject to unknown exogenous inputs. The exogenous inputs themselves are modeled as being generated by a nonlinear system subject to unknown inputs. The nonlinearities considered in this work are characterized by multiplier matrices that include many commonly encountered nonlinearities. We obta…
▽ More
We consider the problem of estimating the state and unknown input for a large class of nonlinear systems subject to unknown exogenous inputs. The exogenous inputs themselves are modeled as being generated by a nonlinear system subject to unknown inputs. The nonlinearities considered in this work are characterized by multiplier matrices that include many commonly encountered nonlinearities. We obtain a linear matrix inequality (LMI), that, if feasible, provides the gains for an observer which results in certified L2 performance of the error dynamics associated with the observer. We also present conditions which guarantee that the L2 norm of the error can be made arbitrarily small and investigate conditions for feasibility of the proposed LMIs.
△ Less
Submitted 21 February, 2019;
originally announced February 2019.
-
Spectra of Adjacency and Laplacian Matrices of Inhomogeneous Erdős-Rényi Random Graphs
Authors:
Arijit Chakrabarty,
Rajat Subhra Hazra,
Frank den Hollander,
Matteo Sfragara
Abstract:
Inhomogeneous Erdős-Rényi random graphs $\mathbb G_N$ on $N$ vertices in the non-dense regime are considered in this paper. The edge between the pair of vertices $\{i,j\}$ is retained with probability $\varepsilon_N\,f(\frac{i}{N},\frac{j}{N})$, $1 \leq i \neq j \leq N$, independently of other edges, where $f\colon\,[0,1] \times [0,1] \to [0,\infty)$ is a continuous function such that…
▽ More
Inhomogeneous Erdős-Rényi random graphs $\mathbb G_N$ on $N$ vertices in the non-dense regime are considered in this paper. The edge between the pair of vertices $\{i,j\}$ is retained with probability $\varepsilon_N\,f(\frac{i}{N},\frac{j}{N})$, $1 \leq i \neq j \leq N$, independently of other edges, where $f\colon\,[0,1] \times [0,1] \to [0,\infty)$ is a continuous function such that $f(x,y)=f(y,x)$ for all $x,y \in [0,1]$. We study the empirical distribution of both the adjacency matrix $A_N$ and the Laplacian matrix $Δ_N$ associated with $\mathbb G_N$ in the limit as $N \to \infty$ when $\lim_{N\to\infty} \varepsilon_N = 0$ and $\lim_{N\to\infty} N\varepsilon_N = \infty$. In particular, it is shown that the empirical spectral distributions of $A_N$ and $Δ_N$, after appropriate scaling and centering, converge to deterministic limits weakly in probability. For the special case where $f(x,y) = r(x)r(y)$ with $r\colon\,[0,1] \to [0,\infty)$ a continuous function, we give an explicit characterization of the limiting distributions. Furthermore, applications of the results to constrained random graphs, Chung-Lu random graphs and social networks are shown.
△ Less
Submitted 15 October, 2019; v1 submitted 26 July, 2018;
originally announced July 2018.
-
Regular variation and free regular infinitely divisible laws
Authors:
Arijit Chakrabarty,
Sukrit Chakraborty,
Rajat Subhra Hazra
Abstract:
In this article the relation between the tail behaviours of a free regular infinitely divisible (positively supported) probability measure and its Lévy measure is studied. An important example of such a measure is the compound free Poisson distribution, which often occurs as a limiting spectral distribution of certain sequences of random matrices. We also describe a connection between an analogous…
▽ More
In this article the relation between the tail behaviours of a free regular infinitely divisible (positively supported) probability measure and its Lévy measure is studied. An important example of such a measure is the compound free Poisson distribution, which often occurs as a limiting spectral distribution of certain sequences of random matrices. We also describe a connection between an analogous classical result of Embrechts et al. [1979] and our result using the Bercovici-Pata bijection.
△ Less
Submitted 3 October, 2018; v1 submitted 19 April, 2018;
originally announced April 2018.
-
A note on the folklore of free independence
Authors:
Arijit Chakrabarty,
Sukrit Chakraborty,
Rajat Subhra Hazra
Abstract:
It is shown that a Wishart matrix of standard complex normal random variables is asymptotically freely independent of an independent random matrix, under minimal conditions, in two different sense of asymptotic free independence.
It is shown that a Wishart matrix of standard complex normal random variables is asymptotically freely independent of an independent random matrix, under minimal conditions, in two different sense of asymptotic free independence.
△ Less
Submitted 3 February, 2018;
originally announced February 2018.