-
Local Adaptivity in Federated Learning: Convergence and Consistency
Authors:
Jianyu Wang,
Zheng Xu,
Zachary Garrett,
Zachary Charles,
Luyang Liu,
Gauri Joshi
Abstract:
The federated learning (FL) framework trains a machine learning model using decentralized data stored at edge client devices by periodically aggregating locally trained models. Popular optimization algorithms of FL use vanilla (stochastic) gradient descent for both local updates at clients and global updates at the aggregating server. Recently, adaptive optimization methods such as AdaGrad have be…
▽ More
The federated learning (FL) framework trains a machine learning model using decentralized data stored at edge client devices by periodically aggregating locally trained models. Popular optimization algorithms of FL use vanilla (stochastic) gradient descent for both local updates at clients and global updates at the aggregating server. Recently, adaptive optimization methods such as AdaGrad have been studied for server updates. However, the effect of using adaptive optimization methods for local updates at clients is not yet understood. We show in both theory and practice that while local adaptive methods can accelerate convergence, they can cause a non-vanishing solution bias, where the final converged solution may be different from the stationary point of the global objective function. We propose correction techniques to overcome this inconsistency and complement the local adaptive methods for FL. Extensive experiments on realistic federated training tasks show that the proposed algorithms can achieve faster convergence and higher test accuracy than the baselines without local adaptivity.
△ Less
Submitted 4 June, 2021;
originally announced June 2021.
-
Photometric and Spectroscopic Analysis of Five Am Stars
Authors:
Gireesh C. Joshi
Abstract:
The photometric analysis of sample Am stars is carried out to determine the stellar characteristics and to constrain the stellar dynamics. The spectroscopic analysis of the studied Am stars confirms their general characteristics of Am stars. The available data on elemental abundances for HD 113878 and HD 118660 have shown different characteristics during different epochs of observations. The basic…
▽ More
The photometric analysis of sample Am stars is carried out to determine the stellar characteristics and to constrain the stellar dynamics. The spectroscopic analysis of the studied Am stars confirms their general characteristics of Am stars. The available data on elemental abundances for HD 113878 and HD 118660 have shown different characteristics during different epochs of observations. The basic stellar parameters (mass, luminosity, radius, life time, distance, proper-motion, etc.) are also determined to identify the stellar habitat zones for earth like exoplanet. Such information is important to identify suitable planets for human settlement in the near future. In this connection, the tidal radius and boundaries of the habitable zone of each star have been computed to support the search of an extra-terrestrial life around them. Asteroseismic mass scale test shows greater stellar masses comparable to the solar mass.
△ Less
Submitted 2 June, 2021;
originally announced June 2021.
-
A Review on Explainability in Multimodal Deep Neural Nets
Authors:
Gargi Joshi,
Rahee Walambe,
Ketan Kotecha
Abstract:
Artificial Intelligence techniques powered by deep neural nets have achieved much success in several application domains, most significantly and notably in the Computer Vision applications and Natural Language Processing tasks. Surpassing human-level performance propelled the research in the applications where different modalities amongst language, vision, sensory, text play an important role in a…
▽ More
Artificial Intelligence techniques powered by deep neural nets have achieved much success in several application domains, most significantly and notably in the Computer Vision applications and Natural Language Processing tasks. Surpassing human-level performance propelled the research in the applications where different modalities amongst language, vision, sensory, text play an important role in accurate predictions and identification. Several multimodal fusion methods employing deep learning models are proposed in the literature. Despite their outstanding performance, the complex, opaque and black-box nature of the deep neural nets limits their social acceptance and usability. This has given rise to the quest for model interpretability and explainability, more so in the complex tasks involving multimodal AI methods. This paper extensively reviews the present literature to present a comprehensive survey and commentary on the explainability in multimodal deep neural nets, especially for the vision and language tasks. Several topics on multimodal AI and its applications for generic domains have been covered in this paper, including the significance, datasets, fundamental building blocks of the methods and techniques, challenges, applications, and future trends in this domain
△ Less
Submitted 18 May, 2021; v1 submitted 17 May, 2021;
originally announced May 2021.
-
Adaptive Policy Transfer in Reinforcement Learning
Authors:
Girish Joshi,
Girish Chowdhary
Abstract:
Efficient and robust policy transfer remains a key challenge for reinforcement learning to become viable for real-wold robotics. Policy transfer through warm initialization, imitation, or interacting over a large set of agents with randomized instances, have been commonly applied to solve a variety of Reinforcement Learning tasks. However, this seems far from how skill transfer happens in the biol…
▽ More
Efficient and robust policy transfer remains a key challenge for reinforcement learning to become viable for real-wold robotics. Policy transfer through warm initialization, imitation, or interacting over a large set of agents with randomized instances, have been commonly applied to solve a variety of Reinforcement Learning tasks. However, this seems far from how skill transfer happens in the biological world: Humans and animals are able to quickly adapt the learned behaviors between similar tasks and learn new skills when presented with new situations. Here we seek to answer the question: Will learning to combine adaptation and exploration lead to a more efficient transfer of policies between domains? We introduce a principled mechanism that can "Adapt-to-Learn", that is adapt the source policy to learn to solve a target task with significant transition differences and uncertainties. We show that the presented method learns to seamlessly combine learning from adaptation and exploration and leads to a robust policy transfer algorithm with significantly reduced sample complexity in transferring skills between related tasks.
△ Less
Submitted 10 May, 2021;
originally announced May 2021.
-
Adaptive Quantization of Model Updates for Communication-Efficient Federated Learning
Authors:
Divyansh Jhunjhunwala,
Advait Gadhikar,
Gauri Joshi,
Yonina C. Eldar
Abstract:
Communication of model updates between client nodes and the central aggregating server is a major bottleneck in federated learning, especially in bandwidth-limited settings and high-dimensional models. Gradient quantization is an effective way of reducing the number of bits required to communicate each model update, albeit at the cost of having a higher error floor due to the higher variance of th…
▽ More
Communication of model updates between client nodes and the central aggregating server is a major bottleneck in federated learning, especially in bandwidth-limited settings and high-dimensional models. Gradient quantization is an effective way of reducing the number of bits required to communicate each model update, albeit at the cost of having a higher error floor due to the higher variance of the stochastic gradients. In this work, we propose an adaptive quantization strategy called AdaQuantFL that aims to achieve communication efficiency as well as a low error floor by changing the number of quantization levels during the course of training. Experiments on training deep neural networks show that our method can converge in much fewer communicated bits as compared to fixed quantization level setups, with little or no impact on training and test accuracy.
△ Less
Submitted 8 February, 2021;
originally announced February 2021.
-
Temperature Dependence of $β-Ga_2O_3$ Heteroepitaxy on c-plane Sapphire using Low Pressure Chemical Vapor Deposition
Authors:
Gavax Joshi,
Yogesh Singh Chauhan,
Amit Verma
Abstract:
$β-Ga_2O_3$ has drawn significant attention for power electronics and deep ultraviolet (UV) photodetectors owing to its wide bandgap of ~ 4.4 - 4.9 eV and high electric breakdown strength ~7-8 MV/cm. Growth of $β-Ga_2O_3…
▽ More
$β-Ga_2O_3$ has drawn significant attention for power electronics and deep ultraviolet (UV) photodetectors owing to its wide bandgap of ~ 4.4 - 4.9 eV and high electric breakdown strength ~7-8 MV/cm. Growth of $β-Ga_2O_3$ epitaxial thin films with high growth rate has been recently reported using low pressure chemical vapor deposition (LPCVD) technique. In this work, we have investigated the effect of growth temperature on $β-Ga_2O_3$ films grown on c-plane sapphire substrates using LPCVD. We performed growths by varying temperatures from 800$^°$C to 950$^°$C while keeping all other growth parameters (Ar/O$_2$ gas flow rates, growth pressure, and Gallium precursor to substrate distance) constant. Optical, structural, and surface characterizations are performed to determine the bandgap, phase purity, crystal orientation, and crystalline quality of the grown thin films. Amorphous islands of $Ga_2O_3$ are observed at growth temperature of 800$^°$C while continuous and crystalline (-201) oriented $β-Ga_2O_3$ thin films are achieved for growth temperatures of 850$^°$C to 950$^°$C. Crystallinity of the films is found to improve with increase in growth temperature with a minimum rocking full width at half maximum of 1.52$^°$ in sample grown at 925$^°$C. For all the samples grown at and above 875$^°$C, transmittance measurements revealed an optical bandgap of ~4.77-4.80 eV with high growth rate of ~6 $μ$m/hr.
△ Less
Submitted 7 February, 2021;
originally announced February 2021.
-
Critical anomalous metals near superconductivity in models with random interactions
Authors:
Chenyuan Li,
Darshan G. Joshi,
Subir Sachdev
Abstract:
Anomalous metals are observed in numerous experiments on disordered two-dimensional systems proximate to superconductivity. A characteristic feature of an anomalous metal is that its low temperature conductivity has a weakly temperature dependent value, significantly higher than that of a disordered Fermi liquid. We propose a dynamical mean-field model of an anomalous metal: interacting electrons…
▽ More
Anomalous metals are observed in numerous experiments on disordered two-dimensional systems proximate to superconductivity. A characteristic feature of an anomalous metal is that its low temperature conductivity has a weakly temperature dependent value, significantly higher than that of a disordered Fermi liquid. We propose a dynamical mean-field model of an anomalous metal: interacting electrons similar in structure to that of the well-studied universal Hamiltonian of mesoscopic metallic grains, but with independent random interactions between pairs of sites, involving Cooper pair hopping and spin exchange. We find evidence for critical anomalous phases or points between a superconducting phase and a disordered Fermi liquid phase in this model. Our results are obtained by a renormalization group analysis in a weak coupling limit, and a complementary solution at large $M$ when the spin symmetry is generalized to USp($M$). The large $M$ limit describes the anomalous metal by fractionalization of the electron into spinons, holons, and doublons, with these partons forming critical non-Fermi liquids in the Sachdev-Ye-Kitaev class. We compute the low temperature conductivity in the large $M$ limit, and find temperature-independent values moderately enhanced from that in the disordered metal.
△ Less
Submitted 29 March, 2021; v1 submitted 2 February, 2021;
originally announced February 2021.
-
The cumulative star-formation histories of dwarf galaxies with TNG50. I: Environment-driven diversity and connection to quenching
Authors:
Gandhali D. Joshi,
Annalisa Pillepich,
Dylan Nelson,
Elad Zinger,
Federico Marinacci,
Volker Springel,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We present the cumulative star-formation histories (SFHs) of >15000 dwarf galaxies ($M_{*}=10^{7-10}M_{\odot}$) from the TNG50 run of the IllustrisTNG suite across a vast range of environments. The key factors determining the dwarfs' SFHs are their status as central or satellite and their stellar mass, with centrals and more massive dwarfs assembling their stellar mass at later times on average co…
▽ More
We present the cumulative star-formation histories (SFHs) of >15000 dwarf galaxies ($M_{*}=10^{7-10}M_{\odot}$) from the TNG50 run of the IllustrisTNG suite across a vast range of environments. The key factors determining the dwarfs' SFHs are their status as central or satellite and their stellar mass, with centrals and more massive dwarfs assembling their stellar mass at later times on average compared to satellites and lower mass dwarfs. The satellites (in hosts of total mass $M_{200c,\,host}=10^{12-14.3}M_{\odot}$) assembled 90% of their z=0 stellar mass ~$7.0_{-5.5}^{+3.3}$ Gyr ago, while the centrals did so only ~$1.0_{-0.5}^{+4.0}$ Gyr ago. TNG50 predicts a large diversity in SFHs for both centrals and satellites, so that the stacked cumulative SFHs are representative of the TNG50 dwarf populations only in an average sense and individual dwarfs can have significantly different cumulative SFHs. Satellite dwarfs with the highest stellar mass to host mass ratios have the latest stellar mass assembly. Satellites at fixed stellar and host halo mass, found closer to the cluster centre, or accreted at earlier times, show significantly earlier stellar mass assembly. These trends, as well as the shapes of the SFHs themselves, are a manifestation of the varying proportions within a given subsample of quenched vs. star-forming galaxies, which exhibit markedly distinct SFH shapes. We also find a subtle effect whereby satellite dwarfs in the most massive hosts at z=0 have higher SFRs at early times, well before final infall into their z=0 host, compared to a control sample of centrals mass-matched at the time of accretion. This suggests that the large-scale environment can have a mild effect even on future satellites by providing the conditions for enhanced SF at early epochs. Our results are useful theoretical predictions for comparison to future resolved-stellar-population observations.
△ Less
Submitted 28 January, 2021;
originally announced January 2021.
-
Photometric Search for variable stars in the field of two Northern open clusters, DOLIDGE 14 and NGC 1960
Authors:
Gireesh C. Joshi
Abstract:
The aim of present work is extract and analyses the light curves of the stars in the field of two clusters, NGC 1960 and DOLIDZE 14. The photometric calibration is performed by comprehensive method of secondary standard transformation and differential photometry using two comparison stars per candidate variable star. The resultant light curves for each potential variable star are displayed and the…
▽ More
The aim of present work is extract and analyses the light curves of the stars in the field of two clusters, NGC 1960 and DOLIDZE 14. The photometric calibration is performed by comprehensive method of secondary standard transformation and differential photometry using two comparison stars per candidate variable star. The resultant light curves for each potential variable star are displayed and their period analyzed by two different methods. The period and classification of 18 discovered short periodic type variable stars of NGC 1960 are discussed, which consist of four known variable stars and fourteen new variable stars. In the case of DOLIDZE 14, four discovered variables consist of one miscellaneous, one rotational, two $binary$ type variable stars. In the case of NGC~1960, the 12 selected comparison stars appear to be likely candidate for long periodic variability and 4 stars may be standard stars. The variation in brightness of other twenty comparison stars is non-pulsating with an irregular pattern. Membership analysis of variable stars is performed using their distance, kinematic probability and location in $(U-B)$ vs $(B-V)$ TCD. C-M diagrams were constructed to confirm the evolutionary state of the new variable stars.
△ Less
Submitted 1 August, 2023; v1 submitted 3 January, 2021;
originally announced January 2021.
-
Synergy via Redundancy: Adaptive Replication Strategies and Fundamental Limits
Authors:
Gauri Joshi,
Dhruva Kaushal
Abstract:
The maximum possible throughput (or the rate of job completion) of a multi-server system is typically the sum of the service rates of individual servers. Recent work shows that launching multiple replicas of a job and canceling them as soon as one copy finishes can boost the throughput, especially when the service time distribution has high variability. This means that redundancy can, in fact, cre…
▽ More
The maximum possible throughput (or the rate of job completion) of a multi-server system is typically the sum of the service rates of individual servers. Recent work shows that launching multiple replicas of a job and canceling them as soon as one copy finishes can boost the throughput, especially when the service time distribution has high variability. This means that redundancy can, in fact, create synergy among servers such that their overall throughput is greater than the sum of individual servers. This work seeks to find the fundamental limit of the throughput boost achieved by job replication and the optimal replication policy to achieve it. While most previous works consider upfront replication policies, we expand the set of possible policies to delayed launch of replicas. The search for the optimal adaptive replication policy can be formulated as a Markov Decision Process, using which we propose two myopic replication policies, MaxRate and AdaRep, to adaptively replicate jobs. In order to quantify the optimality gap of these and other policies, we derive upper bounds on the service capacity, which provide fundamental limits on the throughput of queueing systems with redundancy.
△ Less
Submitted 25 December, 2020;
originally announced December 2020.
-
Bandit-based Communication-Efficient Client Selection Strategies for Federated Learning
Authors:
Yae Jee Cho,
Samarth Gupta,
Gauri Joshi,
Osman Yağan
Abstract:
Due to communication constraints and intermittent client availability in federated learning, only a subset of clients can participate in each training round. While most prior works assume uniform and unbiased client selection, recent work on biased client selection has shown that selecting clients with higher local losses can improve error convergence speed. However, previously proposed biased sel…
▽ More
Due to communication constraints and intermittent client availability in federated learning, only a subset of clients can participate in each training round. While most prior works assume uniform and unbiased client selection, recent work on biased client selection has shown that selecting clients with higher local losses can improve error convergence speed. However, previously proposed biased selection strategies either require additional communication cost for evaluating the exact local loss or utilize stale local loss, which can even make the model diverge. In this paper, we present a bandit-based communication-efficient client selection strategy UCB-CS that achieves faster convergence with lower communication overhead. We also demonstrate how client selection can be used to improve fairness.
△ Less
Submitted 14 December, 2020;
originally announced December 2020.
-
Asynchronous Deep Model Reference Adaptive Control
Authors:
Girish Joshi,
Jasvir Virdi,
Girish Chowdhary
Abstract:
In this paper, we present Asynchronous implementation of Deep Neural Network-based Model Reference Adaptive Control (DMRAC). We evaluate this new neuro-adaptive control architecture through flight tests on a small quadcopter. We demonstrate that a single DMRAC controller can handle significant nonlinearities due to severe system faults and deliberate wind disturbances while executing high-bandwidt…
▽ More
In this paper, we present Asynchronous implementation of Deep Neural Network-based Model Reference Adaptive Control (DMRAC). We evaluate this new neuro-adaptive control architecture through flight tests on a small quadcopter. We demonstrate that a single DMRAC controller can handle significant nonlinearities due to severe system faults and deliberate wind disturbances while executing high-bandwidth attitude control. We also show that the architecture has long-term learning abilities across different flight regimes, and can generalize to fly different flight trajectories than those on which it was trained. These results demonstrating the efficacy of this architecture for high bandwidth closed-loop attitude control of unstable and nonlinear robots operating in adverse situations. To achieve these results, we designed a software+communication architecture to ensure online real-time inference of the deep network on a high-bandwidth computation-limited platform. We expect that this architecture will benefit other deep learning in the closed-loop experiments on robots.
△ Less
Submitted 4 November, 2020;
originally announced November 2020.
-
Client Selection in Federated Learning: Convergence Analysis and Power-of-Choice Selection Strategies
Authors:
Yae Jee Cho,
Jianyu Wang,
Gauri Joshi
Abstract:
Federated learning is a distributed optimization paradigm that enables a large number of resource-limited client nodes to cooperatively train a model without data sharing. Several works have analyzed the convergence of federated learning by accounting of data heterogeneity, communication and computation limitations, and partial client participation. However, they assume unbiased client participati…
▽ More
Federated learning is a distributed optimization paradigm that enables a large number of resource-limited client nodes to cooperatively train a model without data sharing. Several works have analyzed the convergence of federated learning by accounting of data heterogeneity, communication and computation limitations, and partial client participation. However, they assume unbiased client participation, where clients are selected at random or in proportion of their data sizes. In this paper, we present the first convergence analysis of federated optimization for biased client selection strategies, and quantify how the selection bias affects convergence speed. We reveal that biasing client selection towards clients with higher local loss achieves faster error convergence. Using this insight, we propose Power-of-Choice, a communication- and computation-efficient client selection framework that can flexibly span the trade-off between convergence speed and solution bias. Our experiments demonstrate that Power-of-Choice strategies converge up to 3 $\times$ faster and give $10$% higher test accuracy than the baseline random selection.
△ Less
Submitted 2 October, 2020;
originally announced October 2020.
-
A new architecture for hand-worn Sign language to Speech translator
Authors:
Sai Charan Bodda,
Palki Gupta,
Gaurav Joshi,
Ayush Chaturvedi
Abstract:
People with speech and hearing impairments often rely on sign language to communicate with others but most of the general population cannot understand sign language and sign language itself is a difficult language to learn, so there is a definite need for technologies to translate sign language to speech. In this paper, we describe the design and implementation of Smart glove, a hand-worn hardware…
▽ More
People with speech and hearing impairments often rely on sign language to communicate with others but most of the general population cannot understand sign language and sign language itself is a difficult language to learn, so there is a definite need for technologies to translate sign language to speech. In this paper, we describe the design and implementation of Smart glove, a hand-worn hardware device capable of translating American Sign Language gestures into English speech by tracking the finger's orientation, gestures and hand motion. It uses hardware sensors like Flex, Accelerometer and gyroscope and intelligent software to capture and translate the gestures into speech. This paper explains the translation of both Alphabet and Word gestures. New approaches and algorithms are proposed and implemented to address hardware-dependent issues in existing glove based designs. The whole device is designed to be modular with distributed processing units to encourage modular enhancement, reducing complexity, and interrelation between subsystems.Decision Trees are used in gesture recognition and error correction. We hope that the henceforth mentioned design and architecture would be the basis for the advancement in research related to sensor-based sign language translation along with research for smart glove and cybernetic accessories.
△ Less
Submitted 8 September, 2020;
originally announced September 2020.
-
Service Rate Region: A New Aspect of Coded Distributed System Design
Authors:
Mehmet Aktas,
Gauri Joshi,
Swanand Kadhe,
Fatemeh Kazemi,
Emina Soljanin
Abstract:
Erasure coding has been recognized as a powerful method to mitigate delays due to slow or straggling nodes in distributed systems. This work shows that erasure coding of data objects can flexibly handle skews in the request rates. Coding can help boost the \emph{service rate region}, that is, increase the overall volume of data access requests that the system can handle. This paper aims to postula…
▽ More
Erasure coding has been recognized as a powerful method to mitigate delays due to slow or straggling nodes in distributed systems. This work shows that erasure coding of data objects can flexibly handle skews in the request rates. Coding can help boost the \emph{service rate region}, that is, increase the overall volume of data access requests that the system can handle. This paper aims to postulate the service rate region as an important consideration in the design of erasure-coded distributed systems. We highlight several open problems that can be grouped into two broad threads: 1) characterizing the service rate region of a given code and finding the optimal request allocation, and2) designing the underlying erasure code for a given service rate region. As contributions along the first thread, we characterize the rate regions of maximum-distance-separable, locally repairable, and Simplex codes. We show the effectiveness of hybrid codes that combine replication and erasure coding in terms of code design. We also discover fundamental connections between multi-set batch codes and the problem of maximizing the service rate region.
△ Less
Submitted 27 June, 2021; v1 submitted 3 September, 2020;
originally announced September 2020.
-
Quenched fractions in the IllustrisTNG simulations: the roles of AGN feedback, environment, and pre-processing
Authors:
Martina Donnari,
Annalisa Pillepich,
Gandhali D. Joshi,
Dylan Nelson,
Shy Genel,
Federico Marinacci,
Vicente Rodriguez-Gomez,
Ruediger Pakmor,
Paul Torrey,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We use the IllustrisTNG simulations to show how the fractions of quenched galaxies vary across different environments and cosmic time, and to quantify the role AGN feedback and preprocessing play in quenching group and cluster satellites. At $z=0$, we select galaxies with $M_* = 10^{9-12} M_{\odot}$ residing within ($\leq R_{200c}$) groups and clusters of total host mass…
▽ More
We use the IllustrisTNG simulations to show how the fractions of quenched galaxies vary across different environments and cosmic time, and to quantify the role AGN feedback and preprocessing play in quenching group and cluster satellites. At $z=0$, we select galaxies with $M_* = 10^{9-12} M_{\odot}$ residing within ($\leq R_{200c}$) groups and clusters of total host mass $M_{200c}=10^{13-15.2} M_{\odot}$. TNG predicts a quenched fraction of $\sim70-90\%$ (on average) for centrals and satellites $\gtrsim 10^{10.5} M_{\odot}$, regardless of host mass, cosmic time ($0\leq z\leq0.5$), clustercentric distance and time since infall in the $z=0$ host. Low-mass centrals ($\lesssim 10^{10} M_{\odot}$), instead, are rarely quenched unless they become members of groups ($10^{13-14} M_{\odot}$) or clusters ($\geq10^{14} M_{\odot}$), where the quenched fraction rises to $\sim80\%$. The fraction of low-mass passive galaxies is higher closer to the host center and for more massive hosts. The population of low-mass satellites accreted $\gtrsim$4-6 Gyr ago in massive hosts is almost entirely passive, thus suggesting an upper limit for the time needed for environmental quenching to occur. In fact, $\sim30\%$ of group and cluster satellites that are quenched at $z=0$ were already quenched before falling into their current host, and the bulk of them quenched as early as 4 to 10 billion years ago. For low-mass galaxies ($\lesssim10^{10-10.5}M_{\odot}$), this is due to preprocessing, whereby current satellites may have been members of other hosts, and hence have undergone environmental processes, before falling into their final host, this mechanism being more common and more effective for the purposes of quenching for satellites found today in more massive hosts. On the other hand, massive galaxies quench on their own and because of AGN feedback, regardless of whether they are centrals or satellites.
△ Less
Submitted 14 October, 2020; v1 submitted 31 July, 2020;
originally announced August 2020.
-
Probabilistic Neighbourhood Component Analysis: Sample Efficient Uncertainty Estimation in Deep Learning
Authors:
Ankur Mallick,
Chaitanya Dwivedi,
Bhavya Kailkhura,
Gauri Joshi,
T. Yong-Jin Han
Abstract:
While Deep Neural Networks (DNNs) achieve state-of-the-art accuracy in various applications, they often fall short in accurately estimating their predictive uncertainty and, in turn, fail to recognize when these predictions may be wrong. Several uncertainty-aware models, such as Bayesian Neural Network (BNNs) and Deep Ensembles have been proposed in the literature for quantifying predictive uncert…
▽ More
While Deep Neural Networks (DNNs) achieve state-of-the-art accuracy in various applications, they often fall short in accurately estimating their predictive uncertainty and, in turn, fail to recognize when these predictions may be wrong. Several uncertainty-aware models, such as Bayesian Neural Network (BNNs) and Deep Ensembles have been proposed in the literature for quantifying predictive uncertainty. However, research in this area has been largely confined to the big data regime. In this work, we show that the uncertainty estimation capability of state-of-the-art BNNs and Deep Ensemble models degrades significantly when the amount of training data is small. To address the issue of accurate uncertainty estimation in the small-data regime, we propose a probabilistic generalization of the popular sample-efficient non-parametric kNN approach. Our approach enables deep kNN classifier to accurately quantify underlying uncertainties in its prediction. We demonstrate the usefulness of the proposed approach by achieving superior uncertainty quantification as compared to state-of-the-art on a real-world application of COVID-19 diagnosis from chest X-Rays. Our code is available at https://github.com/ankurmallick/sample-efficient-uq
△ Less
Submitted 18 July, 2020;
originally announced July 2020.
-
Tackling the Objective Inconsistency Problem in Heterogeneous Federated Optimization
Authors:
Jianyu Wang,
Qinghua Liu,
Hao Liang,
Gauri Joshi,
H. Vincent Poor
Abstract:
In federated optimization, heterogeneity in the clients' local datasets and computation speeds results in large variations in the number of local updates performed by each client in each communication round. Naive weighted aggregation of such models causes objective inconsistency, that is, the global model converges to a stationary point of a mismatched objective function which can be arbitrarily…
▽ More
In federated optimization, heterogeneity in the clients' local datasets and computation speeds results in large variations in the number of local updates performed by each client in each communication round. Naive weighted aggregation of such models causes objective inconsistency, that is, the global model converges to a stationary point of a mismatched objective function which can be arbitrarily different from the true objective. This paper provides a general framework to analyze the convergence of federated heterogeneous optimization algorithms. It subsumes previously proposed methods such as FedAvg and FedProx and provides the first principled understanding of the solution bias and the convergence slowdown due to objective inconsistency. Using insights from this analysis, we propose FedNova, a normalized averaging method that eliminates objective inconsistency while preserving fast error convergence.
△ Less
Submitted 15 July, 2020;
originally announced July 2020.
-
Anomalous density fluctuations in a random $t$-$J$ model
Authors:
Darshan G. Joshi,
Subir Sachdev
Abstract:
A previous work (Joshi et al., arXiv:1912.08822) found a deconfined critical point at non-zero doping in a $t$-$J$ model with all-to-all and random hopping and spin exchange, and argued for its relevance to the phenomenology of the cuprates. We extend this model to include all-to-all and random density-density interactions of mean-square strength $K$. In a fixed realization of the disorder, and fo…
▽ More
A previous work (Joshi et al., arXiv:1912.08822) found a deconfined critical point at non-zero doping in a $t$-$J$ model with all-to-all and random hopping and spin exchange, and argued for its relevance to the phenomenology of the cuprates. We extend this model to include all-to-all and random density-density interactions of mean-square strength $K$. In a fixed realization of the disorder, and for specific values of the hopping, exchange, and density interactions, the model is supersymmetric; but, we find no supersymmetry after independent averages over the interactions. Using the previously developed renormalization group analysis, we find a new fixed point at non-zero $K$. However, this fixed point is unstable towards the previously found fixed point at $K=0$ in our perturbative analysis. We compute the exponent characterizing density fluctuations at both fixed points: this exponent determines the spectrum of electron energy-loss spectroscopy.
△ Less
Submitted 9 September, 2020; v1 submitted 24 June, 2020;
originally announced June 2020.
-
Robust and Precision Satellite Formation Flying Guidance Using Adaptive Optimal Control Techniques
Authors:
Girish Joshi
Abstract:
The main focus of the work presented in this thesis is to develop an optimal control based formation flying control strategy for high precision formation flying of small satellites that have restricted computation and storage capacity. Using the recently developed model predictive static programming (MPSP), and Generalized MPSP algorithm a suboptimal guidance logic is presented for the formation f…
▽ More
The main focus of the work presented in this thesis is to develop an optimal control based formation flying control strategy for high precision formation flying of small satellites that have restricted computation and storage capacity. Using the recently developed model predictive static programming (MPSP), and Generalized MPSP algorithm a suboptimal guidance logic is presented for the formation flying of small satellites. The proposed guidance scheme is valid both for high eccentricity chief satellite orbits as well as the large separation distance between the chief and deputy satellites. Comparative study with standard Linear Quadratic Regulator (LQR) solution (which serves as a guess solution for MPSP) and another nonlinear controller, Finite-time State-Dependent Ricatti Equation (SDRE) reveals that MPSP guidance achieves the objective with higher accuracy and with a lesser amount of control usage.
△ Less
Submitted 31 May, 2020;
originally announced June 2020.
-
The fate of disk galaxies in IllustrisTNG clusters
Authors:
Gandhali D. Joshi,
Annalisa Pillepich,
Dylan Nelson,
Federico Marinacci,
Volker Springel,
Vicente Rodriguez-Gomez,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We study the stellar morphological evolution of disc galaxies within clusters in the TNG50 and TNG100 runs from the IllustrisTNG simulation suite. We select satellites of masses $10^{9.7} \leq M_{*,z=0}/\text{M}_{\odot} \leq 10^{11.6}$ residing in clusters of masses $10^{14} \lesssim M_{\text{200c,z=0}}/\text{M}_{\odot} \leq 10^{14.6}$ at $z=0$ and that were discs at accretion according to a kinem…
▽ More
We study the stellar morphological evolution of disc galaxies within clusters in the TNG50 and TNG100 runs from the IllustrisTNG simulation suite. We select satellites of masses $10^{9.7} \leq M_{*,z=0}/\text{M}_{\odot} \leq 10^{11.6}$ residing in clusters of masses $10^{14} \lesssim M_{\text{200c,z=0}}/\text{M}_{\odot} \leq 10^{14.6}$ at $z=0$ and that were discs at accretion according to a kinematic morphology indicator (the circularity fraction). These are traced from the time of accretion to $z=0$ and compared to a control sample of central galaxies mass-matched at accretion. Most cluster discs become non-discy by $z=0$, in stark contrast with the control discs, of which a significant fraction remains discy over the same timescales. Cluster discs become non-discy accompanied by gas removal and star formation quenching, loss of dark matter and little growth or a loss of stellar mass. In contrast, control discs transform while also losing gas mass and quenching, but growing significantly in dark matter and stellar mass. Most cluster satellites change morphologies on similar timescales regardless of stellar mass, in $\sim0.5-4$ Gyr after accretion. Cluster discs that experienced more numerous and closer pericentric passages show the largest change in morphology. Morphological change in all cases requires the presence of a gravitational perturbation to drive stellar orbits to non-discy configurations, along with gas removal/heating to prevent replenishment of the disc through continued star-formation. For cluster discs, the perturbation is impulsive tidal shocking at pericentres and not tidal stripping of outer disc stellar material, whereas for control discs, a combination of mergers and AGN feedback appears to be the key driving force behind morphological transformations.
△ Less
Submitted 14 July, 2020; v1 submitted 2 April, 2020;
originally announced April 2020.
-
Slow and Stale Gradients Can Win the Race
Authors:
Sanghamitra Dutta,
Jianyu Wang,
Gauri Joshi
Abstract:
Distributed Stochastic Gradient Descent (SGD) when run in a synchronous manner, suffers from delays in runtime as it waits for the slowest workers (stragglers). Asynchronous methods can alleviate stragglers, but cause gradient staleness that can adversely affect the convergence error. In this work, we present a novel theoretical characterization of the speedup offered by asynchronous methods by an…
▽ More
Distributed Stochastic Gradient Descent (SGD) when run in a synchronous manner, suffers from delays in runtime as it waits for the slowest workers (stragglers). Asynchronous methods can alleviate stragglers, but cause gradient staleness that can adversely affect the convergence error. In this work, we present a novel theoretical characterization of the speedup offered by asynchronous methods by analyzing the trade-off between the error in the trained model and the actual training runtime(wallclock time). The main novelty in our work is that our runtime analysis considers random straggling delays, which helps us design and compare distributed SGD algorithms that strike a balance between straggling and staleness. We also provide a new error convergence analysis of asynchronous SGD variants without bounded or exponential delay assumptions. Finally, based on our theoretical characterization of the error-runtime trade-off, we propose a method of gradually varying synchronicity in distributed SGD and demonstrate its performance on CIFAR10 dataset.
△ Less
Submitted 23 March, 2020;
originally announced March 2020.
-
Machine Learning on Volatile Instances
Authors:
Xiaoxi Zhang,
Jianyu Wang,
Gauri Joshi,
Carlee Joe-Wong
Abstract:
Due to the massive size of the neural network models and training datasets used in machine learning today, it is imperative to distribute stochastic gradient descent (SGD) by splitting up tasks such as gradient evaluation across multiple worker nodes. However, running distributed SGD can be prohibitively expensive because it may require specialized computing resources such as GPUs for extended per…
▽ More
Due to the massive size of the neural network models and training datasets used in machine learning today, it is imperative to distribute stochastic gradient descent (SGD) by splitting up tasks such as gradient evaluation across multiple worker nodes. However, running distributed SGD can be prohibitively expensive because it may require specialized computing resources such as GPUs for extended periods of time. We propose cost-effective strategies to exploit volatile cloud instances that are cheaper than standard instances, but may be interrupted by higher priority workloads. To the best of our knowledge, this work is the first to quantify how variations in the number of active worker nodes (as a result of preemption) affects SGD convergence and the time to train the model. By understanding these trade-offs between preemption probability of the instances, accuracy, and training time, we are able to derive practical strategies for configuring distributed SGD jobs on volatile instances such as Amazon EC2 spot instances and other preemptible cloud instances. Experimental results show that our strategies achieve good training performance at substantially lower cost.
△ Less
Submitted 12 March, 2020;
originally announced March 2020.
-
Metal-insulator transition in a random Hubbard model
Authors:
Grigory Tarnopolsky,
Chenyuan Li,
Darshan G. Joshi,
Subir Sachdev
Abstract:
We examine the metal-insulator transition in a half-filled Hubbard model of electrons with random and all-to-all hopping and exchange, and an on-site non-random repulsion, the Hubbard $U$. We argue that recent numerical results of Cha et al. (arXiv:2002.07181) can be understood in terms of a deconfined critical point between a disordered Fermi liquid and an insulating spin glass. We find a deconfi…
▽ More
We examine the metal-insulator transition in a half-filled Hubbard model of electrons with random and all-to-all hopping and exchange, and an on-site non-random repulsion, the Hubbard $U$. We argue that recent numerical results of Cha et al. (arXiv:2002.07181) can be understood in terms of a deconfined critical point between a disordered Fermi liquid and an insulating spin glass. We find a deconfined critical point in a previously proposed large $M$ theory which generalizes the SU(2) spin symmetry to SU($M$), and obtain exponents for the electron and spin correlators which agree with those of Cha et al. We also present a renormalization group analysis, and argue for the presence of an additional metallic spin glass phase at half-filling and small $U$.
△ Less
Submitted 27 February, 2020;
originally announced February 2020.
-
The distinct stellar-to-halo mass relations of satellite and central galaxies: insights from the IllustrisTNG simulations
Authors:
Christoph Engler,
Annalisa Pillepich,
Gandhali D. Joshi,
Dylan Nelson,
Anna Pasquali,
Eva K. Grebel,
Thorsten Lisker,
Elad Zinger,
Martina Donnari,
Federico Marinacci,
Mark Vogelsberger,
Lars Hernquist
Abstract:
We study the stellar-to-halo mass relation (SHMR) for central and satellite galaxies with total dynamical masses above 10^10.5 Msun using the suite of cosmological magneto-hydrodynamical simulations IllustrisTNG. In particular, we quantify environmental effects on satellite populations from TNG50, TNG100, and TNG300 located within the virial radius of group- and cluster-like hosts with total masse…
▽ More
We study the stellar-to-halo mass relation (SHMR) for central and satellite galaxies with total dynamical masses above 10^10.5 Msun using the suite of cosmological magneto-hydrodynamical simulations IllustrisTNG. In particular, we quantify environmental effects on satellite populations from TNG50, TNG100, and TNG300 located within the virial radius of group- and cluster-like hosts with total masses of 10^12-15.2 Msun. At fixed stellar mass, the satellite SHMR exhibits a distinct shift towards lower dynamical mass compared to the SHMR of centrals. Conversely, at fixed dynamical mass, satellite galaxies appear to have larger stellar-to-total mass fractions than centrals by up to a factor of a few. The systematic deviation from the central SHMR is larger for satellites in more massive hosts, at smaller cluster-centric distances, with earlier infall times, and that inhabit higher local density environments; moreover, it is in place already at early times (z < 2). Systematic environmental effects might contribute to the perceived galaxy-to-galaxy variation in the measured SHMR when galaxies cannot be separated into satellites and centrals. The SHMR of satellites exhibits a larger scatter than centrals, over the whole range of dynamical mass (by up to 0.8 dex). The shift of the satellite SHMR results mostly from tidal stripping of their dark matter, which affects satellites in an outside-in fashion: the departure of the satellite SHMR from the centrals' relation diminishes for measurements of dynamical mass in progressively smaller apertures. Finally, we provide a family of fitting functions for the SHMR predicted by IllustrisTNG.
△ Less
Submitted 4 February, 2022; v1 submitted 25 February, 2020;
originally announced February 2020.
-
Overlap Local-SGD: An Algorithmic Approach to Hide Communication Delays in Distributed SGD
Authors:
Jianyu Wang,
Hao Liang,
Gauri Joshi
Abstract:
Distributed stochastic gradient descent (SGD) is essential for scaling the machine learning algorithms to a large number of computing nodes. However, the infrastructures variability such as high communication delay or random node slowdown greatly impedes the performance of distributed SGD algorithm, especially in a wireless system or sensor networks. In this paper, we propose an algorithmic approa…
▽ More
Distributed stochastic gradient descent (SGD) is essential for scaling the machine learning algorithms to a large number of computing nodes. However, the infrastructures variability such as high communication delay or random node slowdown greatly impedes the performance of distributed SGD algorithm, especially in a wireless system or sensor networks. In this paper, we propose an algorithmic approach named Overlap-Local-SGD (and its momentum variant) to overlap the communication and computation so as to speedup the distributed training procedure. The approach can help to mitigate the straggler effects as well. We achieve this by adding an anchor model on each node. After multiple local updates, locally trained models will be pulled back towards the synchronized anchor model rather than communicating with others. Experimental results of training a deep neural network on CIFAR-10 dataset demonstrate the effectiveness of Overlap-Local-SGD. We also provide a convergence guarantee for the proposed algorithm under non-convex objective functions.
△ Less
Submitted 21 February, 2020;
originally announced February 2020.
-
Deconfined critical point in a doped random quantum Heisenberg magnet
Authors:
Darshan G. Joshi,
Chenyuan Li,
Grigory Tarnopolsky,
Antoine Georges,
Subir Sachdev
Abstract:
We describe the phase diagram of electrons on a fully connected lattice with random hopping, subject to a random Heisenberg spin exchange interactions between any pair of sites and a constraint of no double occupancy. A perturbative renormalization group analysis yields a critical point with fractionalized excitations at a non-zero critical value $p_c$ of the hole doping $p$ away from the half-fil…
▽ More
We describe the phase diagram of electrons on a fully connected lattice with random hopping, subject to a random Heisenberg spin exchange interactions between any pair of sites and a constraint of no double occupancy. A perturbative renormalization group analysis yields a critical point with fractionalized excitations at a non-zero critical value $p_c$ of the hole doping $p$ away from the half-filled insulator. We compute the renormalization group to two loops, but some exponents are obtained to all loop order. We argue that the critical point $p_c$ is flanked by confining phases: a disordered Fermi liquid with carrier density $1+p$ for $p>p_c$, and a metallic spin glass with carrier density $p$ for $p<p_c$. Additional evidence for the critical behavior is obtained from a large $M$ analysis of a model which extends the SU(2) spin symmetry to SU($M$). We discuss the relationship of the vicinity of this deconfined quantum critical point to key aspects of cuprate phenomenology.
△ Less
Submitted 18 February, 2020; v1 submitted 18 December, 2019;
originally announced December 2019.
-
Advances and Open Problems in Federated Learning
Authors:
Peter Kairouz,
H. Brendan McMahan,
Brendan Avent,
Aurélien Bellet,
Mehdi Bennis,
Arjun Nitin Bhagoji,
Kallista Bonawitz,
Zachary Charles,
Graham Cormode,
Rachel Cummings,
Rafael G. L. D'Oliveira,
Hubert Eichner,
Salim El Rouayheb,
David Evans,
Josh Gardner,
Zachary Garrett,
Adrià Gascón,
Badih Ghazi,
Phillip B. Gibbons,
Marco Gruteser,
Zaid Harchaoui,
Chaoyang He,
Lie He,
Zhouyuan Huo,
Ben Hutchinson
, et al. (34 additional authors not shown)
Abstract:
Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs re…
▽ More
Federated learning (FL) is a machine learning setting where many clients (e.g. mobile devices or whole organizations) collaboratively train a model under the orchestration of a central server (e.g. service provider), while keeping the training data decentralized. FL embodies the principles of focused data collection and minimization, and can mitigate many of the systemic privacy risks and costs resulting from traditional, centralized machine learning and data science approaches. Motivated by the explosive growth in FL research, this paper discusses recent advances and presents an extensive collection of open problems and challenges.
△ Less
Submitted 8 March, 2021; v1 submitted 10 December, 2019;
originally announced December 2019.
-
Multi-Armed Bandits with Correlated Arms
Authors:
Samarth Gupta,
Shreyas Chaudhari,
Gauri Joshi,
Osman Yağan
Abstract:
We consider a multi-armed bandit framework where the rewards obtained by pulling different arms are correlated. We develop a unified approach to leverage these reward correlations and present fundamental generalizations of classic bandit algorithms to the correlated setting. We present a unified proof technique to analyze the proposed algorithms. Rigorous analysis of C-UCB (the correlated bandit v…
▽ More
We consider a multi-armed bandit framework where the rewards obtained by pulling different arms are correlated. We develop a unified approach to leverage these reward correlations and present fundamental generalizations of classic bandit algorithms to the correlated setting. We present a unified proof technique to analyze the proposed algorithms. Rigorous analysis of C-UCB (the correlated bandit version of Upper-confidence-bound) reveals that the algorithm ends up pulling certain sub-optimal arms, termed as non-competitive, only O(1) times, as opposed to the O(log T) pulls required by classic bandit algorithms such as UCB, TS etc. We present regret-lower bound and show that when arms are correlated through a latent random source, our algorithms obtain order-optimal regret. We validate the proposed algorithms via experiments on the MovieLens and Goodreads datasets, and show significant improvement over classical bandit algorithms.
△ Less
Submitted 10 September, 2021; v1 submitted 6 November, 2019;
originally announced November 2019.
-
Emergence of Griffiths phase, re-entrant cluster glass,metamagnetic transition and field induced unusual spin dynamics in Tb2CoMnO6
Authors:
Khyati Anand,
Arkadeb Pal,
Prajyoti Singh,
Md. Alam,
Amish G Joshi,
Anita Mohan,
Sandip Chatterjee
Abstract:
The structural and magnetic properties of double perovskiteTb2CoMnO6 have been investigated. Electronic structure analysis by XPS study reveals the presence of mixed oxidation state (Mn4+/Mn3+ and Co2+/Co3+) of B-site ions. The dc and ac magnetization measurements reveal different interesting phases such as Griffith phase, re-entrant spin glass, metamagnetic steps, Hopkinson like peak and also unu…
▽ More
The structural and magnetic properties of double perovskiteTb2CoMnO6 have been investigated. Electronic structure analysis by XPS study reveals the presence of mixed oxidation state (Mn4+/Mn3+ and Co2+/Co3+) of B-site ions. The dc and ac magnetization measurements reveal different interesting phases such as Griffith phase, re-entrant spin glass, metamagnetic steps, Hopkinson like peak and also unusual slow relaxation. The M-H curve indicates the presence of competing AFM/FM interactions. The disorder in Tb2CoMnO6 leads to spin frustration at low temperature giving rise to the re-entrant spin glass. Moreover, the field-dependent ac susceptibility studies unraveled the presence of Hopkinson like peak associated with the domain wall motion and the large anisotropy field. The further study yielded that the relaxation associated with this peak is unusually slow.
△ Less
Submitted 30 October, 2019;
originally announced October 2019.
-
Adjustable coupling and in-situ variable frequency EPR probe with loop-gap resonators for spectroscopy up to X-band
Authors:
G. Joshi,
J. Kubasek,
I. Nikolov,
B. Sheehan,
T. A. costa,
R. A. A. Cassaro,
J. R. Friedman
Abstract:
In standard electron paramagnetic resonance (EPR) spectroscopy, the frequency of an experiment is set and the spectrum is acquired using magnetic field as the independent variable. There are cases in which it is desirable instead to fix the field and tune the frequency such as when studying avoided level crossings. We have designed and tested an adjustable frequency and variable coupling EPR probe…
▽ More
In standard electron paramagnetic resonance (EPR) spectroscopy, the frequency of an experiment is set and the spectrum is acquired using magnetic field as the independent variable. There are cases in which it is desirable instead to fix the field and tune the frequency such as when studying avoided level crossings. We have designed and tested an adjustable frequency and variable coupling EPR probe with loop-gap resonators (LGRs) that works at a temperature down to 1.8 K. The frequency is tuned by adjusting the height of a dielectric piece of sapphire inserted into the gap of an LGR; coupling of the microwave antenna is varied with the height of antenna above the LGR. Both coupling antenna and dielectric are located within the cryogenic sample chamber, but their motion is controlled with external micrometers located outside the cryostat. The frequency of the LGR can be adjusted by more than 1 GHz. To cover a wide range of frequencies, different LGRs can be designed to cover frequencies up to X-band. We demonstrate the operation of our probe by mapping out avoided crossings for the Ni$_4$ single-molecule magnet to determine the tunnel splittings with high precision.
△ Less
Submitted 23 October, 2019;
originally announced October 2019.
-
Deep Kernels with Probabilistic Embeddings for Small-Data Learning
Authors:
Ankur Mallick,
Chaitanya Dwivedi,
Bhavya Kailkhura,
Gauri Joshi,
T. Yong-Jin Han
Abstract:
Gaussian Processes (GPs) are known to provide accurate predictions and uncertainty estimates even with small amounts of labeled data by capturing similarity between data points through their kernel function. However traditional GP kernels are not very effective at capturing similarity between high dimensional data points. Neural networks can be used to learn good representations that encode intric…
▽ More
Gaussian Processes (GPs) are known to provide accurate predictions and uncertainty estimates even with small amounts of labeled data by capturing similarity between data points through their kernel function. However traditional GP kernels are not very effective at capturing similarity between high dimensional data points. Neural networks can be used to learn good representations that encode intricate structures in high dimensional data, and can be used as inputs to the GP kernel. However the huge data requirement of neural networks makes this approach ineffective in small data settings. To solves the conflicting problems of representation learning and data efficiency, we propose to learn deep kernels on probabilistic embeddings by using a probabilistic neural network. Our approach maps high-dimensional data to a probability distribution in a low dimensional subspace and then computes a kernel between these distributions to capture similarity. To enable end-to-end learning, we derive a functional gradient descent procedure for training the model. Experiments on a variety of datasets show that our approach outperforms the state-of-the-art in GP kernel learning in both supervised and semi-supervised settings. We also extend our approach to other small-data paradigms such as few-shot classification where it outperforms previous approaches on mini-Imagenet and CUB datasets.
△ Less
Submitted 13 November, 2021; v1 submitted 13 October, 2019;
originally announced October 2019.
-
Accelerating Deep Learning by Focusing on the Biggest Losers
Authors:
Angela H. Jiang,
Daniel L. -K. Wong,
Giulio Zhou,
David G. Andersen,
Jeffrey Dean,
Gregory R. Ganger,
Gauri Joshi,
Michael Kaminksy,
Michael Kozuch,
Zachary C. Lipton,
Padmanabhan Pillai
Abstract:
This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward pass to decide whether to use that example to compute gradients and update parameters, or to skip immediately to the next example. By reducing the number of co…
▽ More
This paper introduces Selective-Backprop, a technique that accelerates the training of deep neural networks (DNNs) by prioritizing examples with high loss at each iteration. Selective-Backprop uses the output of a training example's forward pass to decide whether to use that example to compute gradients and update parameters, or to skip immediately to the next example. By reducing the number of computationally-expensive backpropagation steps performed, Selective-Backprop accelerates training. Evaluation on CIFAR10, CIFAR100, and SVHN, across a variety of modern image models, shows that Selective-Backprop converges to target error rates up to 3.5x faster than with standard SGD and between 1.02--1.8x faster than a state-of-the-art importance sampling approach. Further acceleration of 26% can be achieved by using stale forward pass results for selection, thus also skipping forward passes of low priority examples.
△ Less
Submitted 1 October, 2019;
originally announced October 2019.
-
Perdeuteration of poly[2-methoxy-5-(2'-ethylhexyloxy)-1,4-phenylenevinylene] (d-MEHPPV): control of microscopic charge-carrier spin-spin coupling and of magnetic-field effects in optoelectronic devices
Authors:
Dani M. Stoltzfus,
Gajadhar Joshi,
Henna Popli,
Shirin Jamali,
Marzieh Kavand,
Sebastian Milster,
Tobias Grünbaum,
Sebastian Bange,
Adnan Nahlawi,
Mandefro Y. Teferi,
Sabastian I. Atwood,
Anna E. Leung,
Tamim A. Darwish,
Hans Malissa,
Paul L. Burn,
John M. Lupton,
Christoph Boehme
Abstract:
Control of the effective local hyperfine fields in a conjugated polymer, poly[2-methoxy-5-(2'-ethylhexyloxy)-1,4-phenylenevinylene] (MEHPPV), by isotopic engineering is reported. These fields, evident as a frequency-independent line broadening mechanism in electrically detected magnetic resonance spectroscopy (EDMR), originate from the unresolved hyperfine coupling between the electronic spin of c…
▽ More
Control of the effective local hyperfine fields in a conjugated polymer, poly[2-methoxy-5-(2'-ethylhexyloxy)-1,4-phenylenevinylene] (MEHPPV), by isotopic engineering is reported. These fields, evident as a frequency-independent line broadening mechanism in electrically detected magnetic resonance spectroscopy (EDMR), originate from the unresolved hyperfine coupling between the electronic spin of charge carrier pairs and the nuclear spins of surrounding hydrogen isotopes. The room temperature study of effects caused by complete deuteration of this polymer through magnetoresistance, magnetoelectroluminescence, coherent pulsed and multi-frequency EDMR, as well as inverse spin-Hall effect measurements, confirm the weak hyperfine broadening of charge carrier magnetic resonance lines. As a consequence, we can resolve coherent charge-carrier spin-beating, allowing for direct measurements of the magnitude of electronic spin-spin interactions. In addition, the weak hyperfine coupling allows us to resolve substantial spin-orbit coupling effects in EDMR spectra, even at low magnetic field strengths. These results illustrate the dramatic influence of hyperfine fields on the spin physics of organic light-emitting diode (OLED) materials at room temperature, and point to routes to reaching exotic ultra-strong resonant-drive regimes needed for the study of light-matter interactions.
△ Less
Submitted 26 September, 2019;
originally announced September 2019.
-
Deep Model Reference Adaptive Control
Authors:
Girish Joshi,
Girish Chowdhary
Abstract:
We present a new neuroadaptive architecture: Deep Neural Network based Model Reference Adaptive Control (DMRAC). Our architecture utilizes the power of deep neural network representations for modeling significant nonlinearities while marrying it with the boundedness guarantees that characterize MRAC based controllers. We demonstrate through simulations and analysis that DMRAC can subsume previousl…
▽ More
We present a new neuroadaptive architecture: Deep Neural Network based Model Reference Adaptive Control (DMRAC). Our architecture utilizes the power of deep neural network representations for modeling significant nonlinearities while marrying it with the boundedness guarantees that characterize MRAC based controllers. We demonstrate through simulations and analysis that DMRAC can subsume previously studied learning based MRAC methods, such as concurrent learning and GP-MRAC. This makes DMRAC a highly powerful architecture for high-performance control of nonlinear systems with long-term learning properties.
△ Less
Submitted 18 September, 2019;
originally announced September 2019.
-
Symmetry-enforced band crossings in trigonal materials: Accordion states and Weyl nodal lines
Authors:
Y. -H. Chan,
Berkay Kilic,
Moritz M. Hirschmann,
Ching-Kai Chiu,
Leslie M. Schoop,
Darshan G. Joshi,
Andreas P. Schnyder
Abstract:
Nonsymmoprhic symmetries, such as screw rotations or glide reflections, can enforce band crossings within high-symmetry lines or planes of the Brillouin zone. When these band degeneracies are close to the Fermi energy, they can give rise to a number of unusual phenomena: e.g., anomalous magnetoelectric responses, transverse Hall currents, and exotic surface states. In this paper, we present a comp…
▽ More
Nonsymmoprhic symmetries, such as screw rotations or glide reflections, can enforce band crossings within high-symmetry lines or planes of the Brillouin zone. When these band degeneracies are close to the Fermi energy, they can give rise to a number of unusual phenomena: e.g., anomalous magnetoelectric responses, transverse Hall currents, and exotic surface states. In this paper, we present a comprehensive classification of such nonsymmorphic band crossings in trigonal materials with strong spin-orbit coupling. We find that in trigonal systems there are two different types of nonsymmorphic band degeneracies: (i) Weyl points protected by screw rotations with an accordion-like dispersion, and (ii) Weyl nodal lines protected by glide reflections. We report a number of existing materials, where these band crossings are realized near the Fermi energy. This includes Cu2SrSnS4 and elemental tellurium (Te), which exhibit accordion Weyl points; and the tellurium-silicon clathrate Te16Si38, which shows Weyl nodal lines. The ab-initio band structures and surface states of these materials are studied in detail, and implications for experiments are briefly discussed.
△ Less
Submitted 2 August, 2019;
originally announced August 2019.
-
Study of Band structure, Transport and magnetic properties of BiFeO3-TbMnO3 composite
Authors:
Prince Kr. Gupta,
Surajit Ghosh,
Arkadeb Pal,
Somnath Roy,
Amish G Joshi,
A. K. Ghosh,
Sandip Chatterjee
Abstract:
Magnetoelectric multiferroic composite of two types of multiferroic (Type I and II) consisting BiFeO3 and TbMnO3 is studied for enhanced magnetic and transport properties. A narrower band gap is estimated from the UV-visible absorption spectrum from that of BiFeO3 and TbMnO3. With known value of band gap, the band structure was estimated from the valence band x-ray photoemission spectra (XPS) and…
▽ More
Magnetoelectric multiferroic composite of two types of multiferroic (Type I and II) consisting BiFeO3 and TbMnO3 is studied for enhanced magnetic and transport properties. A narrower band gap is estimated from the UV-visible absorption spectrum from that of BiFeO3 and TbMnO3. With known value of band gap, the band structure was estimated from the valence band x-ray photoemission spectra (XPS) and ultra violet photoemission spectra (UPS). The valence and conduction band was found at 1.0 eV and 0.45 eV above and below the Fermi level respectively. Thus the insulating behavior of the system is understood from the reconstruction of the energy bands at the interface which happens due to lattice mismatch of the two materials. The large coercivity and the increase on the magnetization value are understood to be due to superexchange interaction between different Mn ions (Mn2+, Mn3+ and Mn4+). From the composition study of EDXA and core level x-ray photoemission spectra oxygen vacancy was found which in turn creates the mixed valence state of Mn to maintain the charge neutrality.
△ Less
Submitted 22 June, 2019;
originally announced June 2019.
-
MATCHA: Speeding Up Decentralized SGD via Matching Decomposition Sampling
Authors:
Jianyu Wang,
Anit Kumar Sahu,
Zhouyi Yang,
Gauri Joshi,
Soummya Kar
Abstract:
This paper studies the problem of error-runtime trade-off, typically encountered in decentralized training based on stochastic gradient descent (SGD) using a given network. While a denser (sparser) network topology results in faster (slower) error convergence in terms of iterations, it incurs more (less) communication time/delay per iteration. In this paper, we propose MATCHA, an algorithm that ca…
▽ More
This paper studies the problem of error-runtime trade-off, typically encountered in decentralized training based on stochastic gradient descent (SGD) using a given network. While a denser (sparser) network topology results in faster (slower) error convergence in terms of iterations, it incurs more (less) communication time/delay per iteration. In this paper, we propose MATCHA, an algorithm that can achieve a win-win in this error-runtime trade-off for any arbitrary network topology. The main idea of MATCHA is to parallelize inter-node communication by decomposing the topology into matchings. To preserve fast error convergence speed, it identifies and communicates more frequently over critical links, and saves communication time by using other links less frequently. Experiments on a suite of datasets and deep neural networks validate the theoretical analyses and demonstrate that MATCHA takes up to $5\times$ less time than vanilla decentralized SGD to reach the same training loss.
△ Less
Submitted 18 November, 2019; v1 submitted 22 May, 2019;
originally announced May 2019.
-
Identification of point defects on Co-Ni co-doping in SnO$_{2}$ nanocrystals and their effect on the structural and optical properties
Authors:
S. Roy,
Brijmohan Prajapati,
A. Singh,
Amish G. Joshi,
S. Chatterjee,
Anup K. Ghosh
Abstract:
Sn$_{0.97-y}$Co$_{0.03}$Ni$_{y}$O$_{2}$ (0 $\leq y \leq$ 0.04) nanocrystals, with average crystallite size in the range of 7.3 nm ($y$=0.00) to 5.6 nm ($y$=0.04), have been synthesized using pH-controlled chemical co-precipitation technique. The non-stoichiometric Sn related defects and the O related stoichiometric Frenkel defects arising in the nanocrystals because of co-doping have been identifi…
▽ More
Sn$_{0.97-y}$Co$_{0.03}$Ni$_{y}$O$_{2}$ (0 $\leq y \leq$ 0.04) nanocrystals, with average crystallite size in the range of 7.3 nm ($y$=0.00) to 5.6 nm ($y$=0.04), have been synthesized using pH-controlled chemical co-precipitation technique. The non-stoichiometric Sn related defects and the O related stoichiometric Frenkel defects arising in the nanocrystals because of co-doping have been identified and their effect on the structural and optical properties of the nanocrystals have been extensively studied. It has been observed, using XPS that on increasing the Ni co-doping concentration ($y$), the non-stoichiometric Sn defect Sn$_{\text{Sn}}^{"}$ increases in compensation of existing defect Sn$_{i}^{....}$ for $y$ = 0.00 nanocrystals. High resolution transmission electron microscopy (HR-TEM) also confirms the existence of Sn$_{\text{Sn}}^{"}$. Regarding the Frenkel defect, XPS results indicate that the concentration of $V_{\text{O}}$ and O$_{i}$, manifested in the form of dangling bond related surface defect states,increases with increase in $y$. Temperature dependent magnetisation measurement of the nanocrystals confirm the charge state of $V_{\text{O}}$. The point defects have been found to affect the structural properties in a way that distortion in octahedral geometry of complete Sn-O octahderon effectively reduces whereas distortion in the trigonal planar coordination geometry of O increases. The investigation of Urbach edge indicates an enhancement in the disorder in the nanocrystals on co-doping. The optical band gap of the nanocrystals has been found to be red shifted upto $y$=0.02 and then a gradual blue shift has been observed. A direct effect of the O related defect has been observed on the blue luminescence of the nanocrystals such that the spectral contribution of blue luminescence in the total emission intensity increases by 72% for $y$=0.04 as compared to $y$=0.00.
△ Less
Submitted 3 June, 2019; v1 submitted 6 May, 2019;
originally announced May 2019.
-
Probing the Griffiths like phase, unconventional dual glassy states, giant exchange bias effects and its correlation with its electronic structure in Pr2-xSrxCoMnO6
Authors:
Arkadeb Pal,
Prajyoti Singh,
Vinod K Gangwar,
Amish G Joshi,
G. D. Dwivedi,
Prince K Gupta,
Md. Alam,
Khyati Anand,
Anup K Ghosh,
Sandip Chatterjee
Abstract:
Electronic structure, electrical transport, dc and ac magnetization properties of the hole substituted (Sr2+) partially B-site disordered double perovskite Pr2-xSrxCoMnO6 system have been investigated. Electronic structure was probed by employing X-ray photoemission spectroscopy (XPS) measurements. The study suggested the presence of mixed valence states of the B-site ions (Co2+/Co3+ and Mn3+/Mn4+…
▽ More
Electronic structure, electrical transport, dc and ac magnetization properties of the hole substituted (Sr2+) partially B-site disordered double perovskite Pr2-xSrxCoMnO6 system have been investigated. Electronic structure was probed by employing X-ray photoemission spectroscopy (XPS) measurements. The study suggested the presence of mixed valence states of the B-site ions (Co2+/Co3+ and Mn3+/Mn4+) with significant enhancement of the average oxidation states due to hole doping. The mere absence of electronic states near the Fermi level in the valence band (VB) spectra for both of the pure (x=0.0) and Sr doped (x=0.5) systems indicated the insulating nature of the samples. Sr substitution is observed to increase the spectral weight near the Fermi level suggesting for an enhanced conductivity of the hole doped system. The temperature variation of electrical resistivity measurements revealed the insulating nature for both the systems, thus supporting the VB spectra results. The dc magnetization data divulged a Griffiths like phase above the long range ordering temperature. A typical re-entrant spin glass like phase driven by the inherent anti-site disorder (ASD) has been maidenly recognized by ac susceptibility study for both the pure and doped systems. Most interestingly, the emergence of a new cluster glass like phase (immediately below the magnetic ordering temperature and above the spin-glass transition temperature) solely driven by the Sr substitution has been unravelled by ac magnetization dynamics study. The isothermal magnetization measurements further probed the exhibition of the giant exchange bias effect emanated from the existence of multiple magnetic phases.
△ Less
Submitted 20 April, 2019; v1 submitted 18 April, 2019;
originally announced April 2019.
-
Stellar Variability with Photometric and Spectroscopic Analysis of five Am Field Stars
Authors:
Gireesh C. Joshi
Abstract:
The spectroscopic and photometric analysis of sample Am stars are carried out to determine the stellar characteristics of each studied star. The CCD photometric analysis of HD 98851 and HD 207561 show clear evidence of pulsation variability of 1.55 hr and 5.8 min respectively. Similarly, a clear evidence of the photometric variability is also found for an Am star HD 73045 which is likely to be pul…
▽ More
The spectroscopic and photometric analysis of sample Am stars are carried out to determine the stellar characteristics of each studied star. The CCD photometric analysis of HD 98851 and HD 207561 show clear evidence of pulsation variability of 1.55 hr and 5.8 min respectively. Similarly, a clear evidence of the photometric variability is also found for an Am star HD 73045 which is likely to be pulsating in nature with a period of about 36-min. We are also found dissimilar behaviour of elemental abundances of various ions for HD 113878 and HD 118660. The basic stellar parameters (mass, luminosity, radius, life time, distance, proper-motion etc.) are determined for each sample stars. The tidal radius and boundaries of habitable zone of each star are also computed to search the extra-terrestrial life. Asteroseismic mass scale test shows greater stellar masses compare to the solar mass.
△ Less
Submitted 9 April, 2019; v1 submitted 6 April, 2019;
originally announced April 2019.
-
MLSys: The New Frontier of Machine Learning Systems
Authors:
Alexander Ratner,
Dan Alistarh,
Gustavo Alonso,
David G. Andersen,
Peter Bailis,
Sarah Bird,
Nicholas Carlini,
Bryan Catanzaro,
Jennifer Chayes,
Eric Chung,
Bill Dally,
Jeff Dean,
Inderjit S. Dhillon,
Alexandros Dimakis,
Pradeep Dubey,
Charles Elkan,
Grigori Fursin,
Gregory R. Ganger,
Lise Getoor,
Phillip B. Gibbons,
Garth A. Gibson,
Joseph E. Gonzalez,
Justin Gottschlich,
Song Han,
Kim Hazelwood
, et al. (44 additional authors not shown)
Abstract:
Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a ne…
▽ More
Machine learning (ML) techniques are enjoying rapidly increasing adoption. However, designing and implementing the systems that support ML models in real-world deployments remains a significant obstacle, in large part due to the radically different development and deployment profile of modern ML methods, and the range of practical concerns that come with broader adoption. We propose to foster a new systems machine learning research community at the intersection of the traditional systems and ML communities, focused on topics such as hardware systems for ML, software systems for ML, and ML optimized for metrics beyond predictive accuracy. To do this, we describe a new conference, MLSys, that explicitly targets research at the intersection of systems and machine learning with a program committee split evenly between experts in systems and ML, and an explicit focus on topics at the intersection of the two.
△ Less
Submitted 1 December, 2019; v1 submitted 29 March, 2019;
originally announced April 2019.
-
Hybrid Direct-Indirect Adaptive Control of Nonlinear System with Unmatched Uncertainty
Authors:
Girish Joshi,
Girish Chowdhary
Abstract:
In this paper, we present a hybrid direct-indirect model reference adaptive controller (MRAC), to address a class of problems with matched and unmatched uncertainties. In the proposed architecture, the unmatched uncertainty is estimated online through a companion observer model. Upon convergence of the observer, the unmatched uncertainty estimate is remodeled into a state dependent linear form to…
▽ More
In this paper, we present a hybrid direct-indirect model reference adaptive controller (MRAC), to address a class of problems with matched and unmatched uncertainties. In the proposed architecture, the unmatched uncertainty is estimated online through a companion observer model. Upon convergence of the observer, the unmatched uncertainty estimate is remodeled into a state dependent linear form to augment the nominal system dynamics. Meanwhile, a direct adaptive controller designed for a switching system cancels the effect of matched uncertainty in the system and achieves reference model tracking. We demonstrate that the proposed hybrid controller can handle a broad class of nonlinear systems with both matched and unmatched uncertainties
△ Less
Submitted 21 February, 2019;
originally announced February 2019.
-
Quenching low-mass satellite galaxies: evidence for a threshold ICM density
Authors:
Ian D Roberts,
Laura C Parker,
Toby Brown,
Gandhali D Joshi,
Julie Hlavacek-Larrondo,
James Wadsley
Abstract:
We compile a sample of SDSS galaxy clusters with high-quality Chandra X-ray data to directly study the influence of the dense intra-cluster medium (ICM) on the quenching of satellite galaxies. We study the quenched fractions of satellite galaxies as a function of ICM density for low- ($10^9 \lesssim M_\star \lesssim 10^{10}\,\mathrm{M_\odot}$), intermediate- (…
▽ More
We compile a sample of SDSS galaxy clusters with high-quality Chandra X-ray data to directly study the influence of the dense intra-cluster medium (ICM) on the quenching of satellite galaxies. We study the quenched fractions of satellite galaxies as a function of ICM density for low- ($10^9 \lesssim M_\star \lesssim 10^{10}\,\mathrm{M_\odot}$), intermediate- ($10^{10} \lesssim M_\star \lesssim 10^{10.5}\,\mathrm{M_\odot}$), and high-mass ($M_\star \gtrsim 10^{10.5}\,\mathrm{M_\odot}$) satellite galaxies with $>\!3000$ satellite galaxies across 24 low-redshift ($z < 0.1$) clusters. For low-mass galaxies we find evidence for a broken powerlaw trend between satellite quenched fraction and local ICM density. The quenched fraction increases modestly at ICM densities below a threshold before increasing sharply beyond this threshold toward the cluster center. We show that this increase in quenched fraction at high ICM density is well matched by a simple, analytic model of ram pressure stripping. These results are consistent with a picture where low-mass cluster galaxies experience an initial, slow-quenching mode driven by steady gas depletion, followed by rapid quenching associated with ram pressure of cold-gas stripping near (one quarter of the virial radius, on average) the cluster center.
△ Less
Submitted 7 February, 2019;
originally announced February 2019.
-
IC 361:- Near infrared and UBVRI photometric Analysis
Authors:
Gireesh C. Joshi
Abstract:
We present here the detailed optical and infra-red photometric analysis of the open star cluster IC 361. On studying the radial density profile, radial extent of the cluster is found to be 8.0 +/- 0.5 arcmin. The basic physical parameters of the cluster such as E(B-V) = 0.56 +/- 0.10 mag, E(V-K) = 1.72+/-0.12 mag, log(Age)=9.10+/-0.05, and (m-M)0 = 12.54 +/-.05 mag are obtained using the color-col…
▽ More
We present here the detailed optical and infra-red photometric analysis of the open star cluster IC 361. On studying the radial density profile, radial extent of the cluster is found to be 8.0 +/- 0.5 arcmin. The basic physical parameters of the cluster such as E(B-V) = 0.56 +/- 0.10 mag, E(V-K) = 1.72+/-0.12 mag, log(Age)=9.10+/-0.05, and (m-M)0 = 12.54 +/-.05 mag are obtained using the color-color and color-magnitude diagrams. IC 361 is found to be located at a distance of 3.22 +/- 0.07 kpc. Using the archival proper motion catalogues, we estimate mean proper motions of IC 361 as 4.97+/-0.17 mas yr-1 and -5.80+/-0.18 mas yr-1 in the direction of RA and DEC, respectively. We derive the luminosity and mass functions for the cluster main sequence stars. The mass function slope is found to be -1.06+/-0.09 which is too low compare than Salpeter value.
△ Less
Submitted 6 January, 2019;
originally announced January 2019.
-
Service Rate Region of Content Access from Erasure Coded Storage
Authors:
Sarah Anderson,
Ann Johnston,
Gauri Joshi,
Gretchen Matthews,
Carolyn Mayer,
Emina Soljanin
Abstract:
We consider storage systems in which $K$ files are stored over $N$ nodes. A node may be systematic for a particular file in the sense that access to it gives access to the file. Alternatively, a node may be coded, meaning that it gives access to a particular file only when combined with other nodes (which may be coded or systematic). Requests for file $f_k$ arrive at rate $λ_k$, and we are interes…
▽ More
We consider storage systems in which $K$ files are stored over $N$ nodes. A node may be systematic for a particular file in the sense that access to it gives access to the file. Alternatively, a node may be coded, meaning that it gives access to a particular file only when combined with other nodes (which may be coded or systematic). Requests for file $f_k$ arrive at rate $λ_k$, and we are interested in the rate that can be served by a particular system. In this paper, we determine the set of request arrival rates for the a $3$-file coded storage system. We also provide an algorithm to maximize the rate of requests served for file $K$ given $λ_1,\dots, λ_{K-1}$ in a general $K$-file case.
△ Less
Submitted 8 January, 2019;
originally announced January 2019.
-
Probing the multi spin-phonon coupling and local B-site disorder in Pr2CoFeO6 by Raman spectroscopy and correlation with its electronic structure by X-ray photoemission spectroscopy
Authors:
Arkadeb Pal,
Surajit Ghosh,
Amish G. Joshi,
P. K. Gupta,
P. Prakash,
Amitabh Das,
A. K. Ghosh,
Sandip Chatterjee
Abstract:
Electronic structure near Fermi level of Pr2CoFeO6 (at 300 K) was investigated by X-ray photoemission spectroscopy (XPS) technique. All three cations, i.e., Pr, Co and Fe were found to be trivalent in nature. XPS analysis also suggested the system to be insulating in nature. Moreover, Raman spectroscopy study indicated the random distribution of the B-site ions (Co/Fe) triggered by same charge sta…
▽ More
Electronic structure near Fermi level of Pr2CoFeO6 (at 300 K) was investigated by X-ray photoemission spectroscopy (XPS) technique. All three cations, i.e., Pr, Co and Fe were found to be trivalent in nature. XPS analysis also suggested the system to be insulating in nature. Moreover, Raman spectroscopy study indicated the random distribution of the B-site ions (Co/Fe) triggered by same charge states. In temperature-dependent Raman study, the relative heights of the two observed phonon modes exhibited anomalous behaviour near magnetic transition temperature TN~270 K, thus indicating towards interplay between spin and phonon in the system. Furthermore, clear anomalous softening was observed below TN which confirmed the existence of strong spin-phonon coupling occurring for at least two phonon modes of the system. The line width analysis of the phonon modes essentially ruled out the role of magnetostriction effect in the observed phonon anomaly. The investigation of the lattice parameter variation across TN (obtained from the temperature-dependent neutron diffraction measurements) further confirmed the existence of the spin-phonon coupling.
△ Less
Submitted 2 January, 2019;
originally announced January 2019.
-
DSCnet: Replicating Lidar Point Clouds with Deep Sensor Cloning
Authors:
Paden Tomasello,
Sammy Sidhu,
Anting Shen,
Matthew W. Moskewicz,
Nobie Redmon,
Gayatri Joshi,
Romi Phadte,
Paras Jain,
Forrest Iandola
Abstract:
Convolutional neural networks (CNNs) have become increasingly popular for solving a variety of computer vision tasks, ranging from image classification to image segmentation. Recently, autonomous vehicles have created a demand for depth information, which is often obtained using hardware sensors such as Light detection and ranging (LIDAR). Although it can provide precise distance measurements, mos…
▽ More
Convolutional neural networks (CNNs) have become increasingly popular for solving a variety of computer vision tasks, ranging from image classification to image segmentation. Recently, autonomous vehicles have created a demand for depth information, which is often obtained using hardware sensors such as Light detection and ranging (LIDAR). Although it can provide precise distance measurements, most LIDARs are still far too expensive to sell in mass-produced consumer vehicles, which has motivated methods to generate depth information from commodity automotive sensors like cameras.
In this paper, we propose an approach called Deep Sensor Cloning (DSC). The idea is to use Convolutional Neural Networks in conjunction with inexpensive sensors to replicate the 3D point-clouds that are created by expensive LIDARs. To accomplish this, we develop a new dataset (DSDepth) and a new family of CNN architectures (DSCnets). While previous tasks such as KITTI depth prediction use an interpolated RGB-D images as ground-truth for training, we instead use DSCnets to directly predict LIDAR point-clouds. When we compare the output of our models to a $75,000 LIDAR, we find that our most accurate DSCnet achieves a relative error of 5.77% using a single camera and 4.69% using stereo cameras.
△ Less
Submitted 26 November, 2018; v1 submitted 16 November, 2018;
originally announced November 2018.
-
The trajectories of galaxies in groups: mass loss and preprocessing
Authors:
Gandhali D. Joshi,
Laura C. Parker,
James Wadsley,
Benjamin W. Keller
Abstract:
We present a study of environmental effects and preprocessing in a large galaxy group using a high-resolution, zoom-in simulation run with the GASOLINE2 hydrodynamics code. We categorize galaxies that were always in distinct haloes as unaccreted, galaxies that were distinct before accretion onto the main group as single, and galaxies that were in external sub-groups before accretion onto the main…
▽ More
We present a study of environmental effects and preprocessing in a large galaxy group using a high-resolution, zoom-in simulation run with the GASOLINE2 hydrodynamics code. We categorize galaxies that were always in distinct haloes as unaccreted, galaxies that were distinct before accretion onto the main group as single, and galaxies that were in external sub-groups before accretion onto the main group as grouped.
The unaccreted galaxy population experiences steady growth in dark matter, gas and stellar mass. Both single- and group-accreted galaxies begin to lose dark matter and gas after first accretion onto any host but continue to grow in stellar mass. Individual trajectories show that galaxies cease mass growth within roughly three virial radii of the main group. Single galaxies continue to form stars until the group virial radius is crossed, when they begin to lose both dark matter and gas. Grouped galaxies peak in mass when joining their external sub-group, indicating that they experience preprocessing. Most accreted galaxies retain their accumulated stellar mass. The total mass loss is dominated by tidal stripping, with evidence for additional gas stripping via ram pressure. Most accreted galaxies are quenched $\sim$(0.5-2.5) Gyr after accretion onto any group.
These differing histories place unaccreted, single and grouped galaxies in distinct regions of the stellar mass-to-halo mass (SMHM) relation. This suggests that preprocessed galaxies are a key source of scatter in the SMHM relation for mixed galaxy populations.
△ Less
Submitted 15 November, 2018;
originally announced November 2018.
-
Adaptive Communication Strategies to Achieve the Best Error-Runtime Trade-off in Local-Update SGD
Authors:
Jianyu Wang,
Gauri Joshi
Abstract:
Large-scale machine learning training, in particular distributed stochastic gradient descent, needs to be robust to inherent system variability such as node straggling and random communication delays. This work considers a distributed training framework where each worker node is allowed to perform local model updates and the resulting models are averaged periodically. We analyze the true speed of…
▽ More
Large-scale machine learning training, in particular distributed stochastic gradient descent, needs to be robust to inherent system variability such as node straggling and random communication delays. This work considers a distributed training framework where each worker node is allowed to perform local model updates and the resulting models are averaged periodically. We analyze the true speed of error convergence with respect to wall-clock time (instead of the number of iterations), and analyze how it is affected by the frequency of averaging. The main contribution is the design of AdaComm, an adaptive communication strategy that starts with infrequent averaging to save communication delay and improve convergence speed, and then increases the communication frequency in order to achieve a low error floor. Rigorous experiments on training deep neural networks show that AdaComm can take $3 \times$ less time than fully synchronous SGD, and still reach the same final training loss.
△ Less
Submitted 7 March, 2019; v1 submitted 18 October, 2018;
originally announced October 2018.