-
Attention is All You Need in Speech Separation
Authors:
Cem Subakan,
Mirco Ravanelli,
Samuele Cornell,
Mirko Bronzi,
Jianyuan Zhong
Abstract:
Recurrent Neural Networks (RNNs) have long been the dominant architecture in sequence-to-sequence learning. RNNs, however, are inherently sequential models that do not allow parallelization of their computations. Transformers are emerging as a natural alternative to standard RNNs, replacing recurrent computations with a multi-head attention mechanism. In this paper, we propose the SepFormer, a nov…
▽ More
Recurrent Neural Networks (RNNs) have long been the dominant architecture in sequence-to-sequence learning. RNNs, however, are inherently sequential models that do not allow parallelization of their computations. Transformers are emerging as a natural alternative to standard RNNs, replacing recurrent computations with a multi-head attention mechanism. In this paper, we propose the SepFormer, a novel RNN-free Transformer-based neural network for speech separation. The SepFormer learns short and long-term dependencies with a multi-scale approach that employs transformers. The proposed model achieves state-of-the-art (SOTA) performance on the standard WSJ0-2/3mix datasets. It reaches an SI-SNRi of 22.3 dB on WSJ0-2mix and an SI-SNRi of 19.5 dB on WSJ0-3mix. The SepFormer inherits the parallelization advantages of Transformers and achieves a competitive performance even when downsampling the encoded representation by a factor of 8. It is thus significantly faster and it is less memory-demanding than the latest speech separation systems with comparable performance.
△ Less
Submitted 8 March, 2021; v1 submitted 25 October, 2020;
originally announced October 2020.
-
Global invariant manifolds delineating transition and escape dynamics in dissipative systems
Authors:
Jun Zhong,
Shane D. Ross
Abstract:
Invariant manifolds play an important role in organizing global dynamical behaviors. For example, it is found that in multi-well conservative systems where the potential energy wells are connected by index-1 saddles, the motion between potential wells is governed by the invariant manifolds of a periodic orbit around the saddle. In two degree of freedom systems, such invariant manifolds appear as c…
▽ More
Invariant manifolds play an important role in organizing global dynamical behaviors. For example, it is found that in multi-well conservative systems where the potential energy wells are connected by index-1 saddles, the motion between potential wells is governed by the invariant manifolds of a periodic orbit around the saddle. In two degree of freedom systems, such invariant manifolds appear as cylindrical conduits which are referred to as transition tubes. In this study, we apply the concept of invariant manifolds to study the transition between potential wells in not only conservative systems, but more realistic dissipative systems, by solving respective proper boundary-value problems. The example system considered is a two mode model of the snap-through buckling of a shallow arch. We define the transition region, $\mathcal{T}_h$, as a set of initial conditions of a given initial Hamiltonian energy $h$ with which the trajectories can escape from one potential well to another, which in the example system corresponds to snap-through buckling of a structure. The numerical results reveal that in the conservative system the boundary of the transition region, $\partial \mathcal{T}_h$, is a cylinder, while in the dissipative system, $\partial \mathcal{T}_h$ is an ellipsoid. The algorithms developed in the current research from the perspective of invariant manifold provides a robust theoretical-computational framework to study escape and transition dynamics.
△ Less
Submitted 19 October, 2020;
originally announced October 2020.
-
Look It Up: Bilingual Dictionaries Improve Neural Machine Translation
Authors:
Xing Jie Zhong,
David Chiang
Abstract:
Despite advances in neural machine translation (NMT) quality, rare words continue to be problematic. For humans, the solution to the rare-word problem has long been dictionaries, but dictionaries cannot be straightforwardly incorporated into NMT. In this paper, we describe a new method for "attaching" dictionary definitions to rare words so that the network can learn the best way to use them. We d…
▽ More
Despite advances in neural machine translation (NMT) quality, rare words continue to be problematic. For humans, the solution to the rare-word problem has long been dictionaries, but dictionaries cannot be straightforwardly incorporated into NMT. In this paper, we describe a new method for "attaching" dictionary definitions to rare words so that the network can learn the best way to use them. We demonstrate improvements of up to 1.8 BLEU using bilingual dictionaries.
△ Less
Submitted 28 January, 2022; v1 submitted 12 October, 2020;
originally announced October 2020.
-
Rapid and sensitive detection of SARS-CoV-2 with functionalized magnetic nanoparticles
Authors:
Jing Zhong,
Enja Laureen Roesch,
Thilo Viereck,
Meinhard Schilling,
Frank Ludwig
Abstract:
The outbreak of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) threatens global medical systems and economies, and rules our daily living life. Controlling the outbreak of SARS-CoV-2 has become one of the most important and urgent strategies throughout the whole world. As of October, 2020, there have not yet been any medicines or therapies to be effective against SARS-CoV-2. Thus…
▽ More
The outbreak of the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) threatens global medical systems and economies, and rules our daily living life. Controlling the outbreak of SARS-CoV-2 has become one of the most important and urgent strategies throughout the whole world. As of October, 2020, there have not yet been any medicines or therapies to be effective against SARS-CoV-2. Thus, rapid and sensitive diagnostics is the most important measures to control the outbreak of SARS-CoV-2. Homogeneous biosensing based on magnetic nanoparticles (MNPs) is one of the most promising approaches for rapid and highly sensitive detection of biomolecules. This paper proposes an approach for rapid and sensitive detection of SARS-CoV-2 with functionalized MNPs via the measurement of their magnetic response in an ac magnetic field. Experimental results demonstrate that the proposed approach allows the rapid detection of mimic SARS-CoV-2 with a limit of detection of 0.084 nM (5.9 fmole). The proposed approach has great potential for designing a low-cost and point-of-care device for rapid and sensitive diagnostics of SARS-CoV-2.
△ Less
Submitted 8 October, 2020;
originally announced October 2020.
-
Discovery of Four New Clusters in the Cygnus Cloud
Authors:
Song-mei Qin,
Jing Li,
Li Chen,
Jing Zhong
Abstract:
We report the discovery of four new open clusters (named as QC 1, QC 2, QC3 and QC 4) in the direction of Cygnus Cloud and select their members based on five astrometric parameters (l, b, \varpi, μ^*_α, μ_δ) of Gaia DR2. We also derive their astrophysical parameters for each new cluster. Structure parameters are generated by fitting the radial density distribution with King's profile. Using solar…
▽ More
We report the discovery of four new open clusters (named as QC 1, QC 2, QC3 and QC 4) in the direction of Cygnus Cloud and select their members based on five astrometric parameters (l, b, \varpi, μ^*_α, μ_δ) of Gaia DR2. We also derive their astrophysical parameters for each new cluster. Structure parameters are generated by fitting the radial density distribution with King's profile. Using solar metallicity, we performed isochrone-fitting on their purified color-magnitude diagrams (CMDs) to achieve the age of the clusters. The known cluster NGC 7062 at adjacent area is chosen to verify our identification process. The estimated distance, reddening and age of NGC 7062 are in good agreement with the literature.
△ Less
Submitted 17 August, 2020;
originally announced August 2020.
-
Modelling unresolved binaries of open clusters in color-magnitude diagram. I. method and application of NGC3532
Authors:
Lu Li,
Zhengyi Shao,
Zhao-Zhou Li,
Jincheng Yu,
Jing Zhong,
Li Chen
Abstract:
The binary properties of open clusters place crucial constraints on star formation theory and clusters' dynamical evolution. We develop a comprehensive approach that models the color-magnitude diagram (CMD) of the cluster members as the mixture of single stars and photometric unresolved binaries. This method enables us to infer the binary properties, including the binary fraction $f_\mathrm{b}$ an…
▽ More
The binary properties of open clusters place crucial constraints on star formation theory and clusters' dynamical evolution. We develop a comprehensive approach that models the color-magnitude diagram (CMD) of the cluster members as the mixture of single stars and photometric unresolved binaries. This method enables us to infer the binary properties, including the binary fraction $f_\mathrm{b}$ and binary mass-ratio distribution index $γ_q$ when a power-law is assumed, with high accuracy and precision, which were unfeasible in conventional methods. We employ a modified Gaussian process to determine the main sequence ridge line and its scatter from the observed CMD as model input. As a first example, we apply the method to the open cluster NGC3532 with the Gaia DR2 photometry. For the cluster members within a magnitude range corresponding to FGK dwarfs, we obtain $f_\mathrm{b} = 0.267\pm0.019$ and $γ_q = - 0.10\pm0.22$ for binaries with mass ratio $q > 0.2$. The $f_\mathrm{b}$ value is consistent with the previous work on NGC3532 and smaller than that of field stars. The close to zero $γ_q$ indicates that the mass ratios of binaries follow a nearly uniform distribution. For the first time, we unveil that the stars with smaller mass or in the inner region tend to have lower $f_\mathrm{b}$ and more positive value of $γ_q$ due to the lack of low mass-ratio binaries. The clear dependences of binary properties on mass and radius are most likely caused by the internal dynamics.
△ Less
Submitted 11 August, 2020;
originally announced August 2020.
-
Improving the Speed and Quality of GAN by Adversarial Training
Authors:
Jiachen Zhong,
Xuanqing Liu,
Cho-Jui Hsieh
Abstract:
Generative adversarial networks (GAN) have shown remarkable results in image generation tasks. High fidelity class-conditional GAN methods often rely on stabilization techniques by constraining the global Lipschitz continuity. Such regularization leads to less expressive models and slower convergence speed; other techniques, such as the large batch training, require unconventional computing power…
▽ More
Generative adversarial networks (GAN) have shown remarkable results in image generation tasks. High fidelity class-conditional GAN methods often rely on stabilization techniques by constraining the global Lipschitz continuity. Such regularization leads to less expressive models and slower convergence speed; other techniques, such as the large batch training, require unconventional computing power and are not widely accessible. In this paper, we develop an efficient algorithm, namely FastGAN (Free AdverSarial Training), to improve the speed and quality of GAN training based on the adversarial training technique. We benchmark our method on CIFAR10, a subset of ImageNet, and the full ImageNet datasets. We choose strong baselines such as SNGAN and SAGAN; the results demonstrate that our training algorithm can achieve better generation quality (in terms of the Inception score and Frechet Inception distance) with less overall training time. Most notably, our training algorithm brings ImageNet training to the broader public by requiring 2-4 GPUs.
△ Less
Submitted 7 August, 2020;
originally announced August 2020.
-
Heat transport scaling and transition in geostrophic rotating convection with varying aspect ratio
Authors:
Hao-Yuan Lu,
Guang-Yu Ding,
Jun-Qiang Shi,
Ke-Qing Xia,
Jin-Qiang Zhong
Abstract:
We present high-precision experimental and numerical studies of the Nusselt number $Nu$ as functions of the Rayleigh number $Ra$ in geostrophic rotating convection with domain aspect ratio $Γ$ varying from 0.4 to 3.8 and the Ekman number Ek from $2.0{\times}10^{-7}$ to $2.7{\times}10^{-5}$. The heat-transport data $Nu(Ra)$ reveal a gradual transition from buoyancy-dominated to geostrophic convecti…
▽ More
We present high-precision experimental and numerical studies of the Nusselt number $Nu$ as functions of the Rayleigh number $Ra$ in geostrophic rotating convection with domain aspect ratio $Γ$ varying from 0.4 to 3.8 and the Ekman number Ek from $2.0{\times}10^{-7}$ to $2.7{\times}10^{-5}$. The heat-transport data $Nu(Ra)$ reveal a gradual transition from buoyancy-dominated to geostrophic convection at large $Ek$, whereas the transition becomes sharp with decreasing $Ek$. We determine the power-law scaling of $Nu{\sim}Ra^γ$, and show that the boundary flows give rise to pronounced enhancement of $Nu$ in a broad range of the geostrophic regime, leading to reduction of the scaling exponent $γ$ in small $Γ$ cells. The present work provides new insight into the heat-transport scaling in geostrophic convection and may explain the discrepancies observed in previous studies.
△ Less
Submitted 26 July, 2020;
originally announced July 2020.
-
Towards a self-organizing pre-symbolic neural model representing sensorimotor primitives
Authors:
Junpei Zhong,
Angelo Cangelosi,
Stefan Wermter
Abstract:
The acquisition of symbolic and linguistic representations of sensorimotor behavior is a cognitive process performed by an agent when it is executing and/or observing own and others' actions. According to Piaget's theory of cognitive development, these representations develop during the sensorimotor stage and the pre-operational stage. We propose a model that relates the conceptualization of the h…
▽ More
The acquisition of symbolic and linguistic representations of sensorimotor behavior is a cognitive process performed by an agent when it is executing and/or observing own and others' actions. According to Piaget's theory of cognitive development, these representations develop during the sensorimotor stage and the pre-operational stage. We propose a model that relates the conceptualization of the higher-level information from visual stimuli to the development of ventral/dorsal visual streams. This model employs neural network architecture incorporating a predictive sensory module based on an RNNPB (Recurrent Neural Network with Parametric Biases) and a horizontal product model. We exemplify this model through a robot passively observing an object to learn its features and movements. During the learning process of observing sensorimotor primitives, i.e. observing a set of trajectories of arm movements and its oriented object features, the pre-symbolic representation is self-organized in the parametric units. These representational units act as bifurcation parameters, guiding the robot to recognize and predict various learned sensorimotor primitives. The pre-symbolic representation also accounts for the learning of sensorimotor primitives in a latent learning context.
△ Less
Submitted 12 July, 2020; v1 submitted 19 June, 2020;
originally announced June 2020.
-
Exploring open cluster properties with Gaia and LAMOST
Authors:
Jing Zhong,
Li Chen,
Di Wu,
Lu Li,
Leya Bai,
Jinliang Hou
Abstract:
In Gaia DR2, the unprecedented high-precision level reached in sub-mas for astrometry and mmag for photometry. Using cluster members identified with these astrometry and photometry in Gaia DR2, we can obtain a reliable determination of cluster properties. However, because of the shortcoming of Gaia spectroscopic observation in dealing with densely crowded cluster region, the number of radial veloc…
▽ More
In Gaia DR2, the unprecedented high-precision level reached in sub-mas for astrometry and mmag for photometry. Using cluster members identified with these astrometry and photometry in Gaia DR2, we can obtain a reliable determination of cluster properties. However, because of the shortcoming of Gaia spectroscopic observation in dealing with densely crowded cluster region, the number of radial velocity and metallicity for cluster member stars from Gaia DR2 is still lacking. In this study, we aim to improve the cluster properties by combining the LAMOST spectra. In particular, we provide the list of cluster members with spectroscopic parameters as an add-value catalog in LAMOST DR5, which can be used to perform detailed study for a better understanding on the stellar properties, by using their spectra and fundamental properties from the host cluster. We cross-matched the spectroscopic catalog in LAMOST DR5 with the identified cluster members in Cantat-Gaudin et al.2018 and then used members with spectroscopic parameters to derive statistical properties of open clusters. We obtained a list of 8811 members with spectroscopic parameters and a catalog of 295 cluster properties. In addition, we study the radial and vertical metallicity gradient and age-metallicity relation with the compiled open clusters as tracers, finding slopes of -0.053$\pm$0.004 dex kpc$^{-1}$, -0.252$\pm$0.039 dex kpc$^{-1}$ and 0.022$\pm$0.008 dex Gyr$^{-1}$, respectively. Both slopes of metallicity distribution relation for young clusters (0.1 Gyr < Age < 2 Gyr) and the age-metallicity relation for clusters within 6 Gyr are consistent with literature results. In order to fully study the chemical evolution history in the disk, more spectroscopic observations for old and distant open clusters are needed for further investigation.
△ Less
Submitted 15 June, 2020; v1 submitted 11 June, 2020;
originally announced June 2020.
-
Investigating Robustness of Adversarial Samples Detection for Automatic Speaker Verification
Authors:
Xu Li,
Na Li,
Jinghua Zhong,
Xixin Wu,
Xunying Liu,
Dan Su,
Dong Yu,
Helen Meng
Abstract:
Recently adversarial attacks on automatic speaker verification (ASV) systems attracted widespread attention as they pose severe threats to ASV systems. However, methods to defend against such attacks are limited. Existing approaches mainly focus on retraining ASV systems with adversarial data augmentation. Also, countermeasure robustness against different attack settings are insufficiently investi…
▽ More
Recently adversarial attacks on automatic speaker verification (ASV) systems attracted widespread attention as they pose severe threats to ASV systems. However, methods to defend against such attacks are limited. Existing approaches mainly focus on retraining ASV systems with adversarial data augmentation. Also, countermeasure robustness against different attack settings are insufficiently investigated. Orthogonal to prior approaches, this work proposes to defend ASV systems against adversarial attacks with a separate detection network, rather than augmenting adversarial data into ASV training. A VGG-like binary classification detector is introduced and demonstrated to be effective on detecting adversarial samples. To investigate detector robustness in a realistic defense scenario where unseen attack settings may exist, we analyze various kinds of unseen attack settings' impact and observe that the detector is robust (6.27\% EER_{det} degradation in the worst case) against unseen substitute ASV systems, but it has weak robustness (50.37\% EER_{det} degradation in the worst case) against unseen perturbation methods. The weak robustness against unseen perturbation methods shows a direction for developing stronger countermeasures.
△ Less
Submitted 7 August, 2020; v1 submitted 11 June, 2020;
originally announced June 2020.
-
LAMOST Medium-Resolution Spectroscopic Survey (LAMOST-MRS): Scientific goals and survey plan
Authors:
Chao Liu,
Jianning Fu,
Jianrong Shi,
Hong Wu,
Zhanwen Han,
Li Chen,
Subo Dong,
Yongheng Zhao,
Jian-Jun Chen,
Haotong Zhang,
Zhong-Rui Bai,
Xuefei Chen,
Wenyuan Cui,
Bing Du,
Chih-Hao Hsia,
Deng-Kai Jiang,
Jinliang Hou,
Wen Hou,
Haining Li,
Jiao Li,
Lifang Li,
Jiaming Liu,
Jifeng Liu,
A-Li Luo,
Juan-Juan Ren
, et al. (16 additional authors not shown)
Abstract:
Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2…
▽ More
Since September 2018, LAMOST starts a new 5-year medium-resolution spectroscopic survey (MRS) using bright/gray nights. We present the scientific goals of LAMOST-MRS and propose a near optimistic strategy of the survey. A complete footprint is also provided. Not only the regular medium-resolution survey, but also a time-domain spectroscopic survey is being conducted since 2018 and will be end in 2023. According to the detailed survey plan, we expect that LAMOST-MRS can observe about 2 million stellar spectra with ~7500 and limiting magnitude of around G=15 mag. Moreover, it will also provide about 200 thousand stars with averagely 60-epoch observations and limiting magnitude of G~14 mag. These high quality spectra will give around 20 elemental abundances, rotational velocities, emission line profiles as well as precise radial velocity with uncertainty less than 1 km/s. With these data, we expect that LAMOST can effectively leverage sciences on stellar physics, e.g. exotic binary stars, detailed observation of many types of variable stars etc., planet host stars, emission nebulae, open clusters, young pre-main-sequence stars etc.
△ Less
Submitted 14 May, 2020;
originally announced May 2020.
-
A Ubiquitous Thermal Conductivity Formula for Liquids, Polymer Glass, and Amorphous Solids
Authors:
Qing Xi,
Jinxin Zhong,
Jixiong He,
Xiangfan Xu,
Tsuneyoshi Nakayama,
Yuanyuan Wang,
Jun Liu,
Jun Zhou,
Baowen Li
Abstract:
The microscopic mechanism of thermal transport in liquids and amorphous solids has been an outstanding problem for a long time. There have been several different approaches to explain the thermal conductivities for these systems, for example, the Bridgman's formula for simple liquids, the concept of the minimum thermal conductivity for amorphous solids, and the thermal resistance network model for…
▽ More
The microscopic mechanism of thermal transport in liquids and amorphous solids has been an outstanding problem for a long time. There have been several different approaches to explain the thermal conductivities for these systems, for example, the Bridgman's formula for simple liquids, the concept of the minimum thermal conductivity for amorphous solids, and the thermal resistance network model for amorphous polymers. Here, we present a ubiquitous formula to explain the thermal conductivities of liquids and amorphous solids in a unified way. The calculated thermal conductivities using this formula without fitting parameters are in excellent agreement with the experimental data for these systems. Our formula is not only providing detailed implications on microscopic mechanisms of heat transfer in these systems, but also solves the discrepancies between existing formulae and experimental data.
△ Less
Submitted 26 September, 2020; v1 submitted 1 May, 2020;
originally announced May 2020.
-
Localization, phases and transitions in the three-dimensional extended Lieb lattices
Authors:
Jie Liu,
Xiaoyu Mao,
Jianxin Zhong,
Rudolf A. Römer
Abstract:
We study the localization properties and the Anderson transition in the 3D Lieb lattice $\mathcal{L}_3(1)$ and its extensions $\mathcal{L}_3(n)$ in the presence of disorder. We compute the positions of the flat bands, the disorder-broadened density of states and the energy-disorder phase diagrams for up to 4 different such Lieb lattices. Via finite-size scaling, we obtain the critical properties s…
▽ More
We study the localization properties and the Anderson transition in the 3D Lieb lattice $\mathcal{L}_3(1)$ and its extensions $\mathcal{L}_3(n)$ in the presence of disorder. We compute the positions of the flat bands, the disorder-broadened density of states and the energy-disorder phase diagrams for up to 4 different such Lieb lattices. Via finite-size scaling, we obtain the critical properties such as critical disorders and energies as well as the universal localization lengths exponent $ν$. We find that the critical disorder $W_c$ decreases from $\sim 16.5$ for the cubic lattice, to $\sim 8.6$ for $\mathcal{L}_3(1)$, $\sim 5.9$ for $\mathcal{L}_3(2)$ and $\sim 4.8$ for $\mathcal{L}_3(3)$. Nevertheless, the value of the critical exponent $ν$ for all Lieb lattices studied here and across disorder and energy transitions agrees within error bars with the generally accepted universal value $ν=1.590 (1.579,1.602)$.
△ Less
Submitted 16 April, 2020;
originally announced April 2020.
-
Bayesian x-vector: Bayesian Neural Network based x-vector System for Speaker Verification
Authors:
Xu Li,
Jinghua Zhong,
Jianwei Yu,
Shoukang Hu,
Xixin Wu,
Xunying Liu,
Helen Meng
Abstract:
Speaker verification systems usually suffer from the mismatch problem between training and evaluation data, such as speaker population mismatch, the channel and environment variations. In order to address this issue, it requires the system to have good generalization ability on unseen data. In this work, we incorporate Bayesian neural networks (BNNs) into the deep neural network (DNN) x-vector spe…
▽ More
Speaker verification systems usually suffer from the mismatch problem between training and evaluation data, such as speaker population mismatch, the channel and environment variations. In order to address this issue, it requires the system to have good generalization ability on unseen data. In this work, we incorporate Bayesian neural networks (BNNs) into the deep neural network (DNN) x-vector speaker verification system to improve the system's generalization ability. With the weight uncertainty modeling provided by BNNs, we expect the system could generalize better on the evaluation data and make verification decisions more accurately. Our experiment results indicate that the DNN x-vector system could benefit from BNNs especially when the mismatch problem is severe for evaluations using out-of-domain data. Specifically, results show that the system could benefit from BNNs by a relative EER decrease of 2.66% and 2.32% respectively for short- and long-utterance in-domain evaluations. Additionally, the fusion of DNN x-vector and Bayesian x-vector systems could achieve further improvement. Moreover, experiments conducted by out-of-domain evaluations, e.g. models trained on Voxceleb1 while evaluated on NIST SRE10 core test, suggest that BNNs could bring a larger relative EER decrease of around 4.69%.
△ Less
Submitted 8 April, 2020;
originally announced April 2020.
-
Compatible Learning for Deep Photonic Neural Network
Authors:
Yong-Liang Xiao,
Rongguang Liang,
Jianxin Zhong,
Xianyu Su,
Zhisheng You
Abstract:
Realization of deep learning with coherent optical field has attracted remarkably attentions presently, which benefits on the fact that optical matrix manipulation can be executed at speed of light with inherent parallel computation as well as low latency. Photonic neural network has a significant potential for prediction-oriented tasks. Yet, real-value Backpropagation behaves somewhat intractably…
▽ More
Realization of deep learning with coherent optical field has attracted remarkably attentions presently, which benefits on the fact that optical matrix manipulation can be executed at speed of light with inherent parallel computation as well as low latency. Photonic neural network has a significant potential for prediction-oriented tasks. Yet, real-value Backpropagation behaves somewhat intractably for coherent photonic intelligent training. We develop a compatible learning protocol in complex space, of which nonlinear activation could be selected efficiently depending on the unveiled compatible condition. Compatibility indicates that matrix representation in complex space covers its real counterpart, which could enable a single channel mingled training in real and complex space as a unified model. The phase logical XOR gate with Mach-Zehnder interferometers and diffractive neural network with optical modulation mechanism, implementing intelligent weight learned from compatible learning, are presented to prove the availability. Compatible learning opens an envisaged window for deep photonic neural network.
△ Less
Submitted 14 March, 2020;
originally announced March 2020.
-
Quantum Hall phase emerging in an array of atoms interacting with photons
Authors:
Alexander V. Poshakinskiy,
Janet Zhong,
Yongguan Ke,
Nikita A. Olekhno,
Chaohong Lee,
Yuri S. Kivshar,
Alexander N. Poddubny
Abstract:
Topological quantum phases underpin many concepts of modern physics. While the existence of disorder-immune topological edge states of electrons usually requires magnetic fields, direct effects of magnetic field on light are very weak. As a result, demonstrations of topological states of photons employ synthetic fields engineered in special complex structures or external time-dependent modulations…
▽ More
Topological quantum phases underpin many concepts of modern physics. While the existence of disorder-immune topological edge states of electrons usually requires magnetic fields, direct effects of magnetic field on light are very weak. As a result, demonstrations of topological states of photons employ synthetic fields engineered in special complex structures or external time-dependent modulations. Here, we reveal that the quantum Hall phase with topological edge states, spectral Landau levels and Hofstadter butterfly can emerge in a simple quantum system, where topological order arises solely from interactions without any fine-tuning. Such systems, arrays of two-level atoms (qubits) coupled to light being described by the classical Dicke model, have recently been realized in experiments with cold atoms and superconducting qubits. We believe that our finding will open new horizons in several disciplines including quantum physics, many-body physics, and nonlinear topological photonics, and it will set an important reference point for experiments on qubit arrays and quantum simulators.
△ Less
Submitted 18 March, 2020;
originally announced March 2020.
-
AVR: Attention based Salient Visual Relationship Detection
Authors:
Jianming Lv,
Qinzhe Xiao,
Jiajie Zhong
Abstract:
Visual relationship detection aims to locate objects in images and recognize the relationships between objects. Traditional methods treat all observed relationships in an image equally, which causes a relatively poor performance in the detection tasks on complex images with abundant visual objects and various relationships. To address this problem, we propose an attention based model, namely AVR,…
▽ More
Visual relationship detection aims to locate objects in images and recognize the relationships between objects. Traditional methods treat all observed relationships in an image equally, which causes a relatively poor performance in the detection tasks on complex images with abundant visual objects and various relationships. To address this problem, we propose an attention based model, namely AVR, to achieve salient visual relationships based on both local and global context of the relationships. Specifically, AVR recognizes relationships and measures the attention on the relationships in the local context of an input image by fusing the visual features, semantic and spatial information of the relationships. AVR then applies the attention to assign important relationships with larger salient weights for effective information filtering. Furthermore, AVR is integrated with the priori knowledge in the global context of image datasets to improve the precision of relationship prediction, where the context is modeled as a heterogeneous graph to measure the priori probability of relationships based on the random walk algorithm. Comprehensive experiments are conducted to demonstrate the effectiveness of AVR in several real-world image datasets, and the results show that AVR outperforms state-of-the-art visual relationship detection methods significantly by up to $87.5\%$ in terms of recall.
△ Less
Submitted 16 March, 2020;
originally announced March 2020.
-
Radiative topological biphoton states in modulated qubit arrays
Authors:
Yongguan Ke,
Janet Zhong,
Alexander V. Poshakinskiy,
Yuri S. Kivshar,
Alexander N. Poddubny,
Chaohong Lee
Abstract:
We study topological properties of bound pairs of photons in spatially-modulated qubit arrays (arrays of two-level atoms) coupled to a waveguide. While bound pairs behave like Bloch waves, they are topologically nontrivial in the parameter space formed by the center-of-mass momentum and the modulation phase, where the latter plays the role of a synthetic dimension. In a superlattice where each uni…
▽ More
We study topological properties of bound pairs of photons in spatially-modulated qubit arrays (arrays of two-level atoms) coupled to a waveguide. While bound pairs behave like Bloch waves, they are topologically nontrivial in the parameter space formed by the center-of-mass momentum and the modulation phase, where the latter plays the role of a synthetic dimension. In a superlattice where each unit cell contains three two-level atoms (qubits), we calculate the Chern numbers for the bound-state photon bands, which are found to be $(1,-2,1)$. For open boundary condition, we find exotic topological bound-pair edge states with radiative losses. Unlike the conventional case of the bulk-edge correspondence, these novel edge modes not only exist in gaps separating the bound-pair bands, but they also may merge with and penetrate into the bands. By joining two structures with different spatial modulations, we find long-lived interface states which may have applications in storage and quantum information processing.
△ Less
Submitted 23 February, 2020;
originally announced February 2020.
-
New structure canditates for the experimentally synthesized heptazine-based and triazine-based two dimensional graphitic carbon nitride
Authors:
Luneng zhao,
Xizhi Shi,
Jin Li,
Tao Ouyang,
Chunxiao Zhang,
Chao Tang,
Chaoyu He,
Jianxin Zhong
Abstract:
The widely used crystal structures for both heptazine-based and triazine-based two-dimensional (2D) graphitic carbon nitride (g-C$_3$N$_4$) are the flat P-6m2 configurations. However, the experimentally synthesized 2D g-C$_3$N$_4$ possess thickness ranging in 0.2-0.5 nm, indicating that the theoretically used flat P-6m2 configurations are not the correct ground states. In this work, we propose thr…
▽ More
The widely used crystal structures for both heptazine-based and triazine-based two-dimensional (2D) graphitic carbon nitride (g-C$_3$N$_4$) are the flat P-6m2 configurations. However, the experimentally synthesized 2D g-C$_3$N$_4$ possess thickness ranging in 0.2-0.5 nm, indicating that the theoretically used flat P-6m2 configurations are not the correct ground states. In this work, we propose three new corrugated structures P321, P3m1 and Pca21 with energies of 66 (86), 77 (87) and 78 (89) meV/atom lower than that of the corresponding heptazine-based (triazine-based) g-C$_3$N$_4$ in flat P-6m2 configuration, respectively. These corrugated structures have very similar periodic patterns to the flat P-6m2 ones and they are difficult to be distinguished from each other according to their top-views. The optimized thicknesses of the three corrugated structures ranging in 1.347-3.142 Å are in good agreement with the experimental results. The first-principles results show that these corrugated structural candidates are also semiconductors with band gaps slightly larger than those of the correspondingly flat P-6m2 ones. Furthermore, they possess also suitable band edge positions for sun-light-driven water-splitting at both $pH=0$ and $pH=7$ environments. Our results show that these three new structures are more promising candidates for the experimentally synthesized g-C$_3$N$_4$.
△ Less
Submitted 25 May, 2020; v1 submitted 17 February, 2020;
originally announced February 2020.
-
Varied fusion reaction probability induced by ion stopping modification in laser-driven plasma with different temperature
Authors:
Yihang Zhang,
Zhe Zhang,
Baojun Zhu,
Weiman Jiang,
Lei Cheng,
Lei Zhao,
Xiaopeng Zhang,
Xu Zhao,
Xiaohui Yuan,
Bowei Tong,
Jiayong Zhong,
Shukai He,
Feng Lu,
Yuchi Wu,
Weimin Zhou,
Faqiang Zhang,
Kainan Zhou,
Na Xie,
Zheng Huang,
Yuqiu Gu,
Suming Weng,
Miaohua Xu,
Yingjun Li,
Yutong Li
Abstract:
The dynamics of nuclear reaction in plasma is a fundamental issue in many high energy density researches, such as the astrophysical reactions and the inertial confinement fusion. The effective reaction cross-sections and ion stopping power in plasma need to be taken into account to analyze the reactivity. In this research, we have experimentally investigated the from D-D reactions from interaction…
▽ More
The dynamics of nuclear reaction in plasma is a fundamental issue in many high energy density researches, such as the astrophysical reactions and the inertial confinement fusion. The effective reaction cross-sections and ion stopping power in plasma need to be taken into account to analyze the reactivity. In this research, we have experimentally investigated the from D-D reactions from interactions between deuteron beams and deuterated polystyrene (CD) plasma, driven by two laser pulses respectively. The neutron yields, plasma density and deuteron energy loss in plasma have been measured, and the plasma temperature and deuteron stopping power have been analyzed from simulations. It is shown that, compared with a cold target, the reaction probability in plasma conditions can be enhanced or suppressed, which is ascribed to the deuteron stopping power modifications in plasma. In hotter CD plasma, the energy loss of moderate energetic deuterons reduces, which leads to higher D-D reaction probability, while the contrary happens in colder plasma. This work provides new understanding of fusion reactions in plasma environment.
△ Less
Submitted 8 February, 2020;
originally announced February 2020.
-
Theoretical prediction of a low-energy Stone-Wales graphene with intrinsic type-III Dirac-cone
Authors:
Zhenghao Gong,
XiZhi Shi,
Jin Li,
ShiFang Li,
Chaoyu He,
Tao Ouyang,
ChunXiao Zhang,
Chao Tang,
JianXin Zhong
Abstract:
Based on first-principles method we predict a new low-energy Stone-Wales graphene SW40, which has an orthorhombic lattice with Pbam symmetry and 40 carbon atoms in its crystalline cell forming well-arranged Stone-Wales patterns. The calculated total energy of SW40 is just about 133 meV higher than that of graphene, indicating its excellent stability exceeds all the previously proposed graphene all…
▽ More
Based on first-principles method we predict a new low-energy Stone-Wales graphene SW40, which has an orthorhombic lattice with Pbam symmetry and 40 carbon atoms in its crystalline cell forming well-arranged Stone-Wales patterns. The calculated total energy of SW40 is just about 133 meV higher than that of graphene, indicating its excellent stability exceeds all the previously proposed graphene allotropes. We find that SW40 processes intrinsic Type-III Dirac-cone (Phys. Rev. Lett., 120, 237403, 2018) formed by band-crossing of a local linear-band and a local flat-band, which can result in highly anisotropic Fermions in the system. Interestingly, such intrinsic type-III Dirac-cone can be effectively tuned by inner-layer strains and it will be transferred into Type-II and Type-I Dirac-cones under tensile and compressed strains, respectively. Finally, a general tight-binding model was constructed to understand the electronic properties nearby the Fermi-level in SW40. The results show that type-III Dirac-cone feature can be well understood by the $π$-electron interactions between adjacent Stone-Wales defects.
△ Less
Submitted 28 January, 2020;
originally announced January 2020.
-
Multi-task self-supervised learning for Robust Speech Recognition
Authors:
Mirco Ravanelli,
Jianyuan Zhong,
Santiago Pascual,
Pawel Swietojanski,
Joao Monteiro,
Jan Trmal,
Yoshua Bengio
Abstract:
Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that combines a convolutional encoder followed by multiple neural networks, called workers, tasked to solve self-supervised problems (i.e., ones that do not require ma…
▽ More
Despite the growing interest in unsupervised learning, extracting meaningful knowledge from unlabelled audio remains an open challenge. To take a step in this direction, we recently proposed a problem-agnostic speech encoder (PASE), that combines a convolutional encoder followed by multiple neural networks, called workers, tasked to solve self-supervised problems (i.e., ones that do not require manual annotations as ground truth). PASE was shown to capture relevant speech information, including speaker voice-print and phonemes. This paper proposes PASE+, an improved version of PASE for robust speech recognition in noisy and reverberant environments. To this end, we employ an online speech distortion module, that contaminates the input signals with a variety of random disturbances. We then propose a revised encoder that better learns short- and long-term speech dynamics with an efficient combination of recurrent and convolutional networks. Finally, we refine the set of workers used in self-supervision to encourage better cooperation. Results on TIMIT, DIRHA and CHiME-5 show that PASE+ significantly outperforms both the previous version of PASE as well as common acoustic features. Interestingly, PASE+ learns transferable representations suitable for highly mismatched acoustic conditions.
△ Less
Submitted 17 April, 2020; v1 submitted 24 January, 2020;
originally announced January 2020.
-
Disorder effects in the two-dimensional Lieb lattice and its extensions
Authors:
Xiaoyu Mao,
Jie Liu,
Jianxin Zhong,
Rudolf A. Römer
Abstract:
We study the localization properties of the two-dimensional Lieb lattice and its extensions in the presence of disorder using transfer matrix method and finite-size scaling. We find that all states in the Lieb lattice and its extensions are localized for $W \geq 1$. Clear differences in the localization properties between disordered flat band and disordered dispersive bands are identified. Our res…
▽ More
We study the localization properties of the two-dimensional Lieb lattice and its extensions in the presence of disorder using transfer matrix method and finite-size scaling. We find that all states in the Lieb lattice and its extensions are localized for $W \geq 1$. Clear differences in the localization properties between disordered flat band and disordered dispersive bands are identified. Our results complement previous experimental studies of clean photonic Lieb lattices and provide information about their stability with respect to disorder.
△ Less
Submitted 2 July, 2020; v1 submitted 16 January, 2020;
originally announced January 2020.
-
Investigation of Growth-Induced Strain in Monolayer MoS2 Grown by Chemical Vapor Deposition
Authors:
Siwei Luo,
Conor P. Cullen,
Gencai Guo,
Jianxin Zhong,
Georg S. Duesberg
Abstract:
Two-dimensional materials such as transitional metal dichalcogenides exhibit unique optical and electrical properties. Here we report on the varying optical properties of CVD grown MoS2 monolayer flakes with different shapes. In particular, it is observed that the perimeter and the central region of the flakes have non-uniform photoluminescence (PL) energy and intensity. We quantified these effect…
▽ More
Two-dimensional materials such as transitional metal dichalcogenides exhibit unique optical and electrical properties. Here we report on the varying optical properties of CVD grown MoS2 monolayer flakes with different shapes. In particular, it is observed that the perimeter and the central region of the flakes have non-uniform photoluminescence (PL) energy and intensity. We quantified these effects systematically and propose that thermally induced strain during growth is the origin. The strain relaxation after transfer of the MoS2 flakes supports this explanation. Detailed investigations of the spatial distribution of the PL energy reveal that depending on the shape of the MoS2 flakes, the width of the strain field is different. Thus, our results help to elucidate the fundamental mechanisms responsible for the differences in PL and Raman signals between the perimeter region and the center region of monolayer MoS2 and suggest that the induced strain plays an important role in the growth of monolayer materials.
△ Less
Submitted 22 December, 2019;
originally announced December 2019.
-
Expected Exit Time for Time-Periodic Stochastic Differential Equations and Applications to Stochastic Resonance
Authors:
Chunrong Feng,
Huaizhong Zhao,
Johnny Zhong
Abstract:
In this paper, we derive a parabolic partial differential equation for the expected exit time of non-autonomous time-periodic non-degenerate stochastic differential equations. This establishes a Feynman-Kac duality between expected exit time of time-periodic stochastic differential equations and time-periodic solutions of parabolic partial differential equations. Casting the time-periodic solution…
▽ More
In this paper, we derive a parabolic partial differential equation for the expected exit time of non-autonomous time-periodic non-degenerate stochastic differential equations. This establishes a Feynman-Kac duality between expected exit time of time-periodic stochastic differential equations and time-periodic solutions of parabolic partial differential equations. Casting the time-periodic solution of the parabolic partial differential equation as a fixed point problem and a convex optimisationproblem, we give sufficient conditions in which the partial differential equation is well-posed in a weak and classical sense. With no known closed formulae for the expected exit time, we show our method can be readily implemented by standard numerical schemes. With relatively weak conditions (e.g. locally Lipschitz coefficients), the method in this paper is applicable to wide range of physical systems including weakly dissipative systems. Particular applications towards stochastic resonance will be discussed.
△ Less
Submitted 11 March, 2021; v1 submitted 11 December, 2019;
originally announced December 2019.
-
Pinning Stabilizer Design for Large-Scale Probabilistic Boolean Networks
Authors:
Lin Lin,
Jinde Cao,
Jianquan Lu,
Jie Zhong
Abstract:
This paper investigates the stabilization of probabilistic Boolean networks (PBNs) via a novel pinning control strategy based on network structure. In a PBN, the evolution equation of each gene switches among a collection of candidate Boolean functions with probability distributions that govern the activation frequency of each Boolean function. Owing to the stochasticity, the uniform state feedbac…
▽ More
This paper investigates the stabilization of probabilistic Boolean networks (PBNs) via a novel pinning control strategy based on network structure. In a PBN, the evolution equation of each gene switches among a collection of candidate Boolean functions with probability distributions that govern the activation frequency of each Boolean function. Owing to the stochasticity, the uniform state feedback controller, independent of switching signal, might be out of work, and in this case, the non-uniform state feedback controller is required. Subsequently, a criterion is derived to determine whether uniform controllers is applicable to achieve stabilization. It is worth pointing out that the pinning control designed in this paper is based on the network structure, which only requires local in-neighbors' information, rather than global information (state transition matrix). Moreover, this pinning control strategy reduces the computational complexity from $O(2^{2n})$ to $O(n2^α)$, and thus it has the ability to handle some large-scale networks, especially the networks with sparse connections. Finally, the mammalian cell-cycle encountering a mutated phenotype is modelled by a PBN to demonstrate the obtained results.
△ Less
Submitted 23 October, 2020; v1 submitted 7 December, 2019;
originally announced December 2019.
-
Sensors Design for Large-Scale Boolean Networks via Pinning Observability
Authors:
Shiyong Zhu,
Jianquan Lu,
Jie Zhong,
Yang Liu,
Jinde Cao
Abstract:
In this paper, a set of sensors is constructed via the pinning observability approach with the help of observability criteria given in [1] and [2], in order to make the given Boolean network (BN) be observable. Given the assumption that system states can be accessible, an efficient pinning control scheme is developed to generate an observable BN by adjusting the network structure rather than just…
▽ More
In this paper, a set of sensors is constructed via the pinning observability approach with the help of observability criteria given in [1] and [2], in order to make the given Boolean network (BN) be observable. Given the assumption that system states can be accessible, an efficient pinning control scheme is developed to generate an observable BN by adjusting the network structure rather than just to check system observability. Accordingly, the sensors are constructed, of which the form is consistent with that of state feedback controllers in the designed pinning control. Since this pinning control approach only utilizes node-to-node message communication instead of global state space information, the time complexity is dramatically reduced from $O(2^{2n})$ to $O(n^2+n2^d)$, where where $n$ and $d$ are respectively the node number of the considered BN and the largest in-degree of vertices in its network structure. Finally, we design the sensors for the reduced D. melanogaster segmentation polarity gene network and the T-cell receptor kinetics, respectively.
△ Less
Submitted 5 March, 2022; v1 submitted 5 December, 2019;
originally announced December 2019.
-
Image-free real-time classification of fast moving objects using 'learned' spatial light modulation and a single-pixel detector
Authors:
Zibang Zhang,
Xiang Li,
Manhong Yao,
Shujun Zheng,
Guoan Zheng,
Jingang Zhong
Abstract:
Objects classification generally relies on image acquisition and analysis. Real-time classification of high-speed moving objects is challenging, as both high temporal resolution in image acquisition and low computational complexity in objects classification algorithms are required. Here we propose and experimentally demonstrate an approach for real-time moving objects classification without image…
▽ More
Objects classification generally relies on image acquisition and analysis. Real-time classification of high-speed moving objects is challenging, as both high temporal resolution in image acquisition and low computational complexity in objects classification algorithms are required. Here we propose and experimentally demonstrate an approach for real-time moving objects classification without image acquisition. As objects classification algorithms rely on the feature information of objects, we propose to use spatial light modulation to acquire the feature information directly rather than performing image acquisition followed by features extraction. A convolutional neural network is designed and trained to learn the spatial features of the target objects. The trained network can generate structured patterns for spatial light modulation. Using the resulting structured patterns for spatial light modulation, the feature information of target objects can be compressively encoded into a short light intensity sequence. The resulting one-dimensional signal is collected by a single-pixel detector and fed to the convolutional neural network for objects classification. As experimentally demonstrated, the proposed approach can achieve accurate and real-time classification of fast moving objects. The proposed method has potential applications in the fields where fast moving objects classification in real time and for long duration is required.
△ Less
Submitted 4 December, 2019; v1 submitted 2 December, 2019;
originally announced December 2019.
-
A New Approach to Pinning Control of Boolean Networks
Authors:
Jie Zhong,
Daniel W. C. Ho,
Jianquan Lu
Abstract:
Boolean networks (BNs) are discrete-time systems where nodes are inter-connected (here we call such connection rule among nodes as network structure), and the dynamics of each gene node is determined by logical functions. In this paper, we propose a new approach on pinning control design for global stabilization of BNs based on BNs' network structure, named as network-structure-based distributed p…
▽ More
Boolean networks (BNs) are discrete-time systems where nodes are inter-connected (here we call such connection rule among nodes as network structure), and the dynamics of each gene node is determined by logical functions. In this paper, we propose a new approach on pinning control design for global stabilization of BNs based on BNs' network structure, named as network-structure-based distributed pinning control. By deleting the minimum number of edges, the network structure becomes acyclic. Then, an efficient distributed pinning control is designed to achieve global stabilization. Compared with existing literature, the design of pinning control is not based on the state transition matrix of BNs. Hence, the computational complexity in this paper is reduced from $O(2^n\times 2^n)$ to $O(2\times 2^K)$, where $n$ is the number of nodes and $K\leq n$ is the largest number of in-neighbors of nodes. In addition, without using state transition matrix, global state information is no longer needed, the design of pinning control is just based on neighbors' local information, which is easier to be implemented. The proposed method is well demonstrated by several biological networks with different sizes. The results are shown to be simple and concise, while the traditional pinning control can not be applied for BNs with such a large dimension.
△ Less
Submitted 30 October, 2020; v1 submitted 2 December, 2019;
originally announced December 2019.
-
A Simple Distortion Calibration method for Wide-Angle Lenses Based on Fringe-pattern Phase Analysis
Authors:
Weishuai Zhou,
Jiawen Weng,
Junzheng Peng,
Jingang Zhong
Abstract:
A distortion calibration method for wide-angle lens is proposed based on fringe-pattern phase analysis. Firstly, according to the experimental result of the radial distortion of the image not related to the recording depth of field, but depending on the field of view angle of the wide-angle lens imaging system, two-dimensional image distortion calibration is need to be considered. Four standard si…
▽ More
A distortion calibration method for wide-angle lens is proposed based on fringe-pattern phase analysis. Firstly, according to the experimental result of the radial distortion of the image not related to the recording depth of field, but depending on the field of view angle of the wide-angle lens imaging system, two-dimensional image distortion calibration is need to be considered. Four standard sinusoidal fringe-patterns with phase shift step of , which are used as calibration templates, are shown on a Liquid Crystal Display screen, and captured by the wide-angle lens imaging system. A four-step phase-shifting method is employed to obtain the radial phase distribution of the distorted fringe-pattern. Wavelet analysis is applied for the analysis of the instantaneous frequency to show the fundamental frequency of the fringe-pattern in the central region being unchanged. Performing numerical calculation by the central 9 points of the central row of the fringe-pattern, we can get the undistorted radial phase distribution, so, the radial modulated phase is computed. Finally, the radial distortion distribution is determined according to the radial modulated phase. By employing a bilinear interpolation algorithm, the wide-angle lens image calibration is achieved. There is no need to establish any kind of image distortion model for the proposed method. There is no projecting system in the experimental apparatus, which avoids projection shadow problems, and no need to align with the center of the template for distortion measurement for the proposed method. Theoretical description, numerical simulation and experimental results show that the proposed method is simple, automatic and effective.
△ Less
Submitted 26 November, 2019;
originally announced November 2019.
-
Zero-Shot Imitating Collaborative Manipulation Plans from YouTube Cooking Videos
Authors:
Hejia Zhang,
Jie Zhong,
Stefanos Nikolaidis
Abstract:
People often watch videos on the web to learn how to cook new recipes, assemble furniture or repair a computer. We wish to enable robots with the very same capability. This is challenging; there is a large variation in manipulation actions and some videos even involve multiple persons, who collaborate by sharing and exchanging objects and tools. Furthermore, the learned representations need to be…
▽ More
People often watch videos on the web to learn how to cook new recipes, assemble furniture or repair a computer. We wish to enable robots with the very same capability. This is challenging; there is a large variation in manipulation actions and some videos even involve multiple persons, who collaborate by sharing and exchanging objects and tools. Furthermore, the learned representations need to be general enough to be transferable to robotic systems. On the other hand, previous work has shown that the space of human manipulation actions has a linguistic, hierarchical structure that relates actions to manipulated objects and tools. Building upon this theory of language for action, we propose a system for understanding and executing demonstrated action sequences from full-length, real-world cooking videos on the web. The system takes as input a new, previously unseen cooking video annotated with object labels and bounding boxes, and outputs a collaborative manipulation action plan for one or more robotic arms. We demonstrate performance of the system in a standardized dataset of 100 YouTube cooking videos, as well as in six full-length Youtube videos that include collaborative actions between two participants. We compare our system with a baseline system that consists of a state-of-the-art action detection baseline and show our system achieves higher action detection accuracy. We additionally propose an open-source platform for executing the learned plans in a simulation environment as well as with an actual robotic arm.
△ Less
Submitted 26 September, 2022; v1 submitted 24 November, 2019;
originally announced November 2019.
-
Photon-mediated localization in two-level qubit arrays
Authors:
Janet Zhong,
Nikita A. Olekhno,
Yongguan Ke,
Alexander V. Poshakinskiy,
Chaohong Lee,
Yuri S. Kivshar,
Alexander N. Poddubny
Abstract:
We predict the existence of a novel interaction-induced spatial localization in a periodic array of qubits coupled to a waveguide. This localization can be described as a quantum analogue of a self-induced optical lattice between two indistinguishable photons, where one photon creates a standing wave that traps the other photon. The localization is caused by the interplay between on-site repulsion…
▽ More
We predict the existence of a novel interaction-induced spatial localization in a periodic array of qubits coupled to a waveguide. This localization can be described as a quantum analogue of a self-induced optical lattice between two indistinguishable photons, where one photon creates a standing wave that traps the other photon. The localization is caused by the interplay between on-site repulsion due to the photon blockade and the waveguide-mediated long-range coupling between the qubits.
△ Less
Submitted 11 November, 2019;
originally announced November 2019.
-
Adversarial Attacks on GMM i-vector based Speaker Verification Systems
Authors:
Xu Li,
Jinghua Zhong,
Xixin Wu,
Jianwei Yu,
Xunying Liu,
Helen Meng
Abstract:
This work investigates the vulnerability of Gaussian Mixture Model (GMM) i-vector based speaker verification systems to adversarial attacks, and the transferability of adversarial samples crafted from GMM i-vector based systems to x-vector based systems. In detail, we formulate the GMM i-vector system as a scoring function of enrollment and testing utterance pairs. Then we leverage the fast gradie…
▽ More
This work investigates the vulnerability of Gaussian Mixture Model (GMM) i-vector based speaker verification systems to adversarial attacks, and the transferability of adversarial samples crafted from GMM i-vector based systems to x-vector based systems. In detail, we formulate the GMM i-vector system as a scoring function of enrollment and testing utterance pairs. Then we leverage the fast gradient sign method (FGSM) to optimize testing utterances for adversarial samples generation. These adversarial samples are used to attack both GMM i-vector and x-vector systems. We measure the system vulnerability by the degradation of equal error rate and false acceptance rate. Experiment results show that GMM i-vector systems are seriously vulnerable to adversarial attacks, and the crafted adversarial samples prove to be transferable and pose threats to neuralnetwork speaker embedding based systems (e.g. x-vector systems).
△ Less
Submitted 12 February, 2020; v1 submitted 8 November, 2019;
originally announced November 2019.
-
RoIMix: Proposal-Fusion among Multiple Images for Underwater Object Detection
Authors:
Wei-Hong Lin,
Jia-Xing Zhong,
Shan Liu,
Thomas Li,
Ge Li
Abstract:
Generic object detection algorithms have proven their excellent performance in recent years. However, object detection on underwater datasets is still less explored. In contrast to generic datasets, underwater images usually have color shift and low contrast; sediment would cause blurring in underwater images. In addition, underwater creatures often appear closely to each other on images due to th…
▽ More
Generic object detection algorithms have proven their excellent performance in recent years. However, object detection on underwater datasets is still less explored. In contrast to generic datasets, underwater images usually have color shift and low contrast; sediment would cause blurring in underwater images. In addition, underwater creatures often appear closely to each other on images due to their living habits. To address these issues, our work investigates augmentation policies to simulate overlapping, occluded and blurred objects, and we construct a model capable of achieving better generalization. We propose an augmentation method called RoIMix, which characterizes interactions among images. Proposals extracted from different images are mixed together. Previous data augmentation methods operate on a single image while we apply RoIMix to multiple images to create enhanced samples as training data. Experiments show that our proposed method improves the performance of region-based object detectors on both Pascal VOC and URPC datasets.
△ Less
Submitted 24 March, 2020; v1 submitted 7 November, 2019;
originally announced November 2019.
-
Stellar chromospheric activity and age relation from open clusters in the LAMOST Survey
Authors:
Jiajun Zhang,
Jingkun Zhao,
Terry D. Oswalt,
Xiangsong Fang,
Gang Zhao,
Xilong Liang,
Xianhao Ye,
Jing Zhong
Abstract:
We identify member stars of more than 90 open clusters in the LAMOST survey. With the method of Fang et al.(2018), the chromospheric activity (CA) indices logR'CaK for 1091 member stars in 82 open clusters and logR'Hα for 1118 member stars in 83 open clusters are calculated. The relations between the average logR'CaK, logR'Hα in each open cluster and its age are investigated in different Teff and…
▽ More
We identify member stars of more than 90 open clusters in the LAMOST survey. With the method of Fang et al.(2018), the chromospheric activity (CA) indices logR'CaK for 1091 member stars in 82 open clusters and logR'Hα for 1118 member stars in 83 open clusters are calculated. The relations between the average logR'CaK, logR'Hα in each open cluster and its age are investigated in different Teff and [Fe/H] ranges. We find that CA starts to decrease slowly from logt = 6.70 to logt = 8.50, and then decreases rapidly until logt = 9.53. The trend becomes clearer for cooler stars. The quadratic functions between logR' and logt with 4000K < Teff < 5500K are constructed, which can be used to roughly estimate ages of field stars with accuracy about 40% for logR'CaK and 60% for logR'Hα.
△ Less
Submitted 30 September, 2019;
originally announced September 2019.
-
Efficient T2 mapping with Blip-up/down EPI and gSlider-SMS (T2-BUDA-gSlider)
Authors:
Xiaozhi Cao,
Congyu Liao,
Zijing Zhang,
Siddharth Srinivasan Iyer,
Kang Wang,
Hongjian He,
Huafeng Liu,
Kawin Setsompop,
Jianhui Zhong,
Berkin Bilgic
Abstract:
Purpose: To rapidly obtain high isotropic-resolution T2 maps with whole-brain coverage and high geometric fidelity.
Methods: A T2 blip-up/down echo planar imaging (EPI) acquisition with generalized Slice-dithered enhanced resolution (T2-BUDA-gSlider) is proposed. A radiofrequency (RF)-encoded multi-slab spin-echo EPI acquisition with multiple echo times (TEs) was developed to obtain high SNR eff…
▽ More
Purpose: To rapidly obtain high isotropic-resolution T2 maps with whole-brain coverage and high geometric fidelity.
Methods: A T2 blip-up/down echo planar imaging (EPI) acquisition with generalized Slice-dithered enhanced resolution (T2-BUDA-gSlider) is proposed. A radiofrequency (RF)-encoded multi-slab spin-echo EPI acquisition with multiple echo times (TEs) was developed to obtain high SNR efficiency with reduced repetition time (TR). This was combined with an interleaved 2-shot EPI acquisition using blip-up/down phase encoding. An estimated field map was incorporated into the joint multi-shot EPI reconstruction with a structured low rank constraint to achieve distortion-free and robust reconstruction for each slab without navigation. A Bloch simulated subspace model was integrated into gSlider reconstruction and utilized for T2 quantification.
Results: In vivo results demonstrated that the T2 values estimated by the proposed method were consistent with gold standard spin-echo acquisition. Compared to the reference 3D fast spin echo (FSE) images, distortion caused by off-resonance and eddy current effects were effectively mitigated.
Conclusion: BUDA-gSlider SE-EPI acquisition and gSlider-subspace joint reconstruction enabled distortion-free whole-brain T2 mapping in 2 min at ~1 mm3 isotropic resolution, which could bring significant benefits to related clinical and neuroscience applications.
△ Less
Submitted 20 September, 2020; v1 submitted 27 September, 2019;
originally announced September 2019.
-
Tracing Kinematic and Chemical Properties of Sagittarius Stream by K-Giants, M-Giants, and BHB stars
Authors:
Chengqun Yang,
Xiang-Xiang Xue,
Jing Li,
Chao Liu,
Bo Zhang,
Hans-Walter Rix,
Lan Zhang,
Gang Zhao,
Hao Tian,
Jing Zhong,
Qianfan Xing,
Yaqian Wu,
Chengdong Li,
Jeffrey L. Carlin,
Jiang Chang
Abstract:
We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kp…
▽ More
We characterize the kinematic and chemical properties of $\sim$3,000 Sagittarius (Sgr) stream stars, including K-giants, M-giants, and BHBs, select from SEGUE-2, LAMOST, and SDSS separately in Integrals-of-Motion space. The orbit of Sgr stream is quite clear from the velocity vector in $X$-$Z$ plane. Stars traced by K-giants and M-giants present the apogalacticon of trailing steam is $\sim$ 100 kpc. The metallicity distributions of Sgr K-, M-giants, and BHBs present that the M-giants are on average the most metal-rich population, followed by K-giants and BHBs. All of the K-, M-giants, and BHBs indicate that the trailing arm is on average more metal-rich than leading arm, and the K-giants show that the Sgr debris is the most metal-poor part. The $α$-abundance of Sgr stars exhibits a similar trend with the Galactic halo stars at lower metallicity ([Fe/H] $<\sim$ $-$1.0 dex), and then evolve down to lower [$α$/Fe] than disk stars at higher metallicity, which is close to the evolution pattern of $α$-element of Milky Way dwarf galaxies. We find $V_Y$ and metallicity of K-giants have gradients along the direction of line-of-sight from the Galactic center in $X$-$Z$ plane, and the K-giants show that $V_Y$ increases with metallicity at [Fe/H] $>\sim-$1.5 dex. After dividing the Sgr stream into bright and faint stream according to their locations in equatorial coordinate, the K-giants and BHBs show that the bright and faint stream present different $V_Y$ and metallicities, the bright stream is on average higher in $V_Y$ and metallicity than the faint stream.
△ Less
Submitted 27 September, 2019;
originally announced September 2019.
-
Streaming controlled by meniscus shape
Authors:
Y. Huang,
C. P. Wolfe,
J. Zhang,
J. -Q. Zhong
Abstract:
Surface waves called meniscus waves often appear in the systems that are close to the capillary length scale. Since the meniscus shape determines the form of the meniscus waves, the resulting streaming circulation has a structure distinct from that caused by other capillary-gravity waves recently reported in the literature. In the present study, we produce symmetric and antisymmetric meniscus shap…
▽ More
Surface waves called meniscus waves often appear in the systems that are close to the capillary length scale. Since the meniscus shape determines the form of the meniscus waves, the resulting streaming circulation has a structure distinct from that caused by other capillary-gravity waves recently reported in the literature. In the present study, we produce symmetric and antisymmetric meniscus shapes by controlling boundary wettability and excite meniscus waves by oscillating the meniscus vertically. The symmetric and antisymmetric configurations produce different surface capillary-gravity wave modes and streaming flow structures. The energy density of the streaming circulation increases at the rate of the fourth power of the forcing amplitude in both configurations. The flow symmetry of streaming circulation is retained under the symmetric meniscus, while it is lost under the antisymmetric meniscus. In our experiments, the streaming circulation primarily originates from the Stokes boundary layer beneath the meniscus and can be successfully explained using the existing streaming theory.
△ Less
Submitted 3 September, 2019;
originally announced September 2019.
-
Properties of Radial Velocities measurement based on LAMOST-II Medium-Resolution Spectroscopic Observations
Authors:
R. Wang,
A. -L. Luo,
J. -J. Chen,
Z. -R. Bai,
L. Chen,
X. -F. Chen,
S. -B. Dong,
B. Du,
J. -N. Fu,
Z. -W. Han,
J. -L. Hou,
Y. -H. Hou,
W. Hou,
D. -K. Jiang,
X. Kong,
L. -F. Li,
C. Liu,
J. -M. Liu,
L. Qin,
J. -R. Shi,
H. Tian,
H. Wu,
C. -J. Wu,
J. -W. Xie,
H. -T. Zhang
, et al. (6 additional authors not shown)
Abstract:
The radial velocity (RV) is a basic physical quantity which can be determined through Doppler shift of the spectrum of a star. The precision of RV measurement depends on the resolution of the spectrum we used and the accuracy of wavelength calibration. In this work, radial velocities of LAMOST-II medium resolution (R ~ 7500) spectra are measured for 1,594,956 spectra (each spectrum has two waveban…
▽ More
The radial velocity (RV) is a basic physical quantity which can be determined through Doppler shift of the spectrum of a star. The precision of RV measurement depends on the resolution of the spectrum we used and the accuracy of wavelength calibration. In this work, radial velocities of LAMOST-II medium resolution (R ~ 7500) spectra are measured for 1,594,956 spectra (each spectrum has two wavebands) through matching with templates. A set of RV standard stars are used to recalibrate the zero point of the measurement, and some reference sets with RVs derived from medium/high-resolution observations are used to evaluate the accuracy of the measurement. Comparing with reference sets, the accuracy of our measurement can get 0.0227 km s/1 with respect to radial velocities standard stars. The intrinsic precision is estimated with the multiple observations of single stars, which can achieve to 1.36 km s/1,1.08 km s/1, 0.91 km s/1 for the spectra at signal-to-noise levels of 10, 20, 50, respectively.
△ Less
Submitted 19 November, 2019; v1 submitted 13 August, 2019;
originally announced August 2019.
-
Value-added catalogs of M type stars in LAMOST DR5
Authors:
Jing Zhong,
Jing Li,
Jeffrey L. Carlin,
Li Chen,
Rene A. Mendez,
Jinliang Hou
Abstract:
In this work, we present new catalogs of M giant and M dwarf stars from the LAMOST DR5. In total, 39,796 M giants and 501,152 M dwarfs are identified from the classification pipeline. The template-fitting results contain M giants with 7 temperature subtypes from M0 to M6, M dwarfs with 18 temperature subtypes from K7.0 to M8.5 and 12 metallicity subclasses from dMr to usdMp. We cross-matched our M…
▽ More
In this work, we present new catalogs of M giant and M dwarf stars from the LAMOST DR5. In total, 39,796 M giants and 501,152 M dwarfs are identified from the classification pipeline. The template-fitting results contain M giants with 7 temperature subtypes from M0 to M6, M dwarfs with 18 temperature subtypes from K7.0 to M8.5 and 12 metallicity subclasses from dMr to usdMp. We cross-matched our M-type catalog with the 2MASS and WISE catalog to obtain infrared magnitude and colors. Adopting the distances derived from the parallaxes in \gaia{} DR2, the M_G vs. (G_bp-G_rp)_0 diagram shows that there are also early-type stars and white dwarf-M dwarf binaries included in our M type stars sample, with a contamination rate of about 4.6% for M giants and 0.48% for M dwarfs. We found that CaH spectral indices are an efficient selection criteria for carbon stars. A total of 289 carbon stars were identified from the M giants sample, and further confirmed by LAMOST spectra.
△ Less
Submitted 3 August, 2019;
originally announced August 2019.
-
Balanced Coherence Times of Mixed-Species Atomic Qubits in a Dual $3\times3$ Magic-Intensity Optical Dipole Trap Array
Authors:
Ruijun Guo,
Xiaodong He,
Cheng Sheng,
Jiaheng Yang,
Peng Xu,
Kunpeng Wang,
Jiaqi Zhong,
Min Liu,
Jin Wang,
Mingsheng Zhan
Abstract:
In this work, we construct a polarization-mediated magic-intensity (MI) optical dipole trap (ODT) array, in which the detrimental effects of light shifts on the mixed-species qubits are efficiently mitigated so that the coherence times of the mixed-species qubits are both substantially enhanced and balanced for the first time. This mixed-species magic trapping technique relies on the tunability of…
▽ More
In this work, we construct a polarization-mediated magic-intensity (MI) optical dipole trap (ODT) array, in which the detrimental effects of light shifts on the mixed-species qubits are efficiently mitigated so that the coherence times of the mixed-species qubits are both substantially enhanced and balanced for the first time. This mixed-species magic trapping technique relies on the tunability of the coefficient of the third-order cross term and ground state hyperpolarizability, which are inherently dependent on the degree of circular polarization of the trap laser. Experimentally, polarization of the ODT array for $^{85}$Rb qubits is finely adjusted to a definite value so that its working magnetic field required for magic trapping amounts to the one required for magically trapping $^{87}$Rb qubits in another ODT array with fully circular polarization. Ultimately, in such a polarization-mediated MI-ODT array, the coherence times of $^{87}$Rb and $^{85}$Rb qubits are respectively enhanced up to 891$\pm$47 ms and 943$\pm$35 ms. Furthermore, a new source of dephasing effect is revealed, which arises from the noise of the elliptic polarization, and the reduction in corresponding dephasing effect on the $^{85}$Rb qubits is attainable by use of shallow magic intensity. It is anticipated that the novel mixed-species MI-ODT array is a versatile platform for building scalable quantum computers with neutral atoms.
△ Less
Submitted 29 July, 2019;
originally announced July 2019.
-
Fine vortex structure and flow transition to the geostrophic regime in rotating Rayleigh-Bénard convection
Authors:
Jun-Qiang Shi,
Hao-Yuan Lu,
Shan-Shan Ding,
Jin-Qiang Zhong
Abstract:
We present spatial-resolved measurements of the columnar vortex structures in rotating Rayleigh-Bénard convection. The scaled radial profiles of the azimuthal velocity $u_φ(r)$ and vertical vorticity $ω(r)$ of the vortices are analyzed and compared with the predictions of the asymptotic theory. The results reveal that the asymptotic theory predicts accurately $u_φ(r)$ and $ω(r)$ in the geostrophic…
▽ More
We present spatial-resolved measurements of the columnar vortex structures in rotating Rayleigh-Bénard convection. The scaled radial profiles of the azimuthal velocity $u_φ(r)$ and vertical vorticity $ω(r)$ of the vortices are analyzed and compared with the predictions of the asymptotic theory. The results reveal that the asymptotic theory predicts accurately $u_φ(r)$ and $ω(r)$ in the geostrophic convection regime, but extension of the theory in the weak rotation regime is needed to interpret the rotation-dependence of the experimental data. Our measurements of the mean velocity, vorticity of the vortices, and the strength of the vortex shield structure all indicate a flow transition from weekly rotating convection to geostrophic convection. Results of the parameter values for the transition are in agreement with the scaling relationship obtained from previous heat-transfer measurements.
△ Less
Submitted 24 July, 2019;
originally announced July 2019.
-
Geometry of escape and transition dynamics in the presence of dissipative and gyroscopic forces in two degree of freedom systems
Authors:
Jun Zhong,
Shane D. Ross
Abstract:
Escape from a potential well can occur in different physical systems, such as capsize of ships, resonance transitions in celestial mechanics, and dynamic snap-through of arches and shells, as well as molecular reconfigurations in chemical reactions. The criteria and routes of escape in one-degree of freedom systems has been well studied theoretically with reasonable agreement with experiment. The…
▽ More
Escape from a potential well can occur in different physical systems, such as capsize of ships, resonance transitions in celestial mechanics, and dynamic snap-through of arches and shells, as well as molecular reconfigurations in chemical reactions. The criteria and routes of escape in one-degree of freedom systems has been well studied theoretically with reasonable agreement with experiment. The trajectory can only transit from the hilltop of the one-dimensional potential energy surface. The situation becomes more complicated when the system has higher degrees of freedom since it has multiple routes to escape through an equilibrium of saddle-type, specifically, an index-1 saddle. This paper summarizes the geometry of escape across a saddle in some widely known physical systems with two degrees of freedom and establishes the criteria of escape providing both a methodology and results under the conceptual framework known as tube dynamics. These problems are classified into two categories based on whether the saddle projection and focus projection in the symplectic eigenspace are coupled or not when damping and/or gyroscopic effects are considered. To simplify the process, only the linearized system around the saddle points are analyzed. We define a transition region, $\mathcal{T}_h$, as the region of initial conditions of a given initial energy $h$ which transit from one side of a saddle to the other. We find that in conservative systems, the boundary of the transition region, $\partial \mathcal{T}_h$, is a cylinder, while in dissipative systems, $\partial \mathcal{T}_h$ is an ellipsoid.
△ Less
Submitted 3 October, 2019; v1 submitted 24 July, 2019;
originally announced July 2019.
-
Nanoscale ice fracture by molecular dynamics simulations
Authors:
A. Afshar,
J. Zhong,
D. S. Thompson,
D. Meng
Abstract:
In this work, we conducted molecular dynamics simulations to study the fracture mechanism of ice crystals in a bulk phase and at ice-ice interfaces at the atomistic scale. We show that there exists a narrow disordered interfacial layer between two Ih ice structures. The width of the interfacial layer is determined to be about the size of two water molecules. Upon deformation, the stress response o…
▽ More
In this work, we conducted molecular dynamics simulations to study the fracture mechanism of ice crystals in a bulk phase and at ice-ice interfaces at the atomistic scale. We show that there exists a narrow disordered interfacial layer between two Ih ice structures. The width of the interfacial layer is determined to be about the size of two water molecules. Upon deformation, the stress response of ice at interface show significantly anisotropic behaviors depending on the direction of deformation. Bulk-like behavior is observed when direction of deformation being orthogonal to the direction of interfacial plane. Significantly smaller fracture stress and yield strain occurs if the deformation is along interfacial plane. This result illustrates the dominant role played by the small amount of disordered water molecules at interface in altering mechanical strength of an interfacial structure.
△ Less
Submitted 21 July, 2019;
originally announced July 2019.
-
Predicting Customer Call Intent by Analyzing Phone Call Transcripts based on CNN for Multi-Class Classification
Authors:
Junmei Zhong,
William Li
Abstract:
Auto dealerships receive thousands of calls daily from customers who are interested in sales, service, vendors and jobseekers. With so many calls, it is very important for auto dealers to understand the intent of these calls to provide positive customer experiences that ensure customer satisfaction, deep customer engagement to boost sales and revenue, and optimum allocation of agents or customer s…
▽ More
Auto dealerships receive thousands of calls daily from customers who are interested in sales, service, vendors and jobseekers. With so many calls, it is very important for auto dealers to understand the intent of these calls to provide positive customer experiences that ensure customer satisfaction, deep customer engagement to boost sales and revenue, and optimum allocation of agents or customer service representatives across the business. In this paper, we define the problem of customer phone call intent as a multi-class classification problem stemming from the large database of recorded phone call transcripts. To solve this problem, we develop a convolutional neural network (CNN)-based supervised learning model to classify the customer calls into four intent categories: sales, service, vendor and jobseeker. Experimental results show that with the thrust of our scalable data labeling method to provide sufficient training data, the CNN-based predictive model performs very well on long text classification according to the quantitative metrics of F1-Score, precision, recall, and accuracy.
△ Less
Submitted 8 July, 2019;
originally announced July 2019.
-
A fourth-order compact solver for fractional-in-time fourth-order diffusion equations
Authors:
Jialing Zhong,
Hong-lin Liao,
Bingquan Ji,
Luming Zhang
Abstract:
A fourth-order compact scheme is proposed for a fourth-order subdiffusion equation with the first Dirichlet boundary conditions. The fourth-order problem is firstly reduced into a couple of spatially second-order system and we use an averaged operator to construct a fourth-order spatial approximation. This averaged operator is compact since it involves only two grid points for the derivative bound…
▽ More
A fourth-order compact scheme is proposed for a fourth-order subdiffusion equation with the first Dirichlet boundary conditions. The fourth-order problem is firstly reduced into a couple of spatially second-order system and we use an averaged operator to construct a fourth-order spatial approximation. This averaged operator is compact since it involves only two grid points for the derivative boundary conditions. The L1 formula on irregular mesh is considered for the Caputo fractional derivative, so we can resolve the initial singularity of solution by putting more grid points near the initial time. The stability and convergence are established by using three theoretical tools: a complementary discrete convolution kernel, a discrete fractional Gronwall inequality and an error convolution structure. Some numerical experiments are reported to demonstrate the accuracy and efficiency of our method.
△ Less
Submitted 2 July, 2019;
originally announced July 2019.
-
ARMIN: Towards a More Efficient and Light-weight Recurrent Memory Network
Authors:
Zhangheng Li,
Jia-Xing Zhong,
Jingjia Huang,
Tao Zhang,
Thomas Li,
Ge Li
Abstract:
In recent years, memory-augmented neural networks(MANNs) have shown promising power to enhance the memory ability of neural networks for sequential processing tasks. However, previous MANNs suffer from complex memory addressing mechanism, making them relatively hard to train and causing computational overheads. Moreover, many of them reuse the classical RNN structure such as LSTM for memory proces…
▽ More
In recent years, memory-augmented neural networks(MANNs) have shown promising power to enhance the memory ability of neural networks for sequential processing tasks. However, previous MANNs suffer from complex memory addressing mechanism, making them relatively hard to train and causing computational overheads. Moreover, many of them reuse the classical RNN structure such as LSTM for memory processing, causing inefficient exploitations of memory information. In this paper, we introduce a novel MANN, the Auto-addressing and Recurrent Memory Integrating Network (ARMIN) to address these issues. The ARMIN only utilizes hidden state ht for automatic memory addressing, and uses a novel RNN cell for refined integration of memory information. Empirical results on a variety of experiments demonstrate that the ARMIN is more light-weight and efficient compared to existing memory networks. Moreover, we demonstrate that the ARMIN can achieve much lower computational overhead than vanilla LSTM while keeping similar performances. Codes are available on github.com/zoharli/armin.
△ Less
Submitted 28 June, 2019;
originally announced June 2019.
-
Anomalous vortex motion induced by asymmetric vorticity distribution in rapidly rotating thermal convection
Authors:
Shan-Shan Ding,
Kai Leong Chong,
Jun-Qiang Shi,
Guang-Yu Ding,
Hao-Yuan Lu,
Ke-Qing Xia,
Jin-Qiang Zhong
Abstract:
In rotating Rayleigh-Bénard convection, columnar vortices advect horizontally in a stochastic manner. When the centrifugal buoyancy is present the vortices exhibit radial motions that can be explained through a Langevin-type stochastic model. Surprisingly, anomalous outward motion of cyclones is observed in a centrifugation-dominant flow regime, which is contrary to the well-known centrifugal effe…
▽ More
In rotating Rayleigh-Bénard convection, columnar vortices advect horizontally in a stochastic manner. When the centrifugal buoyancy is present the vortices exhibit radial motions that can be explained through a Langevin-type stochastic model. Surprisingly, anomalous outward motion of cyclones is observed in a centrifugation-dominant flow regime, which is contrary to the well-known centrifugal effect. We interpret this phenomenon as a symmetry-breaking of both the population and vorticity magnitude of the vortices brought about by the centrifugal buoyancy. Consequently, the cyclones submit to the collective vortex motion dominated by the strong anticyclones. Our study provides new understanding of vortex motions that are widely present in many natural systems.
△ Less
Submitted 9 July, 2019; v1 submitted 4 June, 2019;
originally announced June 2019.
-
Robust diffusion parametric mapping of motion-corrupted data with a three-dimensional convolutional neural network
Authors:
Ting Gong,
Qiqi Tong,
Hongjian He,
Zhiwei Li,
Jianhui Zhong
Abstract:
Head motion is inevitable in the acquisition of diffusion-weighted images, especially for certain motion-prone subjects and for data gathering of advanced diffusion models with prolonged scan times. Deficient accuracy of motion correction cause deterioration in the quality of diffusion model reconstruction, thus affecting the derived measures. This results in either loss of data, or introducing bi…
▽ More
Head motion is inevitable in the acquisition of diffusion-weighted images, especially for certain motion-prone subjects and for data gathering of advanced diffusion models with prolonged scan times. Deficient accuracy of motion correction cause deterioration in the quality of diffusion model reconstruction, thus affecting the derived measures. This results in either loss of data, or introducing bias in outcomes from data of different motion levels, or both. Hence minimizing motion effects and reutilizing motion-contaminated data becomes vital to quantitative studies. We have previously developed a 3-dimensional hierarchical convolution neural network (3D H-CNN) for robust diffusion kurtosis mapping from under-sampled data. In this study, we propose to extend this method to motion-contaminated data for robust recovery of diffusion model-derived measures with a process of motion assessment and corrupted volume rejection. We validate the proposed pipeline in two in-vivo datasets. Results from the first dataset of individual subjects show that all the diffusion tensor and kurtosis tensor-derived measures from the new pipeline are minimally sensitive to motion effects, and are comparable to the motion-free reference with as few as eight volumes retained from the motion-contaminated data. Results from the second dataset of a group of children with attention deficit hyperactivity disorder demonstrate the ability of our approach in ameliorating spurious group differences due to head motion. This method shows great potential for exploiting some valuable but motion-corrupted DWI data which are likely to be discarded otherwise, and applying to data with different motion level thus improving their utilization and statistic power.
△ Less
Submitted 30 May, 2019;
originally announced May 2019.