-
Fluctuations-Induced Quantum Radiation and Reaction from an Atom in a Squeezed Quantum Field
Authors:
Matthew Bravo,
Jen-Tsung Hsiang,
Bei-Lok Hu
Abstract:
In this third of a series on quantum radiation, we explore the feasibility of using the memories kept in a quantum field to decipher certain information about the early universe. As a model study, we let a massless quantum field be subjected to a parametric process for a finite time interval such that the mode frequency of the field transits from one constant value to another. This configuration m…
▽ More
In this third of a series on quantum radiation, we explore the feasibility of using the memories kept in a quantum field to decipher certain information about the early universe. As a model study, we let a massless quantum field be subjected to a parametric process for a finite time interval such that the mode frequency of the field transits from one constant value to another. This configuration mimics a statically-bounded universe, but not a continuously evolving one. The field squeezed by this process should contain information of the process itself. If an atom is coupled to the field after the parametric process, its response will depend on the squeezing, and any quantum radiation emitted by the atom will carry this information away so that an observer at a much later time may still identify it. Our analyses show that 1) a remote observer cannot measure the generated squeezing via the radiation energy flux from the atom because the net radiation energy flux is canceled. However, 2) there is a chance to identify squeezing by measuring the constant radiation energy density at late times. The only restriction is that this energy density is of the near-field nature. The second part of this paper focuses on 3) the dependence of squeezing on the functional form of the parametric process. Via several examples we demonstrate that the behavior of squeezing reflect essential properties of the parametric process. In fact, striking features may show up in complicated processes involving various scales. These analyses allow us to establish the connection between properties of a squeezed quantum field and the parametric process which does the squeezing. Therefore, 4) one can construct templates to reconstitute the unknown parametric processes from the data of measurable quantities subjected to squeezing. In a sequel paper these results will be applied to a study of quantum radiations in cosmology.
△ Less
Submitted 12 March, 2023;
originally announced March 2023.
-
Forecasts of CMB lensing reconstruction of AliCPT-1 from the foreground cleaned polarization data
Authors:
Jiakang Han,
Bin Hu,
Shamik Ghosh,
Siyu Li,
Jiazheng Dou,
Jacques Delabrouille,
Jing Jin,
Hong Li,
Yang Liu,
Mathieu Remazeilles,
Wen Zhao,
Pengjie Zhang,
Zheng-Wei Li,
Cong-Zhan Liu,
Yong-jie Zhang,
Chao-Lin Kuo,
Xinmin Zhang
Abstract:
Cosmic microwave background radiation (CMB) observations are unavoidably contaminated by emission from various extra-galactic foregrounds, which must be removed to obtain reliable measurements of the cosmological signal. In this paper, we demonstrate CMB lensing reconstruction in AliCPT-1 after foreground removal, combine the two bands of AliCPT-1 (90 and 150~GHz) with Planck HFI bands (100, 143,…
▽ More
Cosmic microwave background radiation (CMB) observations are unavoidably contaminated by emission from various extra-galactic foregrounds, which must be removed to obtain reliable measurements of the cosmological signal. In this paper, we demonstrate CMB lensing reconstruction in AliCPT-1 after foreground removal, combine the two bands of AliCPT-1 (90 and 150~GHz) with Planck HFI bands (100, 143, 217 and 353~GHz) and with the WMAP-K band (23~GHz). In order to balance contamination by instrumental noise and foreground residual bias, we adopt the Needlet Internal Linear Combination (NILC) method to clean the E-map and the constrained Internal Linear Combination (cILC) method to clean the B-map. The latter utilizes additional constraints on average frequency scaling of the dust and synchrotron to remove foregrounds at the expense of somewhat noisier maps. Assuming 4 modules observing 1 season from simulation data, the resulting effective residual noise in E- and B-map are roughly $15~μ{\rm K}\cdot{\rm arcmin}$ and $25~μ{\rm K}\cdot{\rm arcmin}$, respectively. As a result, the CMB lensing reconstruction signal-to-noise ratio (SNR) from polarization data is about SNR$\,\approx\,$4.5. This lensing reconstruction capability is comparable to that of other stage-III small aperture millimeter CMB telescopes.
△ Less
Submitted 10 March, 2023;
originally announced March 2023.
-
The JUNO experiment Top Tracker
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato
, et al. (592 additional authors not shown)
Abstract:
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector…
▽ More
The main task of the Top Tracker detector of the neutrino reactor experiment Jiangmen Underground Neutrino Observatory (JUNO) is to reconstruct and extrapolate atmospheric muon tracks down to the central detector. This muon tracker will help to evaluate the contribution of the cosmogenic background to the signal. The Top Tracker is located above JUNO's water Cherenkov Detector and Central Detector, covering about 60% of the surface above them. The JUNO Top Tracker is constituted by the decommissioned OPERA experiment Target Tracker modules. The technology used consists in walls of two planes of plastic scintillator strips, one per transverse direction. Wavelength shifting fibres collect the light signal emitted by the scintillator strips and guide it to both ends where it is read by multianode photomultiplier tubes. Compared to the OPERA Target Tracker, the JUNO Top Tracker uses new electronics able to cope with the high rate produced by the high rock radioactivity compared to the one in Gran Sasso underground laboratory. This paper will present the new electronics and mechanical structure developed for the Top Tracker of JUNO along with its expected performance based on the current detector simulation.
△ Less
Submitted 9 March, 2023;
originally announced March 2023.
-
JUNO sensitivity to $^7$Be, $pep$, and CNO solar neutrinos
Authors:
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Marco Beretta
, et al. (592 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO), the first multi-kton liquid scintillator detector, which is under construction in China, will have a unique potential to perform a real-time measurement of solar neutrinos well below the few MeV threshold typical for Water Cherenkov detectors. JUNO's large target mass and excellent energy resolution are prerequisites for reaching unprecedented levels of precision. In this paper, we provide estimation of the JUNO sensitivity to 7Be, pep, and CNO solar neutrinos that can be obtained via a spectral analysis above the 0.45 MeV threshold. This study is performed assuming different scenarios of the liquid scintillator radiopurity, ranging from the most opti mistic one corresponding to the radiopurity levels obtained by the Borexino experiment, up to the minimum requirements needed to perform the neutrino mass ordering determination with reactor antineutrinos - the main goal of JUNO. Our study shows that in most scenarios, JUNO will be able to improve the current best measurements on 7Be, pep, and CNO solar neutrino fluxes. We also perform a study on the JUNO capability to detect periodical time variations in the solar neutrino flux, such as the day-night modulation induced by neutrino flavor regeneration in Earth, and the modulations induced by temperature changes driven by helioseismic waves.
△ Less
Submitted 7 March, 2023;
originally announced March 2023.
-
A Unified Algebraic Perspective on Lipschitz Neural Networks
Authors:
Alexandre Araujo,
Aaron Havens,
Blaise Delattre,
Alexandre Allauzen,
Bin Hu
Abstract:
Important research efforts have focused on the design and training of neural networks with a controlled Lipschitz constant. The goal is to increase and sometimes guarantee the robustness against adversarial attacks. Recent promising techniques draw inspirations from different backgrounds to design 1-Lipschitz neural networks, just to name a few: convex potential layers derive from the discretizati…
▽ More
Important research efforts have focused on the design and training of neural networks with a controlled Lipschitz constant. The goal is to increase and sometimes guarantee the robustness against adversarial attacks. Recent promising techniques draw inspirations from different backgrounds to design 1-Lipschitz neural networks, just to name a few: convex potential layers derive from the discretization of continuous dynamical systems, Almost-Orthogonal-Layer proposes a tailored method for matrix rescaling. However, it is today important to consider the recent and promising contributions in the field under a common theoretical lens to better design new and improved layers. This paper introduces a novel algebraic perspective unifying various types of 1-Lipschitz neural networks, including the ones previously mentioned, along with methods based on orthogonality and spectral methods. Interestingly, we show that many existing techniques can be derived and generalized via finding analytical solutions of a common semidefinite programming (SDP) condition. We also prove that AOL biases the scaled weight to the ones which are close to the set of orthogonal matrices in a certain mathematical manner. Moreover, our algebraic condition, combined with the Gershgorin circle theorem, readily leads to new and diverse parameterizations for 1-Lipschitz network layers. Our approach, called SDP-based Lipschitz Layers (SLL), allows us to design non-trivial yet efficient generalization of convex potential layers. Finally, the comprehensive set of experiments on image classification shows that SLLs outperform previous approaches on certified robust accuracy. Code is available at https://github.com/araujoalexandre/Lipschitz-SLL-Networks.
△ Less
Submitted 26 October, 2023; v1 submitted 6 March, 2023;
originally announced March 2023.
-
Bounds for the Tracking Error and Dynamic Regret of Inexact Online Optimization Methods: A General Analysis via Sequential Semidefinite Programs
Authors:
Usman Syed,
Emiliano Dall'Anese,
Bin Hu
Abstract:
In this paper, we develop a unified framework for analyzing the tracking error and dynamic regret of inexact online optimization methods under a variety of settings. Specifically, we leverage the quadratic constraint approach from control theory to formulate sequential semidefinite programs (SDPs) whose feasible points naturally correspond to tracking error bounds of various inexact online optimiz…
▽ More
In this paper, we develop a unified framework for analyzing the tracking error and dynamic regret of inexact online optimization methods under a variety of settings. Specifically, we leverage the quadratic constraint approach from control theory to formulate sequential semidefinite programs (SDPs) whose feasible points naturally correspond to tracking error bounds of various inexact online optimization methods including the inexact online gradient descent (OGD) method, the online gradient descent-ascent method, the online stochastic gradient method, and the inexact proximal online gradient method. We provide exact analytical solutions for our proposed sequential SDPs, and obtain fine-grained tracking error bounds for the online algorithms studied in this paper. We also provide a simple routine to convert the obtained tracking error bounds into dynamic regret bounds. The main novelty of our analysis is that we derive exact analytical solutions for our proposed sequential SDPs under various inexact oracle assumptions in a unified manner.
△ Less
Submitted 1 March, 2023;
originally announced March 2023.
-
Multi-Feature Integration for Perception-Dependent Examination-Bias Estimation
Authors:
Xiaoshu Chen,
Xiangsheng Li,
Kunliang Wei,
Bin Hu,
Lei Jiang,
Zeqian Huang,
Zhanhui Kang
Abstract:
Eliminating examination bias accurately is pivotal to apply click-through data to train an unbiased ranking model. However, most examination-bias estimators are limited to the hypothesis of Position-Based Model (PBM), which supposes that the calculation of examination bias only depends on the rank of the document. Recently, although some works introduce information such as clicks in the same query…
▽ More
Eliminating examination bias accurately is pivotal to apply click-through data to train an unbiased ranking model. However, most examination-bias estimators are limited to the hypothesis of Position-Based Model (PBM), which supposes that the calculation of examination bias only depends on the rank of the document. Recently, although some works introduce information such as clicks in the same query list and contextual information when calculating the examination bias, they still do not model the impact of document representation on search engine result pages (SERPs) that seriously affects one's perception of document relevance to a query when examining. Therefore, we propose a Multi-Feature Integration Model (MFIM) where the examination bias depends on the representation of document except the rank of it. Furthermore, we mine a key factor slipoff counts that can indirectly reflects the influence of all perception-bias factors. Real world experiments on Baidu-ULTR dataset demonstrate the superior effectiveness and robustness of the new approach. The source code is available at \href{https://github.com/lixsh6/Tencent_wsdm_cup2023/tree/main/pytorch_unbias}{https://github.com/lixsh6/Tencent\_wsdm\_cup2023}
△ Less
Submitted 27 February, 2023;
originally announced February 2023.
-
Pretraining De-Biased Language Model with Large-scale Click Logs for Document Ranking
Authors:
Xiangsheng Li,
Xiaoshu Chen,
Kunliang Wei,
Bin Hu,
Lei Jiang,
Zeqian Huang,
Zhanhui Kang
Abstract:
Pre-trained language models have achieved great success in various large-scale information retrieval tasks. However, most of pretraining tasks are based on counterfeit retrieval data where the query produced by the tailored rule is assumed as the user's issued query on the given document or passage. Therefore, we explore to use large-scale click logs to pretrain a language model instead of replyin…
▽ More
Pre-trained language models have achieved great success in various large-scale information retrieval tasks. However, most of pretraining tasks are based on counterfeit retrieval data where the query produced by the tailored rule is assumed as the user's issued query on the given document or passage. Therefore, we explore to use large-scale click logs to pretrain a language model instead of replying on the simulated queries. Specifically, we propose to use user behavior features to pretrain a debiased language model for document ranking. Extensive experiments on Baidu desensitization click logs validate the effectiveness of our method. Our team on WSDM Cup 2023 Pre-training for Web Search won the 1st place with a Discounted Cumulative Gain @ 10 (DCG@10) score of 12.16525 on the final leaderboard.
△ Less
Submitted 26 February, 2023;
originally announced February 2023.
-
Investigating galactic double white dwarfs for sub-milliHz gravitational wave mission ASTROD-GW
Authors:
Gang Wang,
Zhen Yan,
Bin Hu,
Wei-Tou Ni
Abstract:
A large number of galactic binary systems emit gravitational waves (GW) continuously with frequencies below $\sim$10 mHz. The LISA mission could identify tens of thousands of binaries over years of observation and will be subject to the confusion noise around 1 mHz yielded by the unresolved sources. Beyond LISA, there are several missions have been proposed to observe GWs in the sub-mHz range wher…
▽ More
A large number of galactic binary systems emit gravitational waves (GW) continuously with frequencies below $\sim$10 mHz. The LISA mission could identify tens of thousands of binaries over years of observation and will be subject to the confusion noise around 1 mHz yielded by the unresolved sources. Beyond LISA, there are several missions have been proposed to observe GWs in the sub-mHz range where the galactic foreground is expected to be overwhelming the instrumental noises. In this study, we investigate the detectability of sub-mHz GW missions to detect the galactic double white dwarf (DWD) binaries and evaluate the confusion noise produced by the undistinguished DWDs. This confusion noise could also be viewed as a stochastic GW foreground and be effectively observed in the sub-mHz band. The parameter determinations for the modeled foreground are examined by employing different detector sensitivities and population models. By assuming the determined foregrounds could be subtracted from the data, we evaluate the residuals which are expected to have power spectral densities two orders of magnitude lower than the originals data.
△ Less
Submitted 25 May, 2023; v1 submitted 15 February, 2023;
originally announced February 2023.
-
Quadrupole and octupole states in $^{152}$Sm using the proton-neutron interacting boson model
Authors:
Bao-Yue Hu,
Yu Zhang,
Gui-Xiu Na,
Sheng-Nan Wang,
Wei Teng
Abstract:
A scheme of solving the proton-neutron interacting boson model (IBM-2) in terms of the SU(3) basis is introduced, by which the IBM-2 coupled with an octupole boson is applied to describe the low-energy structure of the critical point nucleus, $^{152}$Sm. The results indicate that the spectral properties of both the positive-parity bands and negative-parity bands in this nucleus can be well capture…
▽ More
A scheme of solving the proton-neutron interacting boson model (IBM-2) in terms of the SU(3) basis is introduced, by which the IBM-2 coupled with an octupole boson is applied to describe the low-energy structure of the critical point nucleus, $^{152}$Sm. The results indicate that the spectral properties of both the positive-parity bands and negative-parity bands in this nucleus can be well captured by the IBM-2 calculations through a simple Hamiltonian, thus providing an example of the IBM-2 in a unified description of quadrupole and octupole states in a transitional system. In addition, a statistical analysis of the low-spin states in the model is also provided.
△ Less
Submitted 15 February, 2023;
originally announced February 2023.
-
Towards a Field-Theory based Relativistic Quantum Information
Authors:
Charis Anastopoulos,
Bei-Lok Hu,
Konstantina Savvidou
Abstract:
We present our program for the development of quantum informational concepts in relativistic systems in terms of the unequal-time correlation functions of quantum fields. We employ two formalisms that provide the basis for further developments. (i) The Quantum Temporal Probabilities (QTP) Method for quantum field measurements and (ii) the Closed- Time-Path (CTP) formalism for causal time evolution…
▽ More
We present our program for the development of quantum informational concepts in relativistic systems in terms of the unequal-time correlation functions of quantum fields. We employ two formalisms that provide the basis for further developments. (i) The Quantum Temporal Probabilities (QTP) Method for quantum field measurements and (ii) the Closed- Time-Path (CTP) formalism for causal time evolutions. We present the main ideas of QTP and show how it relates to the CTP formalism, allowing one to express concepts of measurement theory in terms of path-integrals. We also present many links of our program to non-equilibrium quantum field theories. Details can be found in a recent paper by the authors (arxiv:2208.03696).
△ Less
Submitted 11 February, 2023;
originally announced February 2023.
-
Uniform tensor clustering by jointly exploring sample affinities of various orders
Authors:
Hongmin Cai,
Fei Qi,
Junyu Li,
Yu Hu,
Yue Zhang,
Yiu-ming Cheung,
Bin Hu
Abstract:
Conventional clustering methods based on pairwise affinity usually suffer from the concentration effect while processing huge dimensional features yet low sample sizes data, resulting in inaccuracy to encode the sample proximity and suboptimal performance in clustering. To address this issue, we propose a unified tensor clustering method (UTC) that characterizes sample proximity using multiple sam…
▽ More
Conventional clustering methods based on pairwise affinity usually suffer from the concentration effect while processing huge dimensional features yet low sample sizes data, resulting in inaccuracy to encode the sample proximity and suboptimal performance in clustering. To address this issue, we propose a unified tensor clustering method (UTC) that characterizes sample proximity using multiple samples' affinity, thereby supplementing rich spatial sample distributions to boost clustering. Specifically, we find that the triadic tensor affinity can be constructed via the Khari-Rao product of two affinity matrices. Furthermore, our early work shows that the fourth-order tensor affinity is defined by the Kronecker product. Therefore, we utilize arithmetical products, Khatri-Rao and Kronecker products, to mathematically integrate different orders of affinity into a unified tensor clustering framework. Thus, the UTC jointly learns a joint low-dimensional embedding to combine various orders. Finally, a numerical scheme is designed to solve the problem. Experiments on synthetic datasets and real-world datasets demonstrate that 1) the usage of high-order tensor affinity could provide a supplementary characterization of sample proximity to the popular affinity matrix; 2) the proposed method of UTC is affirmed to enhance clustering by exploiting different order affinities when processing high-dimensional data.
△ Less
Submitted 3 February, 2023;
originally announced February 2023.
-
Learning the Kalman Filter with Fine-Grained Sample Complexity
Authors:
Xiangyuan Zhang,
Bin Hu,
Tamer Başar
Abstract:
We develop the first end-to-end sample complexity of model-free policy gradient (PG) methods in discrete-time infinite-horizon Kalman filtering. Specifically, we introduce the receding-horizon policy gradient (RHPG-KF) framework and demonstrate $\tilde{\mathcal{O}}(ε^{-2})$ sample complexity for RHPG-KF in learning a stabilizing filter that is $ε$-close to the optimal Kalman filter. Notably, the p…
▽ More
We develop the first end-to-end sample complexity of model-free policy gradient (PG) methods in discrete-time infinite-horizon Kalman filtering. Specifically, we introduce the receding-horizon policy gradient (RHPG-KF) framework and demonstrate $\tilde{\mathcal{O}}(ε^{-2})$ sample complexity for RHPG-KF in learning a stabilizing filter that is $ε$-close to the optimal Kalman filter. Notably, the proposed RHPG-KF framework does not require the system to be open-loop stable nor assume any prior knowledge of a stabilizing filter. Our results shed light on applying model-free PG methods to control a linear dynamical system where the state measurements could be corrupted by statistical noises and other (possibly adversarial) disturbances.
△ Less
Submitted 27 February, 2023; v1 submitted 29 January, 2023;
originally announced January 2023.
-
RDesign: Hierarchical Data-efficient Representation Learning for Tertiary Structure-based RNA Design
Authors:
Cheng Tan,
Yijie Zhang,
Zhangyang Gao,
Bozhen Hu,
Siyuan Li,
Zicheng Liu,
Stan Z. Li
Abstract:
While artificial intelligence has made remarkable strides in revealing the relationship between biological macromolecules' primary sequence and tertiary structure, designing RNA sequences based on specified tertiary structures remains challenging. Though existing approaches in protein design have thoroughly explored structure-to-sequence dependencies in proteins, RNA design still confronts difficu…
▽ More
While artificial intelligence has made remarkable strides in revealing the relationship between biological macromolecules' primary sequence and tertiary structure, designing RNA sequences based on specified tertiary structures remains challenging. Though existing approaches in protein design have thoroughly explored structure-to-sequence dependencies in proteins, RNA design still confronts difficulties due to structural complexity and data scarcity. Moreover, direct transplantation of protein design methodologies into RNA design fails to achieve satisfactory outcomes although sharing similar structural components. In this study, we aim to systematically construct a data-driven RNA design pipeline. We crafted a large, well-curated benchmark dataset and designed a comprehensive structural modeling approach to represent the complex RNA tertiary structure. More importantly, we proposed a hierarchical data-efficient representation learning framework that learns structural representations through contrastive learning at both cluster-level and sample-level to fully leverage the limited data. By constraining data representations within a limited hyperspherical space, the intrinsic relationships between data points could be explicitly imposed. Moreover, we incorporated extracted secondary structures with base pairs as prior knowledge to facilitate the RNA design process. Extensive experiments demonstrate the effectiveness of our proposed method, providing a reliable baseline for future RNA design tasks. The source code and benchmark dataset are available at https://github.com/A4Bio/RDesign.
△ Less
Submitted 6 March, 2024; v1 submitted 25 January, 2023;
originally announced January 2023.
-
Graviton noise on tidal forces and geodesic congruences
Authors:
Hing-Tong Cho,
Bei-Lok Hu
Abstract:
In this work we continue with our recent study, using the Feynman-Vernon worldline influence action and the Schwinger-Keldysh closed-time-path formalism, to consider the effects of quantum noise of gravitons on the motion of point masses. This effect can be regarded as due to a stochastic tensorial force whose correlator is given by the graviton noise kernel associated with the Hadamard function o…
▽ More
In this work we continue with our recent study, using the Feynman-Vernon worldline influence action and the Schwinger-Keldysh closed-time-path formalism, to consider the effects of quantum noise of gravitons on the motion of point masses. This effect can be regarded as due to a stochastic tensorial force whose correlator is given by the graviton noise kernel associated with the Hadamard function of the quantized gravitational field. Solving the Langevin equation governing the motion of the separation of two masses, the fluctuations of the separation due to the graviton noise can be obtained for various states of the quantum field. Since this force has the stretching and compressing effects like the tidal force, we can view it as one. We therefore derive the expressions for, and estimate the magnitude of, this tidal force for the cases of the Minkowski and the squeezed vacua. The influence of this force on the evolution of the geodesic congruence through the Raychaudhuri equation is then studied and the effects of quantum graviton noise on the shear and rotation tensors presented.
△ Less
Submitted 16 January, 2023;
originally announced January 2023.
-
Microlensing sheds light on the detection of strong lensing gravitational waves
Authors:
Xikai Shan,
Xuechun Chen,
Bin Hu,
Rong-Gen Cai
Abstract:
The strong lensing gravitational wave (SLGW) is a promising transient phenomenon that encompasses a wealth of physics. However, the long-wave nature of gravitational waves (GW) poses a significant challenge in identification of its host galaxy. To tackle this challenge, we propose a multi-messenger method triggered by the wave optics effect of microlensing. The microlensing diffraction/interferenc…
▽ More
The strong lensing gravitational wave (SLGW) is a promising transient phenomenon that encompasses a wealth of physics. However, the long-wave nature of gravitational waves (GW) poses a significant challenge in identification of its host galaxy. To tackle this challenge, we propose a multi-messenger method triggered by the wave optics effect of microlensing. The microlensing diffraction/interference fringes introduce frequency-dependent fluctuations in the waveform. Our method has three steps. First, we reconstruct the GW waveforms by using the template-independent and template-dependent methods. The mismatch of two reconstructions serves as an indicator of SLGWs. This step can identify $10\%$ SLGWs. Second, we pair the SLGW's multi-signals by employing the sky localization overlapping. Third, we find the host galaxy by requiring the consistency of time delays between Galaxy-Galaxy strong lensing (GGSL) and SLGW. With the help of CSST and JWST, one can identify $1$ quadruple-image system in roughly $3$ years.
△ Less
Submitted 29 October, 2023; v1 submitted 15 January, 2023;
originally announced January 2023.
-
Ab initio descriptions of $A=16$ mirror nuclei with resonance and continuum coupling
Authors:
S. Zhang,
F. R. Xu,
J. G. Li,
B. S. Hu,
Z. H. Cheng,
N. Michel,
Y. Z. Ma,
Q. Yuan,
Y. H. Zhang
Abstract:
We have used an {\it ab initio} Gamow shell model to study the isospin symmetry breaking in the $A=16$ mirror nuclei of $^{16}$F, $^{16}$N, $^{16}$Ne and $^{16}$C. Starting from a chiral interaction with two-nucleon force (2NF) at N$^3$LO and three-nucleon force (3NF) at N$^2$LO, a complex-momentum ${\it psd}$-shell Hamiltonian was constructed by employing the many-body perturbation theory in the…
▽ More
We have used an {\it ab initio} Gamow shell model to study the isospin symmetry breaking in the $A=16$ mirror nuclei of $^{16}$F, $^{16}$N, $^{16}$Ne and $^{16}$C. Starting from a chiral interaction with two-nucleon force (2NF) at N$^3$LO and three-nucleon force (3NF) at N$^2$LO, a complex-momentum ${\it psd}$-shell Hamiltonian was constructed by employing the many-body perturbation theory in the Gamow Hartree-Fock basis which includes bound, resonant and continuum states self-consistently. Such an elaborated {\it ab initio} Gamow shell model with both continuum coupling and 3NF included can properly treat the many-body correlations of weakly bound and unbound nuclei. The mirror partners of $^{16}$F and $^{16}$N exhibit different level orders in their excitation spectra, which can be well explained by the inclusion of 3NF in the calculation. The isospin asymmetry between the mirror partners $^{16}$Ne and $^{16}$C was studied in detail by insight into their configuration structures. The interplay between 3NF and the continuum coupling is discussed in the weakly bound and unbound nuclear states.
△ Less
Submitted 27 December, 2023; v1 submitted 5 January, 2023;
originally announced January 2023.
-
Thermo-optic phase shifter based on hydrogen-doped indium oxide microheater
Authors:
Weiyu Tong,
Erqi Yang,
Yu Pang,
Haobo Yang,
Xin Qian,
Ronggui Yang,
Bin Hu,
Jianji Dong,
Xinliang Zhang
Abstract:
Thermo-optic (TO) phase shifters are very fundamental units in large-scale active silicon photonic integrated circuits (PICs). However, due to the limitation of microheater materials with a trade-off between heating efficiency and absorption loss, designs reported so far typically suffer from slow response time, high power consumption, low yields, and so on. Here, we demonstrate an energy-efficien…
▽ More
Thermo-optic (TO) phase shifters are very fundamental units in large-scale active silicon photonic integrated circuits (PICs). However, due to the limitation of microheater materials with a trade-off between heating efficiency and absorption loss, designs reported so far typically suffer from slow response time, high power consumption, low yields, and so on. Here, we demonstrate an energy-efficient, fast-response, and low-loss TO phase shifter by introducing hydrogen-doped indium oxide (IHO) films as microheater, and the optimized electron concentration with enhanced mobility endows the IHO high conductivity as well as high near-infrared (NIR) transparency, which allow it to directly contact the silicon waveguide without any insulating layer for efficient tuning and fast response. The TO phase shifter achieves a sub-microsecond response time (970 ns/980 ns) with a π phase shift power consumption of 9.6 mW. And the insertion loss introduced by the IHO microheater is ~ 0.5 dB. The proposed IHO-based microheaters with compatible processing technology illustrate the great potential of such material in the application of large-scale silicon PICs.
△ Less
Submitted 2 January, 2023;
originally announced January 2023.
-
Hot entanglement? -- Parametrically coupled quantum oscillators in two heat baths: instability, squeezing and driving
Authors:
Onat Arısoy,
Jen-Tsung Hsiang,
Bei-Lok Hu
Abstract:
Entanglement being a foundational cornerstone of quantum sciences and the primary resource in quantum information processing, understanding its dynamical evolution in realistic conditions is essential. Unfortunately, numerous model studies show that degradation of entanglement from a quantum system's environment, especially thermal noise, is almost unavoidable. Thus the appellation `hot entangleme…
▽ More
Entanglement being a foundational cornerstone of quantum sciences and the primary resource in quantum information processing, understanding its dynamical evolution in realistic conditions is essential. Unfortunately, numerous model studies show that degradation of entanglement from a quantum system's environment, especially thermal noise, is almost unavoidable. Thus the appellation `hot entanglement' appears like a contradiction, until Galve et al [Phys. Rev. Lett. \textbf{105} 180501 (2010)] announced that entanglement can be kept at high temperatures if one considers a quantum system with time-dependent coupling between the two parties, each interacting with its individual bath. With the goal of understanding the sustenance of entanglement at high temperatures, working with the same model and set up as Galve et al, namely, parametrically-driven coupled harmonic oscillators interacting with their own Markovian baths, this work probes into the feasibility of `hot entanglement' from three aspects listed in the subtitle. Our findings show that 1) hot entanglement functions only in the unstable regimes, 2) instability is a necessary but not sufficient condition, and 3) the power intake required by the drive operating in the unstable regime to sustain entanglement increases exponentially. The last factor indicates that hot entanglement under this modeling is theoretically untenable and its actual implementation likely unattainable.
△ Less
Submitted 31 December, 2022;
originally announced January 2023.
-
White dwarf binary modulation can help stochastic gravitational wave background search
Authors:
Shijie Lin,
Bin Hu,
Xue-Hao Zhang,
Yu-Xiao Liu
Abstract:
For the stochastic gravitational wave backgrounds (SGWBs) search centred at the milli-Hz band, the galactic foreground produced by white dwarf binaries (WDBs) within the Milky Way contaminates the extra-galactic signal severely. Because of the anisotropic distribution pattern of the WDBs and the motion of the spaceborne gravitational wave interferometer constellation, the time-domain data stream w…
▽ More
For the stochastic gravitational wave backgrounds (SGWBs) search centred at the milli-Hz band, the galactic foreground produced by white dwarf binaries (WDBs) within the Milky Way contaminates the extra-galactic signal severely. Because of the anisotropic distribution pattern of the WDBs and the motion of the spaceborne gravitational wave interferometer constellation, the time-domain data stream will show an annual modulation. This property is fundamentally different from those of the SGWBs. In this Letter, we propose a new filtering method for the data vector based on the annual modulation phenomenon. We apply the resulted inverse variance filter to the LISA data challenge. The result shows that for the weaker SGWB signal, such as energy density $Ω_{\rm astro}=1\times10^{-12}$, the filtering method can enhance the posterior distribution peak prominently. For the stronger signal, such as $Ω_{\rm astro}=3\times10^{-12}$, the method can improve the Bayesian evidence from `substantial' to `strong' against null hypotheses. This method is model-independent and self-contained. It does not ask for other types of information besides the gravitational wave data.
△ Less
Submitted 9 May, 2023; v1 submitted 29 December, 2022;
originally announced December 2022.
-
Enhancing Multi-modal and Multi-hop Question Answering via Structured Knowledge and Unified Retrieval-Generation
Authors:
Qian Yang,
Qian Chen,
Wen Wang,
Baotian Hu,
Min Zhang
Abstract:
Multi-modal multi-hop question answering involves answering a question by reasoning over multiple input sources from different modalities. Existing methods often retrieve evidences separately and then use a language model to generate an answer based on the retrieved evidences, and thus do not adequately connect candidates and are unable to model the interdependent relations during retrieval. Moreo…
▽ More
Multi-modal multi-hop question answering involves answering a question by reasoning over multiple input sources from different modalities. Existing methods often retrieve evidences separately and then use a language model to generate an answer based on the retrieved evidences, and thus do not adequately connect candidates and are unable to model the interdependent relations during retrieval. Moreover, the pipelined approaches of retrieval and generation might result in poor generation performance when retrieval performance is low. To address these issues, we propose a Structured Knowledge and Unified Retrieval-Generation (SKURG) approach. SKURG employs an Entity-centered Fusion Encoder to align sources from different modalities using shared entities. It then uses a unified Retrieval-Generation Decoder to integrate intermediate retrieval results for answer generation and also adaptively determine the number of retrieval steps. Extensive experiments on two representative multi-modal multi-hop QA datasets MultimodalQA and WebQA demonstrate that SKURG outperforms the state-of-the-art models in both source retrieval and answer generation performance with fewer parameters. Our code is available at https://github.com/HITsz-TMG/SKURG.
△ Less
Submitted 6 August, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
JUNO Sensitivity on Proton Decay $p\to \barνK^+$ Searches
Authors:
JUNO Collaboration,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Nikita Balashov,
Wander Baldini,
Andrea Barresi,
Davide Basilico,
Eric Baussan,
Marco Bellato,
Antonio Bergnoli,
Thilo Birkenfeld,
Sylvie Blin
, et al. (586 additional authors not shown)
Abstract:
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreov…
▽ More
The Jiangmen Underground Neutrino Observatory (JUNO) is a large liquid scintillator detector designed to explore many topics in fundamental physics. In this paper, the potential on searching for proton decay in $p\to \barνK^+$ mode with JUNO is investigated.The kaon and its decay particles feature a clear three-fold coincidence signature that results in a high efficiency for identification. Moreover, the excellent energy resolution of JUNO permits to suppress the sizable background caused by other delayed signals. Based on these advantages, the detection efficiency for the proton decay via $p\to \barνK^+$ is 36.9% with a background level of 0.2 events after 10 years of data taking. The estimated sensitivity based on 200 kton-years exposure is $9.6 \times 10^{33}$ years, competitive with the current best limits on the proton lifetime in this channel.
△ Less
Submitted 26 October, 2023; v1 submitted 16 December, 2022;
originally announced December 2022.
-
Bifurcation analysis of a free boundary model of vascular tumor growth with a necrotic core and chemotaxis
Authors:
Min-Jhe Lu,
Wenrui Hao,
Bei Hu,
Shuwang Li
Abstract:
A considerable number of research works has been devoted to the study of tumor models. Several biophysical factors, such as cell proliferation, apoptosis, chemotaxis, angiogenesis and necrosis, have been discovered to have an impact on the complicated biological system of tumors. An indicator of the aggressiveness of tumor development is the instability of the shape of the tumor boundary. Complex…
▽ More
A considerable number of research works has been devoted to the study of tumor models. Several biophysical factors, such as cell proliferation, apoptosis, chemotaxis, angiogenesis and necrosis, have been discovered to have an impact on the complicated biological system of tumors. An indicator of the aggressiveness of tumor development is the instability of the shape of the tumor boundary. Complex patterns of tumor morphology have been explored by Lu, Min-Jhe et al. [Nonlinear simulation of vascular tumor growth with chemotaxis and the control of necrosis, Journal of Computational Physics 459 (2022): 111153]. In this paper, we continue to carry out a bifurcation analysis on such a vascular tumor model with a controlled necrotic core and chemotaxis. This bifurcation analysis, to the parameter of cell proliferation, is built on the explicit formulas of radially symmetric steady-state solutions. By perturbing the tumor free boundary and establishing rigorous estimates of the free boundary system, %applying the Hanzawa transformation, we prove the existence of the bifurcation branches with Crandall-Rabinowitz theorem. The parameter of chemotaxis is found to influence the monotonicity of the bifurcation point as the mode $l$ increases both theoretically and numerically.
△ Less
Submitted 13 December, 2022;
originally announced December 2022.
-
The Normalized Cross Density Functional: A Framework to Quantify Statistical Dependence for Random Processes
Authors:
Bo Hu,
Jose C. Principe
Abstract:
This paper presents a novel approach to measuring statistical dependence between two random processes (r.p.) using a positive-definite function called the Normalized Cross Density (NCD). NCD is derived directly from the probability density functions of two r.p. and constructs a data-dependent Hilbert space, the Normalized Cross-Density Hilbert Space (NCD-HS). By Mercer's Theorem, the NCD norm can…
▽ More
This paper presents a novel approach to measuring statistical dependence between two random processes (r.p.) using a positive-definite function called the Normalized Cross Density (NCD). NCD is derived directly from the probability density functions of two r.p. and constructs a data-dependent Hilbert space, the Normalized Cross-Density Hilbert Space (NCD-HS). By Mercer's Theorem, the NCD norm can be decomposed into its eigenspectrum, which we name the Multivariate Statistical Dependence (MSD) measure, and their sum, the Total Dependence Measure (TSD). Hence, the NCD-HS eigenfunctions serve as a novel embedded feature space, suitable for quantifying r.p. statistical dependence. In order to apply NCD directly to r.p. realizations, we introduce an architecture with two multiple-output neural networks, a cost function, and an algorithm named the Functional Maximal Correlation Algorithm (FMCA). With FMCA, the two networks learn concurrently by approximating each other's outputs, extending the Alternating Conditional Expectation (ACE) for multivariate functions. We mathematically prove that FMCA learns the dominant eigenvalues and eigenfunctions of NCD directly from realizations. Preliminary results with synthetic data and medium-sized image datasets corroborate the theory. Different strategies for applying NCD are proposed and discussed, demonstrating the method's versatility and stability beyond supervised learning. Specifically, when the two r.p. are high-dimensional real-world images and a white uniform noise process, FMCA learns factorial codes, i.e., the occurrence of a code guarantees that a specific training set image was present, which is important for feature learning.
△ Less
Submitted 20 February, 2024; v1 submitted 8 December, 2022;
originally announced December 2022.
-
Improving End-to-end Speech Translation by Leveraging Auxiliary Speech and Text Data
Authors:
Yuhao Zhang,
Chen Xu,
Bojie Hu,
Chunliang Zhang,
Tong Xiao,
Jingbo Zhu
Abstract:
We present a method for introducing a text encoder into pre-trained end-to-end speech translation systems. It enhances the ability of adapting one modality (i.e., source-language speech) to another (i.e., source-language text). Thus, the speech translation model can learn from both unlabeled and labeled data, especially when the source-language text data is abundant. Beyond this, we present a deno…
▽ More
We present a method for introducing a text encoder into pre-trained end-to-end speech translation systems. It enhances the ability of adapting one modality (i.e., source-language speech) to another (i.e., source-language text). Thus, the speech translation model can learn from both unlabeled and labeled data, especially when the source-language text data is abundant. Beyond this, we present a denoising method to build a robust text encoder that can deal with both normal and noisy text data. Our system sets new state-of-the-arts on the MuST-C En-De, En-Fr, and LibriSpeech En-Fr tasks.
△ Less
Submitted 4 December, 2022;
originally announced December 2022.
-
Protein Language Models and Structure Prediction: Connection and Progression
Authors:
Bozhen Hu,
Jun Xia,
Jiangbin Zheng,
Cheng Tan,
Yufei Huang,
Yongjie Xu,
Stan Z. Li
Abstract:
The prediction of protein structures from sequences is an important task for function prediction, drug design, and related biological processes understanding. Recent advances have proved the power of language models (LMs) in processing the protein sequence databases, which inherit the advantages of attention networks and capture useful information in learning representations for proteins. The past…
▽ More
The prediction of protein structures from sequences is an important task for function prediction, drug design, and related biological processes understanding. Recent advances have proved the power of language models (LMs) in processing the protein sequence databases, which inherit the advantages of attention networks and capture useful information in learning representations for proteins. The past two years have witnessed remarkable success in tertiary protein structure prediction (PSP), including evolution-based and single-sequence-based PSP. It seems that instead of using energy-based models and sampling procedures, protein language model (pLM)-based pipelines have emerged as mainstream paradigms in PSP. Despite the fruitful progress, the PSP community needs a systematic and up-to-date survey to help bridge the gap between LMs in the natural language processing (NLP) and PSP domains and introduce their methodologies, advancements and practical applications. To this end, in this paper, we first introduce the similarities between protein and human languages that allow LMs extended to pLMs, and applied to protein databases. Then, we systematically review recent advances in LMs and pLMs from the perspectives of network architectures, pre-training strategies, applications, and commonly-used protein databases. Next, different types of methods for PSP are discussed, particularly how the pLM-based architectures function in the process of protein folding. Finally, we identify challenges faced by the PSP community and foresee promising research directions along with the advances of pLMs. This survey aims to be a hands-on guide for researchers to understand PSP methods, develop pLMs and tackle challenging problems in this field for practical purposes.
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Intra-class Adaptive Augmentation with Neighbor Correction for Deep Metric Learning
Authors:
Zheren Fu,
Zhendong Mao,
Bo Hu,
An-An Liu,
Yongdong Zhang
Abstract:
Deep metric learning aims to learn an embedding space, where semantically similar samples are close together and dissimilar ones are repelled against. To explore more hard and informative training signals for augmentation and generalization, recent methods focus on generating synthetic samples to boost metric learning losses. However, these methods just use the deterministic and class-independent…
▽ More
Deep metric learning aims to learn an embedding space, where semantically similar samples are close together and dissimilar ones are repelled against. To explore more hard and informative training signals for augmentation and generalization, recent methods focus on generating synthetic samples to boost metric learning losses. However, these methods just use the deterministic and class-independent generations (e.g., simple linear interpolation), which only can cover the limited part of distribution spaces around original samples. They have overlooked the wide characteristic changes of different classes and can not model abundant intra-class variations for generations. Therefore, generated samples not only lack rich semantics within the certain class, but also might be noisy signals to disturb training. In this paper, we propose a novel intra-class adaptive augmentation (IAA) framework for deep metric learning. We reasonably estimate intra-class variations for every class and generate adaptive synthetic samples to support hard samples mining and boost metric learning losses. Further, for most datasets that have a few samples within the class, we propose the neighbor correction to revise the inaccurate estimations, according to our correlation discovery where similar classes generally have similar variation distributions. Extensive experiments on five benchmarks show our method significantly improves and outperforms the state-of-the-art methods on retrieval performances by 3%-6%. Our code is available at https://github.com/darkpromise98/IAA
△ Less
Submitted 29 November, 2022;
originally announced November 2022.
-
Precision measurement of reactor antineutrino oscillation at kilometer-scale baselines by Daya Bay
Authors:
Daya Bay collaboration,
F. P. An,
W. D. Bai,
A. B. Balantekin,
M. Bishai,
S. Blyth,
G. F. Cao,
J. Cao,
J. F. Chang,
Y. Chang,
H. S. Chen,
H. Y. Chen,
S. M. Chen,
Y. Chen,
Y. X. Chen,
Z. Y. Chen,
J. Cheng,
Z. K. Cheng,
J. J. Cherwinka,
M. C. Chu,
J. P. Cummings,
O. Dalager,
F. S. Deng,
Y. Y. Ding,
X. Y. Ding
, et al. (176 additional authors not shown)
Abstract:
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Comp…
▽ More
We present a new determination of the smallest neutrino mixing angle $θ_{13}$ and the mass-squared difference $Δ{\rm m}^{2}_{32}$ using a final sample of $5.55 \times 10^{6}$ inverse beta-decay (IBD) candidates with the final-state neutron captured on gadolinium. This sample was selected from the complete data set obtained by the Daya Bay reactor neutrino experiment in 3158 days of operation. Compared to the previous Daya Bay results, selection of IBD candidates has been optimized, energy calibration refined, and treatment of backgrounds further improved. The resulting oscillation parameters are ${\rm sin}^{2}2θ_{13} = 0.0851 \pm 0.0024$, $Δ{\rm m}^{2}_{32} = (2.466 \pm 0.060) \times 10^{-3}{\rm eV}^{2}$ for the normal mass ordering or $Δ{\rm m}^{2}_{32} = -(2.571 \pm 0.060) \times 10^{-3} {\rm eV}^{2}$ for the inverted mass ordering.
△ Less
Submitted 27 November, 2022;
originally announced November 2022.
-
One for All, All for One: Learning and Transferring User Embeddings for Cross-Domain Recommendation
Authors:
Chenglin Li,
Yuanzhen Xie,
Chenyun Yu,
Bo Hu,
Zang li,
Guoqiang Shu,
Xiaohu Qie,
Di Niu
Abstract:
Cross-domain recommendation is an important method to improve recommender system performance, especially when observations in target domains are sparse. However, most existing techniques focus on single-target or dual-target cross-domain recommendation (CDR) and are hard to be generalized to CDR with multiple target domains. In addition, the negative transfer problem is prevalent in CDR, where the…
▽ More
Cross-domain recommendation is an important method to improve recommender system performance, especially when observations in target domains are sparse. However, most existing techniques focus on single-target or dual-target cross-domain recommendation (CDR) and are hard to be generalized to CDR with multiple target domains. In addition, the negative transfer problem is prevalent in CDR, where the recommendation performance in a target domain may not always be enhanced by knowledge learned from a source domain, especially when the source domain has sparse data. In this study, we propose CAT-ART, a multi-target CDR method that learns to improve recommendations in all participating domains through representation learning and embedding transfer. Our method consists of two parts: a self-supervised Contrastive AuToencoder (CAT) framework to generate global user embeddings based on information from all participating domains, and an Attention-based Representation Transfer (ART) framework which transfers domain-specific user embeddings from other domains to assist with target domain recommendation. CAT-ART boosts the recommendation performance in any target domain through the combined use of the learned global user representation and knowledge transferred from other domains, in addition to the original user embedding in the target domain. We conducted extensive experiments on a collected real-world CDR dataset spanning 5 domains and involving a million users. Experimental results demonstrate the superiority of the proposed method over a range of prior arts. We further conducted ablation studies to verify the effectiveness of the proposed components. Our collected dataset will be open-sourced to facilitate future research in the field of multi-domain recommender systems and user modeling.
△ Less
Submitted 21 November, 2022;
originally announced November 2022.
-
NeRF-RPN: A general framework for object detection in NeRFs
Authors:
Benran Hu,
Junkai Huang,
Yichen Liu,
Yu-Wing Tai,
Chi-Keung Tang
Abstract:
This paper presents the first significant object detection framework, NeRF-RPN, which directly operates on NeRF. Given a pre-trained NeRF model, NeRF-RPN aims to detect all bounding boxes of objects in a scene. By exploiting a novel voxel representation that incorporates multi-scale 3D neural volumetric features, we demonstrate it is possible to regress the 3D bounding boxes of objects in NeRF dir…
▽ More
This paper presents the first significant object detection framework, NeRF-RPN, which directly operates on NeRF. Given a pre-trained NeRF model, NeRF-RPN aims to detect all bounding boxes of objects in a scene. By exploiting a novel voxel representation that incorporates multi-scale 3D neural volumetric features, we demonstrate it is possible to regress the 3D bounding boxes of objects in NeRF directly without rendering the NeRF at any viewpoint. NeRF-RPN is a general framework and can be applied to detect objects without class labels. We experimented NeRF-RPN with various backbone architectures, RPN head designs and loss functions. All of them can be trained in an end-to-end manner to estimate high quality 3D bounding boxes. To facilitate future research in object detection for NeRF, we built a new benchmark dataset which consists of both synthetic and real-world data with careful labeling and clean up. Code and dataset are available at https://github.com/lyclyc52/NeRF_RPN.
△ Less
Submitted 27 March, 2023; v1 submitted 21 November, 2022;
originally announced November 2022.
-
Distinctive Self-Similar Object Detection
Authors:
Zeyu Shangguan,
Bocheng Hu,
Guohua Dai,
Yuyu Liu,
Darun Tang,
Xingqun Jiang
Abstract:
Deep learning-based object detection has demonstrated a significant presence in the practical applications of artificial intelligence. However, objects such as fire and smoke, pose challenges to object detection because of their non-solid and various shapes, and consequently difficult to truly meet requirements in practical fire prevention and control. In this paper, we propose that the distinctiv…
▽ More
Deep learning-based object detection has demonstrated a significant presence in the practical applications of artificial intelligence. However, objects such as fire and smoke, pose challenges to object detection because of their non-solid and various shapes, and consequently difficult to truly meet requirements in practical fire prevention and control. In this paper, we propose that the distinctive fractal feature of self-similar in fire and smoke can relieve us from struggling with their various shapes. To our best knowledge, we are the first to discuss this problem. In order to evaluate the self-similarity of the fire and smoke and improve the precision of object detection, we design a semi-supervised method that use Hausdorff distance to describe the resemblance between instances. Besides, based on the concept of self-similar, we have devised a novel methodology for evaluating this particular task in a more equitable manner. We have meticulously designed our network architecture based on well-established and representative baseline networks such as YOLO and Faster R-CNN. Our experiments have been conducted on publicly available fire and smoke detection datasets, which we have thoroughly verified to ensure the validity of our approach. As a result, we have observed significant improvements in the detection accuracy.
△ Less
Submitted 25 August, 2023; v1 submitted 20 November, 2022;
originally announced November 2022.
-
Linear RNNs Provably Learn Linear Dynamic Systems
Authors:
Lifu Wang,
Tianyu Wang,
Shengwei Yi,
Bo Shen,
Bo Hu,
Xing Cao
Abstract:
We study the learning ability of linear recurrent neural networks with Gradient Descent. We prove the first theoretical guarantee on linear RNNs to learn any stable linear dynamic system using any a large type of loss functions. For an arbitrary stable linear system with a parameter $ρ_C$ related to the transition matrix $C$, we show that despite the non-convexity of the parameter optimization los…
▽ More
We study the learning ability of linear recurrent neural networks with Gradient Descent. We prove the first theoretical guarantee on linear RNNs to learn any stable linear dynamic system using any a large type of loss functions. For an arbitrary stable linear system with a parameter $ρ_C$ related to the transition matrix $C$, we show that despite the non-convexity of the parameter optimization loss if the width of the RNN is large enough (and the required width in hidden layers does not rely on the length of the input sequence), a linear RNN can provably learn any stable linear dynamic system with the sample and time complexity polynomial in $\frac{1}{1-ρ_C}$. Our results provide the first theoretical guarantee to learn a linear RNN and demonstrate how can the recurrent structure help to learn a dynamic system.
△ Less
Submitted 22 October, 2023; v1 submitted 18 November, 2022;
originally announced November 2022.
-
First wide field-of-view X-ray observations by a lobster eye focusing telescope in orbit
Authors:
C. Zhang,
Z. X. Ling,
X. J. Sun,
S. L. Sun,
Y. Liu,
Z. D. Li,
Y. L. Xue,
Y. F. Chen,
Y. F. Dai,
Z. Q. Jia,
H. Y. Liu,
X. F. Zhang,
Y. H. Zhang,
S. N. Zhang,
F. S. Chen,
Z. W. Cheng,
W. Fu,
Y. X. Han,
H. Li,
J. F. Li,
Y. Li,
P. R. Liu,
X. H. Ma,
Y. J. Tang,
C. B. Wang
, et al. (53 additional authors not shown)
Abstract:
As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we repor…
▽ More
As a novel X-ray focusing technology, lobster eye micro-pore optics (MPO) feature both a wide observing field of view and true imaging capability, promising sky monitoring with significantly improved sensitivity and spatial resolution in soft X-rays. Since first proposed by Angel (1979), the optics have been extensively studied, developed and trialed over the past decades. In this Letter, we report on the first-light results from a flight experiment of the Lobster Eye Imager for Astronomy ($LEIA$), a pathfinder of the wide-field X-ray telescope of the Einstein Probe mission. The piggyback imager, launched in July 2022, has a mostly un-vignetted field of view of $18.6^\circ \times 18.6^\circ $. Its spatial resolution is in the range of 4$-$7 arcmin in FWHM and the focal spot effective area is 2$-$3 cm$^2$, both showing only mild fluctuations across the field of view. We present images of the Galactic center region, Sco X-1 and the diffuse Cygnus Loop nebular taken in snapshot observations over 0.5$-$4 keV. These are truly wide-field X-ray images of celestial bodies observed, for the first time, by a focusing imaging telescope. Initial analyses of the in-flight data show excellent agreement between the observed images and the on-ground calibration and simulations. The instrument and its characterization are briefly described, as well as the flight experiment. The results provide a solid basis for the development of the present and proposed wide-field X-ray missions using lobster eye MPO.
△ Less
Submitted 17 November, 2022;
originally announced November 2022.
-
Entanglement dynamics of coupled quantum oscillators in independent nonMarkovian baths
Authors:
Jen-Tsung Hsiang,
Onat Arısoy,
Bei-Lok Hu
Abstract:
This work strives to better understand how the entanglement in an open quantum system, here represented by two coupled Brownian oscillators, is affected by a nonMarkovian environment (with memories), here represented by two independent baths each oscillator separately interacts with. We consider two settings, a `symmetric' configuration wherein the parameters of both oscillators and their baths ar…
▽ More
This work strives to better understand how the entanglement in an open quantum system, here represented by two coupled Brownian oscillators, is affected by a nonMarkovian environment (with memories), here represented by two independent baths each oscillator separately interacts with. We consider two settings, a `symmetric' configuration wherein the parameters of both oscillators and their baths are identical, and an `asymmetric' configuration wherein they are different, in particular, a `hybrid' configuration, where one of the two coupled oscillators interacts with a nonMarkovian bath and the other with a Markovian bath. We ask two groups of questions: Q1) Which time regime does the bath's nonMarkovianity benefit the system's entanglement most? The answers we get from detailed numerical studies suggest that A1) For an initially entangled pair of oscillators, we see that in the intermediate time range, the duration of entanglement is proportional to the memory time, and it lasts a fraction of the relaxation time, but at late times when the dynamics reaches a steady state, the value of the symplectic eigenvalue of the partially transposed covariance matrix barely benefit from the bath nonMarkovianity. For the second group of questions: Q2)Can the memory of one nonMarkovian bath be passed on to another Markovian bath? And if so, does this memory transfer help to sustain the system's entanglement dynamics? Our results from numerical studies of the asymmetric hybrid configuration indicate that A2) A system with a short memory time can acquire improvement when it is coupled to another system with a long memory time, but, at a cost of the latter. The sustainability of the bipartite entanglement is determined by the party which breaks off entanglement most easily.
△ Less
Submitted 14 November, 2022;
originally announced November 2022.
-
Calibration Meets Explanation: A Simple and Effective Approach for Model Confidence Estimates
Authors:
Dongfang Li,
Baotian Hu,
Qingcai Chen
Abstract:
Calibration strengthens the trustworthiness of black-box models by producing better accurate confidence estimates on given examples. However, little is known about if model explanations can help confidence calibration. Intuitively, humans look at important features attributions and decide whether the model is trustworthy. Similarly, the explanations can tell us when the model may or may not know.…
▽ More
Calibration strengthens the trustworthiness of black-box models by producing better accurate confidence estimates on given examples. However, little is known about if model explanations can help confidence calibration. Intuitively, humans look at important features attributions and decide whether the model is trustworthy. Similarly, the explanations can tell us when the model may or may not know. Inspired by this, we propose a method named CME that leverages model explanations to make the model less confident with non-inductive attributions. The idea is that when the model is not highly confident, it is difficult to identify strong indications of any class, and the tokens accordingly do not have high attribution scores for any class and vice versa. We conduct extensive experiments on six datasets with two popular pre-trained language models in the in-domain and out-of-domain settings. The results show that CME improves calibration performance in all settings. The expected calibration errors are further reduced when combined with temperature scaling. Our findings highlight that model explanations can help calibrate posterior estimates.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
Prompt-based Text Entailment for Low-Resource Named Entity Recognition
Authors:
Dongfang Li,
Baotian Hu,
Qingcai Chen
Abstract:
Pre-trained Language Models (PLMs) have been applied in NLP tasks and achieve promising results. Nevertheless, the fine-tuning procedure needs labeled data of the target domain, making it difficult to learn in low-resource and non-trivial labeled scenarios. To address these challenges, we propose Prompt-based Text Entailment (PTE) for low-resource named entity recognition, which better leverages k…
▽ More
Pre-trained Language Models (PLMs) have been applied in NLP tasks and achieve promising results. Nevertheless, the fine-tuning procedure needs labeled data of the target domain, making it difficult to learn in low-resource and non-trivial labeled scenarios. To address these challenges, we propose Prompt-based Text Entailment (PTE) for low-resource named entity recognition, which better leverages knowledge in the PLMs. We first reformulate named entity recognition as the text entailment task. The original sentence with entity type-specific prompts is fed into PLMs to get entailment scores for each candidate. The entity type with the top score is then selected as final label. Then, we inject tagging labels into prompts and treat words as basic units instead of n-gram spans to reduce time complexity in generating candidates by n-grams enumeration. Experimental results demonstrate that the proposed method PTE achieves competitive performance on the CoNLL03 dataset, and better than fine-tuned counterparts on the MIT Movie and Few-NERD dataset in low-resource settings.
△ Less
Submitted 6 November, 2022;
originally announced November 2022.
-
Continual Learning of Neural Machine Translation within Low Forgetting Risk Regions
Authors:
Shuhao Gu,
Bojie Hu,
Yang Feng
Abstract:
This paper considers continual learning of large-scale pretrained neural machine translation model without accessing the previous training data or introducing model separation. We argue that the widely used regularization-based methods, which perform multi-objective learning with an auxiliary loss, suffer from the misestimate problem and cannot always achieve a good balance between the previous an…
▽ More
This paper considers continual learning of large-scale pretrained neural machine translation model without accessing the previous training data or introducing model separation. We argue that the widely used regularization-based methods, which perform multi-objective learning with an auxiliary loss, suffer from the misestimate problem and cannot always achieve a good balance between the previous and new tasks. To solve the problem, we propose a two-stage training method based on the local features of the real loss. We first search low forgetting risk regions, where the model can retain the performance on the previous task as the parameters are updated, to avoid the catastrophic forgetting problem. Then we can continually train the model within this region only with the new training data to fit the new task. Specifically, we propose two methods to search the low forgetting risk regions, which are based on the curvature of loss and the impacts of the parameters on the model output, respectively. We conduct experiments on domain adaptation and more challenging language adaptation tasks, and the experimental results show that our method can achieve significant improvements compared with several strong baselines.
△ Less
Submitted 3 November, 2022; v1 submitted 2 November, 2022;
originally announced November 2022.
-
Knowing the Past to Predict the Future: Reinforcement Virtual Learning
Authors:
Peng Zhang,
Yawen Huang,
Bingzhang Hu,
Shizheng Wang,
Haoran Duan,
Noura Al Moubayed,
Yefeng Zheng,
Yang Long
Abstract:
Reinforcement Learning (RL)-based control system has received considerable attention in recent decades. However, in many real-world problems, such as Batch Process Control, the environment is uncertain, which requires expensive interaction to acquire the state and reward values. In this paper, we present a cost-efficient framework, such that the RL model can evolve for itself in a Virtual Space us…
▽ More
Reinforcement Learning (RL)-based control system has received considerable attention in recent decades. However, in many real-world problems, such as Batch Process Control, the environment is uncertain, which requires expensive interaction to acquire the state and reward values. In this paper, we present a cost-efficient framework, such that the RL model can evolve for itself in a Virtual Space using the predictive models with only historical data. The proposed framework enables a step-by-step RL model to predict the future state and select optimal actions for long-sight decisions. The main focuses are summarized as: 1) how to balance the long-sight and short-sight rewards with an optimal strategy; 2) how to make the virtual model interacting with real environment to converge to a final learning policy. Under the experimental settings of Fed-Batch Process, our method consistently outperforms the existing state-of-the-art methods.
△ Less
Submitted 2 November, 2022;
originally announced November 2022.
-
An Efficient Memory-Augmented Transformer for Knowledge-Intensive NLP Tasks
Authors:
Yuxiang Wu,
Yu Zhao,
Baotian Hu,
Pasquale Minervini,
Pontus Stenetorp,
Sebastian Riedel
Abstract:
Access to external knowledge is essential for many natural language processing tasks, such as question answering and dialogue. Existing methods often rely on a parametric model that stores knowledge in its parameters, or use a retrieval-augmented model that has access to an external knowledge source. Parametric and retrieval-augmented models have complementary strengths in terms of computational e…
▽ More
Access to external knowledge is essential for many natural language processing tasks, such as question answering and dialogue. Existing methods often rely on a parametric model that stores knowledge in its parameters, or use a retrieval-augmented model that has access to an external knowledge source. Parametric and retrieval-augmented models have complementary strengths in terms of computational efficiency and predictive accuracy. To combine the strength of both approaches, we propose the Efficient Memory-Augmented Transformer (EMAT) -- it encodes external knowledge into a key-value memory and exploits the fast maximum inner product search for memory querying. We also introduce pre-training tasks that allow EMAT to encode informative key-value representations, and to learn an implicit strategy to integrate multiple memory slots into the transformer. Experiments on various knowledge-intensive tasks such as question answering and dialogue datasets show that, simply augmenting parametric models (T5-base) using our method produces more accurate results (e.g., 25.8 -> 44.3 EM on NQ) while retaining a high throughput (e.g., 1000 queries/s on NQ). Compared to retrieval-augmented models, EMAT runs substantially faster across the board and produces more accurate results on WoW and ELI5. Our code and datasets are available at https://github. com/uclnlp/EMAT.
△ Less
Submitted 30 October, 2022;
originally announced October 2022.
-
Global Convergence of Direct Policy Search for State-Feedback $\mathcal{H}_\infty$ Robust Control: A Revisit of Nonsmooth Synthesis with Goldstein Subdifferential
Authors:
Xingang Guo,
Bin Hu
Abstract:
Direct policy search has been widely applied in modern reinforcement learning and continuous control. However, the theoretical properties of direct policy search on nonsmooth robust control synthesis have not been fully understood. The optimal $\mathcal{H}_\infty$ control framework aims at designing a policy to minimize the closed-loop $\mathcal{H}_\infty$ norm, and is arguably the most fundamenta…
▽ More
Direct policy search has been widely applied in modern reinforcement learning and continuous control. However, the theoretical properties of direct policy search on nonsmooth robust control synthesis have not been fully understood. The optimal $\mathcal{H}_\infty$ control framework aims at designing a policy to minimize the closed-loop $\mathcal{H}_\infty$ norm, and is arguably the most fundamental robust control paradigm. In this work, we show that direct policy search is guaranteed to find the global solution of the robust $\mathcal{H}_\infty$ state-feedback control design problem. Notice that policy search for optimal $\mathcal{H}_\infty$ control leads to a constrained nonconvex nonsmooth optimization problem, where the nonconvex feasible set consists of all the policies stabilizing the closed-loop dynamics. We show that for this nonsmooth optimization problem, all Clarke stationary points are global minimum. Next, we identify the coerciveness of the closed-loop $\mathcal{H}_\infty$ objective function, and prove that all the sublevel sets of the resultant policy search problem are compact. Based on these properties, we show that Goldstein's subgradient method and its implementable variants can be guaranteed to stay in the nonconvex feasible set and eventually find the global optimal solution of the $\mathcal{H}_\infty$ state-feedback synthesis problem. Our work builds a new connection between nonconvex nonsmooth optimization theory and robust control, leading to an interesting global convergence result for direct policy search on optimal $\mathcal{H}_\infty$ synthesis.
△ Less
Submitted 20 October, 2022;
originally announced October 2022.
-
The Commutator of the Bergman Projection on Strongly Pseudoconvex Domains with Minimal Smoothness
Authors:
Bingyang Hu,
Zhenghui Huo,
Loredana Lanzani,
Kevin Palencia,
Nathan A. Wagner
Abstract:
Consider a bounded, strongly pseudoconvex domain $D\subset \mathbb C^n$ with minimal smoothness (namely, the class $C^2$) and let $b$ be a locally integrable function on $D$. We characterize boundedness (resp., compactness) in $L^p(D), p > 1$, of the commutator $[b, P]$ of the Bergman projection $P$ in terms of an appropriate bounded (resp. vanishing) mean oscillation requirement on $b$. We also e…
▽ More
Consider a bounded, strongly pseudoconvex domain $D\subset \mathbb C^n$ with minimal smoothness (namely, the class $C^2$) and let $b$ be a locally integrable function on $D$. We characterize boundedness (resp., compactness) in $L^p(D), p > 1$, of the commutator $[b, P]$ of the Bergman projection $P$ in terms of an appropriate bounded (resp. vanishing) mean oscillation requirement on $b$. We also establish the equivalence of such notion of BMO (resp., VMO) with other BMO and VMO spaces given in the literature. Our proofs use a dyadic analog of the Berezin transform and holomorphic integral representations going back (for smooth domains) to N. Kerzman & E. M. Stein, and E. Ligocka.
△ Less
Submitted 27 November, 2023; v1 submitted 19 October, 2022;
originally announced October 2022.
-
Tenrec: A Large-scale Multipurpose Benchmark Dataset for Recommender Systems
Authors:
Guanghu Yuan,
Fajie Yuan,
Yudong Li,
Beibei Kong,
Shujie Li,
Lei Chen,
Min Yang,
Chenyun Yu,
Bo Hu,
Zang Li,
Yu Xu,
Xiaohu Qie
Abstract:
Existing benchmark datasets for recommender systems (RS) either are created at a small scale or involve very limited forms of user feedback. RS models evaluated on such datasets often lack practical values for large-scale real-world applications. In this paper, we describe Tenrec, a novel and publicly available data collection for RS that records various user feedback from four different recommend…
▽ More
Existing benchmark datasets for recommender systems (RS) either are created at a small scale or involve very limited forms of user feedback. RS models evaluated on such datasets often lack practical values for large-scale real-world applications. In this paper, we describe Tenrec, a novel and publicly available data collection for RS that records various user feedback from four different recommendation scenarios. To be specific, Tenrec has the following five characteristics: (1) it is large-scale, containing around 5 million users and 140 million interactions; (2) it has not only positive user feedback, but also true negative feedback (vs. one-class recommendation); (3) it contains overlapped users and items across four different scenarios; (4) it contains various types of user positive feedback, in forms of clicks, likes, shares, and follows, etc; (5) it contains additional features beyond the user IDs and item IDs. We verify Tenrec on ten diverse recommendation tasks by running several classical baseline models per task. Tenrec has the potential to become a useful benchmark dataset for a majority of popular recommendation tasks.
△ Less
Submitted 4 June, 2023; v1 submitted 13 October, 2022;
originally announced October 2022.
-
Increasing Visual Awareness in Multimodal Neural Machine Translation from an Information Theoretic Perspective
Authors:
Baijun Ji,
Tong Zhang,
Yicheng Zou,
Bojie Hu,
Si Shen
Abstract:
Multimodal machine translation (MMT) aims to improve translation quality by equipping the source sentence with its corresponding image. Despite the promising performance, MMT models still suffer the problem of input degradation: models focus more on textual information while visual information is generally overlooked. In this paper, we endeavor to improve MMT performance by increasing visual aware…
▽ More
Multimodal machine translation (MMT) aims to improve translation quality by equipping the source sentence with its corresponding image. Despite the promising performance, MMT models still suffer the problem of input degradation: models focus more on textual information while visual information is generally overlooked. In this paper, we endeavor to improve MMT performance by increasing visual awareness from an information theoretic perspective. In detail, we decompose the informative visual signals into two parts: source-specific information and target-specific information. We use mutual information to quantify them and propose two methods for objective optimization to better leverage visual signals. Experiments on two datasets demonstrate that our approach can effectively enhance the visual awareness of MMT model and achieve superior results against strong baselines.
△ Less
Submitted 16 October, 2022;
originally announced October 2022.
-
Model Independent Approach of the JUNO $^8$B Solar Neutrino Program
Authors:
JUNO Collaboration,
Jie Zhao,
Baobiao Yue,
Haoqi Lu,
Yufeng Li,
Jiajie Ling,
Zeyuan Yu,
Angel Abusleme,
Thomas Adam,
Shakeel Ahmad,
Rizwan Ahmed,
Sebastiano Aiello,
Muhammad Akram,
Abid Aleem,
Tsagkarakis Alexandros,
Fengpeng An,
Qi An,
Giuseppe Andronico,
Nikolay Anfimov,
Vito Antonelli,
Tatiana Antoshkina,
Burin Asavapibhop,
João Pedro Athayde Marcondes de André,
Didier Auguste,
Weidong Bai
, et al. (579 additional authors not shown)
Abstract:
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low backg…
▽ More
The physics potential of detecting $^8$B solar neutrinos will be exploited at the Jiangmen Underground Neutrino Observatory (JUNO), in a model independent manner by using three distinct channels of the charged-current (CC), neutral-current (NC) and elastic scattering (ES) interactions. Due to the largest-ever mass of $^{13}$C nuclei in the liquid-scintillator detectors and the {expected} low background level, $^8$B solar neutrinos would be observable in the CC and NC interactions on $^{13}$C for the first time. By virtue of optimized event selections and muon veto strategies, backgrounds from the accidental coincidence, muon-induced isotopes, and external backgrounds can be greatly suppressed. Excellent signal-to-background ratios can be achieved in the CC, NC and ES channels to guarantee the $^8$B solar neutrino observation. From the sensitivity studies performed in this work, we show that JUNO, with ten years of data, can reach the {1$σ$} precision levels of 5%, 8% and 20% for the $^8$B neutrino flux, $\sin^2θ_{12}$, and $Δm^2_{21}$, respectively. It would be unique and helpful to probe the details of both solar physics and neutrino physics. In addition, when combined with SNO, the world-best precision of 3% is expected for the $^8$B neutrino flux measurement.
△ Less
Submitted 6 March, 2024; v1 submitted 15 October, 2022;
originally announced October 2022.
-
ACRNet: Attention Cube Regression Network for Multi-view Real-time 3D Human Pose Estimation in Telemedicine
Authors:
Boce Hu,
Chenfei Zhu,
Xupeng Ai,
Sunil K. Agrawal
Abstract:
Human pose estimation (HPE) for 3D skeleton reconstruction in telemedicine has long received attention. Although the development of deep learning has made HPE methods in telemedicine simpler and easier to use, addressing low accuracy and high latency remains a big challenge. In this paper, we propose a novel multi-view Attention Cube Regression Network (ACRNet), which regresses the 3D position of…
▽ More
Human pose estimation (HPE) for 3D skeleton reconstruction in telemedicine has long received attention. Although the development of deep learning has made HPE methods in telemedicine simpler and easier to use, addressing low accuracy and high latency remains a big challenge. In this paper, we propose a novel multi-view Attention Cube Regression Network (ACRNet), which regresses the 3D position of joints in real time by aggregating informative attention points on each cube surface. More specially, a cube whose each surface contains uniformly distributed attention points with specific coordinate values is first created to wrap the target from the main view. Then, our network regresses the 3D position of each joint by summing and averaging the coordinates of attention points on each surface after being weighted. To verify our method, we first tested ACRNet on the open-source ITOP dataset; meanwhile, we collected a new multi-view upper body movement dataset (UBM) on the trunk support trainer (TruST) to validate the capability of our model in real rehabilitation scenarios. Experimental results demonstrate the superiority of ACRNet compared with other state-of-the-art methods. We also validate the efficacy of each module in ACRNet. Furthermore, Our work analyzes the performance of ACRNet under the medical monitoring indicator. Because of the high accuracy and running speed, our model is suitable for real-time telemedicine settings. The source code is available at https://github.com/BoceHu/ACRNet
△ Less
Submitted 11 October, 2022;
originally announced October 2022.
-
Towards a Theoretical Foundation of Policy Optimization for Learning Control Policies
Authors:
Bin Hu,
Kaiqing Zhang,
Na Li,
Mehran Mesbahi,
Maryam Fazel,
Tamer Başar
Abstract:
Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and reinforcement learning. This article surveys some of the recent developments on policy optimization, a gradient-based iterative approach for feedback control synt…
▽ More
Gradient-based methods have been widely used for system design and optimization in diverse application domains. Recently, there has been a renewed interest in studying theoretical properties of these methods in the context of control and reinforcement learning. This article surveys some of the recent developments on policy optimization, a gradient-based iterative approach for feedback control synthesis, popularized by successes of reinforcement learning. We take an interdisciplinary perspective in our exposition that connects control theory, reinforcement learning, and large-scale optimization. We review a number of recently-developed theoretical results on the optimization landscape, global convergence, and sample complexity of gradient-based methods for various continuous control problems such as the linear quadratic regulator (LQR), $\mathcal{H}_\infty$ control, risk-sensitive control, linear quadratic Gaussian (LQG) control, and output feedback synthesis. In conjunction with these optimization results, we also discuss how direct policy optimization handles stability and robustness concerns in learning-based control, two main desiderata in control engineering. We conclude the survey by pointing out several challenges and opportunities at the intersection of learning and control.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Sampling of Correlated Bandlimited Continuous Signals by Joint Time-vertex Graph Fourier Transform
Authors:
Zhongyi Ni,
Feng Ji,
Hang Sheng,
Hui Feng,
Bo Hu
Abstract:
When sampling multiple signals, the correlation between the signals can be exploited to reduce the overall number of samples. In this paper, we study the sampling theory of multiple correlated signals, using correlation to sample them at the lowest sampling rate. Based on the correlation between signal sources, we model multiple continuous-time signals as continuous time-vertex graph signals. The…
▽ More
When sampling multiple signals, the correlation between the signals can be exploited to reduce the overall number of samples. In this paper, we study the sampling theory of multiple correlated signals, using correlation to sample them at the lowest sampling rate. Based on the correlation between signal sources, we model multiple continuous-time signals as continuous time-vertex graph signals. The graph signals are projected onto orthogonal bases to remove spatial correlation and reduce dimensions by graph Fourier transform. When the bandwidths of the original signals and the reduced dimension signals are given, we prove the minimum sampling rate required for recovery of the original signals, and propose a feasible sampling scheme.
△ Less
Submitted 10 October, 2022;
originally announced October 2022.
-
Can One Perturb the Equatorial Zone on a Sphere with Larger Mean Curvature?
Authors:
Baichuan Hu,
Xiang Ma,
Shengyang Wang
Abstract:
We consider the mean curvature rigidity problem of an equatorial zone on a sphere which is symmetric about the equator with width $2w$. There are two different notions on rigidity, i.e. strong rigidity and local rigidity. We prove that for each kind of these rigidity problems, there exists a critical value such that the rigidity holds true if, and only if, the zone width is smaller than that value…
▽ More
We consider the mean curvature rigidity problem of an equatorial zone on a sphere which is symmetric about the equator with width $2w$. There are two different notions on rigidity, i.e. strong rigidity and local rigidity. We prove that for each kind of these rigidity problems, there exists a critical value such that the rigidity holds true if, and only if, the zone width is smaller than that value. For the rigidity part, we used the tangency principle and a specific lemma (the trap-slice lemma we established before). For the non-rigidity part, we construct the nontrivial perturbations by a gluing procedure called the round-corner lemma using the Delaunay surfaces.
△ Less
Submitted 3 October, 2022; v1 submitted 3 October, 2022;
originally announced October 2022.
-
Applying a Chemical Structure Teaching Method in the Pharmaceutical Analysis Curriculum to Improve Student Engagement and Learning
Authors:
Hui Zhenga,
Binjing Hu,
Qiang Sun,
Jun Cao,
Fangmin Liu
Abstract:
Pharmaceutical analysis, as the core curriculum of chemistry, chemical engineering and pharmaceutical engineering, contains broad and in-depth knowledge that leads to massive learning & teaching loads. There are more than 100 analytical methods of medicines in this course. As such, this subject is a big challenge for both students and lecturers. A novel chemical structure teaching (CST) method was…
▽ More
Pharmaceutical analysis, as the core curriculum of chemistry, chemical engineering and pharmaceutical engineering, contains broad and in-depth knowledge that leads to massive learning & teaching loads. There are more than 100 analytical methods of medicines in this course. As such, this subject is a big challenge for both students and lecturers. A novel chemical structure teaching (CST) method was developed based on our long-term teaching experience to cope with these challenges. It has been shown in practice that this CST method can significantly unload the stress of students and lecturers simultaneously. The survey about the improvement of students' interests was carried out and listed in the form of questionnaire. The outcome of CST also indicates that it can help them to form abilities of critical and logical thinking as independent learners, motivate them to discuss with their peers and lecturers, and eventually improve average grades. Furthermore, CST can be beneficial for lecturers who instruct other relevant curriculum in chemical or pharmaceutical engineering to improve the teaching outcome, such as organic chemistry, spectrum analysis, pharmaceutical synthesis and medicinal chemistry. This CST model can also help students cultivate lifelong learning ability as active learners and habit from the cognitive perspective view.
△ Less
Submitted 19 September, 2022;
originally announced September 2022.
-
Weight-based Channel-model Matrix Framework provides a reasonable solution for EEG-based cross-dataset emotion recognition
Authors:
Huayu Chen,
Huanhuan He,
Jing Zhu,
Shuting Sun,
Jianxiu Li,
Xuexiao Shao,
Junxiang Li,
Xiaowei Li,
Bin Hu
Abstract:
Cross-dataset emotion recognition as an extremely challenging task in the field of EEG-based affective computing is influenced by many factors, which makes the universal models yield unsatisfactory results. Facing the situation that lacks EEG information decoding research, we first analyzed the impact of different EEG information(individual, session, emotion and trial) for emotion recognition by s…
▽ More
Cross-dataset emotion recognition as an extremely challenging task in the field of EEG-based affective computing is influenced by many factors, which makes the universal models yield unsatisfactory results. Facing the situation that lacks EEG information decoding research, we first analyzed the impact of different EEG information(individual, session, emotion and trial) for emotion recognition by sample space visualization, sample aggregation phenomena quantification, and energy pattern analysis on five public datasets. Based on these phenomena and patterns, we provided the processing methods and interpretable work of various EEG differences. Through the analysis of emotional feature distribution patterns, the Individual Emotional Feature Distribution Difference(IEFDD) was found, which was also considered as the main factor of the stability for emotion recognition. After analyzing the limitations of traditional modeling approach suffering from IEFDD, the Weight-based Channel-model Matrix Framework(WCMF) was proposed. To reasonably characterize emotional feature distribution patterns, four weight extraction methods were designed, and the optimal was the correction T-test(CT) weight extraction method. Finally, the performance of WCMF was validated on cross-dataset tasks in two kinds of experiments that simulated different practical scenarios, and the results showed that WCMF had more stable and better emotion recognition ability.
△ Less
Submitted 3 November, 2022; v1 submitted 13 September, 2022;
originally announced September 2022.