Search | arXiv e-print repository

Towards Open-world Cross-Domain Sequential Recommendation: A Model-Agnostic Contrastive Denoising Approach

Authors: Wujiang Xu, Xuying Ning, Wenfang Lin, Mingming Ha, Qiongxu Ma, Qianqiao Liang, Xuewen Tao, Linxun Chen, Bing Han, Minnan Luo

Abstract: Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlapping users with abundant behaviors. However, in real-world recommender systems, CDSR sc… ▽ More Cross-domain sequential recommendation (CDSR) aims to address the data sparsity problems that exist in traditional sequential recommendation (SR) systems. The existing approaches aim to design a specific cross-domain unit that can transfer and propagate information across multiple domains by relying on overlapping users with abundant behaviors. However, in real-world recommender systems, CDSR scenarios usually consist of a majority of long-tailed users with sparse behaviors and cold-start users who only exist in one domain. This leads to a drop in the performance of existing CDSR methods in the real-world industry platform. Therefore, improving the consistency and effectiveness of models in open-world CDSR scenarios is crucial for constructing CDSR models (\textit{1st} CH). Recently, some SR approaches have utilized auxiliary behaviors to complement the information for long-tailed users. However, these multi-behavior SR methods cannot deliver promising performance in CDSR, as they overlook the semantic gap between target and auxiliary behaviors, as well as user interest deviation across domains (\textit{2nd} CH). △ Less

Submitted 5 June, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

arXiv:2311.02608 [pdf, other]

doi 10.1016/j.displa.2023.102456

Deep Learning-based 3D Point Cloud Classification: A Systematic Survey and Outlook

Authors: Huang Zhang, Changshuo Wang, Shengwei Tian, Baoli Lu, Liping Zhang, Xin Ning, Xiao Bai

Abstract: In recent years, point cloud representation has become one of the research hotspots in the field of computer vision, and has been widely used in many fields, such as autonomous driving, virtual reality, robotics, etc. Although deep learning techniques have achieved great success in processing regular structured 2D grid image data, there are still great challenges in processing irregular, unstructu… ▽ More In recent years, point cloud representation has become one of the research hotspots in the field of computer vision, and has been widely used in many fields, such as autonomous driving, virtual reality, robotics, etc. Although deep learning techniques have achieved great success in processing regular structured 2D grid image data, there are still great challenges in processing irregular, unstructured point cloud data. Point cloud classification is the basis of point cloud analysis, and many deep learning-based methods have been widely used in this task. Therefore, the purpose of this paper is to provide researchers in this field with the latest research progress and future trends. First, we introduce point cloud acquisition, characteristics, and challenges. Second, we review 3D data representations, storage formats, and commonly used datasets for point cloud classification. We then summarize deep learning-based methods for point cloud classification and complement recent research work. Next, we compare and analyze the performance of the main methods. Finally, we discuss some challenges and future directions for point cloud classification. △ Less

Submitted 5 November, 2023; originally announced November 2023.

Journal ref: Displays 102456 (2023)

arXiv:2311.00603 [pdf, other]

Occluded Person Re-Identification with Deep Learning: A Survey and Perspectives

Authors: Enhao Ning, Changshuo Wang, Huang Zhangc, Xin Ning, Prayag Tiwari

Abstract: Person re-identification (Re-ID) technology plays an increasingly crucial role in intelligent surveillance systems. Widespread occlusion significantly impacts the performance of person Re-ID. Occluded person Re-ID refers to a pedestrian matching method that deals with challenges such as pedestrian information loss, noise interference, and perspective misalignment. It has garnered extensive attenti… ▽ More Person re-identification (Re-ID) technology plays an increasingly crucial role in intelligent surveillance systems. Widespread occlusion significantly impacts the performance of person Re-ID. Occluded person Re-ID refers to a pedestrian matching method that deals with challenges such as pedestrian information loss, noise interference, and perspective misalignment. It has garnered extensive attention from researchers. Over the past few years, several occlusion-solving person Re-ID methods have been proposed, tackling various sub-problems arising from occlusion. However, there is a lack of comprehensive studies that compare, summarize, and evaluate the potential of occluded person Re-ID methods in detail. In this review, we start by providing a detailed overview of the datasets and evaluation scheme used for occluded person Re-ID. Next, we scientifically classify and analyze existing deep learning-based occluded person Re-ID methods from various perspectives, summarizing them concisely. Furthermore, we conduct a systematic comparison among these methods, identify the state-of-the-art approaches, and present an outlook on the future development of occluded person Re-ID. △ Less

Submitted 1 November, 2023; originally announced November 2023.

arXiv:2310.15211 [pdf, other]

Modeling Path Importance for Effective Alzheimer's Disease Drug Repurposing

Authors: Shunian Xiang, Patrick J. Lawrence, Bo Peng, ChienWei Chiang, Dokyoon Kim, Li Shen, Xia Ning

Abstract: Recently, drug repurposing has emerged as an effective and resource-efficient paradigm for AD drug discovery. Among various methods for drug repurposing, network-based methods have shown promising results as they are capable of leveraging complex networks that integrate multiple interaction types, such as protein-protein interactions, to more effectively identify candidate drugs. However, existing… ▽ More Recently, drug repurposing has emerged as an effective and resource-efficient paradigm for AD drug discovery. Among various methods for drug repurposing, network-based methods have shown promising results as they are capable of leveraging complex networks that integrate multiple interaction types, such as protein-protein interactions, to more effectively identify candidate drugs. However, existing approaches typically assume paths of the same length in the network have equal importance in identifying the therapeutic effect of drugs. Other domains have found that same length paths do not necessarily have the same importance. Thus, relying on this assumption may be deleterious to drug repurposing attempts. In this work, we propose MPI (Modeling Path Importance), a novel network-based method for AD drug repurposing. MPI is unique in that it prioritizes important paths via learned node embeddings, which can effectively capture a network's rich structural information. Thus, leveraging learned embeddings allows MPI to effectively differentiate the importance among paths. We evaluate MPI against a commonly used baseline method that identifies anti-AD drug candidates primarily based on the shortest paths between drugs and AD in the network. We observe that among the top-50 ranked drugs, MPI prioritizes 20.0% more drugs with anti-AD evidence compared to the baseline. Finally, Cox proportional-hazard models produced from insurance claims data aid us in identifying the use of etodolac, nicotine, and BBB-crossing ACE-INHs as having a reduced risk of AD, suggesting such drugs may be viable candidates for repurposing and should be explored further in future studies. △ Less

Submitted 27 October, 2023; v1 submitted 23 October, 2023; originally announced October 2023.

Comments: 16 pages, 3 figures, 2 tables, 1 supplementary figure, 5 supplementary tables, Preprint of an article accepted for publication in Pacific Symposium on Biocomputing ©2023 World Scientific Publishing Co., Singapore, http://psb.stanford.edu/

arXiv:2310.13725 [pdf]

Enhancing drug and cell line representations via contrastive learning for improved anti-cancer drug prioritization

Authors: Patrick J. Lawrence, Xia Ning

Abstract: Due to cancer's complex nature and variable response to therapy, precision oncology informed by omics sequence analysis has become the current standard of care. However, the amount of data produced for each patients makes it difficult to quickly identify the best treatment regimen. Moreover, limited data availability has hindered computational methods' abilities to learn patterns associated with e… ▽ More Due to cancer's complex nature and variable response to therapy, precision oncology informed by omics sequence analysis has become the current standard of care. However, the amount of data produced for each patients makes it difficult to quickly identify the best treatment regimen. Moreover, limited data availability has hindered computational methods' abilities to learn patterns associated with effective drug-cell line pairs. In this work, we propose the use of contrastive learning to improve learned drug and cell line representations by preserving relationship structures associated with drug mechanism of action and cell line cancer types. In addition to achieving enhanced performance relative to a state-of-the-art method, we find that classifiers using our learned representations exhibit a more balances reliance on drug- and cell line-derived features when making predictions. This facilitates more personalized drug prioritizations that are informed by signals related to drug resistance. △ Less

Submitted 27 October, 2023; v1 submitted 20 October, 2023; originally announced October 2023.

Comments: 60 pages, 4 figures, 4 tables, 11 supplementary tables, 1 supplementary note, submitted to Nature Communications

arXiv:2310.04763 [pdf, other]

Orbital diffusion, polarization and swapping in centrosymmetric metals

Authors: Xiaobai Ning, A. Pezo, Kyoung-Whan Kim, Weisheng Zhao, Kyung-Jin Lee, Aurelien Manchon

Abstract: We propose a general theory of charge, spin, and orbital diffusion based on Keldysh formalism. Our findings indicate that the diffusivity of orbital angular momentum in metals is much lower than that of spin or charge due to the strong orbital intermixing in crystals. Furthermore, our theory introduces the concept of spin-orbit polarization by which a pure orbital (spin) current induces a longitud… ▽ More We propose a general theory of charge, spin, and orbital diffusion based on Keldysh formalism. Our findings indicate that the diffusivity of orbital angular momentum in metals is much lower than that of spin or charge due to the strong orbital intermixing in crystals. Furthermore, our theory introduces the concept of spin-orbit polarization by which a pure orbital (spin) current induces a longitudinal spin (orbital) current, a process as efficient as spin polarization in ferromagnets. Finally, we find that orbital currents undergo momentum swapping, even in the absence of spin-orbit coupling. This theory establishes several key parameters for orbital transport of direct importance to experiments. △ Less

Submitted 24 June, 2024; v1 submitted 7 October, 2023; originally announced October 2023.

arXiv:2310.01612 [pdf, other]

Towards Efficient and Effective Adaptation of Large Language Models for Sequential Recommendation

Authors: Bo Peng, Ben Burns, Ziqi Chen, Srinivasan Parthasarathy, Xia Ning

Abstract: In recent years, with large language models (LLMs) achieving state-of-the-art performance in context understanding, increasing efforts have been dedicated to developing LLM-enhanced sequential recommendation (SR) methods. Considering that most existing LLMs are not specifically optimized for recommendation tasks, adapting them for SR becomes a critical step in LLM-enhanced SR methods. Though numer… ▽ More In recent years, with large language models (LLMs) achieving state-of-the-art performance in context understanding, increasing efforts have been dedicated to developing LLM-enhanced sequential recommendation (SR) methods. Considering that most existing LLMs are not specifically optimized for recommendation tasks, adapting them for SR becomes a critical step in LLM-enhanced SR methods. Though numerous adaptation methods have been developed, it still remains a significant challenge to adapt LLMs for SR both efficiently and effectively. To address this challenge, in this paper, we introduce a novel side sequential network adaptation method, denoted as SSNA, for LLM enhanced SR. SSNA features three key designs to allow both efficient and effective LLM adaptation. First, SSNA learns adapters separate from LLMs, while fixing all the pre-trained parameters within LLMs to allow efficient adaptation. In addition, SSNA adapts the top-a layers of LLMs jointly, and integrates adapters sequentially for enhanced effectiveness (i.e., recommendation performance). We compare SSNA against five state-of-the-art baseline methods on five benchmark datasets using three LLMs. The experimental results demonstrate that SSNA significantly outperforms all the baseline methods in terms of recommendation performance, and achieves substantial improvement over the best-performing baseline methods at both run-time and memory efficiency during training. Our analysis shows the effectiveness of integrating adapters in a sequential manner. Our parameter study demonstrates the effectiveness of jointly adapting the top-a layers of LLMs. △ Less

Submitted 2 October, 2023; originally announced October 2023.

arXiv:2309.11075 [pdf, other]

Large-scale Kinetic Simulations of Colliding Plasmas within a Hohlraum of Indirect Drive Inertial Confinement Fusions

Authors: Tianyi Liang, Dong Wu, Xiaochuan Ning, Lianqiang Shan, Zongqiang Yuan, Hongbo Cai, Zhengmao Sheng, Xiantu He

Abstract: The National Ignition Facility has recently achieved successful burning plasma and ignition using the inertial confinement fusion (ICF) approach. However, there are still many fundamental physics phenomena that are not well understood, including the kinetic processes in the hohlraum. Shan et al. [Phys. Rev. Lett, 120, 195001, 2018] utilized the energy spectra of neutrons to investigate the kinetic… ▽ More The National Ignition Facility has recently achieved successful burning plasma and ignition using the inertial confinement fusion (ICF) approach. However, there are still many fundamental physics phenomena that are not well understood, including the kinetic processes in the hohlraum. Shan et al. [Phys. Rev. Lett, 120, 195001, 2018] utilized the energy spectra of neutrons to investigate the kinetic colliding plasma in a hohlraum of indirect drive ICF. However, due to the typical large spatial-temporal scales, this experiment could not be well simulated by using available codes at that time. Utilizing our advanced high-order implicit PIC code, LAPINS, we were able to successfully reproduce the experiment on a large scale of both spatial and temporal dimensions, in which the original computational scale was increased by approximately 7 to 8 orders of magnitude. When gold plasmas expand into deuterium plasmas, a kinetic shock is generated and propagates within deuterium plasmas. Simulations allow us to observe the entire progression of a strong shock wave, including its initial formation and steady propagation. Although both electrons and gold ions are collisional (on a small scale compared to the shock wave), deuterium ions seem to be collisionless. This is because a quasi-monoenergetic spectrum of deuterium ions can be generated by reflecting ions from the shock front, which then leads to the production of neutrons with unusual broadening due to beam-target nuclear reactions. This work displays an unprecedented kinetic analysis of an existing experiment, shedding light on the mechanisms behind shock wave formation. It also serves as a reference for benchmark simulations of upcoming new simulation codes and may be relevant for future research on mixtures and entropy increments at plasma interfaces. △ Less

Submitted 20 September, 2023; originally announced September 2023.

arXiv:2309.10195 [pdf, other]

Multi-modality Meets Re-learning: Mitigating Negative Transfer in Sequential Recommendation

Authors: Bo Peng, Srinivasan Parthasarathy, Xia Ning

Abstract: Learning effective recommendation models from sparse user interactions represents a fundamental challenge in developing sequential recommendation methods. Recently, pre-training-based methods have been developed to tackle this challenge. Though promising, in this paper, we show that existing methods suffer from the notorious negative transfer issue, where the model adapted from the pre-trained mod… ▽ More Learning effective recommendation models from sparse user interactions represents a fundamental challenge in developing sequential recommendation methods. Recently, pre-training-based methods have been developed to tackle this challenge. Though promising, in this paper, we show that existing methods suffer from the notorious negative transfer issue, where the model adapted from the pre-trained model results in worse performance compared to the model learned from scratch in the task of interest (i.e., target task). To address this issue, we develop a method, denoted as ANT, for transferable sequential recommendation. ANT mitigates negative transfer by 1) incorporating multi-modality item information, including item texts, images and prices, to effectively learn more transferable knowledge from related tasks (i.e., auxiliary tasks); and 2) better capturing task-specific knowledge in the target task using a re-learning-based adaptation strategy. We evaluate ANT against eight state-of-the-art baseline methods on five target tasks. Our experimental results demonstrate that ANT does not suffer from the negative transfer issue on any of the target tasks. The results also demonstrate that ANT substantially outperforms baseline methods in the target tasks with an improvement of as much as 15.2%. Our analysis highlights the superior effectiveness of our re-learning-based strategy compared to fine-tuning on the target tasks. △ Less

Submitted 20 September, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

arXiv:2309.04967 [pdf, other]

Towards Fully Decoupled End-to-End Person Search

Authors: Pengcheng Zhang, Xiao Bai, Jin Zheng, Xin Ning

Abstract: End-to-end person search aims to jointly detect and re-identify a target person in raw scene images with a unified model. The detection task unifies all persons while the re-id task discriminates different identities, resulting in conflict optimal objectives. Existing works proposed to decouple end-to-end person search to alleviate such conflict. Yet these methods are still sub-optimal on one or t… ▽ More End-to-end person search aims to jointly detect and re-identify a target person in raw scene images with a unified model. The detection task unifies all persons while the re-id task discriminates different identities, resulting in conflict optimal objectives. Existing works proposed to decouple end-to-end person search to alleviate such conflict. Yet these methods are still sub-optimal on one or two of the sub-tasks due to their partially decoupled models, which limits the overall person search performance. In this paper, we propose to fully decouple person search towards optimal person search. A task-incremental person search network is proposed to incrementally construct an end-to-end model for the detection and re-id sub-task, which decouples the model architecture for the two sub-tasks. The proposed task-incremental network allows task-incremental training for the two conflicting tasks. This enables independent learning for different objectives thus fully decoupled the model for persons earch. Comprehensive experimental evaluations demonstrate the effectiveness of the proposed fully decoupled models for end-to-end person search. △ Less

Submitted 10 March, 2024; v1 submitted 10 September, 2023; originally announced September 2023.

Comments: DICTA 2023 Best Student Paper

arXiv:2309.02671 [pdf, other]

RLSynC: Offline-Online Reinforcement Learning for Synthon Completion

Authors: Frazier N. Baker, Ziqi Chen, Daniel Adu-Ampratwum, Xia Ning

Abstract: Retrosynthesis is the process of determining the set of reactant molecules that can react to form a desired product. Semi-template-based retrosynthesis methods, which imitate the reverse logic of synthesis reactions, first predict the reaction centers in the products, and then complete the resulting synthons back into reactants. We develop a new offline-online reinforcement learning method RLSynC… ▽ More Retrosynthesis is the process of determining the set of reactant molecules that can react to form a desired product. Semi-template-based retrosynthesis methods, which imitate the reverse logic of synthesis reactions, first predict the reaction centers in the products, and then complete the resulting synthons back into reactants. We develop a new offline-online reinforcement learning method RLSynC for synthon completion in semi-template-based methods. RLSynC assigns one agent to each synthon, all of which complete the synthons by conducting actions step by step in a synchronized fashion. RLSynC learns the policy from both offline training episodes and online interactions, which allows RLSynC to explore new reaction spaces. RLSynC uses a standalone forward synthesis model to evaluate the likelihood of the predicted reactants in synthesizing a product, and thus guides the action search. Our results demonstrate that RLSynC can outperform state-of-the-art synthon completion methods with improvements as high as 14.9%, highlighting its potential in synthesis planning. △ Less

Submitted 29 March, 2024; v1 submitted 5 September, 2023; originally announced September 2023.

Comments: 32 pages, 5 figures, 4 tables

arXiv:2308.13144 [pdf]

Giant orbit-to-charge conversion induced via the inverse orbital Hall effect

Authors: Renyou Xu, Hui Zhang, Yuhao Jiang, Houyi Cheng, Yunfei Xie, Yuxuan Yao, Danrong Xiong, Zhaozhao Zhu, Xiaobai Ning, Runze Chen, Yan Huang, Shijie Xu, Jianwang Cai, Yong Xu, Tao Liu, Weisheng Zhao

Abstract: We investigate the orbit-to-charge conversion in YIG/Pt/nonmagnetic material (NM) trilayer heterostructures. With the additional Ru layer on the top of YIG/Pt stacks, the charge current signal increases nearly an order of magnitude in both longitudinal spin Seebeck effect (SSE) and spin pumping (SP) measurements. Through thickness dependence studies of the Ru metal layer and theoretical model, we… ▽ More We investigate the orbit-to-charge conversion in YIG/Pt/nonmagnetic material (NM) trilayer heterostructures. With the additional Ru layer on the top of YIG/Pt stacks, the charge current signal increases nearly an order of magnitude in both longitudinal spin Seebeck effect (SSE) and spin pumping (SP) measurements. Through thickness dependence studies of the Ru metal layer and theoretical model, we quantitatively clarify different contributions of the increased SSE signal that mainly comes from the inverse orbital Hall effect (IOHE) of Ru, and partially comes from the orbital sink effect in the Ru layer. A similar enhancement of SSE(SP) signals is also observed when Ru is replaced by other materials (Ta, W, and Cu), implying the universality of the IOHE in transition metals. Our findings not only suggest a more efficient generation of the charge current via the orbital angular moment channel but also provides crucial insights into the interplay among charge, spin, and orbit. △ Less

Submitted 24 August, 2023; originally announced August 2023.

arXiv:2308.11890 [pdf, other]

Shape-conditioned 3D Molecule Generation via Equivariant Diffusion Models

Authors: Ziqi Chen, Bo Peng, Srinivasan Parthasarathy, Xia Ning

Abstract: Ligand-based drug design aims to identify novel drug candidates of similar shapes with known active molecules. In this paper, we formulated an in silico shape-conditioned molecule generation problem to generate 3D molecule structures conditioned on the shape of a given molecule. To address this problem, we developed a translation- and rotation-equivariant shape-guided generative model ShapeMol. Sh… ▽ More Ligand-based drug design aims to identify novel drug candidates of similar shapes with known active molecules. In this paper, we formulated an in silico shape-conditioned molecule generation problem to generate 3D molecule structures conditioned on the shape of a given molecule. To address this problem, we developed a translation- and rotation-equivariant shape-guided generative model ShapeMol. ShapeMol consists of an equivariant shape encoder that maps molecular surface shapes into latent embeddings, and an equivariant diffusion model that generates 3D molecules based on these embeddings. Experimental results show that ShapeMol can generate novel, diverse, drug-like molecules that retain 3D molecular shapes similar to the given shape condition. These results demonstrate the potential of ShapeMol in designing drug candidates of desired 3D shapes binding to protein target pockets. △ Less

Submitted 16 October, 2023; v1 submitted 22 August, 2023; originally announced August 2023.

arXiv:2308.01540 [pdf, other]

doi 10.1103/PhysRevLett.131.191002

Search for Dark-Matter-Nucleon Interactions with a Dark Mediator in PandaX-4T

Authors: Di Huang, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Yanlin Huang, Zhou Huang, Ruquan Hou, Xiangdong Ji , et al. (70 additional authors not shown)

Abstract: We report results of a search for dark-matter-nucleon interactions via a dark mediator using optimized low-energy data from the PandaX-4T liquid xenon experiment. With the ionization-signal-only data and utilizing the Migdal effect, we set the most stringent limits on the cross section for dark matter masses ranging from 30~$\rm{MeV/c^2}$ to 2~$\rm{GeV/c^2}$. Under the assumption that the dark med… ▽ More We report results of a search for dark-matter-nucleon interactions via a dark mediator using optimized low-energy data from the PandaX-4T liquid xenon experiment. With the ionization-signal-only data and utilizing the Migdal effect, we set the most stringent limits on the cross section for dark matter masses ranging from 30~$\rm{MeV/c^2}$ to 2~$\rm{GeV/c^2}$. Under the assumption that the dark mediator is a dark photon that decays into scalar dark matter pairs in the early Universe, we rule out significant parameter space of such thermal relic dark-matter model. △ Less

Submitted 18 December, 2023; v1 submitted 3 August, 2023; originally announced August 2023.

Comments: 6 pages, 4 figures

Journal ref: Phys. Rev. Lett. 131, 191002 (2023)

arXiv:2307.15337 [pdf, other]

Skeleton-of-Thought: Prompting LLMs for Efficient Parallel Generation

Authors: Xuefei Ning, Zinan Lin, Zixuan Zhou, Zifu Wang, Huazhong Yang, Yu Wang

Abstract: This work aims at decreasing the end-to-end generation latency of large language models (LLMs). One of the major causes of the high generation latency is the sequential decoding approach adopted by almost all state-of-the-art LLMs. In this work, motivated by the thinking and writing process of humans, we propose Skeleton-of-Thought (SoT), which first guides LLMs to generate the skeleton of the ans… ▽ More This work aims at decreasing the end-to-end generation latency of large language models (LLMs). One of the major causes of the high generation latency is the sequential decoding approach adopted by almost all state-of-the-art LLMs. In this work, motivated by the thinking and writing process of humans, we propose Skeleton-of-Thought (SoT), which first guides LLMs to generate the skeleton of the answer, and then conducts parallel API calls or batched decoding to complete the contents of each skeleton point in parallel. Not only does SoT provide considerable speed-ups across 12 LLMs, but it can also potentially improve the answer quality on several question categories. SoT is an initial attempt at data-centric optimization for inference efficiency, and showcases the potential of eliciting high-quality answers by explicitly planning the answer structure in language. △ Less

Submitted 1 March, 2024; v1 submitted 28 July, 2023; originally announced July 2023.

Comments: In ICLR'24

arXiv:2307.08209 [pdf, other]

Ada3D : Exploiting the Spatial Redundancy with Adaptive Inference for Efficient 3D Object Detection

Authors: Tianchen Zhao, Xuefei Ning, Ke Hong, Zhongyuan Qiu, Pu Lu, Yali Zhao, Linfeng Zhang, Lipu Zhou, Guohao Dai, Huazhong Yang, Yu Wang

Abstract: Voxel-based methods have achieved state-of-the-art performance for 3D object detection in autonomous driving. However, their significant computational and memory costs pose a challenge for their application to resource-constrained vehicles. One reason for this high resource consumption is the presence of a large number of redundant background points in Lidar point clouds, resulting in spatial redu… ▽ More Voxel-based methods have achieved state-of-the-art performance for 3D object detection in autonomous driving. However, their significant computational and memory costs pose a challenge for their application to resource-constrained vehicles. One reason for this high resource consumption is the presence of a large number of redundant background points in Lidar point clouds, resulting in spatial redundancy in both 3D voxel and dense BEV map representations. To address this issue, we propose an adaptive inference framework called Ada3D, which focuses on exploiting the input-level spatial redundancy. Ada3D adaptively filters the redundant input, guided by a lightweight importance predictor and the unique properties of the Lidar point cloud. Additionally, we utilize the BEV features' intrinsic sparsity by introducing the Sparsity Preserving Batch Normalization. With Ada3D, we achieve 40% reduction for 3D voxels and decrease the density of 2D BEV feature maps from 100% to 20% without sacrificing accuracy. Ada3D reduces the model computational and memory cost by 5x, and achieves 1.52x/1.45x end-to-end GPU latency and 1.5x/4.5x GPU peak memory optimization for the 3D and 2D backbone respectively. △ Less

Submitted 8 August, 2023; v1 submitted 16 July, 2023; originally announced July 2023.

Comments: Accepted at ICCV2023

arXiv:2307.03416 [pdf, other]

Learning Adversarial Semantic Embeddings for Zero-Shot Recognition in Open Worlds

Authors: Tianqi Li, Guansong Pang, Xiao Bai, Jin Zheng, Lei Zhou, Xin Ning

Abstract: Zero-Shot Learning (ZSL) focuses on classifying samples of unseen classes with only their side semantic information presented during training. It cannot handle real-life, open-world scenarios where there are test samples of unknown classes for which neither samples (e.g., images) nor their side semantic information is known during training. Open-Set Recognition (OSR) is dedicated to addressing the… ▽ More Zero-Shot Learning (ZSL) focuses on classifying samples of unseen classes with only their side semantic information presented during training. It cannot handle real-life, open-world scenarios where there are test samples of unknown classes for which neither samples (e.g., images) nor their side semantic information is known during training. Open-Set Recognition (OSR) is dedicated to addressing the unknown class issue, but existing OSR methods are not designed to model the semantic information of the unseen classes. To tackle this combined ZSL and OSR problem, we consider the case of "Zero-Shot Open-Set Recognition" (ZS-OSR), where a model is trained under the ZSL setting but it is required to accurately classify samples from the unseen classes while being able to reject samples from the unknown classes during inference. We perform large experiments on combining existing state-of-the-art ZSL and OSR models for the ZS-OSR task on four widely used datasets adapted from the ZSL task, and reveal that ZS-OSR is a non-trivial task as the simply combined solutions perform badly in distinguishing the unseen-class and unknown-class samples. We further introduce a novel approach specifically designed for ZS-OSR, in which our model learns to generate adversarial semantic embeddings of the unknown classes to train an unknowns-informed ZS-OSR classifier. Extensive empirical results show that our method 1) substantially outperforms the combined solutions in detecting the unknown classes while retaining the classification accuracy on the unseen classes and 2) achieves similar superiority under generalized ZS-OSR settings. △ Less

Submitted 7 July, 2023; originally announced July 2023.

ACM Class: I.4; I.5

arXiv:2306.17771 [pdf, other]

Precision Anti-Cancer Drug Selection via Neural Ranking

Authors: Vishal Dey, Xia Ning

Abstract: Personalized cancer treatment requires a thorough understanding of complex interactions between drugs and cancer cell lines in varying genetic and molecular contexts. To address this, high-throughput screening has been used to generate large-scale drug response data, facilitating data-driven computational models. Such models can capture complex drug-cell line interactions across various contexts i… ▽ More Personalized cancer treatment requires a thorough understanding of complex interactions between drugs and cancer cell lines in varying genetic and molecular contexts. To address this, high-throughput screening has been used to generate large-scale drug response data, facilitating data-driven computational models. Such models can capture complex drug-cell line interactions across various contexts in a fully data-driven manner. However, accurately prioritizing the most sensitive drugs for each cell line still remains a significant challenge. To address this, we developed neural ranking approaches that leverage large-scale drug response data across multiple cell lines from diverse cancer types. Unlike existing approaches that primarily utilize regression and classification techniques for drug response prediction, we formulated the objective of drug selection and prioritization as a drug ranking problem. In this work, we proposed two neural listwise ranking methods that learn latent representations of drugs and cell lines, and then use those representations to score drugs in each cell line via a learnable scoring function. Specifically, we developed a neural listwise ranking method, List-One, on top of the existing method ListNet. Additionally, we proposed a novel listwise ranking method, List-All, that focuses on all the sensitive drugs instead of the top sensitive drug, unlike List-One. Our results demonstrate that List-All outperforms the best baseline with significant improvements of as much as 8.6% in hit@20 across 50% test cell lines. Furthermore, our analyses suggest that the learned latent spaces from our proposed methods demonstrate informative clustering structures and capture relevant underlying biological features. Moreover, our comprehensive empirical evaluation provides a thorough and objective comparison of the performance of different methods (including our proposed ones). △ Less

Submitted 30 June, 2023; originally announced June 2023.

Comments: Accepted in BioKDD '23

arXiv:2306.08860 [pdf, other]

OMS-DPM: Optimizing the Model Schedule for Diffusion Probabilistic Models

Authors: Enshu Liu, Xuefei Ning, Zinan Lin, Huazhong Yang, Yu Wang

Abstract: Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains. Despite the promise, one major drawback of DPMs is the slow generation speed due to the large number of neural network evaluations required in the generation process. In this paper, we reveal an overlooked dimension -- model schedule -- for optimizin… ▽ More Diffusion probabilistic models (DPMs) are a new class of generative models that have achieved state-of-the-art generation quality in various domains. Despite the promise, one major drawback of DPMs is the slow generation speed due to the large number of neural network evaluations required in the generation process. In this paper, we reveal an overlooked dimension -- model schedule -- for optimizing the trade-off between generation quality and speed. More specifically, we observe that small models, though having worse generation quality when used alone, could outperform large models in certain generation steps. Therefore, unlike the traditional way of using a single model, using different models in different generation steps in a carefully designed \emph{model schedule} could potentially improve generation quality and speed \emph{simultaneously}. We design OMS-DPM, a predictor-based search algorithm, to optimize the model schedule given an arbitrary generation time budget and a set of pre-trained models. We demonstrate that OMS-DPM can find model schedules that improve generation quality and speed than prior state-of-the-art methods across CIFAR-10, CelebA, ImageNet, and LSUN datasets. When applied to the public checkpoints of the Stable Diffusion model, we are able to accelerate the sampling by 2$\times$ while maintaining the generation quality. △ Less

Submitted 15 June, 2023; originally announced June 2023.

Comments: Accepted by ICML2023

arXiv:2305.06666 [pdf, other]

doi 10.1088/1367-2630/acd4de

Diagnosis of Fast Electron Transport by Coherent Transition Radiation

Authors: Yangchun Liu, Xiaochuan Ning, Dong Wu, Tianyi Liang, Peng Liu, Shujun Liu, Xu Liu, Zhengmao Sheng, Wei Hong, Yuqiu Gu, Xiantu He

Abstract: Transport of fast electron in overdense plasmas is of key importance in high energy density physics. However, it is challenging to diagnose the fast electron transport in experiments. In this article, we study coherent transition radiation (CTR) generated by fast electrons on the back surface of the target by using 2D and 3D first-principle particle-in-cell (PIC) simulations. In our simulations, a… ▽ More Transport of fast electron in overdense plasmas is of key importance in high energy density physics. However, it is challenging to diagnose the fast electron transport in experiments. In this article, we study coherent transition radiation (CTR) generated by fast electrons on the back surface of the target by using 2D and 3D first-principle particle-in-cell (PIC) simulations. In our simulations, aluminium target of 2.7 g/cc is simulated in two different situations by using a newly developed high order implicit PIC code. Comparing realistic simulations containing collision and ionization effects, artificial simulations without taking collision and ionization effects into account significantly underestimate the energy loss of electron beam when transporting in the target, which fail to describe the complete characteristics of CTR produced by electron beam on the back surface of the target. Realistic simulations indicate the diameter of CTR increases when the thickness of the target is increased. This is attributed to synergetic energy losses of high flux fast electrons due to Ohm heatings and colliding drags, which appear quite significant even when the thickness of the solid target only differs by micrometers. Especially, when the diagnosing position is fixed, we find that the intensity distribution of the CTR is also a function of time, with the diameter increased with time. As the diameter of CTR is related to the speed of electrons passing through the back surface of the target, our finding may be used as a new tool to diagnose the electron energy spectra near the surface of solid density plasmas. △ Less

Submitted 11 May, 2023; originally announced May 2023.

Comments: accepted by New Journal of Physics

arXiv:2304.08357 [pdf, other]

A high-efficiency proton-boron fusion scheme taking into account the effects of quantum degeneracy

Authors: S. J. Liu, D. Wu, T. X. Hu, T. Y. Liang, X. C. Ning, J. H. Liang, Y. C. Liu, P. Liu, X. Liu, Z. M. Sheng, Y. T. Zhao, D. H. H. Hoffmann, X. T. He, J. Zhang

Abstract: The proton-boron (p-$^{11}$B) reaction is regarded as the holy grail of advanced fusion fuels, since the primary reaction produces three $α$ particles with few neutrons and induced radio-activities from second order reactions. Compared to the Deuterium-Tritium reaction a much higher reaction temperature is required. Moreover, bremsstrahlung energy losses due to the high nuclear charge of boron dee… ▽ More The proton-boron (p-$^{11}$B) reaction is regarded as the holy grail of advanced fusion fuels, since the primary reaction produces three $α$ particles with few neutrons and induced radio-activities from second order reactions. Compared to the Deuterium-Tritium reaction a much higher reaction temperature is required. Moreover, bremsstrahlung energy losses due to the high nuclear charge of boron deem it seemingly apparent than a fusion reactor based on Deuterium-Tritium plasma in equilibrium is to say the least very difficult.It is becoming more appealing to collide intense laser beams or accelerated proton beams with a boron target to produce p-$^{11}$B reactions. The fusion yield of p-$^{11}$B reactions is closely related to proton beam parameters and boron target conditions such as density, temperature, and ingredients. Quantum degeneracy will increase fusion yields by reducing the stopping power of injected protons. In this work, we suggest a high-efficiency scheme for beam-target p-$^{11}$B fusions via injecting a MeV proton beam into a highly compressed quantum degenerated boron target. Such a boron target can be achieved via quasi-isentropic compression of solid boron by using precisely shaped laser pulses. Our results indicate that for densities ranging from $10^3$ to $10^4ρ_s$, where $ρ_s$ is the density of solid boron, contributions of bound and free electrons to the stopping of protons can be completely disregarded and dramatically reduced respectively. The result is an increase in fusion yield by orders of magnitude. Furthermore, in order to achieve multiplication factor $F$ greater than one, with $F$ defined as the ratio of output fusion energy to the energy of injected protons, it is found there exits a minimum possible density of boron target, which is $2.15 \times 10^4 ρ_s$ when the kinetic energy of injected protons is $0.8$ MeV. △ Less

Submitted 17 April, 2023; originally announced April 2023.

arXiv:2304.08130 [pdf, other]

A Survey on Few-Shot Class-Incremental Learning

Authors: Songsong Tian, Lusi Li, Weijun Li, Hang Ran, Xin Ning, Prayag Tiwari

Abstract: Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup easily leads to catastrophic forgetting and overfitting problems, severely affecting model p… ▽ More Large deep learning models are impressive, but they struggle when real-time data is not available. Few-shot class-incremental learning (FSCIL) poses a significant challenge for deep neural networks to learn new tasks from just a few labeled samples without forgetting the previously learned ones. This setup easily leads to catastrophic forgetting and overfitting problems, severely affecting model performance. Studying FSCIL helps overcome deep learning model limitations on data volume and acquisition time, while improving practicality and adaptability of machine learning models. This paper provides a comprehensive survey on FSCIL. Unlike previous surveys, we aim to synthesize few-shot learning and incremental learning, focusing on introducing FSCIL from two perspectives, while reviewing over 30 theoretical research studies and more than 20 applied research studies. From the theoretical perspective, we provide a novel categorization approach that divides the field into five subcategories, including traditional machine learning methods, meta-learning based methods, feature and feature space-based methods, replay-based methods, and dynamic network structure-based methods. We also evaluate the performance of recent theoretical research on benchmark datasets of FSCIL. From the application perspective, FSCIL has achieved impressive achievements in various fields of computer vision such as image classification, object detection, and image segmentation, as well as in natural language processing and graph. We summarize the important applications. Finally, we point out potential future research directions, including applications, problem setups, and theory development. Overall, this paper offers a comprehensive analysis of the latest advances in FSCIL from a methodological, performance, and application perspective. △ Less

Submitted 23 October, 2023; v1 submitted 17 April, 2023; originally announced April 2023.

arXiv:2303.02308 [pdf, other]

A Physics-based and Data-driven Approach for Localized Statistical Channel Modeling

Authors: Shutao Zhang, Xinzhi Ning, Xi Zheng, Qingjiang Shi, Tsung-Hui Chang, Zhi-Quan Luo

Abstract: Localized channel modeling is crucial for offline performance optimization of 5G cellular networks, but the existing channel models are for general scenarios and do not capture local geographical structures. In this paper, we propose a novel physics-based and data-driven localized statistical channel modeling (LSCM), which is capable of sensing the physical geographical structures of the targeted… ▽ More Localized channel modeling is crucial for offline performance optimization of 5G cellular networks, but the existing channel models are for general scenarios and do not capture local geographical structures. In this paper, we propose a novel physics-based and data-driven localized statistical channel modeling (LSCM), which is capable of sensing the physical geographical structures of the targeted cellular environment. The proposed channel modeling solely relies on the reference signal receiving power (RSRP) of the user equipment, unlike the traditional methods which use full channel impulse response matrices. The key is to build the relationship between the RSRP and the channel's angular power spectrum. Based on it, we formulate the task of channel modeling as a sparse recovery problem where the non-zero entries of the sparse vector indicate the channel paths' powers and angles of departure. A computationally efficient weighted non-negative orthogonal matching pursuit (WNOMP) algorithm is devised for solving the formulated problem. Finally, experiments based on synthetic and real RSRP measurements are presented to examine the performance of the proposed method. △ Less

Submitted 3 March, 2023; originally announced March 2023.

Comments: the 34th International Teletraffic Congress (ITC), Shenzhen, China, 2022

arXiv:2303.02162 [pdf, other]

T-Cell Receptor Optimization with Reinforcement Learning and Mutation Policies for Precesion Immunotherapy

Authors: Ziqi Chen, Martin Renqiang Min, Hongyu Guo, Chao Cheng, Trevor Clancy, Xia Ning

Abstract: T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the… ▽ More T cells monitor the health status of cells by identifying foreign peptides displayed on their surface. T-cell receptors (TCRs), which are protein complexes found on the surface of T cells, are able to bind to these peptides. This process is known as TCR recognition and constitutes a key step for immune response. Optimizing TCR sequences for TCR recognition represents a fundamental step towards the development of personalized treatments to trigger immune responses killing cancerous or virus-infected cells. In this paper, we formulated the search for these optimized TCRs as a reinforcement learning (RL) problem, and presented a framework TCRPPO with a mutation policy using proximal policy optimization. TCRPPO mutates TCRs into effective ones that can recognize given peptides. TCRPPO leverages a reward function that combines the likelihoods of mutated sequences being valid TCRs measured by a new scoring function based on deep autoencoders, with the probabilities of mutated sequences recognizing peptides from a peptide-TCR interaction predictor. We compared TCRPPO with multiple baseline methods and demonstrated that TCRPPO significantly outperforms all the baseline methods to generate positive binding and valid TCRs. These results demonstrate the potential of TCRPPO for both precision immunotherapy and peptide-recognizing TCR motif discovery. △ Less

Submitted 2 March, 2023; originally announced March 2023.

arXiv:2302.05666 [pdf, other]

Jaccard Metric Losses: Optimizing the Jaccard Index with Soft Labels

Authors: Zifu Wang, Xuefei Ning, Matthew B. Blaschko

Abstract: Intersection over Union (IoU) losses are surrogates that directly optimize the Jaccard index. Leveraging IoU losses as part of the loss function have demonstrated superior performance in semantic segmentation tasks compared to optimizing pixel-wise losses such as the cross-entropy loss alone. However, we identify a lack of flexibility in these losses to support vital training techniques like label… ▽ More Intersection over Union (IoU) losses are surrogates that directly optimize the Jaccard index. Leveraging IoU losses as part of the loss function have demonstrated superior performance in semantic segmentation tasks compared to optimizing pixel-wise losses such as the cross-entropy loss alone. However, we identify a lack of flexibility in these losses to support vital training techniques like label smoothing, knowledge distillation, and semi-supervised learning, mainly due to their inability to process soft labels. To address this, we introduce Jaccard Metric Losses (JMLs), which are identical to the soft Jaccard loss in standard settings with hard labels but are fully compatible with soft labels. We apply JMLs to three prominent use cases of soft labels: label smoothing, knowledge distillation and semi-supervised learning, and demonstrate their potential to enhance model accuracy and calibration. Our experiments show consistent improvements over the cross-entropy loss across 4 semantic segmentation datasets (Cityscapes, PASCAL VOC, ADE20K, DeepGlobe Land) and 13 architectures, including classic CNNs and recent vision transformers. Remarkably, our straightforward approach significantly outperforms state-of-the-art knowledge distillation and semi-supervised learning methods. The code is available at \href{https://github.com/zifuwanggg/JDTLosses}{https://github.com/zifuwanggg/JDTLosses}. △ Less

Submitted 20 March, 2024; v1 submitted 11 February, 2023; originally announced February 2023.

Comments: NeurIPS 2023

arXiv:2302.00932 [pdf, other]

Dynamic Ensemble of Low-fidelity Experts: Mitigating NAS "Cold-Start"

Authors: Junbo Zhao, Xuefei Ning, Enshu Liu, Binxin Ru, Zixuan Zhou, Tianchen Zhao, Chen Chen, Jiajin Zhang, Qingmin Liao, Yu Wang

Abstract: Predictor-based Neural Architecture Search (NAS) employs an architecture performance predictor to improve the sample efficiency. However, predictor-based NAS suffers from the severe ``cold-start'' problem, since a large amount of architecture-performance data is required to get a working predictor. In this paper, we focus on exploiting information in cheaper-to-obtain performance estimations (i.e.… ▽ More Predictor-based Neural Architecture Search (NAS) employs an architecture performance predictor to improve the sample efficiency. However, predictor-based NAS suffers from the severe ``cold-start'' problem, since a large amount of architecture-performance data is required to get a working predictor. In this paper, we focus on exploiting information in cheaper-to-obtain performance estimations (i.e., low-fidelity information) to mitigate the large data requirements of predictor training. Despite the intuitiveness of this idea, we observe that using inappropriate low-fidelity information even damages the prediction ability and different search spaces have different preferences for low-fidelity information types. To solve the problem and better fuse beneficial information provided by different types of low-fidelity information, we propose a novel dynamic ensemble predictor framework that comprises two steps. In the first step, we train different sub-predictors on different types of available low-fidelity information to extract beneficial knowledge as low-fidelity experts. In the second step, we learn a gating network to dynamically output a set of weighting coefficients conditioned on each input neural architecture, which will be used to combine the predictions of different low-fidelity experts in a weighted sum. The overall predictor is optimized on a small set of actual architecture-performance data to fuse the knowledge from different low-fidelity experts to make the final prediction. We conduct extensive experiments across five search spaces with different architecture encoders under various experimental settings. Our method can easily be incorporated into existing predictor-based NAS frameworks to discover better architectures. △ Less

Submitted 2 February, 2023; originally announced February 2023.

arXiv:2301.03010 [pdf, other]

doi 10.1103/PhysRevLett.131.041001

Search for light dark matter from atmosphere in PandaX-4T

Authors: Xuyang Ning, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou , et al. (70 additional authors not shown)

Abstract: We report a search for light dark matter produced through the cascading decay of $η$ mesons, which are created as a result of inelastic collisions between cosmic rays and Earth's atmosphere. We introduce a new and general framework, publicly accessible, designed to address boosted dark matter specifically, with which a full and dedicated simulation including both elastic and quasi-elastic processe… ▽ More We report a search for light dark matter produced through the cascading decay of $η$ mesons, which are created as a result of inelastic collisions between cosmic rays and Earth's atmosphere. We introduce a new and general framework, publicly accessible, designed to address boosted dark matter specifically, with which a full and dedicated simulation including both elastic and quasi-elastic processes of Earth attenuation effect on the dark matter particles arriving at the detector is performed. In the PandaX-4T commissioning data of 0.63 tonne$\cdot$year exposure, no significant excess over background is observed. The first constraints on the interaction between light dark matter generated in the atmosphere and nucleus through a light scalar mediator are obtained. The lowest excluded cross-section is set at $5.9 \times 10^{-37}{\rm cm^2}$ for dark matter mass of $0.1$ MeV$/c^2$ and mediator mass of 300 MeV$/c^2$. The lowest upper limit of $η$ to dark matter decay branching ratio is $1.6 \times 10^{-7}$. △ Less

Submitted 25 July, 2023; v1 submitted 8 January, 2023; originally announced January 2023.

Comments: 6 pages, 3 figures

arXiv:2211.00407 [pdf, other]

Missing data interpolation in integrative multi-cohort analysis with disparate covariate information

Authors: Ekaterina Smirnova, Yongqi Zhong, Rasha Alsaadawi, Xu Ning, Amii Kress, Jordan Kuiper, Mingyu Zhang, Kristen Lyall, Sheenas Martenies, Akram Alshawabkeh, Catherine Bulka, Carlos Camargo, Jaeun Choi, Elena Colicino, Anne Dunlop, Michael Elliott, Assiamira Ferrara, Tebeb Gebrestadik, Jiang Gui, Kylie Harrall, Tina Hartert, Barry Lester, Andrew Manigault, Justin Manjourides, Yu Ni , et al. (4 additional authors not shown)

Abstract: Integrative analysis of datasets generated by multiple cohorts is a widely-used approach for increasing sample size, precision of population estimators, and generalizability of analysis results in epidemiological studies. However, often each individual cohort dataset does not have all variables of interest for an integrative analysis collected as a part of an original study. Such cohort-level miss… ▽ More Integrative analysis of datasets generated by multiple cohorts is a widely-used approach for increasing sample size, precision of population estimators, and generalizability of analysis results in epidemiological studies. However, often each individual cohort dataset does not have all variables of interest for an integrative analysis collected as a part of an original study. Such cohort-level missingness poses methodological challenges to the integrative analysis since missing variables have traditionally: (1) been removed from the data for complete case analysis; or (2) been completed by missing data interpolation techniques using data with the same covariate distribution from other studies. In most integrative-analysis studies, neither approach is optimal as it leads to either loosing the majority of study covariates or challenges in specifying the cohorts following the same distributions. We propose a novel approach to identify the studies with same distributions that could be used for completing the cohort-level missing information. Our methodology relies on (1) identifying sub-groups of cohorts with similar covariate distributions using cohort identity random forest prediction models followed by clustering; and then (2) applying a recursive pairwise distribution test for high dimensional data to these sub-groups. Extensive simulation studies show that cohorts with the same distribution are correctly grouped together in almost all simulation settings. Our methods' application to two ECHO-wide Cohort Studies reveals that the cohorts grouped together reflect the similarities in study design. The methods are implemented in R software package relate. △ Less

Submitted 1 November, 2022; originally announced November 2022.

arXiv:2210.16736 [pdf]

Equation of state for tungsten predicted by ensemble theory

Authors: Yue-Yue Tian, Bo-Yuan Ning, X. -D. Xiang, Hui-Fen Zhang, Xi-Jing Ning

Abstract: Equation of state (EOS) for bcc tungsten at 300 K (or 3000 K) up to 1000 GPa (or 300 GPa) was predicted for the first time by solving the partition function via a direct integral approach (DIA) with ab initio calculations of the atoms' interactions. Compared with available experiments under static compressions up to 150 GPa (or 35 GPa) for room temperature (or 1673 K), all the calculated results a… ▽ More Equation of state (EOS) for bcc tungsten at 300 K (or 3000 K) up to 1000 GPa (or 300 GPa) was predicted for the first time by solving the partition function via a direct integral approach (DIA) with ab initio calculations of the atoms' interactions. Compared with available experiments under static compressions up to 150 GPa (or 35 GPa) for room temperature (or 1673 K), all the calculated results are within the experimental uncertainty achieved very recently. Furthermore, the same procedure was performed to investigate the shock wave experiments on the EOS up to 400 GPa and 10000 K, and the calculated average pressure deviates the experimental measurements by only 2.0%. These facts suggest that the other calculated results of DIA for the EOS are reliable, and DIA as a universal method without any artificial parameters could be widely applied to predict EOS of various materials under various conditions. △ Less

Submitted 1 November, 2022; v1 submitted 29 October, 2022; originally announced October 2022.

Comments: 5 figures, 14 figures

arXiv:2209.07997 [pdf, other]

Recursive Attentive Methods with Reused Item Representations for Sequential Recommendation

Authors: Bo Peng, Srinivasan Parthasarathy, Xia Ning

Abstract: Sequential recommendation aims to recommend the next item of users' interest based on their historical interactions. Recently, the self-attention mechanism has been adapted for sequential recommendation, and demonstrated state-of-the-art performance. However, in this manuscript, we show that the self-attention-based sequential recommendation methods could suffer from the localization-deficit issue… ▽ More Sequential recommendation aims to recommend the next item of users' interest based on their historical interactions. Recently, the self-attention mechanism has been adapted for sequential recommendation, and demonstrated state-of-the-art performance. However, in this manuscript, we show that the self-attention-based sequential recommendation methods could suffer from the localization-deficit issue. As a consequence, in these methods, over the first few blocks, the item representations may quickly diverge from their original representations, and thus, impairs the learning in the following blocks. To mitigate this issue, in this manuscript, we develop a recursive attentive method with reused item representations (RAM) for sequential recommendation. We compare RAM with five state-of-the-art baseline methods on six public benchmark datasets. Our experimental results demonstrate that RAM significantly outperforms the baseline methods on benchmark datasets, with an improvement of as much as 11.3%. Our stability analysis shows that RAM could enable deeper and wider models for better performance. Our run-time performance comparison signifies that RAM could also be more efficient on benchmark datasets. △ Less

Submitted 16 September, 2022; originally announced September 2022.

arXiv:2208.03626 [pdf, other]

doi 10.1016/j.physletb.2022.137487

Constraints on the axial-vector and pseudo-scalar mediated WIMP-nucleus interactions from PandaX-4T experiment

Authors: Zhou Huang, Chencheng Han, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Ruquan Hou, Xiangdong Ji , et al. (66 additional authors not shown)

Abstract: We present the constraints on the axial-vector and pseudo-scalar mediated WIMP-nucleus interactions from the PandaX-4T experiment, using the data set corresponding to a total exposure of 0.63~tonne$\cdot$year. No significant signal excess is observed, and the most stringent upper limits to date on the spin-dependent WIMP-neutron scattering cross section are set at 90\% confidence level with the mi… ▽ More We present the constraints on the axial-vector and pseudo-scalar mediated WIMP-nucleus interactions from the PandaX-4T experiment, using the data set corresponding to a total exposure of 0.63~tonne$\cdot$year. No significant signal excess is observed, and the most stringent upper limits to date on the spin-dependent WIMP-neutron scattering cross section are set at 90\% confidence level with the minimum WIMP-neutron scattering cross section of 5.8$\times 10^{-42}$\si{\cm^{2}} for WIMP mass of 40~\si{\GeV/}$c^2$. Exclusion limits on the axial-vector and pseudo-scalar simplified models are also derived. △ Less

Submitted 26 August, 2024; v1 submitted 6 August, 2022; originally announced August 2022.

Journal ref: Physics Letters B 834 (2022) 137487

arXiv:2207.07942 [pdf]

doi 10.1021/acsnano.2c03622

Observation of magnetism induced topological edge state in antiferromagnetic topological insulator MnBi4Te7

Authors: HaoKe Xu, Mingqiang Gu, Fucong Fei, YiSheng Gu, Dang Liu, QiaoYan Yu, ShaSha Xue, XuHui Ning, Bo Chen, Hangkai Xie, Zhen Zhu, Dandan Guan, Shiyong Wang, Yaoyi Li, Canhua Liu, Qihang Liu, Fengqi Song, Hao Zheng, Jinfeng Jia

Abstract: Breaking time reversal symmetry in a topological insulator may lead to quantum anomalous Hall effect and axion insulator phase. MnBi4Te7 is a recently discovered antiferromagnetic topological insulator with TN ~12.5 K, which is constituted of alternatively stacked magnetic layer (MnBi2Te4) and non-magnetic layer (Bi2Te3). By means of scanning tunneling spectroscopy, we clearly observe the electron… ▽ More Breaking time reversal symmetry in a topological insulator may lead to quantum anomalous Hall effect and axion insulator phase. MnBi4Te7 is a recently discovered antiferromagnetic topological insulator with TN ~12.5 K, which is constituted of alternatively stacked magnetic layer (MnBi2Te4) and non-magnetic layer (Bi2Te3). By means of scanning tunneling spectroscopy, we clearly observe the electronic state present at a step edge of a magnetic MnBi2Te4 layer but absent at non-magnetic Bi2Te3 layers at 4.5 K. Furthermore, we find that as the temperature rises above TN, the edge state vanishes, while the point defect induced state persists upon temperature increasing. These results confirm the observation of magnetism induced edge states. Our analysis based on an axion insulator theory reveals that the nontrivial topological nature of the observed edge state. △ Less

Submitted 16 July, 2022; originally announced July 2022.

arXiv:2207.07868 [pdf, other]

CLOSE: Curriculum Learning On the Sharing Extent Towards Better One-shot NAS

Authors: Zixuan Zhou, Xuefei Ning, Yi Cai, Jiashu Han, Yiping Deng, Yuhan Dong, Huazhong Yang, Yu Wang

Abstract: One-shot Neural Architecture Search (NAS) has been widely used to discover architectures due to its efficiency. However, previous studies reveal that one-shot performance estimations of architectures might not be well correlated with their performances in stand-alone training because of the excessive sharing of operation parameters (i.e., large sharing extent) between architectures. Thus, recent m… ▽ More One-shot Neural Architecture Search (NAS) has been widely used to discover architectures due to its efficiency. However, previous studies reveal that one-shot performance estimations of architectures might not be well correlated with their performances in stand-alone training because of the excessive sharing of operation parameters (i.e., large sharing extent) between architectures. Thus, recent methods construct even more over-parameterized supernets to reduce the sharing extent. But these improved methods introduce a large number of extra parameters and thus cause an undesirable trade-off between the training costs and the ranking quality. To alleviate the above issues, we propose to apply Curriculum Learning On Sharing Extent (CLOSE) to train the supernet both efficiently and effectively. Specifically, we train the supernet with a large sharing extent (an easier curriculum) at the beginning and gradually decrease the sharing extent of the supernet (a harder curriculum). To support this training strategy, we design a novel supernet (CLOSENet) that decouples the parameters from operations to realize a flexible sharing scheme and adjustable sharing extent. Extensive experiments demonstrate that CLOSE can obtain a better ranking quality across different computational budget constraints than other one-shot supernets, and is able to discover superior architectures when combined with various search strategies. Code is available at https://github.com/walkerning/aw_nas. △ Less

Submitted 16 July, 2022; originally announced July 2022.

Comments: accepted by ECCV 2022 (14 pages main texts)

arXiv:2207.04883 [pdf, other]

doi 10.1103/PhysRevLett.130.021802

A First Search for Solar $^8$B Neutrino in the PandaX-4T Experiment using Neutrino-Nucleus Coherent Scattering

Authors: Wenbo Ma, Abdusalam Abdukerim, Chen Cheng, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou , et al. (66 additional authors not shown)

Abstract: A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data… ▽ More A search for interactions from solar $^8$B neutrinos elastically scattering off xenon nuclei using PandaX-4T commissioning data is reported. The energy threshold of this search is further lowered compared with the previous search for dark matter, with various techniques utilized to suppress the background that emerges from data with the lowered threshold. A blind analysis is performed on the data with an effective exposure of 0.48 tonne$\cdot$year, and no significant excess of events is observed. Among results obtained using the neutrino-nucleus coherent scattering, our results give the best constraint on the solar $^8$B neutrino flux. We further provide a more stringent limit on the cross section between dark matter and nucleon in the mass range from 3 to 9 GeV/c$^2$. △ Less

Submitted 13 January, 2023; v1 submitted 11 July, 2022; originally announced July 2022.

Comments: 5 pages, 4 figures

Journal ref: Physical Review Letters 130, 021802 (2023)

arXiv:2206.06087 [pdf, other]

doi 10.1088/1674-1137/ac8539

Neutron-induced nuclear recoil background in the PandaX-4T experiment

Authors: Zhou Huang, Guofang Shen, Qiuhong Wang, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Yunshan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Chencheng Han, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang , et al. (55 additional authors not shown)

Abstract: Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also u… ▽ More Neutron-induced nuclear recoil background is critical to the dark matter searches in the PandaX-4T liquid xenon experiment. This paper studies the feature of neutron background in liquid xenon and evaluates their contribution in the single scattering nuclear recoil events through three methods. The first method is fully Monte Carlo simulation based. The last two are data-driven methods that also use the multiple scattering signals and high energy signals in the data, respectively. In the PandaX-4T commissioning data with an exposure of 0.63 tonne-year, all these methods give a consistent result that there are $1.15\pm0.57$ neutron-induced background in dark matter signal region within an approximated nuclear recoil energy window between 5 and 100 keV. △ Less

Submitted 29 July, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

Comments: 14 pages, 14 figures, 6 tables

arXiv:2206.04882 [pdf, other]

doi 10.1038/s42004-023-00897-3

$\mathsf{G^2Retro}$ as a Two-Step Graph Generative Models for Retrosynthesis Prediction

Authors: Ziqi Chen, Oluwatosin R. Ayinde, James R. Fuchs, Huan Sun, Xia Ning

Abstract: Retrosynthesis is a procedure where a target molecule is transformed into potential reactants and thus the synthesis routes can be identified. Recently, computational approaches have been developed to accelerate the design of synthesis routes. In this paper, we develop a generative framework $\mathsf{G^2Retro}$ for one-step retrosynthesis prediction. $\mathsf{G^2Retro}$ imitates the reversed logic… ▽ More Retrosynthesis is a procedure where a target molecule is transformed into potential reactants and thus the synthesis routes can be identified. Recently, computational approaches have been developed to accelerate the design of synthesis routes. In this paper, we develop a generative framework $\mathsf{G^2Retro}$ for one-step retrosynthesis prediction. $\mathsf{G^2Retro}$ imitates the reversed logic of synthetic reactions. It first predicts the reaction centers in the target molecules (products), identifies the synthons needed to assemble the products, and transforms these synthons into reactants. $\mathsf{G^2Retro}$ defines a comprehensive set of reaction center types, and learns from the molecular graphs of the products to predict potential reaction centers. To complete synthons into reactants, $\mathsf{G^2Retro}$ considers all the involved synthon structures and the product structures to identify the optimal completion paths, and accordingly attaches small substructures sequentially to the synthons. Here we show that $\mathsf{G^2Retro}$ is able to better predict the reactants for given products in the benchmark dataset than the state-of-the-art methods. △ Less

Submitted 5 June, 2023; v1 submitted 10 June, 2022; originally announced June 2022.

Journal ref: Commun Chem 6, 102 (2023)

arXiv:2206.02339 [pdf, other]

doi 10.1103/PhysRevLett.129.161804

A Search for Light Fermionic Dark Matter Absorption on Electrons in PandaX-4T

Authors: Dan Zhang, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou, Xiangdong Ji , et al. (67 additional authors not shown)

Abstract: We report a search on a sub-MeV fermionic dark matter absorbed by electrons with an outgoing active neutrino using the 0.63 tonne-year exposure collected by PandaX-4T liquid xenon experiment. No significant signals are observed over the expected background. The data are interpreted into limits to the effective couplings between such dark matter and electrons. For axial-vector or vector interaction… ▽ More We report a search on a sub-MeV fermionic dark matter absorbed by electrons with an outgoing active neutrino using the 0.63 tonne-year exposure collected by PandaX-4T liquid xenon experiment. No significant signals are observed over the expected background. The data are interpreted into limits to the effective couplings between such dark matter and electrons. For axial-vector or vector interactions, our sensitivity is competitive in comparison to existing astrophysical bounds on the decay of such dark matter into photon final states. In particular, we present the first direct detection limits for an axial-vector (vector) interaction which are the strongest in the mass range from 25 to 45 (35 to 50) keV/c$^2$. △ Less

Submitted 4 July, 2023; v1 submitted 5 June, 2022; originally announced June 2022.

Journal ref: PhysRevLett (2022) 129.161804

arXiv:2206.01875 [pdf, other]

Prospective Preference Enhanced Mixed Attentive Model for Session-based Recommendation

Authors: Bo Peng, Chang-Yu Tai, Srinivasan Parthasarathy, Xia Ning

Abstract: Session-based recommendation aims to generate recommendations for the next item of users' interest based on a given session. In this manuscript, we develop prospective preference enhanced mixed attentive model (P2MAM) to generate session-based recommendations using two important factors: temporal patterns and estimates of users' prospective preferences. Unlike existing methods, P2MAM models the te… ▽ More Session-based recommendation aims to generate recommendations for the next item of users' interest based on a given session. In this manuscript, we develop prospective preference enhanced mixed attentive model (P2MAM) to generate session-based recommendations using two important factors: temporal patterns and estimates of users' prospective preferences. Unlike existing methods, P2MAM models the temporal patterns using a light-weight while effective position-sensitive attention mechanism. In P2MAM, we also leverage the estimate of users' prospective preferences to signify important items, and generate better recommendations. Our experimental results demonstrate that P2MAM models significantly outperform the state-of-the-art methods in six benchmark datasets, with an improvement as much as 19.2%. In addition, our run-time performance comparison demonstrates that during testing, P2MAM models are much more efficient than the best baseline method, with a significant average speedup of 47.7 folds. △ Less

Submitted 3 June, 2022; originally announced June 2022.

Comments: Under review by IEEE Transactions on Knowledge and Data Engineering (TKDE)

Journal ref: Springer Data Mining and Knowledge Discovery (DMKD) 2024

arXiv:2205.15771 [pdf, other]

doi 10.1103/PhysRevLett.129.161803

First search for the absorption of fermionic dark matter with the PandaX-4T experiment

Authors: Linhui Gu, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Yunshan Cheng, Zhaokan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou, Xiangdong Ji , et al. (64 additional authors not shown)

Abstract: Compared with the signature of dark matter elastic scattering off nuclei, the absorption of fermionic dark matter by nuclei opens up a new searching channel for light dark matter with a characteristic monoenergetic signal. In this Letter, we explore the $95.0$-day data from the PandaX-4T commissioning run and report the first dedicated searching results of the fermionic dark matter absorption sign… ▽ More Compared with the signature of dark matter elastic scattering off nuclei, the absorption of fermionic dark matter by nuclei opens up a new searching channel for light dark matter with a characteristic monoenergetic signal. In this Letter, we explore the $95.0$-day data from the PandaX-4T commissioning run and report the first dedicated searching results of the fermionic dark matter absorption signal through a neutral current process. No significant signal was found, and the lowest limit on the dark matter-nucleon interaction cross section is set to be $1.5\times10^{-50}$ cm$^2$ for a fermionic dark matter mass of $40$ MeV/$c^2$ with 90\% confidence level. △ Less

Submitted 14 October, 2022; v1 submitted 31 May, 2022; originally announced May 2022.

Comments: 6 pages, 6 figures

arXiv:2205.12809 [pdf, other]

doi 10.34133/2022/9798721

Measurement of Double Beta Decay Half-life of $^{136}$Xe with the PandaX-4T Detector

Authors: PandaX Collaboration, Lin Si, Zhaokan Cheng, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Yunshan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang , et al. (63 additional authors not shown)

Abstract: Precise measurement of two-neutrino double beta decay (DBD) half-life is an important step for the searches of Majorana neutrinos with neutrinoless double beta decay. We report the measurement of DBD half-life of $^{136}$Xe using the PandaX-4T dual-phase Time Projection Chamber (TPC) with 3.7-tonne natural xenon and the first 94.9-day physics data release. The background model in the fiducial volu… ▽ More Precise measurement of two-neutrino double beta decay (DBD) half-life is an important step for the searches of Majorana neutrinos with neutrinoless double beta decay. We report the measurement of DBD half-life of $^{136}$Xe using the PandaX-4T dual-phase Time Projection Chamber (TPC) with 3.7-tonne natural xenon and the first 94.9-day physics data release. The background model in the fiducial volume is well constrained in situ by events in the outer active region. With a $^{136}$Xe exposure of 15.5 kg-year, we establish the half-life as $2.27 \pm 0.03 (stat.)\pm 0.10 (syst.)\times 10^{21}$ years. This is the first DBD half-life measurement with natural xenon and demonstrates the physics capability of a large-scale liquid xenon TPC in the field of rare event searches. △ Less

Submitted 12 December, 2022; v1 submitted 25 May, 2022; originally announced May 2022.

Comments: 6 pages, 4 figures

Journal ref: Research, vol. 2022, Article ID 9798721, 2022

arXiv:2205.08066 [pdf, other]

doi 10.1016/j.physletb.2022.137254

A search for two-component Majorana dark matter in a simplified model using the full exposure data of PandaX-II experiment

Authors: Ying Yuan, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou, Xiangdong Ji, Yonglin Ju , et al. (53 additional authors not shown)

Abstract: In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX… ▽ More In the two-component Majorana dark matter model, one dark matter particle can scatter off the target nuclei, and turn into a slightly heavier component. In the framework of a simplified model with a vector boson mediator, both the tree-level and loop-level processes contribute to the signal in direct detection experiment. In this paper, we report the search results for such dark matter from PandaX-II experiment, using total data of the full 100.7 tonne$\cdot$day exposure. No significant excess is observed, so strong constraints on the combined parameter space of mediator mass and dark matter mass are derived. With the complementary search results from collider experiments, a large range of parameter space can be excluded. △ Less

Submitted 16 May, 2022; originally announced May 2022.

arXiv:2204.11175 [pdf, other]

doi 10.1088/1674-1137/ac7cd8

Study of background from accidental coincidence signals in the PandaX-II experiment

Authors: PandaX-II Collaboration, :, Abdusalam Abdukerim, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Di Huang, Yan Huang, Yanlin Huang, Zhou Huang, Xiangdong Ji, Yonglin Ju, Shuaijie Li , et al. (42 additional authors not shown)

Abstract: The PandaX-II experiment employed a 580kg liquid xenon detector to search for the interactions between dark matter particles and the target xenon atoms. The accidental coincidences of isolated signals result in a dangerous background which mimic the signature of the dark matter. We performed a detailed study on the accidental coincidence background in PandaX-II, including the possible origin of th… ▽ More The PandaX-II experiment employed a 580kg liquid xenon detector to search for the interactions between dark matter particles and the target xenon atoms. The accidental coincidences of isolated signals result in a dangerous background which mimic the signature of the dark matter. We performed a detailed study on the accidental coincidence background in PandaX-II, including the possible origin of the isolated signals, the background level and corresponding background suppression method. With a boosted-decision-tree algorithm, the accidental coincidence background is reduced by 70% in the dark matter signal region, thus the sensitivity of dark matter search at PandaX-II is improved. △ Less

Submitted 1 July, 2022; v1 submitted 23 April, 2022; originally announced April 2022.

Comments: 20 pages, 12 figures in main text and 5 figures in the appendix. Accepted by Chinese Physics C

arXiv:2204.01942 [pdf, other]

Fault-Tolerant Deep Learning: A Hierarchical Perspective

Authors: Cheng Liu, Zhen Gao, Siting Liu, Xuefei Ning, Huawei Li, Xiaowei Li

Abstract: With the rapid advancements of deep learning in the past decade, it can be foreseen that deep learning will be continuously deployed in more and more safety-critical applications such as autonomous driving and robotics. In this context, reliability turns out to be critical to the deployment of deep learning in these applications and gradually becomes a first-class citizen among the major design me… ▽ More With the rapid advancements of deep learning in the past decade, it can be foreseen that deep learning will be continuously deployed in more and more safety-critical applications such as autonomous driving and robotics. In this context, reliability turns out to be critical to the deployment of deep learning in these applications and gradually becomes a first-class citizen among the major design metrics like performance and energy efficiency. Nevertheless, the back-box deep learning models combined with the diverse underlying hardware faults make resilient deep learning extremely challenging. In this special session, we conduct a comprehensive survey of fault-tolerant deep learning design approaches with a hierarchical perspective and investigate these approaches from model layer, architecture layer, circuit layer, and cross layer respectively. △ Less

Submitted 4 April, 2022; originally announced April 2022.

Comments: Special session submitted to VTS'22

ACM Class: B.2.3; B.8.1

arXiv:2203.09887 [pdf, other]

CodedVTR: Codebook-based Sparse Voxel Transformer with Geometric Guidance

Authors: Tianchen Zhao, Niansong Zhang, Xuefei Ning, He Wang, Li Yi, Yu Wang

Abstract: Transformers have gained much attention by outperforming convolutional neural networks in many 2D vision tasks. However, they are known to have generalization problems and rely on massive-scale pre-training and sophisticated training techniques. When applying to 3D tasks, the irregular data structure and limited data scale add to the difficulty of transformer's application. We propose CodedVTR (Co… ▽ More Transformers have gained much attention by outperforming convolutional neural networks in many 2D vision tasks. However, they are known to have generalization problems and rely on massive-scale pre-training and sophisticated training techniques. When applying to 3D tasks, the irregular data structure and limited data scale add to the difficulty of transformer's application. We propose CodedVTR (Codebook-based Voxel TRansformer), which improves data efficiency and generalization ability for 3D sparse voxel transformers. On the one hand, we propose the codebook-based attention that projects an attention space into its subspace represented by the combination of "prototypes" in a learnable codebook. It regularizes attention learning and improves generalization. On the other hand, we propose geometry-aware self-attention that utilizes geometric information (geometric pattern, density) to guide attention learning. CodedVTR could be embedded into existing sparse convolution-based methods, and bring consistent performance improvements for indoor and outdoor 3D semantic segmentation tasks △ Less

Submitted 27 March, 2022; v1 submitted 18 March, 2022; originally announced March 2022.

Comments: Published at CVPR2022

arXiv:2203.02125 [pdf, ps, other]

doi 10.1088/1361-648X/ac8907

Pressure-induced structural phase transition of vanadium: A revisit from the perspective of ensemble theory

Authors: Bo-Yuan Ning, Xi-Jing Ning

Abstract: For realistic crystals, the free energy strictly formulated in ensemble theory can hardly be obtained because of the difficulty in solving the high-dimension integral of the partition function, the dilemma of which makes it even a doubt if the rigorous ensemble theory is applicable to phase transitions of condensed matters. In the present work, the partition function of crystal vanadium under comp… ▽ More For realistic crystals, the free energy strictly formulated in ensemble theory can hardly be obtained because of the difficulty in solving the high-dimension integral of the partition function, the dilemma of which makes it even a doubt if the rigorous ensemble theory is applicable to phase transitions of condensed matters. In the present work, the partition function of crystal vanadium under compression up to $320$ GPa at room temperature is solved by an approach developed very recently, and the derived equation of state is in a good agreement with all the experimental measurements, especially the latest one covering the widest pressure range up to $300$ GPa. Furthermore, the derived Gibbs free energy proves the very argument to understand most of the experiments reported in the past decade on the pressure-induced phase transition, and, especially, a novel phase transition sequence concerning three different phases observed very recently and the measured angles of two phases agree with our theoretical results excellently. △ Less

Submitted 3 March, 2022; originally announced March 2022.

Comments: 5pages, 5figures

Journal ref: J.Phys.Condens.Matter 34 425404 (2022)

arXiv:2202.09711 [pdf, other]

Compact polarized X-ray source based on all-optical inverse Compton scattering

Authors: Yue Ma, Jianfei Hua, Dexiang Liu, Yunxiao He, Tianliang Zhang, Jiucheng Chen, Fan Yang, Xiaonan Ning, Hongze Zhang, Yingchao Du, Wei Lu

Abstract: Polarized X-ray source is an important probe for many fields such as fluorescence imaging, magnetic microscopy, and nuclear physics research. All-optical inverse Compton scattering source (AOCS) based on laser wakefield accelerator (LWFA) has drawn great attention in recent years due to its compact scale and high performance, especially its potential to generate polarized X-rays. Here, polarizatio… ▽ More Polarized X-ray source is an important probe for many fields such as fluorescence imaging, magnetic microscopy, and nuclear physics research. All-optical inverse Compton scattering source (AOCS) based on laser wakefield accelerator (LWFA) has drawn great attention in recent years due to its compact scale and high performance, especially its potential to generate polarized X-rays. Here, polarization-tunable X-rays are generated by a plasma-mirror-based AOCS scheme. The linearly and circularly polarized AOCS pulses are achieved with the mean photon energy of 60($\pm$5)/64($\pm$3) keV and the single-shot photon yield of $\sim$1.1/1.3$\times10^7$. A Compton polarimeter is designed to diagnose the photon polarization states, demonstrating AOCS's polarization-tunable property, and indicating the average polarization degree of the linearly polarized AOCS is 75($\pm$3)%. △ Less

Submitted 19 February, 2022; originally announced February 2022.

arXiv:2112.08957 [pdf, other]

doi 10.1103/PhysRevLett.128.171801

A Search for the Cosmic Ray Boosted Sub-GeV Dark Matter at the PandaX-II Experiment

Authors: Xiangyi Cui, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Yunshan Cheng, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou, Xiangdong Ji, Yonglin Ju , et al. (54 additional authors not shown)

Abstract: We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enha… ▽ More We report a novel search for the cosmic ray boosted dark matter using the 100~tonne$\cdot$day full data set of the PandaX-II detector located at the China Jinping Underground Laboratory. With the extra energy gained from the cosmic rays, sub-GeV dark matter particles can produce visible recoil signals in the detector. The diurnal modulations in rate and energy spectrum are utilized to further enhance the signal sensitivity. Our result excludes the dark matter-nucleon elastic scattering cross section between 10$^{-31}$cm$^{2}$ and 10$^{-28}$cm$^{2}$ for a dark matter masses from 0.1 MeV/$c^2$ to 0.1 GeV/$c^2$, with a large parameter space previously unexplored by experimental collaborations. △ Less

Submitted 11 April, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

Comments: 8 pages, 5 figures, 1 table. New constraints adopted a CRDM energy cutoff at 0.2 GeV, and some uncertainties from the cosmic ray propagation model and dark matter density are studied

arXiv:2112.06185 [pdf, other]

Multi-Agent Vulnerability Discovery for Autonomous Driving with Hazard Arbitration Reward

Authors: Weilin Liu, Ye Mu, Chao Yu, Xuefei Ning, Zhong Cao, Yi Wu, Shuang Liang, Huazhong Yang, Yu Wang

Abstract: Discovering hazardous scenarios is crucial in testing and further improving driving policies. However, conducting efficient driving policy testing faces two key challenges. On the one hand, the probability of naturally encountering hazardous scenarios is low when testing a well-trained autonomous driving strategy. Thus, discovering these scenarios by purely real-world road testing is extremely cos… ▽ More Discovering hazardous scenarios is crucial in testing and further improving driving policies. However, conducting efficient driving policy testing faces two key challenges. On the one hand, the probability of naturally encountering hazardous scenarios is low when testing a well-trained autonomous driving strategy. Thus, discovering these scenarios by purely real-world road testing is extremely costly. On the other hand, a proper determination of accident responsibility is necessary for this task. Collecting scenarios with wrong-attributed responsibilities will lead to an overly conservative autonomous driving strategy. To be more specific, we aim to discover hazardous scenarios that are autonomous-vehicle responsible (AV-responsible), i.e., the vulnerabilities of the under-test driving policy. To this end, this work proposes a Safety Test framework by finding Av-Responsible Scenarios (STARS) based on multi-agent reinforcement learning. STARS guides other traffic participants to produce Av-Responsible Scenarios and make the under-test driving policy misbehave via introducing Hazard Arbitration Reward (HAR). HAR enables our framework to discover diverse, complex, and AV-responsible hazardous scenarios. Experimental results against four different driving policies in three environments demonstrate that STARS can effectively discover AV-responsible hazardous scenarios. These scenarios indeed correspond to the vulnerabilities of the under-test driving policies, thus are meaningful for their further improvements. △ Less

Submitted 12 December, 2021; originally announced December 2021.

arXiv:2112.02892 [pdf, other]

doi 10.1007/JHEP06(2022)147

Low Radioactive Material Screening and Background Control for the PandaX-4T Experiment

Authors: Zhicheng Qian, Lin Si, Abdusalam Abdukerim, Zihao Bo, Wei Chen, Xun Chen, Yunhua Chen, Chen Cheng, Yunshan Cheng, Xiangyi Cui, Yingjie Fan, Deqing Fang, Changbo Fu, Mengting Fu, Lisheng Geng, Karl Giboni, Linhui Gu, Xuyuan Guo, Ke Han, Changda He, Jinrong He, Di Huang, Yanlin Huang, Zhou Huang, Ruquan Hou , et al. (54 additional authors not shown)

Abstract: PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low backgro… ▽ More PandaX-4T is a ton-scale dark matter direct detection experiment using a dual-phase TPC technique at the China Jinping Underground Laboratory. Various ultra-low background technologies have been developed and applied to material screening for PandaX-4T, including HPGe gamma spectroscopy, ICP-MS, NAA, radon emanation measurement system, krypton assay station, and alpha detection system. Low background materials were selected to assemble the detector. Surface treatment procedures were investigated to further suppress radioactive background. Combining measured results and Monte Carlo simulation, the total material background rates of PandaX-4T in the energy region of 1-25 keV$\rm{}_{ee}$ are estimated to be (9.9 $\pm$ 1.9) $\times \ 10^{-3}$ mDRU for electron recoil and (2.8 $\pm$ 0.6) $\times \ 10^{-4}$ mDRU for nuclear recoil. In addition, $^{nat}$Kr in the detector is estimated to be <8 ppt. △ Less

Submitted 23 April, 2022; v1 submitted 6 December, 2021; originally announced December 2021.

Comments: 19 pages, 7 figures, 12 tables

arXiv:2111.07439 [pdf, other]

doi 10.1021/acsomega.1c06805

Improving Compound Activity Classification via Deep Transfer and Representation Learning

Authors: Vishal Dey, Raghu Machiraju, Xia Ning

Abstract: Recent advances in molecular machine learning, especially deep neural networks such as Graph Neural Networks (GNNs) for predicting structure activity relationships (SAR) have shown tremendous potential in computer-aided drug discovery. However, the applicability of such deep neural networks are limited by the requirement of large amounts of training data. In order to cope with limited training dat… ▽ More Recent advances in molecular machine learning, especially deep neural networks such as Graph Neural Networks (GNNs) for predicting structure activity relationships (SAR) have shown tremendous potential in computer-aided drug discovery. However, the applicability of such deep neural networks are limited by the requirement of large amounts of training data. In order to cope with limited training data for a target task, transfer learning for SAR modeling has been recently adopted to leverage information from data of related tasks. In this work, in contrast to the popular parameter-based transfer learning such as pretraining, we develop novel deep transfer learning methods TAc and TAc-fc to leverage source domain data and transfer useful information to the target domain. TAc learns to generate effective molecular features that can generalize well from one domain to another, and increase the classification performance in the target domain. Additionally, TAc-fc extends TAc by incorporating novel components to selectively learn feature-wise and compound-wise transferability. We used the bioassay screening data from PubChem, and identified 120 pairs of bioassays such that the active compounds in each pair are more similar to each other compared to its inactive compounds. Our experiments clearly demonstrate that TAc achieves significant improvement over all baselines across a large number of target tasks. Furthermore, although TAc-fc achieves slightly worse ROC-AUC on average compared to TAc, TAc-fc still achieves the best performance on more tasks in terms of PR-AUC and F1 compared to other methods. In summary, TAc-fc is also found to be a strong model with competitive or even better performance than TAc on a notable number of target tasks. △ Less

Submitted 8 March, 2022; v1 submitted 14 November, 2021; originally announced November 2021.

Comments: This manuscript has been accepted at ACS Omega

Journal ref: ACS Omega 2022

Showing 51–100 of 172 results for author: Ning, X