-
Design of a large-scale superconducting dipole magnet for the CEE spectrometer
Authors:
Yuquan Chen,
Wei You,
Jiaqi Lu,
Yujin Tong,
Luncai Zhou,
Beimin Wu,
Enming Mei,
Wentian Feng,
Xianjin Ou,
Wei Wu,
Qinggao Yao,
Peng Yang,
Yuhong Yu,
Zhiyu Sun
Abstract:
The CSR External-target Experiment (CEE) is a large-scale spectrometer under construction at the Heavy Ion Research Facility in Lanzhou (HIRFL) for studying the phase structure of nuclear matter at high baryon density and the equation of states of nuclear matter at supra-saturation densities. One of the key components is a large acceptance dipole magnet with a central field of 0.5 T and the homoge…
▽ More
The CSR External-target Experiment (CEE) is a large-scale spectrometer under construction at the Heavy Ion Research Facility in Lanzhou (HIRFL) for studying the phase structure of nuclear matter at high baryon density and the equation of states of nuclear matter at supra-saturation densities. One of the key components is a large acceptance dipole magnet with a central field of 0.5 T and the homogeneity of 5% within a 1 m long, 1.2 m wide, and 0.9 m high aperture. Detectors will be installed within this aperture. An innovative design for the superconducting detector magnet is proposed that goes beyond the conventional approach. The magnet is designed as a coil-dominant type, with conductors discretized on a racetrack-shaped cross-section to generate the necessary fields. A warm iron yoke is used to enhance the central field and minimize the stray field. The magnet has overall dimensions of 3.4 meters in length, 2.7 meters in height, and 4.3 meters in width. The coils will be wound using a 19-strand rope cable comprised of 12 NbTi superconducting wires and 7 copper wires. The ratio of copper to superconductor of the cable is 6.9. The keel supports serve as the primary structural support for the coils to withstand the electromagnetic force. The coils will be indirectly cooled by liquid helium within three external helium vessels. To ensure reliable protection of the magnet during a quench, an active protection method combined with quench-back effect is employed. In this paper, we mainly present the detailed design of the magnetic field, structure, quench protection and cryostat for the spectrometer magnet.
△ Less
Submitted 3 September, 2024;
originally announced September 2024.
-
Tunable interfacial Rashba spin-orbit coupling in asymmetric Al$_x$In$_{1-x}$Sb/InSb/CdTe quantum well heterostructures
Authors:
Hanzhi Ruan,
Zhenghang Zhi,
Yuyang Wu,
Jiuming Liu,
Puyang Huang,
Shan Yao,
Xinqi Liu,
Chenjia Tang,
Qi Yao,
Lu Sun,
Yifan Zhang,
Yujie Xiao,
Renchao Che,
Xufeng Kou
Abstract:
The manipulation of Rashba-type spin-orbit coupling (SOC) in molecular beam epitaxy-grown Al$_x$In$_{1-x}$Sb/InSb/CdTe quantum well heterostructures is reported. The effective band bending provides robust two-dimensional quantum confinement, while the unidirectional built-in electric field from the asymmetric hetero-interfaces results in pronounced Rashba SOC strength. By tuning the Al concentrati…
▽ More
The manipulation of Rashba-type spin-orbit coupling (SOC) in molecular beam epitaxy-grown Al$_x$In$_{1-x}$Sb/InSb/CdTe quantum well heterostructures is reported. The effective band bending provides robust two-dimensional quantum confinement, while the unidirectional built-in electric field from the asymmetric hetero-interfaces results in pronounced Rashba SOC strength. By tuning the Al concentration in the top Al$_x$In$_{1-x}$Sb barrier layer, the optimal structure with $x = 0.15$ shows the largest Rashba coefficient of 0.23 eV-Angstrom. and the highest low-temperature electron mobility of 4400 cm$^2$/Vs . Quantitative investigations of the weak anti-localization effect further confirm the dominant D'yakonov-Perel (DP) spin relaxation mechanism during charge-to-spin conversion. These findings highlight the significance of quantum well engineering in shaping magneto-resistance responses, and narrow bandgap semiconductor-based heterostructures may offer a reliable platform for energy-efficient spintronic applications.
△ Less
Submitted 19 August, 2024;
originally announced August 2024.
-
HySparK: Hybrid Sparse Masking for Large Scale Medical Image Pre-Training
Authors:
Fenghe Tang,
Ronghao Xu,
Qingsong Yao,
Xueming Fu,
Quan Quan,
Heqin Zhu,
Zaiyi Liu,
S. Kevin Zhou
Abstract:
The generative self-supervised learning strategy exhibits remarkable learning representational capabilities. However, there is limited attention to end-to-end pre-training methods based on a hybrid architecture of CNN and Transformer, which can learn strong local and global representations simultaneously. To address this issue, we propose a generative pre-training strategy called Hybrid Sparse mas…
▽ More
The generative self-supervised learning strategy exhibits remarkable learning representational capabilities. However, there is limited attention to end-to-end pre-training methods based on a hybrid architecture of CNN and Transformer, which can learn strong local and global representations simultaneously. To address this issue, we propose a generative pre-training strategy called Hybrid Sparse masKing (HySparK) based on masked image modeling and apply it to large-scale pre-training on medical images. First, we perform a bottom-up 3D hybrid masking strategy on the encoder to keep consistency masking. Then we utilize sparse convolution for the top CNNs and encode unmasked patches for the bottom vision Transformers. Second, we employ a simple hierarchical decoder with skip-connections to achieve dense multi-scale feature reconstruction. Third, we implement our pre-training method on a collection of multiple large-scale 3D medical imaging datasets. Extensive experiments indicate that our proposed pre-training strategy demonstrates robust transfer-ability in supervised downstream tasks and sheds light on HySparK's promising prospects. The code is available at https://github.com/FengheTan9/HySparK
△ Less
Submitted 11 August, 2024;
originally announced August 2024.
-
Ultrabright-entanglement-based quantum key distribution over a 404-km-long optical fiber
Authors:
Shi-Chang Zhuang,
Bo Li,
Ming-Yang Zheng,
Yi-Xi Zeng,
Hui-Nan Wu,
Guang-Bing Li,
Quan Yao,
Xiu-Ping Xie,
Yu-Huai Li,
Hao Qin,
Li-Xing You,
Fei-Hu Xu,
Juan Yin,
Yuan Cao,
Qiang Zhang,
Cheng-Zhi Peng,
Jian-Wei Pan
Abstract:
The entangled photons are crucial resources for quantum communications and networking. Here, we present an ultra-bright polarization-entangled photon source based on a periodically poled lithium niobate waveguide designed for practical quantum communication networks. Using a 780 nm pump laser, the source achieves a pair generation rate of 2.4 $\times 10^{10}$ pairs/s/mW. This work has achieved a d…
▽ More
The entangled photons are crucial resources for quantum communications and networking. Here, we present an ultra-bright polarization-entangled photon source based on a periodically poled lithium niobate waveguide designed for practical quantum communication networks. Using a 780 nm pump laser, the source achieves a pair generation rate of 2.4 $\times 10^{10}$ pairs/s/mW. This work has achieved a directly measured power of 17.9 nW in entangled photon generation with a 3.2 mW pump power. Based on this, we demonstrate the practicality of the source by conducting quantum key distribution experiments over long-distance fiber links, achieving the applicable secure key rates of up to 440.80 bits/s over 200 km with 62 dB loss and reaching a maximum secure key generation distance of 404 km. These results demonstrate the potential of wavelength-multiplexed polarization-entangled photon sources for high-speed, long-distance quantum communication, positioning them as key components for future large-scale quantum networks.
△ Less
Submitted 8 August, 2024; v1 submitted 8 August, 2024;
originally announced August 2024.
-
UniMoT: Unified Molecule-Text Language Model with Discrete Token Representation
Authors:
Juzheng Zhang,
Yatao Bian,
Yongqiang Chen,
Quanming Yao
Abstract:
The remarkable success of Large Language Models (LLMs) across diverse tasks has driven the research community to extend their capabilities to molecular applications. However, most molecular LLMs employ adapter-based architectures that do not treat molecule and text modalities equally and lack a supervision signal for the molecule modality. To address these issues, we introduce UniMoT, a Unified Mo…
▽ More
The remarkable success of Large Language Models (LLMs) across diverse tasks has driven the research community to extend their capabilities to molecular applications. However, most molecular LLMs employ adapter-based architectures that do not treat molecule and text modalities equally and lack a supervision signal for the molecule modality. To address these issues, we introduce UniMoT, a Unified Molecule-Text LLM adopting a tokenizer-based architecture that expands the vocabulary of LLM with molecule tokens. Specifically, we introduce a Vector Quantization-driven tokenizer that incorporates a Q-Former to bridge the modality gap between molecule and text. This tokenizer transforms molecules into sequences of molecule tokens with causal dependency, encapsulating high-level molecular and textual information. Equipped with this tokenizer, UniMoT can unify molecule and text modalities under a shared token representation and an autoregressive training paradigm, enabling it to interpret molecules as a foreign language and generate them as text. Following a four-stage training scheme, UniMoT emerges as a multi-modal generalist capable of performing both molecule-to-text and text-to-molecule tasks. Extensive experiments demonstrate that UniMoT achieves state-of-the-art performance across a wide range of molecule comprehension and generation tasks.
△ Less
Submitted 1 August, 2024;
originally announced August 2024.
-
Warming Up Cold-Start CTR Prediction by Learning Item-Specific Feature Interactions
Authors:
Yaqing Wang,
Hongming Piao,
Daxiang Dong,
Quanming Yao,
Jingbo Zhou
Abstract:
In recommendation systems, new items are continuously introduced, initially lacking interaction records but gradually accumulating them over time. Accurately predicting the click-through rate (CTR) for these items is crucial for enhancing both revenue and user experience. While existing methods focus on enhancing item ID embeddings for new items within general CTR models, they tend to adopt a glob…
▽ More
In recommendation systems, new items are continuously introduced, initially lacking interaction records but gradually accumulating them over time. Accurately predicting the click-through rate (CTR) for these items is crucial for enhancing both revenue and user experience. While existing methods focus on enhancing item ID embeddings for new items within general CTR models, they tend to adopt a global feature interaction approach, often overshadowing new items with sparse data by those with abundant interactions. Addressing this, our work introduces EmerG, a novel approach that warms up cold-start CTR prediction by learning item-specific feature interaction patterns. EmerG utilizes hypernetworks to generate an item-specific feature graph based on item characteristics, which is then processed by a Graph Neural Network (GNN). This GNN is specially tailored to provably capture feature interactions at any order through a customized message passing mechanism. We further design a meta learning strategy that optimizes parameters of hypernetworks and GNN across various item CTR prediction tasks, while only adjusting a minimal set of item-specific parameters within each task. This strategy effectively reduces the risk of overfitting when dealing with limited data. Extensive experiments on benchmark datasets validate that EmerG consistently performs the best given no, a few and sufficient instances of new items.
△ Less
Submitted 14 July, 2024;
originally announced July 2024.
-
FairMedFM: Fairness Benchmarking for Medical Imaging Foundation Models
Authors:
Ruinan Jin,
Zikang Xu,
Yuan Zhong,
Qiongsong Yao,
Qi Dou,
S. Kevin Zhou,
Xiaoxiao Li
Abstract:
The advent of foundation models (FMs) in healthcare offers unprecedented opportunities to enhance medical diagnostics through automated classification and segmentation tasks. However, these models also raise significant concerns about their fairness, especially when applied to diverse and underrepresented populations in healthcare applications. Currently, there is a lack of comprehensive benchmark…
▽ More
The advent of foundation models (FMs) in healthcare offers unprecedented opportunities to enhance medical diagnostics through automated classification and segmentation tasks. However, these models also raise significant concerns about their fairness, especially when applied to diverse and underrepresented populations in healthcare applications. Currently, there is a lack of comprehensive benchmarks, standardized pipelines, and easily adaptable libraries to evaluate and understand the fairness performance of FMs in medical imaging, leading to considerable challenges in formulating and implementing solutions that ensure equitable outcomes across diverse patient populations. To fill this gap, we introduce FairMedFM, a fairness benchmark for FM research in medical imaging.FairMedFM integrates with 17 popular medical imaging datasets, encompassing different modalities, dimensionalities, and sensitive attributes. It explores 20 widely used FMs, with various usages such as zero-shot learning, linear probing, parameter-efficient fine-tuning, and prompting in various downstream tasks -- classification and segmentation. Our exhaustive analysis evaluates the fairness performance over different evaluation metrics from multiple perspectives, revealing the existence of bias, varied utility-fairness trade-offs on different FMs, consistent disparities on the same datasets regardless FMs, and limited effectiveness of existing unfairness mitigation methods. Checkout FairMedFM's project page and open-sourced codebase, which supports extendible functionalities and applications as well as inclusive for studies on FMs in medical imaging over the long term.
△ Less
Submitted 3 July, 2024; v1 submitted 1 July, 2024;
originally announced July 2024.
-
Knowledge-Aware Parsimony Learning: A Perspective from Relational Graphs
Authors:
Quanming Yao,
Yongqi Zhang,
Yaqing Wang,
Nan Yin,
James Kwok,
Qiang Yang
Abstract:
The scaling law, a strategy that involves the brute-force scaling of the training dataset and learnable parameters, has become a prevalent approach for developing stronger learning models. In this paper, we examine its rationale in terms of learning from relational graphs. We demonstrate that directly adhering to such a scaling law does not necessarily yield stronger models due to architectural in…
▽ More
The scaling law, a strategy that involves the brute-force scaling of the training dataset and learnable parameters, has become a prevalent approach for developing stronger learning models. In this paper, we examine its rationale in terms of learning from relational graphs. We demonstrate that directly adhering to such a scaling law does not necessarily yield stronger models due to architectural incompatibility and representation bottlenecks. To tackle this challenge, we propose a novel framework for learning from relational graphs via knowledge-aware parsimony learning. Our method draws inspiration from the duality between data and knowledge inherent in these graphs. Specifically, we first extract knowledge (like symbolic logic and physical laws) during the learning process, and then apply combinatorial generalization to the task at hand. This extracted knowledge serves as the ``building blocks'' for achieving parsimony learning. By applying this philosophy to architecture, parameters, and inference, we can effectively achieve versatile, sample-efficient, and interpretable learning. Experimental results show that our proposed framework surpasses methods that strictly follow the traditional scaling-up roadmap. This highlights the importance of incorporating knowledge in the development of next-generation learning technologies.
△ Less
Submitted 29 June, 2024;
originally announced July 2024.
-
Wave functions in the Critical Phase: a Planar \textit{Sierpiński} Fractal Lattice
Authors:
Qi Yao,
Xiaotian Yang,
Askar A. Iliasov,
Mikhail I. Katsnelson,
Shengjun Yuan
Abstract:
Electronic states play a crucial role in many quantum systems of moire superlattices, quasicrystals, and fractals. As recently reported in \textit{Sierpiński} lattices [Phys. Rev. B 107, 115424 (2023)], the critical states are revealed by the energy level-correlation spectra, which are caused by the interplay between aperiodicity and determined self-similarity characters. In the case of the \texti…
▽ More
Electronic states play a crucial role in many quantum systems of moire superlattices, quasicrystals, and fractals. As recently reported in \textit{Sierpiński} lattices [Phys. Rev. B 107, 115424 (2023)], the critical states are revealed by the energy level-correlation spectra, which are caused by the interplay between aperiodicity and determined self-similarity characters. In the case of the \textit{Sierpiński Carpet}, our results further demonstrate that there is some degree of spatial overlap between these electronic states. These states could be strongly affected by its `seed lattice' of the $generator$, and slightly modulated by the dilation pattern and the geometrical self-similarity level. These electronic states are multifractal by scaling the $q$-order inverse participation ratio or fractal dimension, which correlates with the subdiffusion behavior. In the $gene$ pattern, the averaged state-based multifractal dimension of second-order would increase as its \textit{Hausdoff dimension} increases. Our findings could potentially contribute to understanding quantum transports and single-particle quantum dynamics in fractals.
△ Less
Submitted 23 June, 2024;
originally announced June 2024.
-
HIGHT: Hierarchical Graph Tokenization for Graph-Language Alignment
Authors:
Yongqiang Chen,
Quanming Yao,
Juzheng Zhang,
James Cheng,
Yatao Bian
Abstract:
Recently there has been a surge of interest in extending the success of large language models (LLMs) to graph modality, such as social networks and molecules. As LLMs are predominantly trained with 1D text data, most existing approaches adopt a graph neural network to represent a graph as a series of node tokens and feed these tokens to LLMs for graph-language alignment. Despite achieving some suc…
▽ More
Recently there has been a surge of interest in extending the success of large language models (LLMs) to graph modality, such as social networks and molecules. As LLMs are predominantly trained with 1D text data, most existing approaches adopt a graph neural network to represent a graph as a series of node tokens and feed these tokens to LLMs for graph-language alignment. Despite achieving some successes, existing approaches have overlooked the hierarchical structures that are inherent in graph data. Especially, in molecular graphs, the high-order structural information contains rich semantics of molecular functional groups, which encode crucial biochemical functionalities of the molecules. We establish a simple benchmark showing that neglecting the hierarchical information in graph tokenization will lead to subpar graph-language alignment and severe hallucination in generated outputs. To address this problem, we propose a novel strategy called HIerarchical GrapH Tokenization (HIGHT). HIGHT employs a hierarchical graph tokenizer that extracts and encodes the hierarchy of node, motif, and graph levels of informative tokens to improve the graph perception of LLMs. HIGHT also adopts an augmented graph-language supervised fine-tuning dataset, enriched with the hierarchical graph information, to further enhance the graph-language alignment. Extensive experiments on 7 molecule-centric benchmarks confirm the effectiveness of HIGHT in reducing hallucination by 40%, as well as significant improvements in various molecule-language downstream tasks.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Bose-Einstein condensation of polaritons at room temperature in a GaAs/AlGaAs structure
Authors:
Hassan Alnatah,
Qi Yao,
Qiaochu Wan,
Jonathan Beaumariage,
Ken West,
Kirk Baldwin,
Loren N. Pfeiffer,
David W. Snoke
Abstract:
We report the canonical properties of Bose-Einstein condensation of polaritons, seen previously in many low-temperature experiments, at room temperature in a GaAs/AlGaAs structure. These effects include a nonlinear energy shift of the polaritons, showing that they are not non-interacting photons, and dramatic line narrowing due to coherence, giving coherent emission with spectral width of 0.24 meV…
▽ More
We report the canonical properties of Bose-Einstein condensation of polaritons, seen previously in many low-temperature experiments, at room temperature in a GaAs/AlGaAs structure. These effects include a nonlinear energy shift of the polaritons, showing that they are not non-interacting photons, and dramatic line narrowing due to coherence, giving coherent emission with spectral width of 0.24 meV at room temperature with no external stabilization. This opens up the possibility of room temperature nonlinear optical devices based on polariton condensation.
△ Less
Submitted 19 June, 2024;
originally announced June 2024.
-
Measurement of exciton fraction of microcavity exciton-polaritons using transfer-matrix modeling
Authors:
Jonathan Beaumariage,
Zheng Sun,
Hassan Alnatah,
Qi Yao,
David M. Myers,
Mark Steger,
Ken West,
Kirk Baldwin,
Loren N. Pfeiffer,
Man Chun Alan Tam,
Zbig R. Wailewski,
David W. Snoke
Abstract:
We present a careful calibration of the exciton fraction of polaritons in high-$Q$ ($\sim 300,000$), long-lifetime ($\sim 300$ ps), GaAs/AlGaAs microcavities.This is a crucial parameter for many-body theories which include the polariton-polariton interactions.It is much harder to establish this number in high-$Q$ structures compared to low-$Q$ structures, because the upper polariton is nearly invi…
▽ More
We present a careful calibration of the exciton fraction of polaritons in high-$Q$ ($\sim 300,000$), long-lifetime ($\sim 300$ ps), GaAs/AlGaAs microcavities.This is a crucial parameter for many-body theories which include the polariton-polariton interactions.It is much harder to establish this number in high-$Q$ structures compared to low-$Q$ structures, because the upper polariton is nearly invisible in high-$Q$ cavities.We present a combination of photoluminescence, photoluminescence excitation, and reflectivity measurements to highly constrain the fit model, and compare the results of this model to the results from low-$Q$ structures.We present a fitted curve of exciton fraction as a function of the lower polariton energy for multiple samples which have been used in prior experiments.
△ Less
Submitted 17 June, 2024;
originally announced June 2024.
-
Heuristic Learning with Graph Neural Networks: A Unified Framework for Link Prediction
Authors:
Juzheng Zhang,
Lanning Wei,
Zhen Xu,
Quanming Yao
Abstract:
Link prediction is a fundamental task in graph learning, inherently shaped by the topology of the graph. While traditional heuristics are grounded in graph topology, they encounter challenges in generalizing across diverse graphs. Recent research efforts have aimed to leverage the potential of heuristics, yet a unified formulation accommodating both local and global heuristics remains undiscovered…
▽ More
Link prediction is a fundamental task in graph learning, inherently shaped by the topology of the graph. While traditional heuristics are grounded in graph topology, they encounter challenges in generalizing across diverse graphs. Recent research efforts have aimed to leverage the potential of heuristics, yet a unified formulation accommodating both local and global heuristics remains undiscovered. Drawing insights from the fact that both local and global heuristics can be represented by adjacency matrix multiplications, we propose a unified matrix formulation to accommodate and generalize various heuristics. We further propose the Heuristic Learning Graph Neural Network (HL-GNN) to efficiently implement the formulation. HL-GNN adopts intra-layer propagation and inter-layer connections, allowing it to reach a depth of around 20 layers with lower time complexity than GCN. Extensive experiments on the Planetoid, Amazon, and OGB datasets underscore the effectiveness and efficiency of HL-GNN. It outperforms existing methods by a large margin in prediction performance. Additionally, HL-GNN is several orders of magnitude faster than heuristic-inspired methods while requiring only a few trainable parameters. The case study further demonstrates that the generalized heuristics and learned weights are highly interpretable.
△ Less
Submitted 14 June, 2024; v1 submitted 12 June, 2024;
originally announced June 2024.
-
Testing common invariant subspace of multilayer networks
Authors:
Mingao Yuan,
Qianqian Yao
Abstract:
Graph (or network) is a mathematical structure that has been widely used to model relational data. As real-world systems get more complex, multilayer (or multiple) networks are employed to represent diverse patterns of relationships among the objects in the systems. One active research problem in multilayer networks analysis is to study the common invariant subspace of the networks, because such c…
▽ More
Graph (or network) is a mathematical structure that has been widely used to model relational data. As real-world systems get more complex, multilayer (or multiple) networks are employed to represent diverse patterns of relationships among the objects in the systems. One active research problem in multilayer networks analysis is to study the common invariant subspace of the networks, because such common invariant subspace could capture the fundamental structural patterns and interactions across all layers. Many methods have been proposed to estimate the common invariant subspace. However, whether real-world multilayer networks share the same common subspace remains unknown. In this paper, we first attempt to answer this question by means of hypothesis testing. The null hypothesis states that the multilayer networks share the same subspace, and under the alternative hypothesis, there exist at least two networks that do not have the same subspace. We propose a Weighted Degree Difference Test, derive the limiting distribution of the test statistic and provide an analytical analysis of the power. Simulation study shows that the proposed test has satisfactory performance, and a real data application is provided.
△ Less
Submitted 7 June, 2024;
originally announced June 2024.
-
Explore then Determine: A GNN-LLM Synergy Framework for Reasoning over Knowledge Graph
Authors:
Guangyi Liu,
Yongqi Zhang,
Yong Li,
Quanming Yao
Abstract:
The task of reasoning over Knowledge Graphs (KGs) poses a significant challenge for Large Language Models (LLMs) due to the complex structure and large amounts of irrelevant information. Existing LLM reasoning methods overlook the importance of compositional learning on KG to supply with precise knowledge. Besides, the fine-tuning and frequent interaction with LLMs incur substantial time and resou…
▽ More
The task of reasoning over Knowledge Graphs (KGs) poses a significant challenge for Large Language Models (LLMs) due to the complex structure and large amounts of irrelevant information. Existing LLM reasoning methods overlook the importance of compositional learning on KG to supply with precise knowledge. Besides, the fine-tuning and frequent interaction with LLMs incur substantial time and resource costs. This paper focuses on the Question Answering over Knowledge Graph (KGQA) task and proposes an Explore-then-Determine (EtD) framework that synergizes LLMs with graph neural networks (GNNs) for reasoning over KGs. The Explore stage employs a lightweight GNN to explore promising candidates and relevant fine-grained knowledge to the questions, while the Determine stage utilizes the explored information to construct a knowledge-enhanced multiple-choice prompt, guiding a frozen LLM to determine the final answer. Extensive experiments on three benchmark KGQA datasets demonstrate that EtD achieves state-of-the-art performance and generates faithful reasoning results.
△ Less
Submitted 3 June, 2024;
originally announced June 2024.
-
On the modelling and prediction of high-dimensional functional time series
Authors:
Jinyuan Chang,
Qin Fang,
Xinghao Qiao,
Qiwei Yao
Abstract:
We propose a two-step procedure to model and predict high-dimensional functional time series, where the number of function-valued time series $p$ is large in relation to the length of time series $n$. Our first step performs an eigenanalysis of a positive definite matrix, which leads to a one-to-one linear transformation for the original high-dimensional functional time series, and the transformed…
▽ More
We propose a two-step procedure to model and predict high-dimensional functional time series, where the number of function-valued time series $p$ is large in relation to the length of time series $n$. Our first step performs an eigenanalysis of a positive definite matrix, which leads to a one-to-one linear transformation for the original high-dimensional functional time series, and the transformed curve series can be segmented into several groups such that any two subseries from any two different groups are uncorrelated both contemporaneously and serially. Consequently in our second step those groups are handled separately without the information loss on the overall linear dynamic structure. The second step is devoted to establishing a finite-dimensional dynamical structure for all the transformed functional time series within each group. Furthermore the finite-dimensional structure is represented by that of a vector time series. Modelling and forecasting for the original high-dimensional functional time series are realized via those for the vector time series in all the groups. We investigate the theoretical properties of our proposed methods, and illustrate the finite-sample performance through both extensive simulation and two real datasets.
△ Less
Submitted 2 June, 2024;
originally announced June 2024.
-
Autoregressive Networks with Dependent Edges
Authors:
Jinyuan Chang,
Qin Fang,
Eric D. Kolaczyk,
Peter W. MacDonald,
Qiwei Yao
Abstract:
We propose an autoregressive framework for modelling dynamic networks with dependent edges. It encompasses the models which accommodate, for example, transitivity, density-dependent and other stylized features often observed in real network data. By assuming the edges of network at each time are independent conditionally on their lagged values, the models, which exhibit a close connection with tem…
▽ More
We propose an autoregressive framework for modelling dynamic networks with dependent edges. It encompasses the models which accommodate, for example, transitivity, density-dependent and other stylized features often observed in real network data. By assuming the edges of network at each time are independent conditionally on their lagged values, the models, which exhibit a close connection with temporal ERGMs, facilitate both simulation and the maximum likelihood estimation in the straightforward manner. Due to the possible large number of parameters in the models, the initial MLEs may suffer from slow convergence rates. An improved estimator for each component parameter is proposed based on an iteration based on the projection which mitigates the impact of the other parameters (Chang et al., 2021, 2023). Based on a martingale difference structure, the asymptotic distribution of the improved estimator is derived without the stationarity assumption. The limiting distribution is not normal in general, and it reduces to normal when the underlying process satisfies some mixing conditions. Illustration with a transitivity model was carried out in both simulation and a real network data set.
△ Less
Submitted 24 April, 2024;
originally announced April 2024.
-
Electronic states and quantum transport in bilayer graphene Sierpinski-carpet fractals
Authors:
Xiaotian Yang,
Weiqing Zhou,
Qi Yao,
Yunhai Li,
Yunhua Wang,
Shengjun Yuan
Abstract:
We construct Sierpinski-carpet (SC) based on AA or AB bilayer graphene by atom vacancies, namely, SC-AA and SC-AB, to investigate the effects of interlayer coupling on the electronic properties of fractals. Compared with monolayer graphene SC, their density of states have similar features, such as Van-Hove singularities and edge states corresponding to the central peaks near zero energy, but remar…
▽ More
We construct Sierpinski-carpet (SC) based on AA or AB bilayer graphene by atom vacancies, namely, SC-AA and SC-AB, to investigate the effects of interlayer coupling on the electronic properties of fractals. Compared with monolayer graphene SC, their density of states have similar features, such as Van-Hove singularities and edge states corresponding to the central peaks near zero energy, but remarkable energy broadening of edge states emerges in SC-AA(AB). Calculated conductance spectrum shows that the conductance fluctuations still hold the Hausdorff fractal dimension behavior even with the interlayer coupling. Thus, the high correlation between quantum conductance and fractal geometry dimension is not affected by the interlayer coupling in bilayer graphene SC. We further reveal the quasi-eigenstates in fractal-like pressure-modulated bilayer graphene, namely, SC-pAA and SC-pAB. Numerical results show that the density of states of SC-pAA(pAB) show an asymptotic behavior to those of SC-AA(AB) especially for high energy quasi-eigenstates. Within a certain energy range, stronger pressure can lead to stronger localization, forming an efficient fractal space.
△ Less
Submitted 17 April, 2024; v1 submitted 16 April, 2024;
originally announced April 2024.
-
Bioinformatics and Biomedical Informatics with ChatGPT: Year One Review
Authors:
Jinge Wang,
Zien Cheng,
Qiuming Yao,
Li Liu,
Dong Xu,
Gangqing Hu
Abstract:
The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines. We surveyed the applications of ChatGPT in bioinformatics and biomedical informatics throughout the year, covering omics, genetics, biomedical text mining, drug discovery, biomedical image understanding, bioinformatics programming, and bioinforma…
▽ More
The year 2023 marked a significant surge in the exploration of applying large language model (LLM) chatbots, notably ChatGPT, across various disciplines. We surveyed the applications of ChatGPT in bioinformatics and biomedical informatics throughout the year, covering omics, genetics, biomedical text mining, drug discovery, biomedical image understanding, bioinformatics programming, and bioinformatics education. Our survey delineates the current strengths and limitations of this chatbot in bioinformatics and offers insights into potential avenues for future developments.
△ Less
Submitted 12 June, 2024; v1 submitted 22 March, 2024;
originally announced March 2024.
-
Knowledge-Enhanced Recommendation with User-Centric Subgraph Network
Authors:
Guangyi Liu,
Quanming Yao,
Yongqi Zhang,
Lei Chen
Abstract:
Recommendation systems, as widely implemented nowadays on various platforms, recommend relevant items to users based on their preferences. The classical methods which rely on user-item interaction matrices has limitations, especially in scenarios where there is a lack of interaction data for new items. Knowledge graph (KG)-based recommendation systems have emerged as a promising solution. However,…
▽ More
Recommendation systems, as widely implemented nowadays on various platforms, recommend relevant items to users based on their preferences. The classical methods which rely on user-item interaction matrices has limitations, especially in scenarios where there is a lack of interaction data for new items. Knowledge graph (KG)-based recommendation systems have emerged as a promising solution. However, most KG-based methods adopt node embeddings, which do not provide personalized recommendations for different users and cannot generalize well to the new items. To address these limitations, we propose Knowledge-enhanced User-Centric subgraph Network (KUCNet), a subgraph learning approach with graph neural network (GNN) for effective recommendation. KUCNet constructs a U-I subgraph for each user-item pair that captures both the historical information of user-item interactions and the side information provided in KG. An attention-based GNN is designed to encode the U-I subgraphs for recommendation. Considering efficiency, the pruned user-centric computation graph is further introduced such that multiple U-I subgraphs can be simultaneously computed and that the size can be pruned by Personalized PageRank. Our proposed method achieves accurate, efficient, and interpretable recommendations especially for new items. Experimental results demonstrate the superiority of KUCNet over state-of-the-art KG-based and collaborative filtering (CF)-based methods.
△ Less
Submitted 21 March, 2024;
originally announced March 2024.
-
Graph Unitary Message Passing
Authors:
Haiquan Qiu,
Yatao Bian,
Quanming Yao
Abstract:
Message passing mechanism contributes to the success of GNNs in various applications, but also brings the oversquashing problem. Recent works combat oversquashing by improving the graph spectrums with rewiring techniques, disrupting the structural bias in graphs, and having limited improvement on oversquashing in terms of oversquashing measure. Motivated by unitary RNN, we propose Graph Unitary Me…
▽ More
Message passing mechanism contributes to the success of GNNs in various applications, but also brings the oversquashing problem. Recent works combat oversquashing by improving the graph spectrums with rewiring techniques, disrupting the structural bias in graphs, and having limited improvement on oversquashing in terms of oversquashing measure. Motivated by unitary RNN, we propose Graph Unitary Message Passing (GUMP) to alleviate oversquashing in GNNs by applying unitary adjacency matrix for message passing. To design GUMP, a transformation is first proposed to make general graphs have unitary adjacency matrix and keep its structural bias. Then, unitary adjacency matrix is obtained with a unitary projection algorithm, which is implemented by utilizing the intrinsic structure of unitary adjacency matrix and allows GUMP to be permutation-equivariant. Experimental results show the effectiveness of GUMP in improving the performance on various graph learning tasks.
△ Less
Submitted 17 March, 2024;
originally announced March 2024.
-
Less is More: One-shot Subgraph Reasoning on Large-scale Knowledge Graphs
Authors:
Zhanke Zhou,
Yongqi Zhang,
Jiangchao Yao,
Quanming Yao,
Bo Han
Abstract:
To deduce new facts on a knowledge graph (KG), a link predictor learns from the graph structure and collects local evidence to find the answer to a given query. However, existing methods suffer from a severe scalability problem due to the utilization of the whole KG for prediction, which hinders their promise on large scale KGs and cannot be directly addressed by vanilla sampling methods. In this…
▽ More
To deduce new facts on a knowledge graph (KG), a link predictor learns from the graph structure and collects local evidence to find the answer to a given query. However, existing methods suffer from a severe scalability problem due to the utilization of the whole KG for prediction, which hinders their promise on large scale KGs and cannot be directly addressed by vanilla sampling methods. In this work, we propose the one-shot-subgraph link prediction to achieve efficient and adaptive prediction. The design principle is that, instead of directly acting on the whole KG, the prediction procedure is decoupled into two steps, i.e., (i) extracting only one subgraph according to the query and (ii) predicting on this single, query dependent subgraph. We reveal that the non-parametric and computation-efficient heuristics Personalized PageRank (PPR) can effectively identify the potential answers and supporting evidence. With efficient subgraph-based prediction, we further introduce the automated searching of the optimal configurations in both data and model spaces. Empirically, we achieve promoted efficiency and leading performances on five large-scale benchmarks. The code is publicly available at: https://github.com/tmlr-group/one-shot-subgraph.
△ Less
Submitted 15 March, 2024;
originally announced March 2024.
-
APPLE: Adversarial Privacy-aware Perturbations on Latent Embedding for Unfairness Mitigation
Authors:
Zikang Xu,
Fenghe Tang,
Quan Quan,
Qingsong Yao,
S. Kevin Zhou
Abstract:
Ensuring fairness in deep-learning-based segmentors is crucial for health equity. Much effort has been dedicated to mitigating unfairness in the training datasets or procedures. However, with the increasing prevalence of foundation models in medical image analysis, it is hard to train fair models from scratch while preserving utility. In this paper, we propose a novel method, Adversarial Privacy-a…
▽ More
Ensuring fairness in deep-learning-based segmentors is crucial for health equity. Much effort has been dedicated to mitigating unfairness in the training datasets or procedures. However, with the increasing prevalence of foundation models in medical image analysis, it is hard to train fair models from scratch while preserving utility. In this paper, we propose a novel method, Adversarial Privacy-aware Perturbations on Latent Embedding (APPLE), that can improve the fairness of deployed segmentors by introducing a small latent feature perturber without updating the weights of the original model. By adding perturbation to the latent vector, APPLE decorates the latent vector of segmentors such that no fairness-related features can be passed to the decoder of the segmentors while preserving the architecture and parameters of the segmentor. Experiments on two segmentation datasets and five segmentors (three U-Net-like and two SAM-like) illustrate the effectiveness of our proposed method compared to several unfairness mitigation methods.
△ Less
Submitted 8 March, 2024;
originally announced March 2024.
-
Loss-aware Curriculum Learning for Heterogeneous Graph Neural Networks
Authors:
Zhen Hao Wong,
Hansi Yang,
Xiaoyi Fu,
Quanming Yao
Abstract:
Heterogeneous Graph Neural Networks (HGNNs) are a class of deep learning models designed specifically for heterogeneous graphs, which are graphs that contain different types of nodes and edges. This paper investigates the application of curriculum learning techniques to improve the performance and robustness of Heterogeneous Graph Neural Networks (GNNs). To better classify the quality of the data,…
▽ More
Heterogeneous Graph Neural Networks (HGNNs) are a class of deep learning models designed specifically for heterogeneous graphs, which are graphs that contain different types of nodes and edges. This paper investigates the application of curriculum learning techniques to improve the performance and robustness of Heterogeneous Graph Neural Networks (GNNs). To better classify the quality of the data, we design a loss-aware training schedule, named LTS that measures the quality of every nodes of the data and incorporate the training dataset into the model in a progressive manner that increases difficulty step by step. LTS can be seamlessly integrated into various frameworks, effectively reducing bias and variance, mitigating the impact of noisy data, and enhancing overall accuracy. Our findings demonstrate the efficacy of curriculum learning in enhancing HGNNs capabilities for analyzing complex graph-structured data. The code is public at https://github.com/LARS-research/CLGNN/.
△ Less
Submitted 29 February, 2024;
originally announced February 2024.
-
CARZero: Cross-Attention Alignment for Radiology Zero-Shot Classification
Authors:
Haoran Lai,
Qingsong Yao,
Zihang Jiang,
Rongsheng Wang,
Zhiyang He,
Xiaodong Tao,
S. Kevin Zhou
Abstract:
The advancement of Zero-Shot Learning in the medical domain has been driven forward by using pre-trained models on large-scale image-text pairs, focusing on image-text alignment. However, existing methods primarily rely on cosine similarity for alignment, which may not fully capture the complex relationship between medical images and reports. To address this gap, we introduce a novel approach call…
▽ More
The advancement of Zero-Shot Learning in the medical domain has been driven forward by using pre-trained models on large-scale image-text pairs, focusing on image-text alignment. However, existing methods primarily rely on cosine similarity for alignment, which may not fully capture the complex relationship between medical images and reports. To address this gap, we introduce a novel approach called Cross-Attention Alignment for Radiology Zero-Shot Classification (CARZero). Our approach innovatively leverages cross-attention mechanisms to process image and report features, creating a Similarity Representation that more accurately reflects the intricate relationships in medical semantics. This representation is then linearly projected to form an image-text similarity matrix for cross-modality alignment. Additionally, recognizing the pivotal role of prompt selection in zero-shot learning, CARZero incorporates a Large Language Model-based prompt alignment strategy. This strategy standardizes diverse diagnostic expressions into a unified format for both training and inference phases, overcoming the challenges of manual prompt design. Our approach is simple yet effective, demonstrating state-of-the-art performance in zero-shot classification on five official chest radiograph diagnostic test sets, including remarkable results on datasets with long-tail distributions of rare diseases. This achievement is attributed to our new image-text alignment strategy, which effectively addresses the complex relationship between medical images and reports. Code and models are available at https://github.com/laihaoran/CARZero.
△ Less
Submitted 24 March, 2024; v1 submitted 27 February, 2024;
originally announced February 2024.
-
Blockchain for Finance: A Survey
Authors:
Hanjie Wu,
Qian Yao,
Zhenguang Liu,
Butian Huang,
Yuan Zhuang,
Huayun Tang,
Erwu Liu
Abstract:
As an innovative technology for enhancing authenticity, security, and risk management, blockchain is being widely adopted in trade and finance systems. The unique capabilities of blockchain, such as immutability and transparency, enable new business models of distributed data storage, point-to-point transactions, and decentralized autonomous organizations. In this paper, we focus on blockchain-bas…
▽ More
As an innovative technology for enhancing authenticity, security, and risk management, blockchain is being widely adopted in trade and finance systems. The unique capabilities of blockchain, such as immutability and transparency, enable new business models of distributed data storage, point-to-point transactions, and decentralized autonomous organizations. In this paper, we focus on blockchain-based securities trading, in which blockchain technology plays a vital role in financial services as it ultimately lifts trust and frees the need for third-party verification by using consensus-based verification. We investigate the 12 most popular blockchain platforms and elaborate on 6 platforms that are related to finance, seeking to provide a panorama of securities trading practices. Meanwhile, this survey provides a comprehensive summary of blockchain-based securities trading applications. We gather numerous practical applications of blockchain-based securities trading and categorize them into four distinct categories. For each category, we introduce a typical example and explain how blockchain contributes to solving the key problems faced by FinTech companies and researchers. Finally, we provide interesting observations ranging from mainstream blockchain-based financial institutions to security issues of decentralized finance applications, aiming to picture the current blockchain ecosystem in finance.
△ Less
Submitted 27 February, 2024;
originally announced February 2024.
-
Towards Versatile Graph Learning Approach: from the Perspective of Large Language Models
Authors:
Lanning Wei,
Jun Gao,
Huan Zhao,
Quanming Yao
Abstract:
Graph-structured data are the commonly used and have wide application scenarios in the real world. For these diverse applications, the vast variety of learning tasks, graph domains, and complex graph learning procedures present challenges for human experts when designing versatile graph learning approaches. Facing these challenges, large language models (LLMs) offer a potential solution due to the…
▽ More
Graph-structured data are the commonly used and have wide application scenarios in the real world. For these diverse applications, the vast variety of learning tasks, graph domains, and complex graph learning procedures present challenges for human experts when designing versatile graph learning approaches. Facing these challenges, large language models (LLMs) offer a potential solution due to the extensive knowledge and the human-like intelligence. This paper proposes a novel conceptual prototype for designing versatile graph learning methods with LLMs, with a particular focus on the "where" and "how" perspectives. From the "where" perspective, we summarize four key graph learning procedures, including task definition, graph data feature engineering, model selection and optimization, deployment and serving. We then explore the application scenarios of LLMs in these procedures across a wider spectrum. In the "how" perspective, we align the abilities of LLMs with the requirements of each procedure. Finally, we point out the promising directions that could better leverage the strength of LLMs towards versatile graph learning methods.
△ Less
Submitted 23 February, 2024; v1 submitted 18 February, 2024;
originally announced February 2024.
-
Limit Properties of Record Numbers in Random walks
Authors:
Penghui Lu,
Yuqiang Li,
Qiang Yao
Abstract:
In this paper, we systematically summarize and enhance the understanding of weak convergence and functional limits of record numbers in discrete-time random walks under Spitzer's condition, and extend these findings to $σ$--record numbers using similar methods. Additionally, we identify a sufficient condition for the existence of functional limits for record numbers in continuous-time random walks…
▽ More
In this paper, we systematically summarize and enhance the understanding of weak convergence and functional limits of record numbers in discrete-time random walks under Spitzer's condition, and extend these findings to $σ$--record numbers using similar methods. Additionally, we identify a sufficient condition for the existence of functional limits for record numbers in continuous-time random walks. Finally, we derive corresponding results for large deviations, moderate deviations, and laws of the iterated logarithm pertaining to record numbers in discrete-time random walks.
△ Less
Submitted 27 January, 2024;
originally announced January 2024.
-
ECAMP: Entity-centered Context-aware Medical Vision Language Pre-training
Authors:
Rongsheng Wang,
Qingsong Yao,
Haoran Lai,
Zhiyang He,
Xiaodong Tao,
Zihang Jiang,
S. Kevin Zhou
Abstract:
Despite significant advancements in medical vision-language pre-training, existing methods have largely overlooked the inherent entity-specific context within radiology reports and the complex cross-modality contextual relationships between text and images. To close this gap, we propose a novel Entity-centered Context-aware Medical Vision-language Pre-training (ECAMP) framework, which is designed…
▽ More
Despite significant advancements in medical vision-language pre-training, existing methods have largely overlooked the inherent entity-specific context within radiology reports and the complex cross-modality contextual relationships between text and images. To close this gap, we propose a novel Entity-centered Context-aware Medical Vision-language Pre-training (ECAMP) framework, which is designed to enable a more entity-centered and context-sensitive interpretation of medical data. Utilizing the recent powerful large language model, we distill entity-centered context from medical reports, which enables ECAMP to gain more effective supervision from the text modality. By further pre-training our model with carefully designed entity-aware, context-enhanced masked language modeling and context-guided super-resolution tasks, ECAMP significantly refines the interplay between text and image modalities, leading to an enhanced ability to extract entity-centered contextual features. Besides, our proposed multi-scale context fusion design also improves the semantic integration of both coarse and fine-level image representations, prompting better performance for multi-scale downstream applications. Combining these components leads to significant performance leaps over current state-of-the-art methods and establishes a new standard for cross-modality learning in medical imaging, whose effectiveness is demonstrated by our extensive experiments on various tasks including classification, segmentation, and detection across several public datasets. Code and models are available at https://github.com/ToniChopp/ECAMP.
△ Less
Submitted 19 March, 2024; v1 submitted 20 December, 2023;
originally announced December 2023.
-
Robust Communicative Multi-Agent Reinforcement Learning with Active Defense
Authors:
Lebin Yu,
Yunbo Qiu,
Quanming Yao,
Yuan Shen,
Xudong Zhang,
Jian Wang
Abstract:
Communication in multi-agent reinforcement learning (MARL) has been proven to effectively promote cooperation among agents recently. Since communication in real-world scenarios is vulnerable to noises and adversarial attacks, it is crucial to develop robust communicative MARL technique. However, existing research in this domain has predominantly focused on passive defense strategies, where agents…
▽ More
Communication in multi-agent reinforcement learning (MARL) has been proven to effectively promote cooperation among agents recently. Since communication in real-world scenarios is vulnerable to noises and adversarial attacks, it is crucial to develop robust communicative MARL technique. However, existing research in this domain has predominantly focused on passive defense strategies, where agents receive all messages equally, making it hard to balance performance and robustness. We propose an active defense strategy, where agents automatically reduce the impact of potentially harmful messages on the final decision. There are two challenges to implement this strategy, that are defining unreliable messages and adjusting the unreliable messages' impact on the final decision properly. To address them, we design an Active Defense Multi-Agent Communication framework (ADMAC), which estimates the reliability of received messages and adjusts their impact on the final decision accordingly with the help of a decomposable decision structure. The superiority of ADMAC over existing methods is validated by experiments in three communication-critical tasks under four types of attacks.
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
Wave Propagation in Pure-Time Modulated Step Media With Applications to Temporal-Aiming
Authors:
Mourad Sini,
Haibing Wang,
Qingyun Yao
Abstract:
We analyze the propagation of an incident electromagnetic wave in a purely-time modulated medium. Precisely, we assume that the permeability is unchanged while the permittivity has a multiple-step profile in time and uniformly constant in space. For this, we use time-dispersive Lorenz's model with time-dependent plasma frequency with highly concentrated values near the centers of the steps' interv…
▽ More
We analyze the propagation of an incident electromagnetic wave in a purely-time modulated medium. Precisely, we assume that the permeability is unchanged while the permittivity has a multiple-step profile in time and uniformly constant in space. For this, we use time-dispersive Lorenz's model with time-dependent plasma frequency with highly concentrated values near the centers of the steps' intervals. Under certain regimes linking the number of steps to the contrasts of the permittivity, we can generate effective permittivity having positive and high values on a finite interval of time which behaves as a 'wall' that kills the forward waves and keeps only the backward waves (i.e. enabling full reflection). But this happens if these high values are away from a discrete set. If these high values are close to the elements of this discrete set, then the effective medium behaves as a 'well' that absorbs all the forward waves and hence there is no backward waves (i.e. enabling full transmission). Such results are reminiscent to the 'wall' and 'well' effects known in the quantum mechanics theory.
△ Less
Submitted 15 December, 2023;
originally announced December 2023.
-
Adversarial Medical Image with Hierarchical Feature Hiding
Authors:
Qingsong Yao,
Zecheng He,
Yuexiang Li,
Yi Lin,
Kai Ma,
Yefeng Zheng,
S. Kevin Zhou
Abstract:
Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon an…
▽ More
Deep learning based methods for medical images can be easily compromised by adversarial examples (AEs), posing a great security flaw in clinical decision-making. It has been discovered that conventional adversarial attacks like PGD which optimize the classification logits, are easy to distinguish in the feature space, resulting in accurate reactive defenses. To better understand this phenomenon and reassess the reliability of the reactive defenses for medical AEs, we thoroughly investigate the characteristic of conventional medical AEs. Specifically, we first theoretically prove that conventional adversarial attacks change the outputs by continuously optimizing vulnerable features in a fixed direction, thereby leading to outlier representations in the feature space. Then, a stress test is conducted to reveal the vulnerability of medical images, by comparing with natural images. Interestingly, this vulnerability is a double-edged sword, which can be exploited to hide AEs. We then propose a simple-yet-effective hierarchical feature constraint (HFC), a novel add-on to conventional white-box attacks, which assists to hide the adversarial feature in the target feature distribution. The proposed method is evaluated on three medical datasets, both 2D and 3D, with different modalities. The experimental results demonstrate the superiority of HFC, \emph{i.e.,} it bypasses an array of state-of-the-art adversarial medical AE detectors more efficiently than competing adaptive attacks, which reveals the deficiencies of medical reactive defense and allows to develop more robust defenses in future.
△ Less
Submitted 4 December, 2023;
originally announced December 2023.
-
Long-tailed multi-label classification with noisy label of thoracic diseases from chest X-ray
Authors:
Haoran Lai,
Qingsong Yao,
Zhiyang He,
Xiaodong Tao,
S Kevin Zhou
Abstract:
Chest X-rays (CXR) often reveal rare diseases, demanding precise diagnosis. However, current computer-aided diagnosis (CAD) methods focus on common diseases, leading to inadequate detection of rare conditions due to the absence of comprehensive datasets. To overcome this, we present a novel benchmark for long-tailed multi-label classification in CXRs, encapsulating both common and rare thoracic di…
▽ More
Chest X-rays (CXR) often reveal rare diseases, demanding precise diagnosis. However, current computer-aided diagnosis (CAD) methods focus on common diseases, leading to inadequate detection of rare conditions due to the absence of comprehensive datasets. To overcome this, we present a novel benchmark for long-tailed multi-label classification in CXRs, encapsulating both common and rare thoracic diseases. Our approach includes developing the "LTML-MIMIC-CXR" dataset, an augmentation of MIMIC-CXR with 26 additional rare diseases. We propose a baseline method for this classification challenge, integrating adaptive negative regularization to address negative logits' over-suppression in tail classes, and a large loss reconsideration strategy for correcting noisy labels from automated annotations. Our evaluation on LTML-MIMIC-CXR demonstrates significant advancements in rare disease detection. This work establishes a foundation for robust CAD methods, achieving a balance in identifying a spectrum of thoracic diseases in CXRs. Access to our code and dataset is provided at:https://github.com/laihaoran/LTML-MIMIC-CXR.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Understanding the role of rock heterogeneity in controlling fault strength and stability
Authors:
Shaobo Han,
Xiaoying Zhuang,
Quanzhou Yao,
Qianlong Zhou,
Xiaodong Hu
Abstract:
The rock heterogeneity exists widely in fault zones; however, the intrinsic mechanism of how it affects the mechanical behavior of faults is poorly understood. To develop a quantitative understanding of the effect of the rock heterogeneity on the strength and stability of faults, here we investigate a pore-pressure model based on rate- and-state friction in the manner of two-degree-of-freedom spri…
▽ More
The rock heterogeneity exists widely in fault zones; however, the intrinsic mechanism of how it affects the mechanical behavior of faults is poorly understood. To develop a quantitative understanding of the effect of the rock heterogeneity on the strength and stability of faults, here we investigate a pore-pressure model based on rate- and-state friction in the manner of two-degree-of-freedom spring-sliders and analyze the reasons of fault weakening and the conditions of frictional instability by carrying out nonlinear simulations and a linear stability analysis. We find that the strength of heterogeneous faults depends largely on the compaction difference (or differential compaction) between the two gouges (e.g. quartz and clay), and the stability is affected by the proportion of the two gouges patches. Our model implies that the rock heterogeneity is likely to weaken faults and reduce the stability of faults.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
Accurate and interpretable drug-drug interaction prediction enabled by knowledge subgraph learning
Authors:
Yaqing Wang,
Zaifei Yang,
Quanming Yao
Abstract:
Background: Discovering potential drug-drug interactions (DDIs) is a long-standing challenge in clinical treatments and drug developments. Recently, deep learning techniques have been developed for DDI prediction. However, they generally require a huge number of samples, while known DDIs are rare.
Methods: In this work, we present KnowDDI, a graph neural network-based method that addresses the a…
▽ More
Background: Discovering potential drug-drug interactions (DDIs) is a long-standing challenge in clinical treatments and drug developments. Recently, deep learning techniques have been developed for DDI prediction. However, they generally require a huge number of samples, while known DDIs are rare.
Methods: In this work, we present KnowDDI, a graph neural network-based method that addresses the above challenge. KnowDDI enhances drug representations by adaptively leveraging rich neighborhood information from large biomedical knowledge graphs. Then, it learns a knowledge subgraph for each drug-pair to interpret the predicted DDI, where each of the edges is associated with a connection strength indicating the importance of a known DDI or resembling strength between a drug-pair whose connection is unknown. Thus, the lack of DDIs is implicitly compensated by the enriched drug representations and propagated drug similarities.
Results: We evaluate KnowDDI on two benchmark DDI datasets. Results show that KnowDDI obtains the state-of-the-art prediction performance with better interpretability. We also find that KnowDDI suffers less than existing works given a sparser knowledge graph. This indicates that the propagated drug similarities play a more important role in compensating for the lack of DDIs when the drug representations are less enriched.
Conclusions: KnowDDI nicely combines the efficiency of deep learning techniques and the rich prior knowledge in biomedical knowledge graphs. As an original open-source tool, KnowDDI can help detect possible interactions in a broad range of relevant interaction prediction tasks, such as protein-protein interactions, drug-target interactions and disease-gene interactions, eventually promoting the development of biomedicine and healthcare.
△ Less
Submitted 19 March, 2024; v1 submitted 25 November, 2023;
originally announced November 2023.
-
Interplay between moment-dependent and field-driven unidirectional magnetoresistance in CoFeB/InSb/CdTe heterostructures
Authors:
Jiuming Liu,
Liyang Liao,
Bin Rong,
Yuyang Wu,
Yu Zhang,
Hanzhi Ruan,
Zhenghang Zhi,
Puyang Huang,
Shan Yao,
Xinyu Cai,
Chenjia Tang,
Qi Yao,
Lu Sun,
Yumeng Yang,
Guoqiang Yu,
Renchao Che,
Xufeng Kou
Abstract:
Magnetoresistance effects are crucial for understanding the charge/spin transport as well as propelling the advancement of spintronic applications. Here we report the coexistence of magnetic moment-dependent (MD) and magnetic field-driven (FD) unidirectional magnetoresistance (UMR) effects in CoFeB/InSb/CdTe heterostructures. The strong spin-orbital coupling of InSb and the matched impedance at th…
▽ More
Magnetoresistance effects are crucial for understanding the charge/spin transport as well as propelling the advancement of spintronic applications. Here we report the coexistence of magnetic moment-dependent (MD) and magnetic field-driven (FD) unidirectional magnetoresistance (UMR) effects in CoFeB/InSb/CdTe heterostructures. The strong spin-orbital coupling of InSb and the matched impedance at the CoFeB/InSb interface warrant a distinct MD-UMR effect at room temperature, while the interaction between the in-plane magnetic field and the Rashba effect at the InSb/CdTe interface induces the marked FD-UMR signal that dominates the high-field region. Moreover, owning to the different spin transport mechanisms, these two types of nonreciprocal charge transport show opposite polarities with respect to the magnetic field direction, which further enable an effective phase modulation of the angular-dependent magnetoresistance. Besides, the demonstrations of both the tunable UMR response and two-terminal spin-orbit torque-driven magnetization switching validate our CoFeB/InSb/CdTe system as a suitable integrated building block for multifunctional spintronic device design.
△ Less
Submitted 20 November, 2023;
originally announced November 2023.
-
Emerging Drug Interaction Prediction Enabled by Flow-based Graph Neural Network with Biomedical Network
Authors:
Yongqi Zhang,
Quanming Yao,
Ling Yue,
Xian Wu,
Ziheng Zhang,
Zhenxi Lin,
Yefeng Zheng
Abstract:
Accurately predicting drug-drug interactions (DDI) for emerging drugs, which offer possibilities for treating and alleviating diseases, with computational methods can improve patient care and contribute to efficient drug development. However, many existing computational methods require large amounts of known DDI information, which is scarce for emerging drugs. In this paper, we propose EmerGNN, a…
▽ More
Accurately predicting drug-drug interactions (DDI) for emerging drugs, which offer possibilities for treating and alleviating diseases, with computational methods can improve patient care and contribute to efficient drug development. However, many existing computational methods require large amounts of known DDI information, which is scarce for emerging drugs. In this paper, we propose EmerGNN, a graph neural network (GNN) that can effectively predict interactions for emerging drugs by leveraging the rich information in biomedical networks. EmerGNN learns pairwise representations of drugs by extracting the paths between drug pairs, propagating information from one drug to the other, and incorporating the relevant biomedical concepts on the paths. The different edges on the biomedical network are weighted to indicate the relevance for the target DDI prediction. Overall, EmerGNN has higher accuracy than existing approaches in predicting interactions for emerging drugs and can identify the most relevant information on the biomedical network.
△ Less
Submitted 15 November, 2023;
originally announced November 2023.
-
Combating Bilateral Edge Noise for Robust Link Prediction
Authors:
Zhanke Zhou,
Jiangchao Yao,
Jiaxu Liu,
Xiawei Guo,
Quanming Yao,
Li He,
Liang Wang,
Bo Zheng,
Bo Han
Abstract:
Although link prediction on graphs has achieved great success with the development of graph neural networks (GNNs), the potential robustness under the edge noise is still less investigated. To close this gap, we first conduct an empirical study to disclose that the edge noise bilaterally perturbs both input topology and target label, yielding severe performance degradation and representation colla…
▽ More
Although link prediction on graphs has achieved great success with the development of graph neural networks (GNNs), the potential robustness under the edge noise is still less investigated. To close this gap, we first conduct an empirical study to disclose that the edge noise bilaterally perturbs both input topology and target label, yielding severe performance degradation and representation collapse. To address this dilemma, we propose an information-theory-guided principle, Robust Graph Information Bottleneck (RGIB), to extract reliable supervision signals and avoid representation collapse. Different from the basic information bottleneck, RGIB further decouples and balances the mutual dependence among graph topology, target labels, and representation, building new learning objectives for robust representation against the bilateral noise. Two instantiations, RGIB-SSL and RGIB-REP, are explored to leverage the merits of different methodologies, i.e., self-supervised learning and data reparameterization, for implicit and explicit data denoising, respectively. Extensive experiments on six datasets and three GNNs with diverse noisy scenarios verify the effectiveness of our RGIB instantiations. The code is publicly available at: https://github.com/tmlr-group/RGIB.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
An inverse source problem for the stochastic multi-term time-fractional diffusion-wave equation
Authors:
Xiaoli Feng,
Qiang Yao,
Peijun Li,
Xu Wang
Abstract:
In this paper, we study both the direct and inverse random source problems associated with the multi-term time-fractional diffusion-wave equation driven by a fractional Brownian motion. Regarding the direct problem, the well-posedness is established and the regularity of the solution is characterized for the equation. In the context of the inverse problem, the uniqueness and instability are invest…
▽ More
In this paper, we study both the direct and inverse random source problems associated with the multi-term time-fractional diffusion-wave equation driven by a fractional Brownian motion. Regarding the direct problem, the well-posedness is established and the regularity of the solution is characterized for the equation. In the context of the inverse problem, the uniqueness and instability are investigated on the determination of the random source. Furthermore, a reconstruction formula is provided for the phaseless Fourier modes of the diffusion coefficient in the random source, based on the variance of the boundary data. To reconstruct the time-dependent source function from its phaseless Fourier modes, the PhaseLift method, combined with a spectral cut-off regularization technique, is employed to tackle the phase retrieval problem. The effectiveness of the proposed method is demonstrated through a series of numerical experiments.
△ Less
Submitted 2 November, 2023;
originally announced November 2023.
-
Ensemble Learning for Graph Neural Networks
Authors:
Zhen Hao Wong,
Ling Yue,
Quanming Yao
Abstract:
Graph Neural Networks (GNNs) have shown success in various fields for learning from graph-structured data. This paper investigates the application of ensemble learning techniques to improve the performance and robustness of Graph Neural Networks (GNNs). By training multiple GNN models with diverse initializations or architectures, we create an ensemble model named ELGNN that captures various aspec…
▽ More
Graph Neural Networks (GNNs) have shown success in various fields for learning from graph-structured data. This paper investigates the application of ensemble learning techniques to improve the performance and robustness of Graph Neural Networks (GNNs). By training multiple GNN models with diverse initializations or architectures, we create an ensemble model named ELGNN that captures various aspects of the data and uses the Tree-Structured Parzen Estimator algorithm to determine the ensemble weights. Combining the predictions of these models enhances overall accuracy, reduces bias and variance, and mitigates the impact of noisy data. Our findings demonstrate the efficacy of ensemble learning in enhancing GNN capabilities for analyzing complex graph-structured data. The code is public at https://github.com/wongzhenhao/ELGNN.
△ Less
Submitted 21 October, 2023;
originally announced October 2023.
-
Positive-Unlabeled Node Classification with Structure-aware Graph Learning
Authors:
Hansi Yang,
Yongqi Zhang,
Quanming Yao,
James Kwok
Abstract:
Node classification on graphs is an important research problem with many applications. Real-world graph data sets may not be balanced and accurate as assumed by most existing works. A challenging setting is positive-unlabeled (PU) node classification, where labeled nodes are restricted to positive nodes. It has diverse applications, e.g., pandemic prediction or network anomaly detection. Existing…
▽ More
Node classification on graphs is an important research problem with many applications. Real-world graph data sets may not be balanced and accurate as assumed by most existing works. A challenging setting is positive-unlabeled (PU) node classification, where labeled nodes are restricted to positive nodes. It has diverse applications, e.g., pandemic prediction or network anomaly detection. Existing works on PU node classification overlook information in the graph structure, which can be critical. In this paper, we propose to better utilize graph structure for PU node classification. We first propose a distance-aware PU loss that uses homophily in graphs to introduce more accurate supervision. We also propose a regularizer to align the model with graph structure. Theoretical analysis shows that minimizing the proposed loss also leads to minimizing the expected loss with both positive and negative labels. Extensive empirical evaluation on diverse graph data sets demonstrates its superior performance over existing state-of-the-art methods.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Relation-aware Ensemble Learning for Knowledge Graph Embedding
Authors:
Ling Yue,
Yongqi Zhang,
Quanming Yao,
Yong Li,
Xian Wu,
Ziheng Zhang,
Zhenxi Lin,
Yefeng Zheng
Abstract:
Knowledge graph (KG) embedding is a fundamental task in natural language processing, and various methods have been proposed to explore semantic patterns in distinctive ways. In this paper, we propose to learn an ensemble by leveraging existing methods in a relation-aware manner. However, exploring these semantics using relation-aware ensemble leads to a much larger search space than general ensemb…
▽ More
Knowledge graph (KG) embedding is a fundamental task in natural language processing, and various methods have been proposed to explore semantic patterns in distinctive ways. In this paper, we propose to learn an ensemble by leveraging existing methods in a relation-aware manner. However, exploring these semantics using relation-aware ensemble leads to a much larger search space than general ensemble methods. To address this issue, we propose a divide-search-combine algorithm RelEns-DSC that searches the relation-wise ensemble weights independently. This algorithm has the same computation cost as general ensemble methods but with much better performance. Experimental results on benchmark datasets demonstrate the effectiveness of the proposed method in efficiently searching relation-aware ensemble weights and achieving state-of-the-art embedding performance. The code is public at https://github.com/LARS-research/RelEns.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
PACIA: Parameter-Efficient Adapter for Few-Shot Molecular Property Prediction
Authors:
Shiguang Wu,
Yaqing Wang,
Quanming Yao
Abstract:
Molecular property prediction (MPP) plays a crucial role in biomedical applications, but it often encounters challenges due to a scarcity of labeled data. Existing works commonly adopt gradient-based strategy to update a large amount of parameters for task-level adaptation. However, the increase of adaptive parameters can lead to overfitting and poor performance. Observing that graph neural networ…
▽ More
Molecular property prediction (MPP) plays a crucial role in biomedical applications, but it often encounters challenges due to a scarcity of labeled data. Existing works commonly adopt gradient-based strategy to update a large amount of parameters for task-level adaptation. However, the increase of adaptive parameters can lead to overfitting and poor performance. Observing that graph neural network (GNN) performs well as both encoder and predictor, we propose PACIA, a parameter-efficient GNN adapter for few-shot MPP. We design a unified adapter to generate a few adaptive parameters to modulate the message passing process of GNN. We then adopt a hierarchical adaptation mechanism to adapt the encoder at task-level and the predictor at query-level by the unified GNN adapter. Extensive results show that PACIA obtains the state-of-the-art performance in few-shot MPP problems, and our proposed hierarchical adaptation mechanism is rational and effective.
△ Less
Submitted 8 May, 2024; v1 submitted 1 October, 2023;
originally announced October 2023.
-
Music- and Lyrics-driven Dance Synthesis
Authors:
Wenjie Yin,
Qingyuan Yao,
Yi Yu,
Hang Yin,
Danica Kragic,
Mårten Björkman
Abstract:
Lyrics often convey information about the songs that are beyond the auditory dimension, enriching the semantic meaning of movements and musical themes. Such insights are important in the dance choreography domain. However, most existing dance synthesis methods mainly focus on music-to-dance generation, without considering the semantic information. To complement it, we introduce JustLMD, a new mult…
▽ More
Lyrics often convey information about the songs that are beyond the auditory dimension, enriching the semantic meaning of movements and musical themes. Such insights are important in the dance choreography domain. However, most existing dance synthesis methods mainly focus on music-to-dance generation, without considering the semantic information. To complement it, we introduce JustLMD, a new multimodal dataset of 3D dance motion with music and lyrics. To the best of our knowledge, this is the first dataset with triplet information including dance motion, music, and lyrics. Additionally, we showcase a cross-modal diffusion-based network designed to generate 3D dance motion conditioned on music and lyrics. The proposed JustLMD dataset encompasses 4.6 hours of 3D dance motion in 1867 sequences, accompanied by musical tracks and their corresponding English lyrics.
△ Less
Submitted 30 September, 2023;
originally announced October 2023.
-
A moving least square immersed boundary method for SPH with thin-walled structures
Authors:
ZhuoLin Wang,
Zichao Jiang,
Yi Zhang,
Gengchao Yang,
Trevor Hocksun Kwan,
Yuhui Chen,
Qinghe Yao
Abstract:
This paper presents a novel method for smoothed particle hydrodynamics (SPH) with thin-walled structures. Inspired by the direct forcing immersed boundary method, this method employs a moving least square method to guarantee the smoothness of velocity near the structure surface. It simplifies thin-walled structure simulations by eliminating the need for multiple layers of boundary particles, and i…
▽ More
This paper presents a novel method for smoothed particle hydrodynamics (SPH) with thin-walled structures. Inspired by the direct forcing immersed boundary method, this method employs a moving least square method to guarantee the smoothness of velocity near the structure surface. It simplifies thin-walled structure simulations by eliminating the need for multiple layers of boundary particles, and improves computational accuracy and stability in three-dimensional scenarios. Supportive three-dimensional numerical results are provided, including the impulsively started plate and the flow past a cylinder. Results of the impulsively started test demonstrate that the proposed method obtains smooth velocity and pressure in the, as well as a good match to the references results of the vortex wake development. In addition, results of the flow past cylinder test show that the proposed method avoids mutual interference on both side of the boundary, remains stable for three-dimensional simulations while accurately calculating the forces acting on structure.
△ Less
Submitted 8 October, 2023; v1 submitted 19 September, 2023;
originally announced September 2023.
-
Stochastic Learning of Semiparametric Monotone Index Models with Large Sample Size
Authors:
Qingsong Yao
Abstract:
I study the estimation of semiparametric monotone index models in the scenario where the number of observation points $n$ is extremely large and conventional approaches fail to work due to heavy computational burdens. Motivated by the mini-batch gradient descent algorithm (MBGD) that is widely used as a stochastic optimization tool in the machine learning field, I proposes a novel subsample- and i…
▽ More
I study the estimation of semiparametric monotone index models in the scenario where the number of observation points $n$ is extremely large and conventional approaches fail to work due to heavy computational burdens. Motivated by the mini-batch gradient descent algorithm (MBGD) that is widely used as a stochastic optimization tool in the machine learning field, I proposes a novel subsample- and iteration-based estimation procedure. In particular, starting from any initial guess of the true parameter, I progressively update the parameter using a sequence of subsamples randomly drawn from the data set whose sample size is much smaller than $n$. The update is based on the gradient of some well-chosen loss function, where the nonparametric component is replaced with its Nadaraya-Watson kernel estimator based on subsamples. My proposed algorithm essentially generalizes MBGD algorithm to the semiparametric setup. Compared with full-sample-based method, the new method reduces the computational time by roughly $n$ times if the subsample size and the kernel function are chosen properly, so can be easily applied when the sample size $n$ is large. Moreover, I show that if I further conduct averages across the estimators produced during iterations, the difference between the average estimator and full-sample-based estimator will be $1/\sqrt{n}$-trivial. Consequently, the average estimator is $1/\sqrt{n}$-consistent and asymptotically normally distributed. In other words, the new estimator substantially improves the computational speed, while at the same time maintains the estimation accuracy.
△ Less
Submitted 27 October, 2023; v1 submitted 12 September, 2023;
originally announced September 2023.
-
A Versatile Graph Learning Approach through LLM-based Agent
Authors:
Lanning Wei,
Huan Zhao,
Xiaohan Zheng,
Zhiqiang He,
Quanming Yao
Abstract:
Designing versatile graph learning approaches is important, considering the diverse graphs and tasks existing in real-world applications. Existing methods have attempted to achieve this target through automated machine learning techniques, pre-training and fine-tuning strategies, and large language models. However, these methods are not versatile enough for graph learning, as they work on either l…
▽ More
Designing versatile graph learning approaches is important, considering the diverse graphs and tasks existing in real-world applications. Existing methods have attempted to achieve this target through automated machine learning techniques, pre-training and fine-tuning strategies, and large language models. However, these methods are not versatile enough for graph learning, as they work on either limited types of graphs or a single task. In this paper, we propose to explore versatile graph learning approaches with LLM-based agents, and the key insight is customizing the graph learning procedures for diverse graphs and tasks. To achieve this, we develop several LLM-based agents, equipped with diverse profiles, tools, functions and human experience. They collaborate to configure each procedure with task and data-specific settings step by step towards versatile solutions, and the proposed method is dubbed GL-Agent. By evaluating on diverse tasks and graphs, the correct results of the agent and its comparable performance showcase the versatility of the proposed method, especially in complex scenarios.The low resource cost and the potential to use open-source LLMs highlight the efficiency of GL-Agent.
△ Less
Submitted 1 September, 2024; v1 submitted 8 September, 2023;
originally announced September 2023.
-
Kinkless electronic junction along one dimensional electronic channel
Authors:
Qirong Yao,
Jae Whan Park,
Choongjae Won,
Sang-Wook Cheong,
Han Woong Yeom
Abstract:
Here we report the formation of type-A and type-B electronic junctions without any structural discontinuity along a well-defined 1-nm-wide one-dimensional electronic channel within a van der Waals layer. We employ scanning tunneling microscopy and spectroscopy techniques to investigate the atomic and electronic structure along peculiar domain walls formed on the charge-density-wave phase of 1T-TaS…
▽ More
Here we report the formation of type-A and type-B electronic junctions without any structural discontinuity along a well-defined 1-nm-wide one-dimensional electronic channel within a van der Waals layer. We employ scanning tunneling microscopy and spectroscopy techniques to investigate the atomic and electronic structure along peculiar domain walls formed on the charge-density-wave phase of 1T-TaS2. We find distinct kinds of abrupt electronic junctions with discontinuities of the band gap along the domain walls, which do not have any structural kinks and defects. Our density-functional calculations reveal a novel mechanism of the electronic junction formation; they are formed by a kinked domain wall in the layer underneath through substantial electronic interlayer coupling. This work demonstrates that the interlayer electronic coupling can be an effective control knob over several-nanometer-scale electronic property of two-dimensional atomic monolayers.
△ Less
Submitted 4 September, 2023;
originally announced September 2023.
-
Observation of gamma rays up to 320 TeV from the middle-aged TeV pulsar wind nebula HESS J1849$-$000
Authors:
M. Amenomori,
S. Asano,
Y. W. Bao,
X. J. Bi,
D. Chen,
T. L. Chen,
W. Y. Chen,
Xu Chen,
Y. Chen,
Cirennima,
S. W. Cui,
Danzengluobu,
L. K. Ding,
J. H. Fang,
K. Fang,
C. F. Feng,
Zhaoyang Feng,
Z. Y. Feng,
Qi Gao,
A. Gomi,
Q. B. Gou,
Y. Q. Guo,
Y. Y. Guo,
Y. Hayashi,
H. H. He
, et al. (93 additional authors not shown)
Abstract:
Gamma rays from HESS J1849$-$000, a middle-aged TeV pulsar wind nebula (PWN), are observed by the Tibet air shower array and the muon detector array. The detection significance of gamma rays reaches $4.0\, σ$ and $4.4\, σ$ levels above 25 TeV and 100 TeV, respectively, in units of Gaussian standard deviation $σ$. The energy spectrum measured between $40\, {\rm TeV} < E < 320\, {\rm TeV}$ for the f…
▽ More
Gamma rays from HESS J1849$-$000, a middle-aged TeV pulsar wind nebula (PWN), are observed by the Tibet air shower array and the muon detector array. The detection significance of gamma rays reaches $4.0\, σ$ and $4.4\, σ$ levels above 25 TeV and 100 TeV, respectively, in units of Gaussian standard deviation $σ$. The energy spectrum measured between $40\, {\rm TeV} < E < 320\, {\rm TeV}$ for the first time is described with a simple power-law function of ${\rm d}N/{\rm d}E = (2.86 \pm 1.44) \times 10^{-16}(E/40\, {\rm TeV})^{-2.24 \pm 0.41}\, {\rm TeV}^{-1}\, {\rm cm}^{-2}\, {\rm s}^{-1}$. The gamma-ray energy spectrum from the sub-TeV ($E < 1\, {\rm TeV}$) to sub-PeV ($100\, {\rm TeV} < E < 1\, {\rm PeV}$) ranges including the results of previous studies can be modeled with the leptonic scenario, inverse Compton scattering by high-energy electrons accelerated by the PWN of PSR J1849$-$0001. On the other hand, the gamma-ray energy spectrum can also be modeled with the hadronic scenario in which gamma rays are generated from the decay of neutral pions produced by collisions between accelerated cosmic-ray protons and the ambient molecular cloud found in the gamma-ray emitting region. The cutoff energy of cosmic-ray protons $E_{\rm p\, cut}$, cut is estimated at ${\rm log}_{10}(E_{\rm p,\, cut}/{\rm TeV}) = 3.73^{+2.98}_{-0.66}$, suggesting that protons are accelerated up to the PeV energy range. Our study thus proposes that HESS J1849$-$000 should be further investigated as a new candidate for a Galactic PeV cosmic-ray accelerator, PeVatron.
△ Less
Submitted 26 August, 2023;
originally announced August 2023.
-
Measurement of the Gamma-Ray Energy Spectrum beyond 100 TeV from the HESS J1843$-$033 Region
Authors:
M. Amenomori,
S. Asano,
Y. W. Bao,
X. J. Bi,
D. Chen,
T. L. Chen,
W. Y. Chen,
Xu Chen,
Y. Chen,
Cirennima,
S. W. Cui,
Danzengluobu,
L. K. Ding,
J. H. Fang,
K. Fang,
C. F. Feng,
Zhaoyang Feng,
Z. Y. Feng,
Qi Gao,
A. Gomi,
Q. B. Gou,
Y. Q. Guo,
Y. Y. Guo,
H. H. He,
Z. T. He
, et al. (91 additional authors not shown)
Abstract:
HESS J1843$-$033 is a very-high-energy gamma-ray source whose origin remains unidentified. This work presents, for the first time, the energy spectrum of gamma rays beyond $100\, {\rm TeV}$ from the HESS J1843$-$033 region using the data recorded by the Tibet air shower array and its underground muon detector array. A gamma-ray source with an extension of $0.34^{\circ} \pm 0.12^{\circ}$ is success…
▽ More
HESS J1843$-$033 is a very-high-energy gamma-ray source whose origin remains unidentified. This work presents, for the first time, the energy spectrum of gamma rays beyond $100\, {\rm TeV}$ from the HESS J1843$-$033 region using the data recorded by the Tibet air shower array and its underground muon detector array. A gamma-ray source with an extension of $0.34^{\circ} \pm 0.12^{\circ}$ is successfully detected above $25\, {\rm TeV}$ at $(α,\, δ) = (281.09^{\circ}\pm 0.10^{\circ},\, -3.76^{\circ}\pm 0.09^{\circ})$ near HESS J1843$-$033 with a statistical significance of $6.2\, σ$, and the source is named TASG J1844$-$038. The position of TASG J1844$-$038 is consistent with those of HESS J1843$-$033, eHWC J1842$-$035, and LHAASO J1843$-$0338. The measured gamma-ray energy spectrum in $25\, {\rm TeV} < E < 130\, {\rm TeV}$ is described with ${\rm d}N/{\rm d}E = (9.70\pm 1.89)\times 10^{-16} (E/40\, {\rm TeV})^{-3.26\pm 0.30}\, {\rm TeV}^{-1} {\rm cm}^{-2} {\rm s}^{-1}$, and the spectral fit to the combined spectra of HESS J1843$-$033, LHAASO J1843$-$0338, and TASG J1844$-$038 implies the existence of a cutoff at $49.5\pm 9.0\, {\rm TeV}$. Associations of TASG J1844-038 with SNR G28.6$-$0.1 and PSR J1844-0346 are also discussed in detail for the first time.
△ Less
Submitted 26 August, 2023;
originally announced August 2023.