-
PepHarmony: A Multi-View Contrastive Learning Framework for Integrated Sequence and Structure-Based Peptide Encoding
Authors:
Ruochi Zhang,
Haoran Wu,
Chang Liu,
Huaping Li,
Yuqian Wu,
Kewei Li,
Yifan Wang,
Yifan Deng,
Jiahui Chen,
Fengfeng Zhou,
Xin Gao
Abstract:
Recent advances in protein language models have catalyzed significant progress in peptide sequence representation. Despite extensive exploration in this field, pre-trained models tailored for peptide-specific needs remain largely unaddressed due to the difficulty in capturing the complex and sometimes unstable structures of peptides. This study introduces a novel multi-view contrastive learning fr…
▽ More
Recent advances in protein language models have catalyzed significant progress in peptide sequence representation. Despite extensive exploration in this field, pre-trained models tailored for peptide-specific needs remain largely unaddressed due to the difficulty in capturing the complex and sometimes unstable structures of peptides. This study introduces a novel multi-view contrastive learning framework PepHarmony for the sequence-based peptide encoding task. PepHarmony innovatively combines both sequence- and structure-level information into a sequence-level encoding module through contrastive learning. We carefully select datasets from the Protein Data Bank (PDB) and AlphaFold database to encompass a broad spectrum of peptide sequences and structures. The experimental data highlights PepHarmony's exceptional capability in capturing the intricate relationship between peptide sequences and structures compared with the baseline and fine-tuned models. The robustness of our model is confirmed through extensive ablation studies, which emphasize the crucial roles of contrastive loss and strategic data sorting in enhancing predictive performance. The proposed PepHarmony framework serves as a notable contribution to peptide representations, and offers valuable insights for future applications in peptide drug discovery and peptide engineering. We have made all the source code utilized in this study publicly accessible via GitHub at https://github.com/zhangruochi/PepHarmony or http://www.healthinformaticslab.org/supp/.
△ Less
Submitted 20 January, 2024;
originally announced January 2024.
-
Rethinking Impersonation and Dodging Attacks on Face Recognition Systems
Authors:
Fengfan Zhou,
Qianyu Zhou,
Bangjie Yin,
Hui Zheng,
Xuequan Lu,
Lizhuang Ma,
Hefei Ling
Abstract:
Face Recognition (FR) systems can be easily deceived by adversarial examples that manipulate benign face images through imperceptible perturbations. Adversarial attacks on FR encompass two types: impersonation (targeted) attacks and dodging (untargeted) attacks. Previous methods often achieve a successful impersonation attack on FR, however, it does not necessarily guarantee a successful dodging a…
▽ More
Face Recognition (FR) systems can be easily deceived by adversarial examples that manipulate benign face images through imperceptible perturbations. Adversarial attacks on FR encompass two types: impersonation (targeted) attacks and dodging (untargeted) attacks. Previous methods often achieve a successful impersonation attack on FR, however, it does not necessarily guarantee a successful dodging attack on FR in the black-box setting. In this paper, our key insight is that the generation of adversarial examples should perform both impersonation and dodging attacks simultaneously. To this end, we propose a novel attack method termed as Adversarial Pruning (Adv-Pruning), to fine-tune existing adversarial examples to enhance their dodging capabilities while preserving their impersonation capabilities. Adv-Pruning consists of Priming, Pruning, and Restoration stages. Concretely, we propose Adversarial Priority Quantification to measure the region-wise priority of original adversarial perturbations, identifying and releasing those with minimal impact on absolute model output variances. Then, Biased Gradient Adaptation is presented to adapt the adversarial examples to traverse the decision boundaries of both the attacker and victim by adding perturbations favoring dodging attacks on the vacated regions, preserving the prioritized features of the original perturbations while boosting dodging performance. As a result, we can maintain the impersonation capabilities of original adversarial examples while effectively enhancing dodging capabilities. Comprehensive experiments demonstrate the superiority of our method compared with state-of-the-art adversarial attack methods.
△ Less
Submitted 17 August, 2024; v1 submitted 16 January, 2024;
originally announced January 2024.
-
CLIPRerank: An Extremely Simple Method for Improving Ad-hoc Video Search
Authors:
Aozhu Chen,
Fangming Zhou,
Ziyuan Wang,
Xirong Li
Abstract:
Ad-hoc Video Search (AVS) enables users to search for unlabeled video content using on-the-fly textual queries. Current deep learning-based models for AVS are trained to optimize holistic similarity between short videos and their associated descriptions. However, due to the diversity of ad-hoc queries, even for a short video, its truly relevant part w.r.t. a given query can be of shorter duration.…
▽ More
Ad-hoc Video Search (AVS) enables users to search for unlabeled video content using on-the-fly textual queries. Current deep learning-based models for AVS are trained to optimize holistic similarity between short videos and their associated descriptions. However, due to the diversity of ad-hoc queries, even for a short video, its truly relevant part w.r.t. a given query can be of shorter duration. In such a scenario, the holistic similarity becomes suboptimal. To remedy the issue, we propose in this paper CLIPRerank, a fine-grained re-scoring method. We compute cross-modal similarities between query and video frames using a pre-trained CLIP model, with multi-frame scores aggregated by max pooling. The fine-grained score is weightedly added to the initial score for search result reranking. As such, CLIPRerank is agnostic to the underlying video retrieval models and extremely simple, making it a handy plug-in for boosting AVS. Experiments on the challenging TRECVID AVS benchmarks (from 2016 to 2021) justify the effectiveness of the proposed strategy. CLIPRerank consistently improves the TRECVID top performers and multiple existing models including SEA, W2VV++, Dual Encoding, Dual Task, LAFF, CLIP2Video, TS2-Net and X-CLIP. Our method also works when substituting BLIP-2 for CLIP.
△ Less
Submitted 16 January, 2024;
originally announced January 2024.
-
Composition method for chromatic symmetric functions: Neat noncommutative analogs
Authors:
David G. L. Wang,
James Z. F. Zhou
Abstract:
This work is inspired by Shareshian and Wachs's exquisite formula for the chromatic symmetric function of paths. We develop a composition method to unearth neat noncommutative analogs of chromatic symmetric functions. A symmetric function is $e$-positive if and only if it has a $Λ$-positive noncommutative analog. We bring to light short and sweet $Λ$-positive noncommutative analogs for the chromat…
▽ More
This work is inspired by Shareshian and Wachs's exquisite formula for the chromatic symmetric function of paths. We develop a composition method to unearth neat noncommutative analogs of chromatic symmetric functions. A symmetric function is $e$-positive if and only if it has a $Λ$-positive noncommutative analog. We bring to light short and sweet $Λ$-positive noncommutative analogs for the chromatic symmetric functions of tadpoles and barbells. Using these elegant formulas and the composition method, we discover a new family of $e$-positive graphs and call it hat graphs, which are the unicyclic graphs obtained by adding an edge to a path. We also obtain a compact ribbon Schur analog for cycles.
△ Less
Submitted 10 January, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
DB-GPT: Empowering Database Interactions with Private Large Language Models
Authors:
Siqiao Xue,
Caigao Jiang,
Wenhui Shi,
Fangyin Cheng,
Keting Chen,
Hongjun Yang,
Zhiping Zhang,
Jianshan He,
Hongyang Zhang,
Ganglin Wei,
Wang Zhao,
Fan Zhou,
Danrui Qi,
Hong Yi,
Shaodong Liu,
Faqiang Chen
Abstract:
The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. Database technologies particularly have an important entanglement with LLMs as efficient and intuitive database interactions are paramount. In this paper, we present DB-GPT, a revolutionary and production-ready project that integrates LLMs with traditional database systems to enhance user…
▽ More
The recent breakthroughs in large language models (LLMs) are positioned to transition many areas of software. Database technologies particularly have an important entanglement with LLMs as efficient and intuitive database interactions are paramount. In this paper, we present DB-GPT, a revolutionary and production-ready project that integrates LLMs with traditional database systems to enhance user experience and accessibility. DB-GPT is designed to understand natural language queries, provide context-aware responses, and generate complex SQL queries with high accuracy, making it an indispensable tool for users ranging from novice to expert. The core innovation in DB-GPT lies in its private LLM technology, which is fine-tuned on domain-specific corpora to maintain user privacy and ensure data security while offering the benefits of state-of-the-art LLMs. We detail the architecture of DB-GPT, which includes a novel retrieval augmented generation (RAG) knowledge system, an adaptive learning mechanism to continuously improve performance based on user feedback and a service-oriented multi-model framework (SMMF) with powerful data-driven agents. Our extensive experiments and user studies confirm that DB-GPT represents a paradigm shift in database interactions, offering a more natural, efficient, and secure way to engage with data repositories. The paper concludes with a discussion of the implications of DB-GPT framework on the future of human-database interaction and outlines potential avenues for further enhancements and applications in the field. The project code is available at https://github.com/eosphoros-ai/DB-GPT. Experience DB-GPT for yourself by installing it with the instructions https://github.com/eosphoros-ai/DB-GPT#install and view a concise 10-minute video at https://www.youtube.com/watch?v=KYs4nTDzEhk.
△ Less
Submitted 3 January, 2024; v1 submitted 28 December, 2023;
originally announced December 2023.
-
Diffusive Limit of the Vlasov-Poisson-Boltzmann System without Angular Cutoff
Authors:
Yuan Xu,
Fujun Zhou,
Yongsheng Li
Abstract:
Diffusive limit of the Vlasov-Poisson-Boltzmann system without angular cutoff in the framework of perturbation around global Maxwellian still remains open. By employing the weighted energy method with a newly introduced weight function $w_l(α,β)$ and some novel treatments, we solve this problem for the full range of non-cutoff potentials $γ>-3$ and $0<s<1$. Uniform estimate with respect to the Knu…
▽ More
Diffusive limit of the Vlasov-Poisson-Boltzmann system without angular cutoff in the framework of perturbation around global Maxwellian still remains open. By employing the weighted energy method with a newly introduced weight function $w_l(α,β)$ and some novel treatments, we solve this problem for the full range of non-cutoff potentials $γ>-3$ and $0<s<1$. Uniform estimate with respect to the Knudsen number $\varepsilon \in (0,1]$ is established globally in time, which eventually leads to the global existence of solutions to the Vlasov-Poisson-Boltzmann system without angular cutoff for the full range of non-cutoff potentials and hydrodynamic limit to the two-fluid incompressible Navier-Stokes-Fourier-Poisson system with Ohm's law. As a byproduct, this approach also extends the global existence results of previous studies on the Vlasov-Poisson-Boltzmann system without angular cutoff to the full range of non-cutoff potentials $γ>-3$ and $0<s<1$.
△ Less
Submitted 14 May, 2024; v1 submitted 27 December, 2023;
originally announced December 2023.
-
Lifting by Image -- Leveraging Image Cues for Accurate 3D Human Pose Estimation
Authors:
Feng Zhou,
Jianqin Yin,
Peiyang Li
Abstract:
The "lifting from 2D pose" method has been the dominant approach to 3D Human Pose Estimation (3DHPE) due to the powerful visual analysis ability of 2D pose estimators. Widely known, there exists a depth ambiguity problem when estimating solely from 2D pose, where one 2D pose can be mapped to multiple 3D poses. Intuitively, the rich semantic and texture information in images can contribute to a mor…
▽ More
The "lifting from 2D pose" method has been the dominant approach to 3D Human Pose Estimation (3DHPE) due to the powerful visual analysis ability of 2D pose estimators. Widely known, there exists a depth ambiguity problem when estimating solely from 2D pose, where one 2D pose can be mapped to multiple 3D poses. Intuitively, the rich semantic and texture information in images can contribute to a more accurate "lifting" procedure. Yet, existing research encounters two primary challenges. Firstly, the distribution of image data in 3D motion capture datasets is too narrow because of the laboratorial environment, which leads to poor generalization ability of methods trained with image information. Secondly, effective strategies for leveraging image information are lacking. In this paper, we give new insight into the cause of poor generalization problems and the effectiveness of image features. Based on that, we propose an advanced framework. Specifically, the framework consists of two stages. First, we enable the keypoints to query and select the beneficial features from all image patches. To reduce the keypoints attention to inconsequential background features, we design a novel Pose-guided Transformer Layer, which adaptively limits the updates to unimportant image patches. Then, through a designed Adaptive Feature Selection Module, we prune less significant image patches from the feature map. In the second stage, we allow the keypoints to further emphasize the retained critical image features. This progressive learning approach prevents further training on insignificant image features. Experimental results show that our model achieves state-of-the-art performance on both the Human3.6M dataset and the MPI-INF-3DHP dataset.
△ Less
Submitted 25 December, 2023;
originally announced December 2023.
-
Power Allocation and Beamforming Design for IRS-aided Secure Directional Modulation Network
Authors:
Rongen Dong,
Feng Shu,
Fuhui Zhou,
Yongpeng Wu,
Jiangzhou Wang
Abstract:
With the aim of boosting the security of the conventional directional modulation (DM) network, a secure DM network assisted by intelligent reflecting surface (IRS) is investigated in this paper. To maximize the secrecy rate (SR), we jointly optimize the power allocation (PA) factor, confidential message (CM) beamforming, artificial noise (AN) beamforming, and IRS reflected beamforming. To tackle t…
▽ More
With the aim of boosting the security of the conventional directional modulation (DM) network, a secure DM network assisted by intelligent reflecting surface (IRS) is investigated in this paper. To maximize the secrecy rate (SR), we jointly optimize the power allocation (PA) factor, confidential message (CM) beamforming, artificial noise (AN) beamforming, and IRS reflected beamforming. To tackle the formulated problem, a maximizing SR with high-performance (Max-SR-HP) scheme is proposed, where the PA factor, CM beamforming, AN beamforming, and IRS phase shift matrix are derived by the derivative operation, generalized Rayleigh-Ritz, generalized power iteration, and semidefinite relaxation criteria, respectively. Given that the high complexity of the above scheme, a maximizing SR with low-complexity (Max-SR-LC) scheme is proposed, which employs the generalized leakage and successive convex approximation algorithms to derive the variables. Simulation results show that both the proposed schemes can significantly boost the SR performance, and are better than the equal PA, no IRS and random phase shift IRS schemes.
△ Less
Submitted 4 March, 2024; v1 submitted 24 December, 2023;
originally announced December 2023.
-
SMC-NCA: Semantic-guided Multi-level Contrast for Semi-supervised Temporal Action Segmentation
Authors:
Feixiang Zhou,
Zheheng Jiang,
Huiyu Zhou,
Xuelong Li
Abstract:
Semi-supervised temporal action segmentation (SS-TA) aims to perform frame-wise classification in long untrimmed videos, where only a fraction of videos in the training set have labels. Recent studies have shown the potential of contrastive learning in unsupervised representation learning using unlabelled data. However, learning the representation of each frame by unsupervised contrastive learning…
▽ More
Semi-supervised temporal action segmentation (SS-TA) aims to perform frame-wise classification in long untrimmed videos, where only a fraction of videos in the training set have labels. Recent studies have shown the potential of contrastive learning in unsupervised representation learning using unlabelled data. However, learning the representation of each frame by unsupervised contrastive learning for action segmentation remains an open and challenging problem. In this paper, we propose a novel Semantic-guided Multi-level Contrast scheme with a Neighbourhood-Consistency-Aware unit (SMC-NCA) to extract strong frame-wise representations for SS-TAS. Specifically, for representation learning, SMC is first used to explore intra- and inter-information variations in a unified and contrastive way, based on action-specific semantic information and temporal information highlighting relations between actions. Then, the NCA module, which is responsible for enforcing spatial consistency between neighbourhoods centered at different frames to alleviate over-segmentation issues, works alongside SMC for semi-supervised learning (SSL). Our SMC outperforms the other state-of-the-art methods on three benchmarks, offering improvements of up to 17.8% and 12.6% in terms of Edit distance and accuracy, respectively. Additionally, the NCA unit results in significantly better segmentation performance in the presence of only 5% labelled videos. We also demonstrate the generalizability and effectiveness of the proposed method on our Parkinson Disease's Mouse Behaviour (PDMB) dataset. Code is available at https://github.com/FeixiangZhou/SMC-NCA.
△ Less
Submitted 19 July, 2024; v1 submitted 19 December, 2023;
originally announced December 2023.
-
Mitigating Label Bias in Machine Learning: Fairness through Confident Learning
Authors:
Yixuan Zhang,
Boyu Li,
Zenan Ling,
Feng Zhou
Abstract:
Discrimination can occur when the underlying unbiased labels are overwritten by an agent with potential bias, resulting in biased datasets that unfairly harm specific groups and cause classifiers to inherit these biases. In this paper, we demonstrate that despite only having access to the biased labels, it is possible to eliminate bias by filtering the fairest instances within the framework of con…
▽ More
Discrimination can occur when the underlying unbiased labels are overwritten by an agent with potential bias, resulting in biased datasets that unfairly harm specific groups and cause classifiers to inherit these biases. In this paper, we demonstrate that despite only having access to the biased labels, it is possible to eliminate bias by filtering the fairest instances within the framework of confident learning. In the context of confident learning, low self-confidence usually indicates potential label errors; however, this is not always the case. Instances, particularly those from underrepresented groups, might exhibit low confidence scores for reasons other than labeling errors. To address this limitation, our approach employs truncation of the confidence score and extends the confidence interval of the probabilistic threshold. Additionally, we incorporate with co-teaching paradigm for providing a more robust and reliable selection of fair instances and effectively mitigating the adverse effects of biased labels. Through extensive experimentation and evaluation of various datasets, we demonstrate the efficacy of our approach in promoting fairness and reducing the impact of label bias in machine learning models.
△ Less
Submitted 24 December, 2023; v1 submitted 14 December, 2023;
originally announced December 2023.
-
Concept-centric Personalization with Large-scale Diffusion Priors
Authors:
Pu Cao,
Lu Yang,
Feng Zhou,
Tianrui Huang,
Qing Song
Abstract:
Despite large-scale diffusion models being highly capable of generating diverse open-world content, they still struggle to match the photorealism and fidelity of concept-specific generators. In this work, we present the task of customizing large-scale diffusion priors for specific concepts as concept-centric personalization. Our goal is to generate high-quality concept-centric images while maintai…
▽ More
Despite large-scale diffusion models being highly capable of generating diverse open-world content, they still struggle to match the photorealism and fidelity of concept-specific generators. In this work, we present the task of customizing large-scale diffusion priors for specific concepts as concept-centric personalization. Our goal is to generate high-quality concept-centric images while maintaining the versatile controllability inherent to open-world models, enabling applications in diverse tasks such as concept-centric stylization and image translation. To tackle these challenges, we identify catastrophic forgetting of guidance prediction from diffusion priors as the fundamental issue. Consequently, we develop a guidance-decoupled personalization framework specifically designed to address this task. We propose Generalized Classifier-free Guidance (GCFG) as the foundational theory for our framework. This approach extends Classifier-free Guidance (CFG) to accommodate an arbitrary number of guidances, sourced from a variety of conditions and models. Employing GCFG enables us to separate conditional guidance into two distinct components: concept guidance for fidelity and control guidance for controllability. This division makes it feasible to train a specialized model for concept guidance, while ensuring both control and unconditional guidance remain intact. We then present a null-text Concept-centric Diffusion Model as a concept-specific generator to learn concept guidance without the need for text annotations. Code will be available at https://github.com/PRIV-Creation/Concept-centric-Personalization.
△ Less
Submitted 13 December, 2023;
originally announced December 2023.
-
Spectroscopy-Guided Discovery of Three-Dimensional Structures of Disordered Materials with Diffusion Models
Authors:
Hyuna Kwon,
Tim Hsu,
Wenyu Sun,
Wonseok Jeong,
Fikret Aydin,
James Chapman,
Xiao Chen,
Matthew R. Carbone,
Deyu Lu,
Fei Zhou,
Tuan Anh Pham
Abstract:
The ability to rapidly develop materials with desired properties has a transformative impact on a broad range of emerging technologies. In this work, we introduce a new framework based on the diffusion model, a recent generative machine learning method to predict 3D structures of disordered materials from a target property. For demonstration, we apply the model to identify the atomic structures of…
▽ More
The ability to rapidly develop materials with desired properties has a transformative impact on a broad range of emerging technologies. In this work, we introduce a new framework based on the diffusion model, a recent generative machine learning method to predict 3D structures of disordered materials from a target property. For demonstration, we apply the model to identify the atomic structures of amorphous carbons ($a$-C) as a representative material system from the target X-ray absorption near edge structure (XANES) spectra--a common experimental technique to probe atomic structures of materials. We show that conditional generation guided by XANES spectra reproduces key features of the target structures. Furthermore, we show that our model can steer the generative process to tailor atomic arrangements for a specific XANES spectrum. Finally, our generative model exhibits a remarkable scale-agnostic property, thereby enabling generation of realistic, large-scale structures through learning from a small-scale dataset (i.e., with small unit cells). Our work represents a significant stride in bridging the gap between materials characterization and atomic structure determination; in addition, it can be leveraged for materials discovery in exploring various material properties as targeted.
△ Less
Submitted 9 December, 2023;
originally announced December 2023.
-
Adaptive Resource Allocation for Semantic Communication Networks
Authors:
Lingyi Wang,
Wei Wu,
Fuhui Zhou,
Zhaohui Yang,
Zhijin Qin
Abstract:
Semantic communication, recognized as a promising technology for future intelligent applications, has received widespread research attention. Despite the potential of semantic communication to enhance transmission reliability, especially in low signal-to-noise (SNR) environments, the critical issue of resource allocation and compatibility in the dynamic wireless environment remains largely unexplo…
▽ More
Semantic communication, recognized as a promising technology for future intelligent applications, has received widespread research attention. Despite the potential of semantic communication to enhance transmission reliability, especially in low signal-to-noise (SNR) environments, the critical issue of resource allocation and compatibility in the dynamic wireless environment remains largely unexplored. In this paper, we propose an adaptive semantic resource allocation paradigm with semantic-bit quantization (SBQ) compatibly for existing wireless communications, where the inaccurate environment perception introduced by the additional mapping relationship between semantic metrics and transmission metrics is solved. In order to investigate the performance of semantic communication networks, the quality of service for semantic communication (SC-QoS), including the semantic quantization efficiency (SQE) and transmission latency, is proposed for the first time. A problem of maximizing the overall effective SC-QoS is formulated by jointly optimizing the transmit beamforming of the base station, the bits for semantic representation, the subchannel assignment, and the bandwidth resource allocation. To address the non-convex formulated problem, an intelligent resource allocation scheme is proposed based on a hybrid deep reinforcement learning (DRL) algorithm, where the intelligent agent can perceive both semantic tasks and dynamic wireless environments. Simulation results demonstrate that our design can effectively combat semantic noise and achieve superior performance in wireless communications compared to several benchmark schemes. Furthermore, compared to mapping-guided paradigm based resource allocation schemes, our proposed adaptive scheme can achieve up to 13% performance improvement in terms of SC-QoS.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Hybrid Hierarchical DRL Enabled Resource Allocation for Secure Transmission in Multi-IRS-Assisted Sensing-Enhanced Spectrum Sharing Networks
Authors:
Lingyi Wang,
Wei Wu,
Fuhui Zhou,
Qihui Wu,
Octavia A. Dobre,
Tony Q. S. Quek
Abstract:
Secure communications are of paramount importance in spectrum sharing networks due to the allocation and sharing characteristics of spectrum resources. To further explore the potential of intelligent reflective surfaces (IRSs) in enhancing spectrum sharing and secure transmission performance, a multiple intelligent reflection surface (multi-IRS)-assisted sensing-enhanced wideband spectrum sharing…
▽ More
Secure communications are of paramount importance in spectrum sharing networks due to the allocation and sharing characteristics of spectrum resources. To further explore the potential of intelligent reflective surfaces (IRSs) in enhancing spectrum sharing and secure transmission performance, a multiple intelligent reflection surface (multi-IRS)-assisted sensing-enhanced wideband spectrum sharing network is investigated by considering physical layer security techniques. An intelligent resource allocation scheme based on double deep Q networks (D3QN) algorithm and soft Actor-Critic (SAC) algorithm is proposed to maximize the secure transmission rate of the secondary network by jointly optimizing IRS pairings, subchannel assignment, transmit beamforming of the secondary base station, reflection coefficients of IRSs and the sensing time. To tackle the sparse reward problem caused by a significant amount of reflection elements of multiple IRSs, the method of hierarchical reinforcement learning is exploited. An alternative optimization (AO)-based conventional mathematical scheme is introduced to verify the computational complexity advantage of our proposed intelligent scheme. Simulation results demonstrate the efficiency of our proposed intelligent scheme as well as the superiority of multi-IRS design in enhancing secrecy rate and spectrum utilization. It is shown that inappropriate deployment of IRSs can reduce the security performance with the presence of multiple eavesdroppers (Eves), and the arrangement of IRSs deserves further consideration.
△ Less
Submitted 2 December, 2023;
originally announced December 2023.
-
Linear normalised hash function for clustering gene sequences and identifying reference sequences from multiple sequence alignments
Authors:
Manal Helal,
Fanrong Kong,
Sharon C-A Chen,
Fei Zhou,
Dominic E Dwyer,
John Potter,
Vitali Sintchenko
Abstract:
The aim of this study was to develop a method that would identify the cluster centroids and the optimal number of clusters for a given sensitivity level and could work equally well for the different sequence datasets. A novel method that combines the linear mapping hash function and multiple sequence alignment (MSA) was developed. This method takes advantage of the already sorted by similarity seq…
▽ More
The aim of this study was to develop a method that would identify the cluster centroids and the optimal number of clusters for a given sensitivity level and could work equally well for the different sequence datasets. A novel method that combines the linear mapping hash function and multiple sequence alignment (MSA) was developed. This method takes advantage of the already sorted by similarity sequences from the MSA output, and identifies the optimal number of clusters, clusters cut-offs, and clusters centroids that can represent reference gene vouchers for the different species. The linear mapping hash function can map an already ordered by similarity distance matrix to indices to reveal gaps in the values around which the optimal cut-offs of the different clusters can be identified. The method was evaluated using sets of closely related (16S rRNA gene sequences of Nocardia species) and highly variable (VP1 genomic region of Enterovirus 71) sequences and outperformed existing unsupervised machine learning clustering methods and dimensionality reduction methods. This method does not require prior knowledge of the number of clusters or the distance between clusters, handles clusters of different sizes and shapes, and scales linearly with the dataset. The combination of MSA with the linear mapping hash function is a computationally efficient way of gene sequence clustering and can be a valuable tool for the assessment of similarity, clustering of different microbial genomes, identifying reference sequences, and for the study of evolution of bacteria and viruses.
△ Less
Submitted 29 November, 2023;
originally announced November 2023.
-
Multireference covariant density-functional theory for the low-lying states of odd-mass nuclei
Authors:
E. F. Zhou,
X. Y. Wu,
J. M. Yao
Abstract:
We extend multireference covariant density-functional theory (MR-CDFT) based on a relativistic point-coupling energy functional to describe the low-lying states of odd-mass nuclei. The nuclear wave function is constructed as a superposition of quadrupole-octupole deformed mean-field configurations, with projection onto angular momentum, particle numbers, and parity within the framework of the gene…
▽ More
We extend multireference covariant density-functional theory (MR-CDFT) based on a relativistic point-coupling energy functional to describe the low-lying states of odd-mass nuclei. The nuclear wave function is constructed as a superposition of quadrupole-octupole deformed mean-field configurations, with projection onto angular momentum, particle numbers, and parity within the framework of the generator coordinate method. Using $^{25}$Mg as an example, we calculate the energy spectrum, electric multipole, and magnetic dipole transition strengths based on three different schemes for the mean-field configurations of odd-mass nuclei. We find that the low-energy structure of $^{25}$Mg is reasonably reproduced in all three schemes. In particular, the effect of octupole correlation is illustrated in the application to the low-lying parity doublets of $^{21}$Ne. This work demonstrates the success of the MR-CDFT for the low-lying states of odd-mass nuclei with possible strong quadruple-octupole correlations.
△ Less
Submitted 8 January, 2024; v1 submitted 26 November, 2023;
originally announced November 2023.
-
Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning
Authors:
Jiaqi Li,
Yuanhao Lai,
Rui Wang,
Changjian Shui,
Sabyasachi Sahoo,
Charles X. Ling,
Shichun Yang,
Boyu Wang,
Christian Gagné,
Fan Zhou
Abstract:
Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive…
▽ More
Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling the parameter transitions along the sequential tasks with the weight matrix transformation, we propose to apply the low-rank approximation on the task-adaptive parameters in each layer of the neural networks. Specifically, we theoretically demonstrate the quantitative relationship between the Hessian and the proposed low-rank approximation. The approximation ranks are then globally determined according to the marginal increment of the empirical loss estimated by the layer-specific gradient and low-rank approximation error. Furthermore, we control the model capacity by pruning less important parameters to diminish the parameter growth. We conduct extensive experiments on various benchmarks, including a dataset with large-scale tasks, and compare our method against some recent state-of-the-art methods to demonstrate the effectiveness and scalability of our proposed method. Empirical results show that our method performs better on different benchmarks, especially in achieving task order robustness and handling the forgetting issue. The source code is at https://github.com/lijiaqi/HALRP.
△ Less
Submitted 7 July, 2024; v1 submitted 25 November, 2023;
originally announced November 2023.
-
Crash-Stop Failures in Asynchronous Multiparty Session Types
Authors:
Adam D. Barwell,
Ping Hou,
Nobuko Yoshida,
Fangyi Zhou
Abstract:
Session types provide a typing discipline for message-passing systems. However, their theory often assumes an ideal world: one in which everything is reliable and without failures. Yet this is in stark contrast with distributed systems in the real world. To address this limitation, we introduce a new asynchronous multiparty session types (MPST) theory with crash-stop failures, where processes may…
▽ More
Session types provide a typing discipline for message-passing systems. However, their theory often assumes an ideal world: one in which everything is reliable and without failures. Yet this is in stark contrast with distributed systems in the real world. To address this limitation, we introduce a new asynchronous multiparty session types (MPST) theory with crash-stop failures, where processes may crash arbitrarily and cease to interact after crashing. We augment asynchronous MPST and processes with crash handling branches, and integrate crash-stop failure semantics into types and processes. Our approach requires no user-level syntax extensions for global types, and features a formalisation of global semantics, which captures complex behaviours induced by crashed/crash handling processes. Our new theory covers the entire spectrum, ranging from the ideal world of total reliability to entirely unreliable scenarios where any process may crash, using optional reliability assumptions. Under these assumptions, we demonstrate the sound and complete correspondence between global and local type semantics, which guarantee deadlock-freedom, protocol conformance, and liveness of well-typed processes by construction, even in the presence of crashes.
△ Less
Submitted 21 August, 2024; v1 submitted 20 November, 2023;
originally announced November 2023.
-
Scaling User Modeling: Large-scale Online User Representations for Ads Personalization in Meta
Authors:
Wei Zhang,
Dai Li,
Chen Liang,
Fang Zhou,
Zhongke Zhang,
Xuewei Wang,
Ru Li,
Yi Zhou,
Yaning Huang,
Dong Liang,
Kai Wang,
Zhangyuan Wang,
Zhengxing Chen,
Fenggang Wu,
Minghai Chen,
Huayu Li,
Yunnan Wu,
Zhan Shu,
Mindi Yuan,
Sri Reddy
Abstract:
Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user represe…
▽ More
Effective user representations are pivotal in personalized advertising. However, stringent constraints on training throughput, serving latency, and memory, often limit the complexity and input feature set of online ads ranking models. This challenge is magnified in extensive systems like Meta's, which encompass hundreds of models with diverse specifications, rendering the tailoring of user representation learning for each model impractical. To address these challenges, we present Scaling User Modeling (SUM), a framework widely deployed in Meta's ads ranking system, designed to facilitate efficient and scalable sharing of online user representation across hundreds of ads models. SUM leverages a few designated upstream user models to synthesize user embeddings from massive amounts of user features with advanced modeling techniques. These embeddings then serve as inputs to downstream online ads ranking models, promoting efficient representation sharing. To adapt to the dynamic nature of user features and ensure embedding freshness, we designed SUM Online Asynchronous Platform (SOAP), a latency free online serving system complemented with model freshness and embedding stabilization, which enables frequent user model updates and online inference of user embeddings upon each user request. We share our hands-on deployment experiences for the SUM framework and validate its superiority through comprehensive experiments. To date, SUM has been launched to hundreds of ads ranking models in Meta, processing hundreds of billions of user requests daily, yielding significant online metric gains and improved infrastructure efficiency.
△ Less
Submitted 22 May, 2024; v1 submitted 15 November, 2023;
originally announced November 2023.
-
Throughput Maximization in Multi-Band Optical Networks with Column Generation
Authors:
Cao Chen,
Shilin Xiao,
Fen Zhou,
Massimo Tornatore
Abstract:
Multi-band transmission is a promising technical direction for spectrum and capacity expansion of existing optical networks. Due to the increase in the number of usable wavelengths in multi-band optical networks, the complexity of resource allocation problems becomes a major concern. Moreover, the transmission performance, spectrum width, and cost constraint across optical bands may be heterogeneo…
▽ More
Multi-band transmission is a promising technical direction for spectrum and capacity expansion of existing optical networks. Due to the increase in the number of usable wavelengths in multi-band optical networks, the complexity of resource allocation problems becomes a major concern. Moreover, the transmission performance, spectrum width, and cost constraint across optical bands may be heterogeneous. Assuming a worst-case transmission margin in U, L, and C-bands, this paper investigates the problem of throughput maximization in multi-band optical networks, including the optimization of route, wavelength, and band assignment. We propose a low-complexity decomposition approach based on Column Generation (CG) to address the scalability issue faced by traditional methodologies. We numerically compare the results obtained by our CG-based approach to an integer linear programming model, confirming the near-optimal network throughput. Our results also demonstrate the scalability of the CG-based approach when the number of wavelengths increases, with the computation time in the magnitude order of 10 s for cases varying from 75 to 1200 wavelength channels per link in a 14-node network. Code of this publication is available at github.com/cchen000/CG-Multi-Band.
△ Less
Submitted 27 March, 2024; v1 submitted 13 November, 2023;
originally announced November 2023.
-
Single-Layer Digitized-Counterdiabatic Quantum Optimization for $p$-spin Models
Authors:
Huijie Guan,
Fei Zhou,
Francisco Albarrán-Arriagada,
Xi Chen,
Enrique Solano,
Narendra N. Hegade,
He-Liang Huang
Abstract:
Quantum computing holds the potential for quantum advantage in optimization problems, which requires advances in quantum algorithms and hardware specifications. Adiabatic quantum optimization is conceptually a valid solution that suffers from limited hardware coherence times. In this sense, counterdiabatic quantum protocols provide a shortcut to this process, steering the system along its ground s…
▽ More
Quantum computing holds the potential for quantum advantage in optimization problems, which requires advances in quantum algorithms and hardware specifications. Adiabatic quantum optimization is conceptually a valid solution that suffers from limited hardware coherence times. In this sense, counterdiabatic quantum protocols provide a shortcut to this process, steering the system along its ground state with fast-changing Hamiltonian. In this work, we take full advantage of a digitized-counterdiabatic quantum optimization (DCQO) algorithm to find an optimal solution of the $p$-spin model up to 4-local interactions. We choose a suitable scheduling function and initial Hamiltonian such that a single-layer quantum circuit suffices to produce a good ground-state overlap. By further optimizing parameters using variational methods, we solve with unit accuracy 2-spin, 3-spin, and 4-spin problems for $100\%$, $93\%$, and $83\%$ of instances, respectively. As a particular case of the latter, we also solve factorization problems involving 5, 9, and 12 qubits. Due to the low computational overhead, our compact approach may become a valuable tool towards quantum advantage in the NISQ era.
△ Less
Submitted 11 November, 2023;
originally announced November 2023.
-
PepLand: a large-scale pre-trained peptide representation model for a comprehensive landscape of both canonical and non-canonical amino acids
Authors:
Ruochi Zhang,
Haoran Wu,
Yuting Xiu,
Kewei Li,
Ningning Chen,
Yu Wang,
Yan Wang,
Xin Gao,
Fengfeng Zhou
Abstract:
In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation. These peptides present promising modifications to biological, pharmacological, and physiochemical attributes in both endogenous and engineered peptides. Notwithstanding their considerable advantages, the s…
▽ More
In recent years, the scientific community has become increasingly interested on peptides with non-canonical amino acids due to their superior stability and resistance to proteolytic degradation. These peptides present promising modifications to biological, pharmacological, and physiochemical attributes in both endogenous and engineered peptides. Notwithstanding their considerable advantages, the scientific community exhibits a conspicuous absence of an effective pre-trained model adept at distilling feature representations from such complex peptide sequences. We herein propose PepLand, a novel pre-training architecture for representation and property analysis of peptides spanning both canonical and non-canonical amino acids. In essence, PepLand leverages a comprehensive multi-view heterogeneous graph neural network tailored to unveil the subtle structural representations of peptides. Empirical validations underscore PepLand's effectiveness across an array of peptide property predictions, encompassing protein-protein interactions, permeability, solubility, and synthesizability. The rigorous evaluation confirms PepLand's unparalleled capability in capturing salient synthetic peptide features, thereby laying a robust foundation for transformative advances in peptide-centric research domains. We have made all the source code utilized in this study publicly accessible via GitHub at https://github.com/zhangruochi/pepland
△ Less
Submitted 7 November, 2023;
originally announced November 2023.
-
Fast Sparse 3D Convolution Network with VDB
Authors:
Fangjun Zhou,
Anyong Mao,
Eftychios Sifakis
Abstract:
We proposed a new Convolution Neural Network implementation optimized for sparse 3D data inference. This implementation uses NanoVDB as the data structure to store the sparse tensor. It leaves a relatively small memory footprint while maintaining high performance. We demonstrate that this architecture is around 20 times faster than the state-of-the-art dense CNN model on a high-resolution 3D objec…
▽ More
We proposed a new Convolution Neural Network implementation optimized for sparse 3D data inference. This implementation uses NanoVDB as the data structure to store the sparse tensor. It leaves a relatively small memory footprint while maintaining high performance. We demonstrate that this architecture is around 20 times faster than the state-of-the-art dense CNN model on a high-resolution 3D object classification network.
△ Less
Submitted 14 November, 2023; v1 submitted 5 November, 2023;
originally announced November 2023.
-
Pore size estimation in axon-mimicking microfibres with diffusion-relaxation MRI
Authors:
Erick J. Canales-Rodríguez,
Marco Pizzolato,
Feng-Lei Zhou,
Muhamed Barakovic,
Jean-Philippe Thiran,
Derek K. Jones,
Geoffrey J. M. Parker,
Tim B. Dyrby
Abstract:
Purpose: This study aims to evaluate two distinct approaches for fibre radius estimation using diffusion-relaxation MRI data acquired in biomimetic microfibre phantoms that mimic hollow axons. The methods considered are the spherical mean power-law approach and a T2-based pore size estimation technique. Theory and Methods: A general diffusion-relaxation theoretical model for the spherical mean sig…
▽ More
Purpose: This study aims to evaluate two distinct approaches for fibre radius estimation using diffusion-relaxation MRI data acquired in biomimetic microfibre phantoms that mimic hollow axons. The methods considered are the spherical mean power-law approach and a T2-based pore size estimation technique. Theory and Methods: A general diffusion-relaxation theoretical model for the spherical mean signal from water molecules within a distribution of cylinders with varying radii was introduced, encompassing the evaluated models as particular cases. Additionally, a new numerical approach was presented for estimating effective radii (i.e., MRI-visible mean radii) from the ground truth radii distributions, not reliant on previous theoretical approximations and adaptable to various acquisition sequences. The ground truth radii were obtained from Scanning Electron Microscope images. Results: Both methods show a linear relationship between effective radii estimated from MRI data and ground-truth radii distributions, though some discrepancies were observed. The spherical mean power-law method overestimated fibre radii. Conversely, the T2-based method exhibited higher sensitivity to smaller fibre radii but faced limitations in accurately estimating the radius in one particular phantom, possibly due to material-specific relaxation changes. Conclusion: The study demonstrates the feasibility of both techniques to predict pore sizes of hollow microfibres. The T2-based technique, unlike the spherical mean power-law method, does not demand ultra-high diffusion gradients but requires calibration with known radius distributions. This research contributes to the ongoing development and evaluation of neuroimaging techniques for fibre radius estimation, highlights the advantages and limitations of both methods and provides datasets for reproducible research.
△ Less
Submitted 21 December, 2023; v1 submitted 5 November, 2023;
originally announced November 2023.
-
Optimal Treatment Allocation for Efficient Policy Evaluation in Sequential Decision Making
Authors:
Ting Li,
Chengchun Shi,
Jianing Wang,
Fan Zhou,
Hongtu Zhu
Abstract:
A/B testing is critical for modern technological companies to evaluate the effectiveness of newly developed products against standard baselines. This paper studies optimal designs that aim to maximize the amount of information obtained from online experiments to estimate treatment effects accurately. We propose three optimal allocation strategies in a dynamic setting where treatments are sequentia…
▽ More
A/B testing is critical for modern technological companies to evaluate the effectiveness of newly developed products against standard baselines. This paper studies optimal designs that aim to maximize the amount of information obtained from online experiments to estimate treatment effects accurately. We propose three optimal allocation strategies in a dynamic setting where treatments are sequentially assigned over time. These strategies are designed to minimize the variance of the treatment effect estimator when data follow a non-Markov decision process or a (time-varying) Markov decision process. We further develop estimation procedures based on existing off-policy evaluation (OPE) methods and conduct extensive experiments in various environments to demonstrate the effectiveness of the proposed methodologies. In theory, we prove the optimality of the proposed treatment allocation design and establish upper bounds for the mean squared errors of the resulting treatment effect estimators.
△ Less
Submitted 4 November, 2023;
originally announced November 2023.
-
Cooperative Network Learning for Large-Scale and Decentralized Graphs
Authors:
Qiang Wu,
Yiming Huang,
Yujie Zeng,
Yijie Teng,
Fang Zhou,
Linyuan Lü
Abstract:
Graph research, the systematic study of interconnected data points represented as graphs, plays a vital role in capturing intricate relationships within networked systems. However, in the real world, as graphs scale up, concerns about data security among different data-owning agencies arise, hindering information sharing and, ultimately, the utilization of graph data. Therefore, establishing a mut…
▽ More
Graph research, the systematic study of interconnected data points represented as graphs, plays a vital role in capturing intricate relationships within networked systems. However, in the real world, as graphs scale up, concerns about data security among different data-owning agencies arise, hindering information sharing and, ultimately, the utilization of graph data. Therefore, establishing a mutual trust mechanism among graph agencies is crucial for unlocking the full potential of graphs. Here, we introduce a Cooperative Network Learning (CNL) framework to ensure secure graph computing for various graph tasks. Essentially, this CNL framework unifies the local and global perspectives of GNN computing with distributed data for an agency by virtually connecting all participating agencies as a global graph without a fixed central coordinator. Inter-agency computing is protected by various technologies inherent in our framework, including homomorphic encryption and secure transmission. Moreover, each agency has a fair right to design or employ various graph learning models from its local or global perspective. Thus, CNL can collaboratively train GNN models based on decentralized graphs inferred from local and global graphs. Experiments on contagion dynamics prediction and traditional graph tasks (i.e., node classification and link prediction) demonstrate that our CNL architecture outperforms state-of-the-art GNNs developed at individual sites, revealing that CNL can provide a reliable, fair, secure, privacy-preserving, and global perspective to build effective and personalized models for network applications. We hope this framework will address privacy concerns in graph-related research and integrate decentralized graph data structures to benefit the network research community in cooperation and innovation.
△ Less
Submitted 7 November, 2023; v1 submitted 2 November, 2023;
originally announced November 2023.
-
Ultrawide color gamut single-pixel dynamic color manipulation based on yarn muscles-graphene MEMS
Authors:
Hongxu Li,
Bo Long,
Tao Wang,
Feng Zhou,
Zhengping Zhang
Abstract:
This work investigated the single pixel color modulation in a composite structure of yarn muscles graphene mechanical system and photonic crystal multimode microcavity. The position of graphene in the microcavity is modified by changing the yarn muscles stretching using different current levels. This helps in adjusting the light absorption of graphene to different colors. Hence, red, green, blue,…
▽ More
This work investigated the single pixel color modulation in a composite structure of yarn muscles graphene mechanical system and photonic crystal multimode microcavity. The position of graphene in the microcavity is modified by changing the yarn muscles stretching using different current levels. This helps in adjusting the light absorption of graphene to different colors. Hence, red, green, blue, and their mixed colors can be displayed using a single pixel; color gamut of this system can reach 96.5% of RGB. The proposed system can avoid the spontaneous oscillation caused by large strain energy. This solution can provide insights into the design of low power, ultrahigh resolution, and ultrawide color gamut interferometric modulator display technologies.
△ Less
Submitted 1 November, 2023;
originally announced November 2023.
-
Twin-field quantum key distribution with local frequency reference
Authors:
Jiu-Peng Chen,
Fei Zhou,
Chi Zhang,
Cong Jiang,
Fa-Xi Chen,
Jia Huang,
Hao Li,
Li-Xing You,
Xiang-Bin Wang,
Yang Liu,
Qiang Zhang,
Jian-Wei Pan
Abstract:
Twin-field quantum key distribution (TF-QKD) overcomes the linear rate-loss limit, which promises a boost of secure key rate over long distance. However, the complexity of eliminating the frequency differences between the independent laser sources hinders its practical application. Here, taking the saturated absorption spectroscopy of acetylene as an absolute reference, we propose and demonstrate…
▽ More
Twin-field quantum key distribution (TF-QKD) overcomes the linear rate-loss limit, which promises a boost of secure key rate over long distance. However, the complexity of eliminating the frequency differences between the independent laser sources hinders its practical application. Here, taking the saturated absorption spectroscopy of acetylene as an absolute reference, we propose and demonstrate a simple and practical approach to realize TF-QKD without requiring relative frequency control of the independent laser sources. Adopting the 4-intensity sending-or-not-sending TF-QKD protocol, we experimentally demonstrate the TF-QKD over 502 km, 301 km and 201 km ultra-low loss optical fiber respectively. We expect this high-performance scheme will find widespread usage in future intercity and free-space quantum communication networks.
△ Less
Submitted 27 October, 2023;
originally announced October 2023.
-
Realizing attractive interacting topological surface fermions: A resonating TI- thin film hybrid platform
Authors:
Saran Vijayan,
Fei Zhou
Abstract:
In this article, we propose a practical way to realize topological surface Dirac fermions with tunable attractive interaction between them. The approach involves coating the surface of a topological insulator with a thin film metal and utilizing the strong-electron phonon coupling in the metal to induce interaction between the surface fermions. We found that for a given TI and thin film, the attra…
▽ More
In this article, we propose a practical way to realize topological surface Dirac fermions with tunable attractive interaction between them. The approach involves coating the surface of a topological insulator with a thin film metal and utilizing the strong-electron phonon coupling in the metal to induce interaction between the surface fermions. We found that for a given TI and thin film, the attractive interaction between the surface fermions can be maximally enhanced when the Dirac point of the TI surface resonates with one of the quasi-2D quantum-well bands of the thin film. This effect can be considered to be an example of 'quantum-well resonance'. We also demonstrate that the superconductivity of the resonating surface fermions can be further enhanced by choosing a strongly interacting thin film metal or by tuning the spin-orbit coupling of the TI. This TI-thin film hybrid configuration holds promise for applications in Majorana-based quantum computations and for the study of quantum critical physics of strongly attractively interacting surface topological matter with emergent supersymmetry.
△ Less
Submitted 26 October, 2023;
originally announced October 2023.
-
Accelerate Microstructure Evolution Simulation Using Graph Neural Networks with Adaptive Spatiotemporal Resolution
Authors:
Shaoxun Fan,
Andrew L. Hitt,
Ming Tang,
Babak Sadigh,
Fei Zhou
Abstract:
Surrogate models driven by sizeable datasets and scientific machine-learning methods have emerged as an attractive microstructure simulation tool with the potential to deliver predictive microstructure evolution dynamics with huge savings in computational costs. Taking 2D and 3D grain growth simulations as an example, we present a completely overhauled computational framework based on graph neural…
▽ More
Surrogate models driven by sizeable datasets and scientific machine-learning methods have emerged as an attractive microstructure simulation tool with the potential to deliver predictive microstructure evolution dynamics with huge savings in computational costs. Taking 2D and 3D grain growth simulations as an example, we present a completely overhauled computational framework based on graph neural networks with not only excellent agreement to both the ground truth phase-field methods and theoretical predictions, but enhanced accuracy and efficiency compared to previous works based on convolutional neural networks. These improvements can be attributed to the graph representation, both improved predictive power and a more flexible data structure amenable to adaptive mesh refinement. As the simulated microstructures coarsen, our method can adaptively adopt remeshed grids and larger timesteps to achieve further speedup. The data-to-model pipeline with training procedures together with the source codes are provided.
△ Less
Submitted 19 January, 2024; v1 submitted 23 October, 2023;
originally announced October 2023.
-
Emergent symmetries and Interactions: An isolated fixed point Vs a manifold of strongly interacting fixed points
Authors:
Fei Zhou
Abstract:
In this article, we study conditions of continuous emergent symmetries in gapless states, either as topological quantum critical points (TQCPs) or a stable phase with protecting symmetries and connections to smooth deformations of the gapped states around. We illustrate that for a wide class of gapless states that can be associated with fully-isolated scale invariant fixed points, there shall alwa…
▽ More
In this article, we study conditions of continuous emergent symmetries in gapless states, either as topological quantum critical points (TQCPs) or a stable phase with protecting symmetries and connections to smooth deformations of the gapped states around. We illustrate that for a wide class of gapless states that can be associated with fully-isolated scale invariant fixed points, there shall always be emergent continuous symmetries that are directly related to smooth deformations of gapped states with symmetries lower than the protecting ones $G_p$. For a 3D TQCP in DIII classes with $G_p=Z^T_2$, $U_{EM}=U(1)$ and $N_f=\frac{1}{2}$ fermions but without charge $U(1)$ symmetry, we explicitly construct a corresponding boundary representation based on a $4D$ topological state with lattice symmetry $H=Z^T_2 \ltimes U(1)$ and $N_f={1}$ fermions. Although emergent continuous symmetries appear to be robust at weakly interacting TQCPs, we further show the breakdown of such one-to-one correspondence between deformations of gapped states and emergent continuous symmetries when gapless states become strongly interacting. In a strongly interacting limit, gapless states can be represented by a smooth manifold of conformal-field-theory fixed points rather than a fully isolated one. A smooth manifold of strong coupling fixed points hinders emergence of a continuous emergent symmetry in the strongly interacting gapless limit, as deformations no longer leave a gapless state or a TQCP invariant, unlike in the more conventional weakly interacting case. This typically reduces continuous emergent symmetries to a discrete symmetry originating from duality transformations under the protection symmetry $G_p$.
△ Less
Submitted 25 May, 2024; v1 submitted 18 October, 2023;
originally announced October 2023.
-
OpenAgents: An Open Platform for Language Agents in the Wild
Authors:
Tianbao Xie,
Fan Zhou,
Zhoujun Cheng,
Peng Shi,
Luoxuan Weng,
Yitao Liu,
Toh Jing Hua,
Junning Zhao,
Qian Liu,
Che Liu,
Leo Z. Liu,
Yiheng Xu,
Hongjin Su,
Dongchan Shin,
Caiming Xiong,
Tao Yu
Abstract:
Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs). Current language agent frameworks aim to facilitate the construction of proof-of-concept language agents while neglecting the non-expert user access to agents and paying little attention to application-level…
▽ More
Language agents show potential in being capable of utilizing natural language for varied and intricate tasks in diverse environments, particularly when built upon large language models (LLMs). Current language agent frameworks aim to facilitate the construction of proof-of-concept language agents while neglecting the non-expert user access to agents and paying little attention to application-level designs. We present OpenAgents, an open platform for using and hosting language agents in the wild of everyday life. OpenAgents includes three agents: (1) Data Agent for data analysis with Python/SQL and data tools; (2) Plugins Agent with 200+ daily API tools; (3) Web Agent for autonomous web browsing. OpenAgents enables general users to interact with agent functionalities through a web user interface optimized for swift responses and common failures while offering developers and researchers a seamless deployment experience on local setups, providing a foundation for crafting innovative language agents and facilitating real-world evaluations. We elucidate the challenges and opportunities, aspiring to set a foundation for future research and development of real-world language agents.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
Revisiting Logistic-softmax Likelihood in Bayesian Meta-Learning for Few-Shot Classification
Authors:
Tianjun Ke,
Haoqun Cao,
Zenan Ling,
Feng Zhou
Abstract:
Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classificat…
▽ More
Meta-learning has demonstrated promising results in few-shot classification (FSC) by learning to solve new problems using prior knowledge. Bayesian methods are effective at characterizing uncertainty in FSC, which is crucial in high-risk fields. In this context, the logistic-softmax likelihood is often employed as an alternative to the softmax likelihood in multi-class Gaussian process classification due to its conditional conjugacy property. However, the theoretical property of logistic-softmax is not clear and previous research indicated that the inherent uncertainty of logistic-softmax leads to suboptimal performance. To mitigate these issues, we revisit and redesign the logistic-softmax likelihood, which enables control of the \textit{a priori} confidence level through a temperature parameter. Furthermore, we theoretically and empirically show that softmax can be viewed as a special case of logistic-softmax and logistic-softmax induces a larger family of data distribution than softmax. Utilizing modified logistic-softmax, we integrate the data augmentation technique into the deep kernel based Gaussian process meta-learning framework, and derive an analytical mean-field approximation for task-specific updates. Our approach yields well-calibrated uncertainty estimates and achieves comparable or superior results on standard benchmark datasets. Code is publicly available at \url{https://github.com/keanson/revisit-logistic-softmax}.
△ Less
Submitted 16 October, 2023;
originally announced October 2023.
-
A Frustratingly Easy Plug-and-Play Detection-and-Reasoning Module for Chinese Spelling Check
Authors:
Haojing Huang,
Jingheng Ye,
Qingyu Zhou,
Yinghui Li,
Yangning Li,
Feng Zhou,
Hai-Tao Zheng
Abstract:
In recent years, Chinese Spelling Check (CSC) has been greatly improved by designing task-specific pre-training methods or introducing auxiliary tasks, which mostly solve this task in an end-to-end fashion. In this paper, we propose to decompose the CSC workflow into detection, reasoning, and searching subtasks so that the rich external knowledge about the Chinese language can be leveraged more di…
▽ More
In recent years, Chinese Spelling Check (CSC) has been greatly improved by designing task-specific pre-training methods or introducing auxiliary tasks, which mostly solve this task in an end-to-end fashion. In this paper, we propose to decompose the CSC workflow into detection, reasoning, and searching subtasks so that the rich external knowledge about the Chinese language can be leveraged more directly and efficiently. Specifically, we design a plug-and-play detection-and-reasoning module that is compatible with existing SOTA non-autoregressive CSC models to further boost their performance. We find that the detection-and-reasoning module trained for one model can also benefit other models. We also study the primary interpretability provided by the task decomposition. Extensive experiments and detailed analyses demonstrate the effectiveness and competitiveness of the proposed module.
△ Less
Submitted 13 October, 2023;
originally announced October 2023.
-
RT-SRTS: Angle-Agnostic Real-Time Simultaneous 3D Reconstruction and Tumor Segmentation from Single X-Ray Projection
Authors:
Miao Zhu,
Qiming Fu,
Bo Liu,
Mengxi Zhang,
Bojian Li,
Xiaoyan Luo,
Fugen Zhou
Abstract:
Radiotherapy is one of the primary treatment methods for tumors, but the organ movement caused by respiration limits its accuracy. Recently, 3D imaging from a single X-ray projection has received extensive attention as a promising approach to address this issue. However, current methods can only reconstruct 3D images without directly locating the tumor and are only validated for fixed-angle imagin…
▽ More
Radiotherapy is one of the primary treatment methods for tumors, but the organ movement caused by respiration limits its accuracy. Recently, 3D imaging from a single X-ray projection has received extensive attention as a promising approach to address this issue. However, current methods can only reconstruct 3D images without directly locating the tumor and are only validated for fixed-angle imaging, which fails to fully meet the requirements of motion control in radiotherapy. In this study, a novel imaging method RT-SRTS is proposed which integrates 3D imaging and tumor segmentation into one network based on multi-task learning (MTL) and achieves real-time simultaneous 3D reconstruction and tumor segmentation from a single X-ray projection at any angle. Furthermore, the attention enhanced calibrator (AEC) and uncertain-region elaboration (URE) modules have been proposed to aid feature extraction and improve segmentation accuracy. The proposed method was evaluated on fifteen patient cases and compared with three state-of-the-art methods. It not only delivers superior 3D reconstruction but also demonstrates commendable tumor segmentation results. Simultaneous reconstruction and segmentation can be completed in approximately 70 ms, significantly faster than the required time threshold for real-time tumor tracking. The efficacies of both AEC and URE have also been validated in ablation studies. The code of work is available at https://github.com/ZywooSimple/RT-SRTS.
△ Less
Submitted 28 March, 2024; v1 submitted 12 October, 2023;
originally announced October 2023.
-
Lemur: Harmonizing Natural Language and Code for Language Agents
Authors:
Yiheng Xu,
Hongjin Su,
Chen Xing,
Boyu Mi,
Qian Liu,
Weijia Shi,
Binyuan Hui,
Fan Zhou,
Yitao Liu,
Tianbao Xie,
Zhoujun Cheng,
Siheng Zhao,
Lingpeng Kong,
Bailin Wang,
Caiming Xiong,
Tao Yu
Abstract:
We introduce Lemur and Lemur-Chat, openly accessible language models optimized for both natural language and coding capabilities to serve as the backbone of versatile language agents. The evolution from language chat models to functional language agents demands that models not only master human interaction, reasoning, and planning but also ensure grounding in the relevant environments. This calls…
▽ More
We introduce Lemur and Lemur-Chat, openly accessible language models optimized for both natural language and coding capabilities to serve as the backbone of versatile language agents. The evolution from language chat models to functional language agents demands that models not only master human interaction, reasoning, and planning but also ensure grounding in the relevant environments. This calls for a harmonious blend of language and coding capabilities in the models. Lemur and Lemur-Chat are proposed to address this necessity, demonstrating balanced proficiencies in both domains, unlike existing open-source models that tend to specialize in either. Through meticulous pre-training using a code-intensive corpus and instruction fine-tuning on text and code data, our models achieve state-of-the-art averaged performance across diverse text and coding benchmarks among open-source models. Comprehensive experiments demonstrate Lemur's superiority over existing open-source models and its proficiency across various agent tasks involving human communication, tool usage, and interaction under fully- and partially- observable environments. The harmonization between natural and programming languages enables Lemur-Chat to significantly narrow the gap with proprietary models on agent abilities, providing key insights into developing advanced open-source agents adept at reasoning, planning, and operating seamlessly across environments. https://github.com/OpenLemur/Lemur
△ Less
Submitted 24 August, 2024; v1 submitted 10 October, 2023;
originally announced October 2023.
-
Integration-free Training for Spatio-temporal Multimodal Covariate Deep Kernel Point Processes
Authors:
Yixuan Zhang,
Quyu Kong,
Feng Zhou
Abstract:
In this study, we propose a novel deep spatio-temporal point process model, Deep Kernel Mixture Point Processes (DKMPP), that incorporates multimodal covariate information. DKMPP is an enhanced version of Deep Mixture Point Processes (DMPP), which uses a more flexible deep kernel to model complex relationships between events and covariate data, improving the model's expressiveness. To address the…
▽ More
In this study, we propose a novel deep spatio-temporal point process model, Deep Kernel Mixture Point Processes (DKMPP), that incorporates multimodal covariate information. DKMPP is an enhanced version of Deep Mixture Point Processes (DMPP), which uses a more flexible deep kernel to model complex relationships between events and covariate data, improving the model's expressiveness. To address the intractable training procedure of DKMPP due to the non-integrable deep kernel, we utilize an integration-free method based on score matching, and further improve efficiency by adopting a scalable denoising score matching method. Our experiments demonstrate that DKMPP and its corresponding score-based estimators outperform baseline models, showcasing the advantages of incorporating covariate information, utilizing a deep kernel, and employing score-based estimators.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Deep Optimal Timing Strategies for Time Series
Authors:
Chen Pan,
Fan Zhou,
Xuanwei Hu,
Xinxin Zhu,
Wenxin Ning,
Zi Zhuang,
Siqiao Xue,
James Zhang,
Yunhua Hu
Abstract:
Deciding the best future execution time is a critical task in many business activities while evolving time series forecasting, and optimal timing strategy provides such a solution, which is driven by observed data. This solution has plenty of valuable applications to reduce the operation costs. In this paper, we propose a mechanism that combines a probabilistic time series forecasting task and an…
▽ More
Deciding the best future execution time is a critical task in many business activities while evolving time series forecasting, and optimal timing strategy provides such a solution, which is driven by observed data. This solution has plenty of valuable applications to reduce the operation costs. In this paper, we propose a mechanism that combines a probabilistic time series forecasting task and an optimal timing decision task as a first systematic attempt to tackle these practical problems with both solid theoretical foundation and real-world flexibility. Specifically, it generates the future paths of the underlying time series via probabilistic forecasting algorithms, which does not need a sophisticated mathematical dynamic model relying on strong prior knowledge as most other common practices. In order to find the optimal execution time, we formulate the decision task as an optimal stopping problem, and employ a recurrent neural network structure (RNN) to approximate the optimal times. Github repository: \url{github.com/ChenPopper/optimal_timing_TSF}.
△ Less
Submitted 9 October, 2023;
originally announced October 2023.
-
Continuous Invariance Learning
Authors:
Yong Lin,
Fan Zhou,
Lu Tan,
Lintao Ma,
Jiameng Liu,
Yansu He,
Yuan Yuan,
Yu Liu,
James Zhang,
Yujiu Yang,
Hao Wang
Abstract:
Invariance learning methods aim to learn invariant features in the hope that they generalize under distributional shifts. Although many tasks are naturally characterized by continuous domains, current invariance learning techniques generally assume categorically indexed domains. For example, auto-scaling in cloud computing often needs a CPU utilization prediction model that generalizes across diff…
▽ More
Invariance learning methods aim to learn invariant features in the hope that they generalize under distributional shifts. Although many tasks are naturally characterized by continuous domains, current invariance learning techniques generally assume categorically indexed domains. For example, auto-scaling in cloud computing often needs a CPU utilization prediction model that generalizes across different times (e.g., time of a day and date of a year), where `time' is a continuous domain index. In this paper, we start by theoretically showing that existing invariance learning methods can fail for continuous domain problems. Specifically, the naive solution of splitting continuous domains into discrete ones ignores the underlying relationship among domains, and therefore potentially leads to suboptimal performance. To address this challenge, we then propose Continuous Invariance Learning (CIL), which extracts invariant features across continuously indexed domains. CIL is a novel adversarial procedure that measures and controls the conditional independence between the labels and continuous domain indices given the extracted features. Our theoretical analysis demonstrates the superiority of CIL over existing invariance learning methods. Empirical results on both synthetic and real-world datasets (including data collected from production systems) show that CIL consistently outperforms strong baselines among all the tasks.
△ Less
Submitted 22 April, 2024; v1 submitted 8 October, 2023;
originally announced October 2023.
-
Synslator: An Interactive Machine Translation Tool with Online Learning
Authors:
Jiayi Wang,
Ke Wang,
Fengming Zhou,
Chengyu Wang,
Zhiyong Fu,
Zeyu Feng,
Yu Zhao,
Yuqi Zhang
Abstract:
Interactive machine translation (IMT) has emerged as a progression of the computer-aided translation paradigm, where the machine translation system and the human translator collaborate to produce high-quality translations. This paper introduces Synslator, a user-friendly computer-aided translation (CAT) tool that not only supports IMT, but is adept at online learning with real-time translation mem…
▽ More
Interactive machine translation (IMT) has emerged as a progression of the computer-aided translation paradigm, where the machine translation system and the human translator collaborate to produce high-quality translations. This paper introduces Synslator, a user-friendly computer-aided translation (CAT) tool that not only supports IMT, but is adept at online learning with real-time translation memories. To accommodate various deployment environments for CAT services, Synslator integrates two different neural translation models to handle translation memories for online learning. Additionally, the system employs a language model to enhance the fluency of translations in an interactive mode. In evaluation, we have confirmed the effectiveness of online learning through the translation models, and have observed a 13% increase in post-editing efficiency with the interactive functionalities of Synslator. A tutorial video is available at:https://youtu.be/K0vRsb2lTt8.
△ Less
Submitted 8 October, 2023;
originally announced October 2023.
-
Collaborative electric vehicle routing with meet points
Authors:
Fangting Zhou,
Ala Arvidsson,
Jiaming Wu,
Balazs Kulcsar
Abstract:
In this paper, we develop a profit-sharing-based optimal routing mechanism to incentivize horizontal collaboration among urban goods distributors. This paper investigates a collaborative routing problem for urban logistics, in which the exchange of goods at meet points is optimally planned en route. We show that collaboration does not only reduce the total cost but also increases the profit of eac…
▽ More
In this paper, we develop a profit-sharing-based optimal routing mechanism to incentivize horizontal collaboration among urban goods distributors. This paper investigates a collaborative routing problem for urban logistics, in which the exchange of goods at meet points is optimally planned en route. We show that collaboration does not only reduce the total cost but also increases the profit of each company by sharing some customers and the related profit. Hence, we focus on solving a collaborative electric vehicle routing problem under constraints such as customer-specific time windows, opportunity charging, vehicle capacity, and meet-point synchronization. The proposed Collaborative Electric Vehicle Routing Problem with Meet Point (CoEVRPMP) is modeled as a mixed-integer nonlinear programming problem. We first present an exact method for optimal benchmarks via decomposition. To handle real-world problems, we suggest using a metaheuristic method: adaptive large neighborhood search with linear programming. The viability and scalability of the collaborative method are demonstrated via numerical case studies: (i) a real-world case of two grocery stores in the city of Gothenburg, Sweden, and (ii) a large-scale experiment with 500 customers. The results underline the importance of horizontal collaboration among delivery companies. Collaboration helps to reduce the environmental footprint (total energy consumed) and to increase the individual company's profit at the same time.
△ Less
Submitted 4 October, 2023;
originally announced October 2023.
-
Score dynamics: scaling molecular dynamics with picoseconds timestep via conditional diffusion model
Authors:
Tim Hsu,
Babak Sadigh,
Vasily Bulatov,
Fei Zhou
Abstract:
We propose score dynamics (SD), a general framework for learning accelerated evolution operators with large timesteps from molecular-dynamics simulations. SD is centered around scores, or derivatives of the transition log-probability with respect to the dynamical degrees of freedom. The latter play the same role as force fields in MD but are used in denoising diffusion probability models to genera…
▽ More
We propose score dynamics (SD), a general framework for learning accelerated evolution operators with large timesteps from molecular-dynamics simulations. SD is centered around scores, or derivatives of the transition log-probability with respect to the dynamical degrees of freedom. The latter play the same role as force fields in MD but are used in denoising diffusion probability models to generate discrete transitions of the dynamical variables in an SD timestep, which can be orders of magnitude larger than a typical MD timestep. In this work, we construct graph neural network based score dynamics models of realistic molecular systems that are evolved with 10~ps timesteps. We demonstrate the efficacy of score dynamics with case studies of alanine dipeptide and short alkanes in aqueous solution. Both equilibrium predictions derived from the stationary distributions of the conditional probability and kinetic predictions for the transition rates and transition paths are in good agreement with MD. Our current SD implementation is about two orders of magnitude faster than the MD counterpart for the systems studied in this work. Open challenges and possible future remedies to improve score dynamics are also discussed.
△ Less
Submitted 6 March, 2024; v1 submitted 2 October, 2023;
originally announced October 2023.
-
LawBench: Benchmarking Legal Knowledge of Large Language Models
Authors:
Zhiwei Fei,
Xiaoyu Shen,
Dawei Zhu,
Fengzhe Zhou,
Zhuo Han,
Songyang Zhang,
Kai Chen,
Zongwen Shen,
Jidong Ge
Abstract:
Large language models (LLMs) have demonstrated strong capabilities in various aspects. However, when applying them to the highly specialized, safe-critical legal domain, it is unclear how much legal knowledge they possess and whether they can reliably perform legal-related tasks. To address this gap, we propose a comprehensive evaluation benchmark LawBench. LawBench has been meticulously crafted t…
▽ More
Large language models (LLMs) have demonstrated strong capabilities in various aspects. However, when applying them to the highly specialized, safe-critical legal domain, it is unclear how much legal knowledge they possess and whether they can reliably perform legal-related tasks. To address this gap, we propose a comprehensive evaluation benchmark LawBench. LawBench has been meticulously crafted to have precise assessment of the LLMs' legal capabilities from three cognitive levels: (1) Legal knowledge memorization: whether LLMs can memorize needed legal concepts, articles and facts; (2) Legal knowledge understanding: whether LLMs can comprehend entities, events and relationships within legal text; (3) Legal knowledge applying: whether LLMs can properly utilize their legal knowledge and make necessary reasoning steps to solve realistic legal tasks. LawBench contains 20 diverse tasks covering 5 task types: single-label classification (SLC), multi-label classification (MLC), regression, extraction and generation. We perform extensive evaluations of 51 LLMs on LawBench, including 20 multilingual LLMs, 22 Chinese-oriented LLMs and 9 legal specific LLMs. The results show that GPT-4 remains the best-performing LLM in the legal domain, surpassing the others by a significant margin. While fine-tuning LLMs on legal specific text brings certain improvements, we are still a long way from obtaining usable and reliable LLMs in legal tasks. All data, model predictions and evaluation code are released in https://github.com/open-compass/LawBench/. We hope this benchmark provides in-depth understanding of the LLMs' domain-specified capabilities and speed up the development of LLMs in the legal domain.
△ Less
Submitted 28 September, 2023;
originally announced September 2023.
-
Experimental Limits on Solar Reflected Dark Matter with a New Approach on Accelerated-Dark-Matter-Electron Analysis in Semiconductors
Authors:
Z. Y. Zhang,
L. T. Yang,
Q. Yue,
K. J. Kang,
Y. J. Li,
H. P. An,
Greeshma C.,
J. P. Chang,
Y. H. Chen,
J. P. Cheng,
W. H. Dai,
Z. Deng,
C. H. Fang,
X. P. Geng,
H. Gong,
Q. J. Guo,
T. Guo,
X. Y. Guo,
L. He,
S. M. He,
J. W. Hu,
H. X. Huang,
T. C. Huang,
L. Jiang,
S. Karmakar
, et al. (59 additional authors not shown)
Abstract:
Recently a dark matter-electron (DM-electron) paradigm has drawn much attention. Models beyond the standard halo model describing DM accelerated by high energy celestial bodies are under intense examination as well. In this Letter, a velocity components analysis (VCA) method dedicated to swift analysis of accelerated DM-electron interactions via semiconductor detectors is proposed and the first HP…
▽ More
Recently a dark matter-electron (DM-electron) paradigm has drawn much attention. Models beyond the standard halo model describing DM accelerated by high energy celestial bodies are under intense examination as well. In this Letter, a velocity components analysis (VCA) method dedicated to swift analysis of accelerated DM-electron interactions via semiconductor detectors is proposed and the first HPGe detector-based accelerated DM-electron analysis is realized. Utilizing the method, the first germanium based constraint on sub-GeV solar reflected DM-electron interaction is presented with the 205.4 kg$\cdot$day dataset from the CDEX-10 experiment. In the heavy mediator scenario, our result excels in the mass range of 5$-$15 keV/$c^2$, achieving a 3 orders of magnitude improvement comparing with previous semiconductor experiments. In the light mediator scenario, the strongest laboratory constraint for DM lighter than 0.1 MeV/$c^2$ is presented. The result proves the feasibility and demonstrates the vast potential of the VCA technique in future accelerated DM-electron analyses with semiconductor detectors.
△ Less
Submitted 24 April, 2024; v1 submitted 26 September, 2023;
originally announced September 2023.
-
Learning dislocation dynamics mobility laws from large-scale MD simulations
Authors:
Nicolas Bertin,
Vasily V. Bulatov,
Fei Zhou
Abstract:
The computational method of discrete dislocation dynamics (DDD), used as a coarse-grained model of true atomistic dynamics of lattice dislocations, has become of powerful tool to study metal plasticity arising from the collective behavior of dislocations. As a mesoscale approach, motion of dislocations in the DDD model is prescribed via the mobility law; a function which specifies how dislocation…
▽ More
The computational method of discrete dislocation dynamics (DDD), used as a coarse-grained model of true atomistic dynamics of lattice dislocations, has become of powerful tool to study metal plasticity arising from the collective behavior of dislocations. As a mesoscale approach, motion of dislocations in the DDD model is prescribed via the mobility law; a function which specifies how dislocation lines should respond to the driving force. However, the development of traditional hand-crafted mobility laws can be a cumbersome task and may involve detrimental simplifications. Here we introduce a machine-learning (ML) framework to streamline the development of data-driven mobility laws which are modeled as graph neural networks (GNN) trained on large-scale Molecular Dynamics (MD) simulations of crystal plasticity. We illustrate our approach on BCC tungsten and demonstrate that our GNN mobility implemented in large-scale DDD simulations accurately reproduces the challenging tension/compression asymmetry observed in ground-truth MD simulations while correctly predicting the flow stress at lower straining rate conditions unseen during training, thereby demonstrating the ability of our method to learn relevant dislocation physics. Our DDD+ML approach opens new promising avenues to improve fidelity of the DDD model and to incorporate more complex dislocation motion behaviors in an automated way, providing a faithful proxy for dislocation dynamics several orders of magnitude faster than ground-truth MD simulations.
△ Less
Submitted 25 September, 2023;
originally announced September 2023.
-
Cross-Modal Translation and Alignment for Survival Analysis
Authors:
Fengtao Zhou,
Hao Chen
Abstract:
With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images. However, existing methods either directly adopt a straightforward fusion of pathological features and genomic profiles for survival prediction, or take genomic profiles as guidance to integrate…
▽ More
With the rapid advances in high-throughput sequencing technologies, the focus of survival analysis has shifted from examining clinical indicators to incorporating genomic profiles with pathological images. However, existing methods either directly adopt a straightforward fusion of pathological features and genomic profiles for survival prediction, or take genomic profiles as guidance to integrate the features of pathological images. The former would overlook intrinsic cross-modal correlations. The latter would discard pathological information irrelevant to gene expression. To address these issues, we present a Cross-Modal Translation and Alignment (CMTA) framework to explore the intrinsic cross-modal correlations and transfer potential complementary information. Specifically, we construct two parallel encoder-decoder structures for multi-modal data to integrate intra-modal information and generate cross-modal representation. Taking the generated cross-modal representation to enhance and recalibrate intra-modal representation can significantly improve its discrimination for comprehensive survival analysis. To explore the intrinsic crossmodal correlations, we further design a cross-modal attention module as the information bridge between different modalities to perform cross-modal interactions and transfer complementary information. Our extensive experiments on five public TCGA datasets demonstrate that our proposed framework outperforms the state-of-the-art methods.
△ Less
Submitted 22 September, 2023;
originally announced September 2023.
-
Generator coordinate method for nuclear octupole excitations: status and perspectives
Authors:
E. F. Zhou,
J. M. Yao
Abstract:
Strong octupole correlations have been observed in the low-lying states of atomic nuclei across various mass regions. In this review, we provide an overview of Beyond Mean-Field (BMF) studies of nuclear octupole collective motions with Generator Coordinate Method (GCM) in combination with quantum-number projections that are implemented to restore the broken symmetries in nuclear mean-field states.…
▽ More
Strong octupole correlations have been observed in the low-lying states of atomic nuclei across various mass regions. In this review, we provide an overview of Beyond Mean-Field (BMF) studies of nuclear octupole collective motions with Generator Coordinate Method (GCM) in combination with quantum-number projections that are implemented to restore the broken symmetries in nuclear mean-field states. We highlight recent developments within this framework and their applications to excitation spectra and electromagnetic transition rates in octupole-shaped nuclei and hypernuclei. We discuss the novel phenomena of nucleon clustering in light nuclei. Additionally, we explore the phase transition from octupole vibrations to rotational motions as spin increases in heavy nuclei. Lastly, we examine the status and future prospects of studies on octupole deformation effects in nuclear Schiff moments. These studies, along with the upper limits of atomic Electric Dipole Moment (EDM), impose stringent constraints on beyond-standard-model time-reversal-violating nucleon-nucleon interactions.
△ Less
Submitted 12 October, 2023; v1 submitted 18 September, 2023;
originally announced September 2023.
-
RRCNN$^{+}$: An Enhanced Residual Recursive Convolutional Neural Network for Non-stationary Signal Decomposition
Authors:
Feng Zhou,
Antonio Cicone,
Haomin Zhou
Abstract:
Time-frequency analysis is an important and challenging task in many applications. Fourier and wavelet analysis are two classic methods that have achieved remarkable success in many fields. They also exhibit limitations when applied to nonlinear and non-stationary signals. To address this challenge, a series of nonlinear and adaptive methods, pioneered by the empirical mode decomposition method ha…
▽ More
Time-frequency analysis is an important and challenging task in many applications. Fourier and wavelet analysis are two classic methods that have achieved remarkable success in many fields. They also exhibit limitations when applied to nonlinear and non-stationary signals. To address this challenge, a series of nonlinear and adaptive methods, pioneered by the empirical mode decomposition method have been proposed. Their aim is to decompose a non-stationary signal into quasi-stationary components which reveal better features in the time-frequency analysis. Recently, inspired by deep learning, we proposed a novel method called residual recursive convolutional neural network (RRCNN). Not only RRCNN can achieve more stable decomposition than existing methods while batch processing large-scale signals with low computational cost, but also deep learning provides a unique perspective for non-stationary signal decomposition. In this study, we aim to further improve RRCNN with the help of several nimble techniques from deep learning and optimization to ameliorate the method and overcome some of the limitations of this technique.
△ Less
Submitted 9 September, 2023;
originally announced September 2023.
-
From Text to Mask: Localizing Entities Using the Attention of Text-to-Image Diffusion Models
Authors:
Changming Xiao,
Qi Yang,
Feng Zhou,
Changshui Zhang
Abstract:
Diffusion models have revolted the field of text-to-image generation recently. The unique way of fusing text and image information contributes to their remarkable capability of generating highly text-related images. From another perspective, these generative models imply clues about the precise correlation between words and pixels. In this work, a simple but effective method is proposed to utilize…
▽ More
Diffusion models have revolted the field of text-to-image generation recently. The unique way of fusing text and image information contributes to their remarkable capability of generating highly text-related images. From another perspective, these generative models imply clues about the precise correlation between words and pixels. In this work, a simple but effective method is proposed to utilize the attention mechanism in the denoising network of text-to-image diffusion models. Without re-training nor inference-time optimization, the semantic grounding of phrases can be attained directly. We evaluate our method on Pascal VOC 2012 and Microsoft COCO 2014 under weakly-supervised semantic segmentation setting and our method achieves superior performance to prior methods. In addition, the acquired word-pixel correlation is found to be generalizable for the learned text embedding of customized generation methods, requiring only a few modifications. To validate our discovery, we introduce a new practical task called "personalized referring image segmentation" with a new dataset. Experiments in various situations demonstrate the advantages of our method compared to strong baselines on this task. In summary, our work reveals a novel way to extract the rich multi-modal knowledge hidden in diffusion models for segmentation.
△ Less
Submitted 8 September, 2023;
originally announced September 2023.
-
Resource Management for IRS-assisted WP-MEC Networks with Practical Phase Shift Model
Authors:
Nana Li,
Wanming Hao,
Fuhui Zhou,
Zheng Chu,
Shouyi Yang,
Pei Xiao
Abstract:
Wireless powered mobile edge computing (WP-MEC) has been recognized as a promising solution to enhance the computational capability and sustainable energy supply for low-power wireless devices (WDs). However, when the communication links between the hybrid access point (HAP) and WDs are hostile, the energy transfer efficiency and task offloading rate are compromised. To tackle this problem, we pro…
▽ More
Wireless powered mobile edge computing (WP-MEC) has been recognized as a promising solution to enhance the computational capability and sustainable energy supply for low-power wireless devices (WDs). However, when the communication links between the hybrid access point (HAP) and WDs are hostile, the energy transfer efficiency and task offloading rate are compromised. To tackle this problem, we propose to employ multiple intelligent reflecting surfaces (IRSs) to WP-MEC networks. Based on the practical IRS phase shift model, we formulate a total computation rate maximization problem by jointly optimizing downlink/uplink IRSs passive beamforming, downlink energy beamforming and uplink multi-user detection (MUD) vector at HAPs, task offloading power and local computing frequency of WDs, and the time slot allocation. Specifically, we first derive the optimal time allocation for downlink wireless energy transmission (WET) to IRSs and the corresponding energy beamforming. Next, with fixed time allocation for the downlink WET to WDs, the original optimization problem can be divided into two independent subproblems. For the WD charging subproblem, the optimal IRSs passive beamforming is derived by utilizing the successive convex approximation (SCA) method and the penalty-based optimization technique, and for the offloading computing subproblem, we propose a joint optimization framework based on the fractional programming (FP) method. Finally, simulation results validate that our proposed optimization method based on the practical phase shift model can achieve a higher total computation rate compared to the baseline schemes.
△ Less
Submitted 7 September, 2023;
originally announced September 2023.