Zum Hauptinhalt springen

Showing 1–50 of 181 results for author: Zhai, J

.
  1. arXiv:2408.11313  [pdf, other

    cs.AI

    Unlocking Adversarial Suffix Optimization Without Affirmative Phrases: Efficient Black-box Jailbreaking via LLM as Optimizer

    Authors: Weipeng Jiang, Zhenting Wang, Juan Zhai, Shiqing Ma, Zhengyu Zhao, Chao Shen

    Abstract: Despite prior safety alignment efforts, mainstream LLMs can still generate harmful and unethical content when subjected to jailbreaking attacks. Existing jailbreaking methods fall into two main categories: template-based and optimization-based methods. The former requires significant manual effort and domain knowledge, while the latter, exemplified by Greedy Coordinate Gradient (GCG), which seeks… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  2. arXiv:2408.01305  [pdf, ps, other

    math.PR

    Ergodicity of Stochastic two-phase Stefan problem driven by pure jump Lévy noise

    Authors: Xiaotian Ge, Shijie Shang, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we consider stochastic two-phase Stefan problem driven by general jump Lévy noise. We first obtain the existence and uniqueness of the strong solution and then establish the ergodicity of the stochastic Stefan problem. Moreover, we give a precise characterization of the support of the invariant measures which provides the regularities of the stationary solutions of the stochastic fr… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  3. arXiv:2407.15462  [pdf, other

    cs.IR cs.DB cs.DS cs.LG

    Efficient Retrieval with Learned Similarities

    Authors: Bailu Ding, Jiaqi Zhai

    Abstract: Retrieval plays a fundamental role in recommendation systems, search, and natural language processing by efficiently finding relevant items from a large corpus given a query. Dot products have been widely used as the similarity function in such retrieval tasks, thanks to Maximum Inner Product Search (MIPS) that enabled efficient retrieval based on dot products. However, state-of-the-art retrieval… ▽ More

    Submitted 13 August, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

  4. arXiv:2407.14118  [pdf, other

    cs.SE

    Beyond Code Generation: Assessing Code LLM Maturity with Postconditions

    Authors: Fusen He, Juan Zhai, Minxue Pan

    Abstract: Most existing code Large Language Model (LLM) benchmarks, e.g., EvalPlus, focus on the code generation tasks. Namely, they contain a natural language description of a problem and ask the LLM to write code to solve the problem. We argue that they do not capture all capabilities needed to assess the quality of a code LLM. In this paper, we propose a code LLM maturity model, based on the postconditio… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

  5. arXiv:2407.12319  [pdf, other

    cs.CV

    Serialized Point Mamba: A Serialized Point Cloud Mamba Segmentation Model

    Authors: Tao Wang, Wei Wen, Jingzhi Zhai, Kang Xu, Haoming Luo

    Abstract: Point cloud segmentation is crucial for robotic visual perception and environmental understanding, enabling applications such as robotic navigation and 3D reconstruction. However, handling the sparse and unordered nature of point cloud data presents challenges for efficient and accurate segmentation. Inspired by the Mamba model's success in natural language processing, we propose the Serialized Po… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  6. arXiv:2407.02805  [pdf, other

    cs.SE cs.AI

    Efficient DNN-Powered Software with Fair Sparse Models

    Authors: Xuanqi Gao, Weipeng Jiang, Juan Zhai, Shiqing Ma, Xiaoyu Zhang, Chao Shen

    Abstract: With the emergence of the Software 3.0 era, there is a growing trend of compressing and integrating large models into software systems, with significant societal implications. Regrettably, in numerous instances, model compression techniques impact the fairness performance of these models and thus the ethical behavior of DNN-powered software. One of the most notable example is the Lottery Ticket Hy… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  7. arXiv:2407.00560  [pdf, other

    q-bio.BM math.OC

    DCI: An Accurate Quality Assessment Criteria for Protein Complex Structure Models

    Authors: Wenda Wang, Jiaqi Zhai, He Huang, Xinqi Gong

    Abstract: The structure of proteins is the basis for studying protein function and drug design. The emergence of AlphaFold 2 has greatly promoted the prediction of protein 3D structures, and it is of great significance to give an overall and accurate evaluation of the predicted models, especially the complex models. Among the existing methods for evaluating multimer structures, DockQ is the most commonly us… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  8. arXiv:2406.16262  [pdf, ps, other

    math.PR

    Large deviations for 2D Stochastic Chemotaxis-Navier-Stokes System

    Authors: Yunfeng Chen, Xuhui Peng, Jianliang Zhai

    Abstract: In this paper, we establish a large deviation principle for 2D stochastic Chemotaxis-Navier-Stokes equation perturbed by a small multiplicative noise. The main difficulties come from the lack of a suitable compact embedding into the space occupied by the solutions and the inherent complexity of equation. Finite dimensional projection arguments and introducing suitable stopping times play important… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  9. SmartAxe: Detecting Cross-Chain Vulnerabilities in Bridge Smart Contracts via Fine-Grained Static Analysis

    Authors: Zeqin Liao, Yuhong Nan, Henglong Liang, Sicheng Hao, Juan Zhai, Jiajing Wu, Zibin Zheng

    Abstract: With the increasing popularity of blockchain, different blockchain platforms coexist in the ecosystem (e.g., Ethereum, BNB, EOSIO, etc.), which prompts the high demand for cross-chain communication. Cross-chain bridge is a specific type of decentralized application for asset exchange across different blockchain platforms. Securing the smart contracts of cross-chain bridges is in urgent need, as th… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

    Journal ref: The ACM International Conference on the Foundations of Software Engineering 2024

  10. arXiv:2406.12196  [pdf, other

    cs.SE

    CITADEL: Context Similarity Based Deep Learning Framework Bug Finding

    Authors: Xiaoyu Zhang, Juan Zhai, Shiqing Ma, Shiwei Wang, Chao Shen

    Abstract: With deep learning (DL) technology becoming an integral part of the new intelligent software, tools of DL framework testing and bug-finding are in high demand. Existing DL framework testing tools have limited coverage on bug types. For example, they lack the capability of finding performance bugs, which are critical for DL model training and inference regarding performance, economics, and the envi… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 12 pages, 10 figures

  11. Optimal Kernel Orchestration for Tensor Programs with Korch

    Authors: Muyan Hu, Ashwin Venkatram, Shreyashri Biswas, Balamurugan Marimuthu, Bohan Hou, Gabriele Oliaro, Haojie Wang, Liyan Zheng, Xupeng Miao, Jidong Zhai

    Abstract: Kernel orchestration is the task of mapping the computation defined in different operators of a deep neural network (DNN) to the execution of GPU kernels on modern hardware platforms. Prior approaches optimize kernel orchestration by greedily applying operator fusion, which fuses the computation of multiple operators into a single kernel, and miss a variety of optimization opportunities in kernel… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

    Comments: Fix some typos in the ASPLOS version

    Journal ref: Proceedings of the 29th ACM International Conference on Architectural Support for Programming Languages and Operating Systems 3 (2024) 755-769

  12. arXiv:2406.05715  [pdf

    physics.acc-ph hep-ex

    Error analysis of vertical test for CEPC 650 MHz superconducting radio-frequency cavity

    Authors: Lingxi Ye, Peng Sha, Zhenghui Mi, Feisi He, Jiyuan Zhai

    Abstract: Hundreds of 650 MHz superconducting radio-frequency (SRF) cavities with high intrinsic quality factor (Q0) and accelerating gradient (Eacc) will be adopted for Circular Electron Positron Collider (CEPC). The values of Q0 and Eacc are obtained during vertical test at 2.0 K. Hence, high accuracy of vertical test is essential for evaluating the performance of SRF cavity. The 650 MHz SRF cavities achi… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  13. arXiv:2406.00699  [pdf, other

    cs.CV

    Towards General Robustness Verification of MaxPool-based Convolutional Neural Networks via Tightening Linear Approximation

    Authors: Yuan Xiao, Shiqing Ma, Juan Zhai, Chunrong Fang, Jinyuan Jia, Zhenyu Chen

    Abstract: The robustness of convolutional neural networks (CNNs) is vital to modern AI-driven systems. It can be quantified by formal verification by providing a certified lower bound, within which any perturbation does not alter the original input's classification result. It is challenging due to nonlinear components, such as MaxPool. At present, many verification methods are sound but risk losing some pre… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Accepted to CVPR2024. Project page: https://github.com/xiaoyuanpigo/maxlin

  14. arXiv:2406.00602  [pdf, other

    cs.SE cs.PL

    From Effectiveness to Efficiency: Comparative Evaluation of Code Generated by LCGMs for Bilingual Programming Questions

    Authors: Weipeng Jiang, Xuanqi Gao, Juan Zhai, Shiqing Ma, Xiaoyu Zhang, Chao Shen

    Abstract: Large Code Generation Models (LCGMs) have garnered significant attention and achieved promising results across various programming tasks. However, concerns arise regarding performance when using non-English prompts, as these models are primarily trained on English-centric corpora, and most programming language tokens resemble English. Existing benchmarks often rely on English programming questions… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 10 and a quarter pages, 6 figures

  15. High-field magnetoelectric coupling and successive magnetic transitions in Mn-doped polar antiferromagnet Ni3TeO6

    Authors: J. H. Zhang, L. Lin, C. Dong, Y. T. Chang, J. F. Wang, C. L. Lu, P. Z. Chen, W. J. Zhai, G. Z. Zhou, L. Huang, Y. S. Tang, S. H. Zheng, M. F. Liu, X. H. Zhou, Z. B. Yan, J. -M. Liu

    Abstract: Among the 3d transition metal ions doped polar Ni3TeO6, Mn-doped Ni3TeO6 has stimulated great interest due to its high magnetic ordering temperature and complex magnetic phases, but the mechanism of magnetoelectric (ME) coupling is far from understood. Herein we report our systematic investigation of the chemical control of magnetism, metamagnetic transition, and ME properties of Ni3-xMnxTeO6 sing… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 30 pages with 8 figures

    Journal ref: Phys. Rev. B 109, 184112 (2024)

  16. arXiv:2405.00414  [pdf, ps, other

    math.PR

    Ergodicity for 2D Navier-Stokes equations with a degenerate pure jump noise

    Authors: Xuhui Peng, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we establish the ergodicity for stochastic 2D Navier-Stokes equations driven by a highly degenerate pure jump Lévy noise. The noise could appear in as few as four directions. This gives an affirmative anwser to a longstanding problem. The case of Gaussian noise was treated in Hairer and Mattingly [\emph{Ann. of Math.}, 164(3):993--1032, 2006]. To obtain the uniqueness of invariant m… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  17. arXiv:2404.04242  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Physical Property Understanding from Language-Embedded Feature Fields

    Authors: Albert J. Zhai, Yuan Shen, Emily Y. Chen, Gloria X. Wang, Xinlei Wang, Sheng Wang, Kaiyu Guan, Shenlong Wang

    Abstract: Can computers perceive the physical properties of objects solely through vision? Research in cognitive science and vision science has shown that humans excel at identifying materials and estimating their physical properties based purely on visual appearance. In this paper, we present a novel approach for dense prediction of the physical properties of objects using a collection of images. Inspired… ▽ More

    Submitted 5 April, 2024; originally announced April 2024.

    Comments: CVPR 2024. Project page (with code): https://ajzhai.github.io/NeRF2Physics/

  18. arXiv:2403.11421  [pdf, other

    cs.DC cs.PF

    FastDecode: High-Throughput GPU-Efficient LLM Serving using Heterogeneous Pipelines

    Authors: Jiaao He, Jidong Zhai

    Abstract: Cost of serving large language models (LLM) is high, but the expensive and scarce GPUs are poorly efficient when generating tokens sequentially, unless the batch of sequences is enlarged. However, the batch size is limited by some constantly reused intermediate results, namely KV-Cache. They occupy too much memory to fit more sequences into a GPU simultaneously. While they could be offloaded to ho… ▽ More

    Submitted 17 March, 2024; originally announced March 2024.

    Comments: 15 pages, 15 figures

    ACM Class: C.4

  19. arXiv:2403.01388  [pdf, ps, other

    math.PR

    Wong-Zakai approximations and support theorems for SDEs under Lyapunov conditions

    Authors: Qi Li, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we establish the Stroock-Varadhan type support theorems for stochastic differential equations (SDEs) under Lyapunov conditions, which significantly improve the existing results in the literature where the coefficients of the SDEs are required to be globally Lipschitz and of linear growth. Our conditions are very mild to include many important models, e.g. Threshold Ornstein-Ulenbeck… ▽ More

    Submitted 2 March, 2024; originally announced March 2024.

  20. arXiv:2402.19274  [pdf, other

    cond-mat.mtrl-sci

    Mixed-halide perovskite alloys $\text{CsPb}(\text{I}_{1-x}^{}\text{Br}_x^{})_3^{}$ and $\text{CsPb}(\text{Br}_{1-x}^{}\text{Cl}_x^{})_3^{}$: New insight of configuration entropy effect from first principles and phase diagrams

    Authors: Fang Pan, Junni Zhai, Jinyu Chen, Lin Yang, Hua Dong, Fang Yuan, Zhuangde Jiang, Wei Ren, Zuo-Guang Ye, Guo-Xu Zhang, Jingrui Li

    Abstract: Stability is one of the key issues in mixed-halide perovskite alloys which are promising in emergent optoelectronics. Previous density-functional-theory (DFT) and machine learning studies indicate that the formation-energy convex hulls of these materials are very shallow, and stable alloy compositions are rare. In this work, we revisit this problem using DFT with special focus on the effects of co… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  21. arXiv:2402.17152  [pdf, other

    cs.LG cs.IR

    Actions Speak Louder than Words: Trillion-Parameter Sequential Transducers for Generative Recommendations

    Authors: Jiaqi Zhai, Lucy Liao, Xing Liu, Yueming Wang, Rui Li, Xuan Cao, Leon Gao, Zhaojie Gong, Fangda Gu, Michael He, Yinghai Lu, Yu Shi

    Abstract: Large-scale recommendation systems are characterized by their reliance on high cardinality, heterogeneous features and the need to handle tens of billions of user actions on a daily basis. Despite being trained on huge volume of data with thousands of features, most Deep Learning Recommendation Models (DLRMs) in industry fail to scale with compute. Inspired by success achieved by Transformers in… ▽ More

    Submitted 5 May, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 26 pages, 13 figures. ICML'24. Code available at https://github.com/facebookresearch/generative-recommenders

  22. arXiv:2402.16522  [pdf, other

    math.PR

    Uniform large deviations and metastability of random dynamical systems

    Authors: Jifa Jiang, Jian Wang, Jianliang Zhai, Tusheng Zhang

    Abstract: In this paper, we first provide a criterion on uniform large deviation principles (ULDP) of stochastic differential equations under Lyapunov conditions on the coefficients, which can be applied to stochastic systems with coefficients of polynomial growth and possible degenerate driving noises. In the second part, using the ULDP criterion we preclude the concentration of limiting measures of invari… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    MSC Class: 60B10; 60F10; 60H10; 37A50; 37C70

  23. arXiv:2402.12640  [pdf, ps, other

    math.DG

    Invertibility of local geodesic transverse and mixed ray transforms II: higher order tensors

    Authors: Gunther Uhlmann, Jian Zhai

    Abstract: Consider a compact Riemannian manifold in dimension $n$ with strictly convex boundary. We show the local invertibility near a boundary point of the transverse ray transform of $2$ tensors for $n\geq 3$ and the mixed ray transform of $2+2$ tensors for $n=3$. When the manifold admits a strictly convex function, this local invertibility result leads to global invertibility.

    Submitted 19 February, 2024; originally announced February 2024.

  24. arXiv:2402.11283  [pdf, other

    math.NA stat.ML

    Deep adaptive sampling for surrogate modeling without labeled data

    Authors: Xili Wang, Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

    Abstract: Surrogate modeling is of great practical significance for parametric differential equation systems. In contrast to classical numerical methods, using physics-informed deep learning methods to construct simulators for such systems is a promising direction due to its potential to handle high dimensionality, which requires minimizing a loss over a training set of random samples. However, the random s… ▽ More

    Submitted 17 February, 2024; originally announced February 2024.

  25. arXiv:2402.03791  [pdf, other

    cs.DC

    ZeroPP: Unleashing Exceptional Parallelism Efficiency through Tensor-Parallelism-Free Methodology

    Authors: Ding Tang, Lijuan Jiang, Jiecheng Zhou, Minxi Jin, Hengjie Li, Xingcheng Zhang, Zhilin Pei, Jidong Zhai

    Abstract: Large-scale models rely heavily on 3D parallelism for distributed training, which utilizes tensor parallelism (TP) as the intra-operator parallelism to partition model states across GPUs. However, TP introduces significant communication overheads and complexity in modifying single-GPU code. In this paper, we propose a TP-free distributed framework ZeroPP, which leverages the hybrid of scalable int… ▽ More

    Submitted 24 May, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  26. arXiv:2401.11385  [pdf, ps, other

    math.PR

    Large deviations for locally monotone stochastic partial differential equations driven by Lévy noise

    Authors: Weina Wu, Jianliang Zhai, Jiahui Zhu

    Abstract: We establish a Freidlin-Wentzell type large deviation principle (LDP) for a class of stochastic partial differential equations with locally monotone coefficients driven by Lévy noise. Our results essentially improve a recent work on this topic (Bernoulli, 2018) by the second named author of this paper and his collaborator, because we drop the compactness embedding assumptions, and we also make the… ▽ More

    Submitted 20 January, 2024; originally announced January 2024.

    Comments: 23 pages

  27. arXiv:2401.09017  [pdf, ps, other

    math.DG

    Invertibility of local geodesic transverse and mixed ray transforms I: basic cases

    Authors: Gunther Uhlmann, Jian Zhai

    Abstract: Consider a compact Riemannian manifold in dimension $n\geq 3$ with strictly convex boundary. We show that the transverse ray transform of $1$ tensors and the mixed ray transform of $1+1$ tensors are invertible, up to natural obstructions, near a boundary point. When the manifold admits a strictly convex function, this local invertibility result leads to a global result by a layer stripping argumen… ▽ More

    Submitted 17 January, 2024; originally announced January 2024.

    MSC Class: 53C22; 53C65

  28. arXiv:2401.00751  [pdf, other

    cs.CL cs.SE

    Machine Translation Testing via Syntactic Tree Pruning

    Authors: Quanjun Zhang, Juan Zhai, Chunrong Fang, Jiawei Liu, Weisong Sun, Haichuan Hu, Qingyu Wang

    Abstract: Machine translation systems have been widely adopted in our daily life, making life easier and more convenient. Unfortunately, erroneous translations may result in severe consequences, such as financial losses. This requires to improve the accuracy and the reliability of machine translation systems. However, it is challenging to test machine translation systems because of the complexity and intrac… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

    Comments: Accepted to ACM Transactions on Software Engineering and Methodology 2024 (TOSEM'24)

  29. arXiv:2401.00379  [pdf, other

    cs.SE cs.AI

    DREAM: Debugging and Repairing AutoML Pipelines

    Authors: Xiaoyu Zhang, Juan Zhai, Shiqing Ma, Chao Shen

    Abstract: Deep Learning models have become an integrated component of modern software systems. In response to the challenge of model design, researchers proposed Automated Machine Learning (AutoML) systems, which automatically search for model architecture and hyperparameters for a given task. Like other software systems, existing AutoML systems suffer from bugs. We identify two common and severe bugs in Au… ▽ More

    Submitted 30 December, 2023; originally announced January 2024.

    Comments: 12 pages, 10 figures

  30. arXiv:2312.12722  [pdf, other

    cs.CV

    Fine-Grained Knowledge Selection and Restoration for Non-Exemplar Class Incremental Learning

    Authors: Jiang-Tian Zhai, Xialei Liu, Lu Yu, Ming-Ming Cheng

    Abstract: Non-exemplar class incremental learning aims to learn both the new and old tasks without accessing any training data from the past. This strict restriction enlarges the difficulty of alleviating catastrophic forgetting since all techniques can only be applied to current task data. Considering this challenge, we propose a novel framework of fine-grained knowledge selection and restoration. The conv… ▽ More

    Submitted 19 December, 2023; originally announced December 2023.

    Comments: to appear at AAAI 2024

  31. Large nonlinear Hall effect and Berry curvature in KTaO3 based two-dimensional electron gas

    Authors: Jinfeng Zhai, Mattia Trama, Hao Liu, Zhifei Zhu, Yinyan Zhu, Carmine Antonio Perroni, Roberta Citro, Pan He, Jian Shen

    Abstract: The two-dimensional electron gas (2DEG) at oxide interfaces exhibits various exotic properties stemming from interfacial inversion symmetry breaking. In this work, we report the emergence of large nonlinear Hall effects (NHE) in the LaAlO3/KTaO3(111) interface 2DEG under zero magnetic field. Skew scattering was identified as the dominant origin based on the cubic scaling of nonlinear Hall conducti… ▽ More

    Submitted 9 December, 2023; originally announced December 2023.

    Journal ref: Nano Letters 2023

  32. arXiv:2312.02441  [pdf, other

    cs.CL

    MedDM:LLM-executable clinical guidance tree for clinical decision-making

    Authors: Binbin Li, Tianxin Meng, Xiaoming Shi, Jie Zhai, Tong Ruan

    Abstract: It is becoming increasingly emphasis on the importance of LLM participating in clinical diagnosis decision-making. However, the low specialization refers to that current medical LLMs can not provide specific medical advice, which are more like a medical Q\&A. And there is no suitable clinical guidance tree data set that can be used directly with LLM. To address this issue, we first propose LLM-exe… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  33. arXiv:2312.01175  [pdf

    physics.acc-ph

    High Q and high gradient performance of the first medium-temperature baking 1.3 GHz cryomodule

    Authors: Jiyuan Zhai, Weimin Pan, Feisi He, Rui Ge, Zhenghui Mi, Peng Sha, Song Jin, Ruixiong Han, Qunyao Wang, Haiying Lin, Guangwei Wang, Mei Li, Minjing Sang, Liangrui Sun, Rui Ye, Tongxian Zhao, Shaopeng Li, Keyu Zhu, Baiqi Liu, Xiaolong Wang, Xiangchen Yang, Xiaojuan Bian, Xiangzhen Zhang, Huizhou Ma, Xuwen Dai , et al. (14 additional authors not shown)

    Abstract: World's first 1.3 GHz cryomodule containing eight 9-cell superconducting radio-frequency (RF) cavities treated by medium-temperature furnace baking (mid-T bake) was developed, assembled and tested at IHEP for the Dalian Advanced Light Source (DALS) and CEPC R&D. The 9-cell cavities in the cryomodule achieved an unprecedented highest average Q0 of 3.8E10 at 16 MV/m and 3.6E10 at 21 MV/m in the hori… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: 5 pages, 6 figures

  34. arXiv:2312.00324  [pdf, other

    cs.SE

    Machine Learning for Actionable Warning Identification: A Comprehensive Survey

    Authors: Xiuting Ge, Chunrong Fang, Xuanye Li, Weisong Sun, Daoyuan Wu, Juan Zhai, Shangwei Lin, Zhihong Zhao, Yang Liu, Zhenyu Chen

    Abstract: Actionable Warning Identification (AWI) plays a crucial role in improving the usability of static code analyzers. With recent advances in Machine Learning (ML), various approaches have been proposed to incorporate ML techniques into AWI. These ML-based AWI approaches, benefiting from ML's strong ability to learn subtle and previously unseen patterns from historical data, have demonstrated superior… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  35. arXiv:2311.17822  [pdf, other

    cs.AI

    Anomalous Behavior Detection in Trajectory Data of Older Drivers

    Authors: Seyedeh Gol Ara Ghoreishi, Sonia Moshfeghi, Muhammad Tanveer Jan, Joshua Conniff, KwangSoo Yang, Jinwoo Jang, Borko Furht, Ruth Tappen, David Newman, Monica Rosselli, Jiannan Zhai

    Abstract: Given a road network and a set of trajectory data, the anomalous behavior detection (ABD) problem is to identify drivers that show significant directional deviations, hardbrakings, and accelerations in their trips. The ABD problem is important in many societal applications, including Mild Cognitive Impairment (MCI) detection and safe route recommendations for older drivers. The ABD problem is comp… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: IEEE HONET 2023

  36. arXiv:2311.09264  [pdf, other

    cs.LG cs.AI q-bio.QM

    Cross-domain feature disentanglement for interpretable modeling of tumor microenvironment impact on drug response

    Authors: Jia Zhai, Hui Liu

    Abstract: High-throughput screening technology has facilitated the generation of large-scale drug responses across hundreds of cancer cell lines. However, there exists significant discrepancy between in vitro cell lines and actual tumors in vivo in terms of their response to drug treatments, because of tumors comprise of complex cellular compositions and histopathology structure, known as tumor microenviron… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  37. arXiv:2310.08879  [pdf, other

    cs.SE

    A Critical Review of Large Language Model on Software Engineering: An Example from ChatGPT and Automated Program Repair

    Authors: Quanjun Zhang, Tongke Zhang, Juan Zhai, Chunrong Fang, Bowen Yu, Weisong Sun, Zhenyu Chen

    Abstract: Large Language Models (LLMs) have been gaining increasing attention and demonstrated promising performance across a variety of Software Engineering (SE) tasks, such as Automated Program Repair (APR), code summarization, and code completion. For example, ChatGPT, the latest black-box LLM, has been investigated by numerous recent research studies and has shown impressive performance in various tasks… ▽ More

    Submitted 17 April, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: add EvalGPTFix URL

  38. arXiv:2309.06645  [pdf, other

    cs.LG

    Bregman Graph Neural Network

    Authors: Jiayu Zhai, Lequan Lin, Dai Shi, Junbin Gao

    Abstract: Numerous recent research on graph neural networks (GNNs) has focused on formulating GNN architectures as an optimization problem with the smoothness assumption. However, in node classification tasks, the smoothing effect induced by GNNs tends to assimilate representations and over-homogenize labels of connected nodes, leading to adverse effects such as over-smoothing and misclassification. In this… ▽ More

    Submitted 12 September, 2023; originally announced September 2023.

  39. arXiv:2308.13282  [pdf, other

    eess.SY

    Advancing Distributed AC Optimal Power Flow for Integrated Transmission-Distribution Systems

    Authors: Xinliang Dai, Junyi Zhai, Yuning Jiang, Yi Guo, Colin N. Jones, Veit Hagenmeyer

    Abstract: This paper introduces a distributed operational solution for coordinating integrated transmission-distribution (ITD) systems regarding data privacy. To tackle the nonconvex challenges of AC optimal power flow (OPF) problems, our research proposes an enhanced version of the Augmented Lagrangian based Alternating Direction Inexact Newton method (ALADIN). This proposed framework incorporates a second… ▽ More

    Submitted 30 January, 2024; v1 submitted 25 August, 2023; originally announced August 2023.

  40. arXiv:2308.12510  [pdf, other

    cs.CV cs.AI cs.LG

    Masked Autoencoders are Efficient Class Incremental Learners

    Authors: Jiang-Tian Zhai, Xialei Liu, Andrew D. Bagdanov, Ke Li, Ming-Ming Cheng

    Abstract: Class Incremental Learning (CIL) aims to sequentially learn new classes while avoiding catastrophic forgetting of previous knowledge. We propose to use Masked Autoencoders (MAEs) as efficient learners for CIL. MAEs were originally designed to learn useful representations through reconstructive unsupervised learning, and they can be easily integrated with a supervised loss for classification. Moreo… ▽ More

    Submitted 23 August, 2023; originally announced August 2023.

    Comments: Accepted at ICCV 2023

  41. arXiv:2308.12475  [pdf, ps, other

    math.AP

    Determination of the density in a nonlinear elastic wave equation

    Authors: Gunther Uhlmann, Jian Zhai

    Abstract: This is a continuation of our study [Uhlmann-Zhai, JMPA, 2021] on an inverse boundary value problem for a nonlinear elastic wave equation. We prove that all the linear and nonlinear coefficients can be recovered from the displacement-to-traction map, including the density, under some natural geometric conditions on the wavespeeds.

    Submitted 24 January, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

  42. arXiv:2308.08459  [pdf, other

    cs.IR cs.AI

    Knowledge Prompt-tuning for Sequential Recommendation

    Authors: Jianyang Zhai, Xiawu Zheng, Chang-Dong Wang, Hui Li, Yonghong Tian

    Abstract: Pre-trained language models (PLMs) have demonstrated strong performance in sequential recommendation (SR), which are utilized to extract general knowledge. However, existing methods still lack domain knowledge and struggle to capture users' fine-grained preferences. Meanwhile, many traditional SR methods improve this issue by integrating side information while suffering from information loss. To s… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

  43. arXiv:2307.14554  [pdf, ps, other

    math.PR

    Large deviation principle for stochastic reaction-diffusion equations with super-linear drift on $\mathbb{R}$ driven by space-time white noise

    Authors: Yue Li, Shijie Shang, Jianliang Zhai

    Abstract: In this paper, we consider stochastic reaction-diffusion equations with super-linear drift on the real line $\mathbb{R}$ driven by space-time white noise. A Freidlin-Wentzell large deviation principle is established by a modified weak convergence method on the space $C([0,T], C_{tem}(\mathbb{R}))$. Obtaining the main result in this paper is challenging due to the setting of unbounded domain, the s… ▽ More

    Submitted 26 July, 2023; originally announced July 2023.

    MSC Class: 60H15; 60F10

  44. arXiv:2307.04995  [pdf, other

    cs.LG cs.PL

    PowerFusion: A Tensor Compiler with Explicit Data Movement Description and Instruction-level Graph IR

    Authors: Zixuan Ma, Haojie Wang, Jingze Xing, Liyan Zheng, Chen Zhang, Huanqi Cao, Kezhao Huang, Shizhi Tang, Penghan Wang, Jidong Zhai

    Abstract: Deep neural networks (DNNs) are of critical use in different domains. To accelerate DNN computation, tensor compilers are proposed to generate efficient code on different domain-specific accelerators. Existing tensor compilers mainly focus on optimizing computation efficiency. However, memory access is becoming a key performance bottleneck because the computational performance of accelerators is i… ▽ More

    Submitted 10 July, 2023; originally announced July 2023.

    Comments: 12 pages, 14 figures

  45. arXiv:2306.10211  [pdf, ps, other

    math.AP

    Increasing stability estimates for the inverse potential scattering problems

    Authors: Jian Zhai, Yue Zhao

    Abstract: This paper is mainly concerned with the inverse scattering problem of determining the unknown potential for the classical Schrödinger equation in two and three dimensions. We establish the increasing stability of the inverse scattering problem from either multi-frequency near-field data or multi-frequency far-field pattern. The stability estimate consists of the Lipschitz type data discrepancy and… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

    MSC Class: 35R30; 78A46

  46. Revisiting Neural Retrieval on Accelerators

    Authors: Jiaqi Zhai, Zhaojie Gong, Yueming Wang, Xiao Sun, Zheng Yan, Fu Li, Xing Liu

    Abstract: Retrieval finds a small number of relevant candidates from a large corpus for information retrieval and recommendation applications. A key component of retrieval is to model (user, item) similarity, which is commonly represented as the dot product of two learned embeddings. This formulation permits efficient inference, commonly known as Maximum Inner Product Search (MIPS). Despite its popularity,… ▽ More

    Submitted 6 June, 2023; originally announced June 2023.

    Comments: To appear in the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD 2023)

  47. arXiv:2305.18702  [pdf, other

    stat.ML cs.LG math.NA

    Adversarial Adaptive Sampling: Unify PINN and Optimal Transport for the Approximation of PDEs

    Authors: Kejun Tang, Jiayu Zhai, Xiaoliang Wan, Chao Yang

    Abstract: Solving partial differential equations (PDEs) is a central task in scientific computing. Recently, neural network approximation of PDEs has received increasing attention due to its flexible meshless discretization and its potential for high-dimensional problems. One fundamental numerical difficulty is that random samples in the training set introduce statistical errors into the discretization of l… ▽ More

    Submitted 14 March, 2024; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: ICLR, 2024

  48. arXiv:2305.10430  [pdf, other

    cs.CV

    Rethinking the Open-Loop Evaluation of End-to-End Autonomous Driving in nuScenes

    Authors: Jiang-Tian Zhai, Ze Feng, Jinhao Du, Yongqiang Mao, Jiang-Jiang Liu, Zichang Tan, Yifu Zhang, Xiaoqing Ye, Jingdong Wang

    Abstract: Modern autonomous driving systems are typically divided into three main tasks: perception, prediction, and planning. The planning task involves predicting the trajectory of the ego vehicle based on inputs from both internal intention and the external environment, and manipulating the vehicle accordingly. Most existing works evaluate their performance on the nuScenes dataset using the L2 error and… ▽ More

    Submitted 21 October, 2023; v1 submitted 17 May, 2023; originally announced May 2023.

    Comments: Technical report. Code is available

  49. arXiv:2305.07220  [pdf, other

    eess.SP

    Physical-layer Adversarial Robustness for Deep Learning-based Semantic Communications

    Authors: Guoshun Nan, Zhichun Li, Jinli Zhai, Qimei Cui, Gong Chen, Xin Du, Xuefei Zhang, Xiaofeng Tao, Zhu Han, Tony Q. S. Quek

    Abstract: End-to-end semantic communications (ESC) rely on deep neural networks (DNN) to boost communication efficiency by only transmitting the semantics of data, showing great potential for high-demand mobile applications. We argue that central to the success of ESC is the robust interpretation of conveyed semantics at the receiver side, especially for security-critical applications such as automatic driv… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: 17 pages, 28 figures, accepted by IEEE jsac

  50. arXiv:2305.05234  [pdf, ps, other

    math.PR math-ph

    Large deviation principles for stochastic nonlinear Schrodinger equations driven by Levy noise

    Authors: Jiahui Zhu, Wei Liu, Jianliang Zhai

    Abstract: In this work we establish a Freidlin-Wentzell type large deviation principle for stochastic nonlinear Schrödinger equation, with either focusing or defocusing nonlinearity, driven by nonlinear multiplicative Lévy noise in the Marcus canonical form. This task is challenging in the current setting due to the presence of the power-type nonlinear term, the lack of regularization effect of the Schrödin… ▽ More

    Submitted 16 August, 2024; v1 submitted 9 May, 2023; originally announced May 2023.