Zum Hauptinhalt springen

Showing 1–50 of 262 results for author: Du, Q

.
  1. arXiv:2408.14255  [pdf, other

    eess.IV

    MSFMamba: Multi-Scale Feature Fusion State Space Model for Multi-Source Remote Sensing Image Classification

    Authors: Feng Gao, Xuepeng Jin, Xiaowei Zhou, Junyu Dong, Qian Du

    Abstract: In multi-source remote sensing image classification field, remarkable progress has been made by convolutional neural network and Transformer. However, existing methods are still limited due to the inherent local reductive bias. Recently, Mamba-based methods built upon the State Space Model have shown great potential for long-range dependency modeling with linear complexity, but it has rarely been… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.14158  [pdf, other

    cs.DC cs.AI

    Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

    Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

    Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More

    Submitted 31 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

  3. arXiv:2408.12232  [pdf, other

    cs.CV

    BihoT: A Large-Scale Dataset and Benchmark for Hyperspectral Camouflaged Object Tracking

    Authors: Hanzheng Wang, Wei Li, Xiang-Gen Xia, Qian Du

    Abstract: Hyperspectral object tracking (HOT) has exhibited potential in various applications, particularly in scenes where objects are camouflaged. Existing trackers can effectively retrieve objects via band regrouping because of the bias in existing HOT datasets, where most objects tend to have distinguishing visual appearances rather than spectral characteristics. This bias allows the tracker to directly… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  4. arXiv:2408.12109  [pdf, other

    cs.CV cs.CL

    RoVRM: A Robust Visual Reward Model Optimized via Auxiliary Textual Preference Data

    Authors: Chenglong Wang, Yang Gan, Yifu Huo, Yongyu Mu, Murun Yang, Qiaozhi He, Tong Xiao, Chunliang Zhang, Tongran Liu, Quan Du, Di Yang, Jingbo Zhu

    Abstract: Large vision-language models (LVLMs) often fail to align with human preferences, leading to issues like generating misleading content without proper visual context (also known as hallucination). A promising solution to this problem is using human-preference alignment techniques, such as best-of-n sampling and reinforcement learning. However, these techniques face the difficulty arising from the sc… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  5. arXiv:2408.08152  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

    Authors: Huajian Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z. F. Wu, Fuli Luo, Chong Ruan

    Abstract: We introduce DeepSeek-Prover-V1.5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes. Pre-trained on DeepSeekMath-Base with specialization in formal mathematical languages, the model undergoes supervised fine-tuning using an enhanced formal theorem proving dataset derived from DeepSeek-Prover-… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  6. arXiv:2408.07261  [pdf, other

    math.NA

    Numerical analysis of a class of penalty discontinuous Galerkin methods for nonlocal diffusion problems

    Authors: Qiang Du, Lili Ju, Jianfang Lu, Xiaochuan Tian

    Abstract: In this paper, we consider a class of discontinuous Galerkin (DG) methods for one-dimensional nonlocal diffusion (ND) problems. The nonlocal models, which are integral equations, are widely used in describing many physical phenomena with long-range interactions. The ND problem is the nonlocal analog of the classic diffusion problem, and as the interaction radius (horizon) vanishes, then the nonloc… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

    MSC Class: 65M60; 65R20; 45A05

  7. arXiv:2408.04177  [pdf, ps, other

    quant-ph cond-mat.stat-mech

    Information Thermodynamics of Non-Hermitian Quantum Systems

    Authors: Kui Cao, Qian Du, Su-Peng Kou

    Abstract: In this study, we uncover the intrinsic information processes in non-Hermitian quantum systems and their thermodynamic effects. We demonstrate that these systems can exhibit negative entropy production, making them potential candidates for information engines. We also identify a key informational quantity that can characterize phase transitions beyond the reach of traditional partition functions.… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  8. arXiv:2408.00422  [pdf, ps, other

    math.FA cs.SI math.CO math.PR

    Ginzburg--Landau Functionals in the Large-Graph Limit

    Authors: Edith Zhang, James Scott, Qiang Du, Mason A. Porter

    Abstract: Ginzburg--Landau (GL) functionals on graphs, which are relaxations of graph-cut functionals on graphs, have yielded a variety of insights in image segmentation and graph clustering. In this paper, we study large-graph limits of GL functionals by taking a functional-analytic view of graphs as nonlocal kernels. For a graph $W_n$ with $n$ nodes, the corresponding graph GL functional $\GL^{W_n}_\ep$ i… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 37 pages

  9. arXiv:2407.19964  [pdf, other

    math.PR

    A Markov representation of Perron-Frobenius eigenvector for infinite non-negative matrix and Metzler-matrix

    Authors: Qian Du, Yong-Hua Mao

    Abstract: We will represent the so-called Perron-Frobenius eigenvector (if exists) for infinite non-negative matrix $A$ and Metzler matrix by using its corresponding Markov chain with probability transition function.

    Submitted 29 July, 2024; originally announced July 2024.

  10. arXiv:2407.19803  [pdf, ps, other

    math.PR

    Quasi-stationary distributions for continuous-time $λ$-recurrent jump processes

    Authors: Qian Du, Yong-Hua Mao

    Abstract: For the continuous-time $λ$-recurrent jump process, the $λ$-recurrence assures the existence of quasi-stationary distribution when it has finite exit states (the states that have positive killing rates). And we give an explicit representation for this quasi-stationary distribution through $Q$-matrix, where the components of the quasi-stationary distribution outside the set $H$ of exit states can b… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  11. arXiv:2407.15156  [pdf, other

    math.NA

    Computational and analytical studies of a new nonlocal phase-field crystal model in two dimensions

    Authors: Qiang Du, Kai Wang, Jiang Yang

    Abstract: A nonlocal phase-field crystal (NPFC) model is presented as a nonlocal counterpart of the local phase-field crystal (LPFC) model and a special case of the structural PFC (XPFC) derived from classical field theory for crystal growth and phase transition. The NPFC incorporates a finite range of spatial nonlocal interactions that can account for both repulsive and attractive effects. The specific for… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  12. arXiv:2406.16087  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Imperative Learning: A Self-supervised Neural-Symbolic Learning Framework for Robot Autonomy

    Authors: Chen Wang, Kaiyi Ji, Junyi Geng, Zhongqiang Ren, Taimeng Fu, Fan Yang, Yifan Guo, Haonan He, Xiangyu Chen, Zitong Zhan, Qiwei Du, Shaoshu Su, Bowen Li, Yuheng Qiu, Yi Du, Qihang Li, Yifan Yang, Xiao Lin, Zhipeng Zhao

    Abstract: Data-driven methods such as reinforcement and imitation learning have achieved remarkable success in robot autonomy. However, their data-centric nature still hinders them from generalizing well to ever-changing environments. Moreover, collecting large datasets for robotic tasks is often impractical and expensive. To overcome these challenges, we introduce a new self-supervised neural-symbolic (NeS… ▽ More

    Submitted 6 August, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  13. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  14. arXiv:2406.10469  [pdf, other

    eess.IV cs.CV cs.MM

    Object-Attribute-Relation Representation based Video Semantic Communication

    Authors: Qiyuan Du, Yiping Duan, Qianqian Yang, Xiaoming Tao, Mérouane Debbah

    Abstract: With the rapid growth of multimedia data volume, there is an increasing need for efficient video transmission in applications such as virtual reality and future video streaming services. Semantic communication is emerging as a vital technique for ensuring efficient and reliable transmission in low-bandwidth, high-noise settings. However, most current approaches focus on joint source-channel coding… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  15. arXiv:2406.04705  [pdf

    cs.CR

    EAIA: An Efficient and Anonymous Identity Authentication Scheme in 5G-V2V

    Authors: Qianmin Du, Jianhong Zhou, Maode Ma

    Abstract: Vehicle Ad-hoc Networks (VANETs) have experienced significant development in recent years, playing a crucial role in enhancing the driving experience by enabling safer and more efficient inter-vehicle interactions through information exchange. Vehicle-to-vehicle (V2V) communication is particularly vital as it not only helps to prevent collisions and improve traffic efficiency but also provides ess… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  16. arXiv:2406.02190  [pdf, ps, other

    eess.SY

    Age of Trust (AoT): A Continuous Verification Framework for Wireless Networks

    Authors: Yuquan Xiao, Qinghe Du, Wenchi Cheng, Panagiotis D. Diamantoulakis, George K. Karagiannidis

    Abstract: Zero Trust is a new security vision for 6G networks that emphasises the philosophy of never trust and always verify. However, there is a fundamental trade-off between the wireless transmission efficiency and the trust level, which is reflected by the verification interval and its adaptation strategy. More importantly, the mathematical framework to characterise the trust level of the adaptive verif… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  17. arXiv:2406.02139  [pdf, other

    eess.SY

    Statistical Age of Information: A Risk-Aware Metric and Its Applications in Status Updates

    Authors: Yuquan Xiao, Qinghe Du, George K. Karagiannidis

    Abstract: Age of information (AoI) is an effective measure to quantify the information freshness in wireless status update systems. It has been further validated that the peak AoI has the potential to capture the core characteristics of the aging process, and thus the average peak AoI is widely used to evaluate the long-term performance of information freshness. However, the average peak AoI is a risk-insen… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

  18. arXiv:2405.13860  [pdf, other

    cs.CV

    MAGIC: Map-Guided Few-Shot Audio-Visual Acoustics Modeling

    Authors: Diwei Huang, Kunyang Lin, Peihao Chen, Qing Du, Mingkui Tan

    Abstract: Few-shot audio-visual acoustics modeling seeks to synthesize the room impulse response in arbitrary locations with few-shot observations. To sufficiently exploit the provided few-shot data for accurate acoustic modeling, we present a *map-guided* framework by constructing acoustic-related visual semantic feature maps of the scenes. Visual features preserve semantic details related to sound and map… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    Comments: 17 pages, 12 pages for main paper, 5 pages for supplementary

  19. arXiv:2405.04434  [pdf, other

    cs.CL cs.AI

    DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

    Authors: DeepSeek-AI, Aixin Liu, Bei Feng, Bin Wang, Bingxuan Wang, Bo Liu, Chenggang Zhao, Chengqi Dengr, Chong Ruan, Damai Dai, Daya Guo, Dejian Yang, Deli Chen, Dongjie Ji, Erhang Li, Fangyun Lin, Fuli Luo, Guangbo Hao, Guanting Chen, Guowei Li, H. Zhang, Hanwei Xu, Hao Yang, Haowei Zhang, Honghui Ding , et al. (132 additional authors not shown)

    Abstract: We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128K tokens. DeepSeek-V2 adopts innovative architectures including Multi-head Latent Attention (MLA) and DeepSeekMoE. MLA guarantees efficient inference… ▽ More

    Submitted 19 June, 2024; v1 submitted 7 May, 2024; originally announced May 2024.

  20. Social Force Embedded Mixed Graph Convolutional Network for Multi-class Trajectory Prediction

    Authors: Quancheng Du, Xiao Wang, Shouguo Yin, Lingxi Li, Huansheng Ning

    Abstract: Accurate prediction of agent motion trajectories is crucial for autonomous driving, contributing to the reduction of collision risks in human-vehicle interactions and ensuring ample response time for other traffic participants. Current research predominantly focuses on traditional deep learning methods, including convolutional neural networks (CNNs) and recurrent neural networks (RNNs). These meth… ▽ More

    Submitted 20 April, 2024; originally announced April 2024.

    Comments: 11 pages,3 figures, published to IEEE Transactions on Intelligent vehicles

  21. S4TP: Social-Suitable and Safety-Sensitive Trajectory Planning for Autonomous Vehicles

    Authors: Xiao Wang, Ke Tang, Xingyuan Dai, Jintao Xu, Quancheng Du, Rui Ai, Yuxiao Wang, Weihao Gu

    Abstract: In public roads, autonomous vehicles (AVs) face the challenge of frequent interactions with human-driven vehicles (HDVs), which render uncertain driving behavior due to varying social characteristics among humans. To effectively assess the risks prevailing in the vicinity of AVs in social interactive traffic scenarios and achieve safe autonomous driving, this article proposes a social-suitable and… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 12 pages,4 figures, published to IEEE Transactions on Intelligent Vehicles

  22. arXiv:2404.11326  [pdf, other

    cs.CV

    Single-temporal Supervised Remote Change Detection for Domain Generalization

    Authors: Qiangang Du, Jinlong Peng, Xu Chen, Qingdong He, Liren He, Qiang Nie, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Change detection is widely applied in remote sensing image analysis. Existing methods require training models separately for each dataset, which leads to poor domain generalization. Moreover, these methods rely heavily on large amounts of high-quality pair-labelled data for training, which is expensive and impractical. In this paper, we propose a multimodal contrastive learning (ChangeCLIP) based… ▽ More

    Submitted 23 April, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  23. arXiv:2404.11318  [pdf, other

    cs.CV

    Leveraging Fine-Grained Information and Noise Decoupling for Remote Sensing Change Detection

    Authors: Qiangang Du, Jinlong Peng, Changan Wang, Xu Chen, Qingdong He, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: Change detection aims to identify remote sense object changes by analyzing data between bitemporal image pairs. Due to the large temporal and spatial span of data collection in change detection image pairs, there are often a significant amount of task-specific and task-agnostic noise. Previous effort has focused excessively on denoising, with this goes a great deal of loss of fine-grained informat… ▽ More

    Submitted 21 June, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  24. arXiv:2403.12582  [pdf, other

    cs.CL

    AlphaFin: Benchmarking Financial Analysis with Retrieval-Augmented Stock-Chain Framework

    Authors: Xiang Li, Zhenyu Li, Chen Shi, Yong Xu, Qing Du, Mingkui Tan, Jun Huang, Wei Lin

    Abstract: The task of financial analysis primarily encompasses two key areas: stock trend prediction and the corresponding financial question answering. Currently, machine learning and deep learning algorithms (ML&DL) have been widely applied for stock trend predictions, leading to significant progress. However, these methods fail to provide reasons for predictions, lacking interpretability and reasoning pr… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    Comments: COLING 2024. The first three authors contributed equally. Project website: https://github.com/AlphaFin-proj/AlphaFin

  25. arXiv:2403.11561  [pdf, other

    cs.CV

    Learning Unified Reference Representation for Unsupervised Multi-class Anomaly Detection

    Authors: Liren He, Zhengkai Jiang, Jinlong Peng, Liang Liu, Qiangang Du, Xiaobin Hu, Wenbing Zhu, Mingmin Chi, Yabiao Wang, Chengjie Wang

    Abstract: In the field of multi-class anomaly detection, reconstruction-based methods derived from single-class anomaly detection face the well-known challenge of "learning shortcuts", wherein the model fails to learn the patterns of normal samples as it should, opting instead for shortcuts such as identity mapping or artificial noise elimination. Consequently, the model becomes unable to reconstruct genuin… ▽ More

    Submitted 16 July, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: Accepted by ECCV 2024

  26. arXiv:2403.10067  [pdf, other

    eess.IV cs.CV

    Hybrid Convolutional and Attention Network for Hyperspectral Image Denoising

    Authors: Shuai Hu, Feng Gao, Xiaowei Zhou, Junyu Dong, Qian Du

    Abstract: Hyperspectral image (HSI) denoising is critical for the effective analysis and interpretation of hyperspectral data. However, simultaneously modeling global and local features is rarely explored to enhance HSI denoising. In this letter, we propose a hybrid convolution and attention network (HCANet), which leverages both the strengths of convolution neural networks (CNNs) and Transformers. To enhan… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: IEEE GRSL 2024

  27. arXiv:2403.07712  [pdf, ps, other

    math.AP math.NA

    Nonlocal Stokes equation with relaxation on the divergence free equation

    Authors: Yajie Zhang, Qiang Du, Zuoqiang Shi

    Abstract: In this paper, we consider a new nonlocal approximation to the linear Stokes system with periodic boundary conditions in two and three dimensional spaces . A relaxation term is added to the equation of nonlocal divergence free equation, which is reminiscent to the relaxation of local Stokes equation with small artificial compressibility. Our analysis shows that the well-posedness of the nonlocal s… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  28. arXiv:2403.05852  [pdf, other

    cs.CV

    SSF-Net: Spatial-Spectral Fusion Network with Spectral Angle Awareness for Hyperspectral Object Tracking

    Authors: Hanzheng Wang, Wei Li, Xiang-Gen Xia, Qian Du, Jing Tian

    Abstract: Hyperspectral video (HSV) offers valuable spatial, spectral, and temporal information simultaneously, making it highly suitable for handling challenges such as background clutter and visual similarity in object tracking. However, existing methods primarily focus on band regrouping and rely on RGB trackers for feature extraction, resulting in limited exploration of spectral information and difficul… ▽ More

    Submitted 9 March, 2024; originally announced March 2024.

  29. LightSword: A Customized Virtual Reality Exergame for Long-Term Cognitive Inhibition Training in Older Adults

    Authors: Qiuxin Du, Zhen Song, Haiyan Jiang, Xiaoying Wei, Dongdong Weng, Mingming Fan

    Abstract: The decline of cognitive inhibition significantly impacts older adults' quality of life and well-being, making it a vital public health problem in today's aging society. Previous research has demonstrated that Virtual reality (VR) exergames have great potential to enhance cognitive inhibition among older adults. However, existing commercial VR exergames were unsuitable for older adults' long-term… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 23 pages

    Journal ref: Proceedings of the CHI Conference on Human Factors in Computing Systems 2024 (CHI '24)

  30. Collisional energy loss of a heavy quark in a semiquark-gluon plasma

    Authors: Qianqian Du, Mudong Du, Yun Guo

    Abstract: By utilizing a background field effective theory, we compute the collisional energy loss of a heavy quark moving through a semiquark-gluon plasma characterized by nontrivial holonomy for Polyakov loops. We consider the elastic scatterings between the incident heavy quark and the thermal partons with both hard and soft momentum transfers. As compared to the energy loss obtained from the perturbatio… ▽ More

    Submitted 14 August, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: final version, published in PRD

  31. arXiv:2402.07749  [pdf, ps, other

    math.NA

    Asymptotically compatible schemes for nonlinear variational models via Gamma-convergence and applications to nonlocal problems

    Authors: Qiang Du, James M. Scott, Xiaochuan Tian

    Abstract: We present a study on asymptotically compatible Galerkin discretizations for a class of parametrized nonlinear variational problems. The abstract analytical framework is based on variational convergence, or Gamma-convergence. We demonstrate the broad applicability of the theoretical framework by developing asymptotically compatible finite element discretizations of some representative nonlinear no… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  32. arXiv:2402.05123  [pdf, ps, other

    cs.CL

    A Survey on Data Selection for LLM Instruction Tuning

    Authors: Jiahao Wang, Bolin Zhang, Qianlong Du, Jiajun Zhang, Dianhui Chu

    Abstract: Instruction tuning is a vital step of training large language models (LLM), so how to enhance the effect of instruction tuning has received increased attention. Existing works indicate that the quality of the dataset is more crucial than the quantity during instruction tuning of LLM. Therefore, recently a lot of studies focus on exploring the methods of selecting high-quality subset from instructi… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

  33. arXiv:2401.05679  [pdf, other

    math.OC cond-mat.soft

    Ohta-Kawasaki energy for amphiphiles: asymptotics and phase-field simulations

    Authors: Qiang Du, James M. Scott, Zirui Xu

    Abstract: We study the minimizers of a degenerate case of the Ohta-Kawasaki energy, defined as the sum of the perimeter and a Coulombic nonlocal term. We start by investigating radially symmetric candidates which give us insights into the asymptotic behaviors of energy minimizers in the large mass limit. In order to numerically study the problems that are analytically challenging, we propose a phase-field r… ▽ More

    Submitted 23 January, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 51 pages; 14 figures; submitted for publication; minor typos corrected

    MSC Class: 49Q20 (Primary) 35Q92; 35B36 (Secondary)

  34. arXiv:2401.02954  [pdf, other

    cs.CL cs.AI cs.LG

    DeepSeek LLM: Scaling Open-Source Language Models with Longtermism

    Authors: DeepSeek-AI, :, Xiao Bi, Deli Chen, Guanting Chen, Shanhuang Chen, Damai Dai, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Zhe Fu, Huazuo Gao, Kaige Gao, Wenjun Gao, Ruiqi Ge, Kang Guan, Daya Guo, Jianzhong Guo, Guangbo Hao, Zhewen Hao, Ying He, Wenjie Hu, Panpan Huang, Erhang Li , et al. (63 additional authors not shown)

    Abstract: The rapid development of open-source large language models (LLMs) has been truly remarkable. However, the scaling law described in previous literature presents varying conclusions, which casts a dark cloud over scaling LLMs. We delve into the study of scaling laws and present our distinctive findings that facilitate scaling of large scale models in two commonly used open-source configurations, 7B… ▽ More

    Submitted 5 January, 2024; originally announced January 2024.

  35. arXiv:2312.08926   

    cs.AI cs.CL

    Modeling Complex Mathematical Reasoning via Large Language Model based MathAgent

    Authors: Haoran Liao, Qinyi Du, Shaohua Hu, Hao He, Yanyan Xu, Jidong Tian, Yaohui Jin

    Abstract: Large language models (LLMs) face challenges in solving complex mathematical problems that require comprehensive capacities to parse the statements, associate domain knowledge, perform compound logical reasoning, and integrate the intermediate rationales. Tackling all these problems once could be arduous for LLMs, thus leading to confusion in generation. In this work, we explore the potential of e… ▽ More

    Submitted 16 December, 2023; v1 submitted 14 December, 2023; originally announced December 2023.

    Comments: There are unfair comparisons on miniF2F. This will be fixed in the future

  36. arXiv:2311.15653  [pdf, other

    cs.CL

    MoDS: Model-oriented Data Selection for Instruction Tuning

    Authors: Qianlong Du, Chengqing Zong, Jiajun Zhang

    Abstract: Instruction tuning has become the de facto method to equip large language models (LLMs) with the ability of following user instructions. Usually, hundreds of thousands or millions of instruction-following pairs are employed to fine-tune the foundation LLMs. Recently, some studies show that a small number of high-quality instruction data is enough. However, how to select appropriate instruction dat… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

  37. arXiv:2311.15173  [pdf, other

    cond-mat.mtrl-sci

    Stretched Non-negative Matrix Factorization

    Authors: Ran Gu, Yevgeny Rakita, Ling Lan, Zach Thatcher, Gabrielle E. Kamm, Daniel O'Nolan, Brennan Mcbride, Allison Wustrow, James R. Neilson, Karena W. Chapman, Qiang Du, Simon J. L. Billinge

    Abstract: An algorithm is described and tested that carries out a non negative matrix factorization (NMF) ignoring any stretching of the signal along the axis of the independent variable. This extended NMF model is called StretchedNMF. Variability in a set of signals due to this stretching is then ignored in the decomposition. This can be used, for example, to study sets of powder diffraction data collected… ▽ More

    Submitted 25 November, 2023; originally announced November 2023.

    Comments: 39 pages, 16 figures

  38. arXiv:2311.04442  [pdf, other

    eess.IV cs.CV

    SS-MAE: Spatial-Spectral Masked Auto-Encoder for Multi-Source Remote Sensing Image Classification

    Authors: Junyan Lin, Feng Gao, Xiaocheng Shi, Junyu Dong, Qian Du

    Abstract: Masked image modeling (MIM) is a highly popular and effective self-supervised learning method for image understanding. Existing MIM-based methods mostly focus on spatial feature modeling, neglecting spectral feature modeling. Meanwhile, existing MIM-based methods use Transformer for feature extraction, some local or high-frequency information may get lost. To this end, we propose a spatial-spectra… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: IEEE TGRS 2023

  39. arXiv:2311.01149  [pdf, other

    cs.CL

    ChineseWebText: Large-scale High-quality Chinese Web Text Extracted with Effective Evaluation Model

    Authors: Jianghao Chen, Pu Jian, Tengxiao Xi, Dongyi Yi, Qianlong Du, Chenglin Ding, Guibo Zhu, Chengqing Zong, Jinqiao Wang, Jiajun Zhang

    Abstract: During the development of large language models (LLMs), the scale and quality of the pre-training data play a crucial role in shaping LLMs' capabilities. To accelerate the research of LLMs, several large-scale datasets, such as C4 [1], Pile [2], RefinedWeb [3] and WanJuan [4], have been released to the public. However, most of the released corpus focus mainly on English, and there is still lack of… ▽ More

    Submitted 10 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

  40. arXiv:2310.16051  [pdf

    cond-mat.supr-con

    Sketched Nanoscale KTaO3-Based Superconducting Quantum Interference Device

    Authors: Muqing Yu, Nicholas Hougland, Qianheng Du, Junyi Yang, Sayanwita Biswas, Ranjani Ramachandran, Dengyu Yang, Anand Bhattacharya, David Pekker, Patrick Irvin, Jeremy Levy

    Abstract: The discovery of two-dimensional superconductivity in LaAlO3/KTaO3 (111) and (110) interfaces has raised significant interest in this system. In this manuscript we report the first successful fabrication of a superconducting quantum interference device (DC-SQUID) in the KTO system. The key device elements, superconducting weak links, are created by conductive atomic force microscope (c-AFM) lithog… ▽ More

    Submitted 5 February, 2024; v1 submitted 24 October, 2023; originally announced October 2023.

    Comments: 40 pages, 16 figures

  41. arXiv:2310.15509  [pdf, other

    physics.acc-ph

    Dual frequency master oscillator generation and distribution for ALS and ALS-U

    Authors: Shreeharshini Dharanesh Murthy, Angel Jurado, Michael Betz, Qiang Du, Benjamin Flugstad

    Abstract: The ongoing work to upgrade ALS to ALS-U demands strict RF requirements such as low jitter and low spurs frequency reference to meet its accelerator and science goals. A low phase noise dual frequency Master Oscillator (MO), where the two frequencies are related by a fractional ratio of 608/609 and flexible divide by four frequency outputs has been consolidated into a single chassis. Optical fiber… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Poster presented at LLRF Workshop 2023 (LLRF2023, arXiv: 2310.03199)

    Report number: LLRF2023/15

  42. arXiv:2310.06144  [pdf

    cond-mat.mtrl-sci physics.app-ph physics.optics

    Ions-induced Epitaxial Growth of Perovskite Nanocomposites for Highly Efficient Light-Emitting Diodes with EQE Exceeding 30%

    Authors: Zhaohui Xing, Qing Du, Peiyuan Pang, Guangrong Jin, Tanghao Liu, Yang Shen, Dengliang Zhang, Bufan Yu, Yue Liang, Jianxin Tang, Lei Wang, Guichuang Xing, Jiangshan Chen, Dongge Ma

    Abstract: Metal halide perovskites, a class of cost-effective semiconductor materials, are of great interest for modern and upcoming display technologies that prioritize the light-emitting diodes (LEDs) with high efficiency and excellent color purity. The prevailing approach to achieving efficient luminescence from pervoskites is enhancing exciton binding effect and confining carriers by reducing their dime… ▽ More

    Submitted 2 March, 2024; v1 submitted 9 October, 2023; originally announced October 2023.

  43. arXiv:2309.12010  [pdf, other

    eess.IV cs.CV

    Convolution and Attention Mixer for Synthetic Aperture Radar Image Change Detection

    Authors: Haopeng Zhang, Zijing Lin, Feng Gao, Junyu Dong, Qian Du, Heng-Chao Li

    Abstract: Synthetic aperture radar (SAR) image change detection is a critical task and has received increasing attentions in the remote sensing community. However, existing SAR change detection methods are mainly based on convolutional neural networks (CNNs), with limited consideration of global attention mechanism. In this letter, we explore Transformer-like architecture for SAR change detection to incorpo… ▽ More

    Submitted 21 September, 2023; originally announced September 2023.

    Comments: Accepted by IEEE GRSL

  44. arXiv:2309.10352  [pdf, ps, other

    math.AP math.NA

    $Γ$-convergence of Nonlocal Dirichlet Energies With Penalty Formulations of Dirichlet Boundary Data

    Authors: Weiye Gan, Qiang Du, Zuoqiang Shi

    Abstract: We study nonlocal Dirichlet energies associated with a class of nonlocal diffusion models on a bounded domain subject to the conventional local Dirichlet boundary condition. The Dirichlet boundary condition is imposed through a specifically designed penalty formulation. We prove that the nonlocal Dirichlet energies with the penalty terms converge to local Dirichlet energies with Dirichlet boundary… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

  45. arXiv:2308.13906  [pdf, other

    eess.SP cs.LG

    A Two-Dimensional Deep Network for RF-based Drone Detection and Identification Towards Secure Coverage Extension

    Authors: Zixiao Zhao, Qinghe Du, Xiang Yao, Lei Lu, Shijiao Zhang

    Abstract: As drones become increasingly prevalent in human life, they also raises security concerns such as unauthorized access and control, as well as collisions and interference with manned aircraft. Therefore, ensuring the ability to accurately detect and identify between different drones holds significant implications for coverage extension. Assisted by machine learning, radio frequency (RF) detection c… ▽ More

    Submitted 26 August, 2023; originally announced August 2023.

  46. arXiv:2308.05180  [pdf, other

    math.AP

    Nonlocal problems with local boundary conditions II: Green's identities and regularity of solutions

    Authors: James M. Scott, Qiang Du

    Abstract: We study nonlocal integral equations on bounded domains with finite-range nonlocal interactions that are localized at the boundary. We establish a Green's identity for the nonlocal operator that recovers the classical boundary integral, which, along with the variational analysis established previously, leads to the well-posedness of these nonlocal problems with various types of classical local bou… ▽ More

    Submitted 9 August, 2023; originally announced August 2023.

    MSC Class: 45K05; 35J20; 46E35

  47. arXiv:2308.04386  [pdf, other

    cs.CL

    Learning Evaluation Models from Large Language Models for Sequence Generation

    Authors: Chenglong Wang, Hang Zhou, Kaiyan Chang, Tongran Liu, Chunliang Zhang, Quan Du, Tong Xiao, Jingbo Zhu

    Abstract: Large language models achieve state-of-the-art performance on sequence generation evaluation, but typically have a large number of parameters. This is a computational challenge as presented by applying their evaluation capability at scale. To overcome the challenge, in this paper, we propose \textbf{ECT}, an \textbf{e}valuation \textbf{c}apability \textbf{t}ransfer method, to transfer the evaluati… ▽ More

    Submitted 8 August, 2023; originally announced August 2023.

  48. arXiv:2308.03220  [pdf

    cond-mat.mtrl-sci

    A15 Phase Ta3Sb Thin Films: Direct Synthesis and Giant Spin-Orbit Effects

    Authors: J. S. Jiang, Qianheng Du, Ulrich Welp, Ramakanta Chapai, Hanu Arava, Yuzi Liu, Yue Li, John Pearson, Anand Bhattacharya, Hyowon Park

    Abstract: We use co-sputtering to directly synthesize thin films of the A15 phase intermetallic compound Ta3Sb, which has been predicted to have a giant spin Hall conductivity. We identify a large window of Ta:Sb flux ratio that stabilizes single-phase A15 Ta3Sb. Composition analyses of these films show a Ta:Sb atomic ratio of 4:1, which is consistent with the known Ta-Sb phase diagram. The spin Hall conduc… ▽ More

    Submitted 6 August, 2023; originally announced August 2023.

  49. arXiv:2308.01969  [pdf

    physics.app-ph

    Complete mode conversion for elastic waves reflected by elastic metamaterial slab with double hexapole resonances

    Authors: Di Liu, Wenjie Yu, Qiujiao Du, Fengming Liu, Pai Peng

    Abstract: In this study, we investigate the phenomenon of mode conversion in elastic bulk waves using coupled hexapole resonances. A metamaterial slab is proposed enabling the complete conversion between longitudinal and transverse modes. Each unit of the elastic metamaterial slab comprises a pair of scatterers, and their relative direction is oriented at an oblique angle. The interaction between the couple… ▽ More

    Submitted 3 August, 2023; originally announced August 2023.

  50. arXiv:2307.14340  [pdf, ps, other

    cond-mat.mes-hall quant-ph

    Non-Hermitian tearing by dissipation

    Authors: Qian Du, Xin-Ran Ma, Su-Peng Kou

    Abstract: In the paper, we study the non-Hermitian system under dissipation and give the effective 2*2 Hamiltonian in the k-space by reducing the N*N Hamiltonian in the real space for them. It is discovered that the energy band shows an imaginary line gap. To describe these phenomena, we propose the theory of "non-Hermitian tearing", in which the tearability we define reveals a continuous phase transition a… ▽ More

    Submitted 24 June, 2024; v1 submitted 26 July, 2023; originally announced July 2023.