Zum Hauptinhalt springen

Showing 1–50 of 784 results for author: Gao, W

.
  1. arXiv:2408.14803  [pdf, other

    math.NA

    Spherical quasi-interpolation using scaled zonal kernels

    Authors: Zhengjie Sun, Wenwu Gao, Xingping Sun

    Abstract: We propose and study a new quasi-interpolation method on spheres featuring the following two-phase construction and analysis. In Phase I, we analyze and characterize a large family of zonal kernels (e.g., the spherical version of Poisson kernel, Gaussian, compactly-supported radial kernels), so that the underlying spherical convolution operators (upon the introduction of a scaling parameter) attai… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    MSC Class: 43A90; 41A25; 41A55; 65D12; 65D32

  2. arXiv:2408.14158  [pdf, other

    cs.DC cs.AI

    Fire-Flyer AI-HPC: A Cost-Effective Software-Hardware Co-Design for Deep Learning

    Authors: Wei An, Xiao Bi, Guanting Chen, Shanhuang Chen, Chengqi Deng, Honghui Ding, Kai Dong, Qiushi Du, Wenjun Gao, Kang Guan, Jianzhong Guo, Yongqiang Guo, Zhe Fu, Ying He, Panpan Huang, Jiashi Li, Wenfeng Liang, Xiaodong Liu, Xin Liu, Yiyuan Liu, Yuxuan Liu, Shanghao Lu, Xuan Lu, Xiaotao Nie, Tian Pei , et al. (27 additional authors not shown)

    Abstract: The rapid progress in Deep Learning (DL) and Large Language Models (LLMs) has exponentially increased demands of computational power and bandwidth. This, combined with the high costs of faster computing chips and interconnects, has significantly inflated High Performance Computing (HPC) construction costs. To address these challenges, we introduce the Fire-Flyer AI-HPC architecture, a synergistic… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: This is the preprint version of the paper accepted for presentation at the 2024 International Conference for High Performance Computing, Networking, Storage, and Analysis (SC'24). \c{opyright} 2024 IEEE. Personal use of this material is permitted. For other uses, permission from IEEE must be obtained. Please refer to IEEE Xplore for the final published version

  3. arXiv:2408.14110  [pdf

    cond-mat.mtrl-sci physics.optics

    Room-temperature Optically Detected Magnetic Resonance of Telecom Single Photon Emitters in GaN

    Authors: John J. H. Eng, Zhengzhi Jiang, Max Meunier, Abdullah Rasmita, Haoran Zhang, Yuzhe Yang, Feifei Zhou, Hongbing Cai, Zhaogang Dong, Jesús Zúñiga Pérez, Weibo Gao

    Abstract: Solid-state defects susceptible of spin manipulation hold great promise for scalable quantum technology. To broaden their utility, operating at room temperature and emitting in the telecom wavelength range are desired, eliminating cryogenic requirements and leveraging existing optical fiber infrastructure for transmitting the quantum information. To that end, we report that telecom single photon e… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  4. arXiv:2408.12158  [pdf, other

    cs.CE cs.CY

    Could Bibliometrics Reveal Top Science and Technology Achievements and Researchers? The Case for Evaluatology-based Science and Technology Evaluation

    Authors: Guoxin Kang, Wanling Gao, Lei Wang, Chunjie Luo, Hainan Ye, Qian He, Shaopeng Dai, Jianfeng Zhan

    Abstract: By utilizing statistical methods to analyze bibliographic data, bibliometrics faces inherent limitations in identifying the most significant science and technology achievements and researchers. To overcome this challenge, we present an evaluatology-based science and technology evaluation methodology. At the heart of this approach lies the concept of an extended evaluation condition, encompassing e… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 18 pages, 8 figures, and 2 tables

  5. arXiv:2408.12077  [pdf, other

    eess.SP cs.CV cs.LG

    Through-the-Wall Radar Human Activity Micro-Doppler Signature Representation Method Based on Joint Boulic-Sinusoidal Pendulum Model

    Authors: Xiaopeng Yang, Weicheng Gao, Xiaodong Qu, Zeyu Ma, Hao Zhang

    Abstract: With the help of micro-Doppler signature, ultra-wideband (UWB) through-the-wall radar (TWR) enables the reconstruction of range and velocity information of limb nodes to accurately identify indoor human activities. However, existing methods are usually trained and validated directly using range-time maps (RTM) and Doppler-time maps (DTM), which have high feature redundancy and poor generalization… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 17 pages, 14 figures, 7 tables, in IEEE Transactions on Microwave Theory and Techniques, 2024

    MSC Class: 94 ACM Class: I.5.1

  6. arXiv:2408.12063  [pdf, other

    stat.ML cs.AI cs.LG physics.ao-ph

    A Deconfounding Approach to Climate Model Bias Correction

    Authors: Wentao Gao, Jiuyong Li, Debo Cheng, Lin Liu, Jixue Liu, Thuc Duy Le, Xiaojing Du, Xiongren Chen, Yanchang Zhao, Yun Chen

    Abstract: Global Climate Models (GCMs) are crucial for predicting future climate changes by simulating the Earth systems. However, GCM outputs exhibit systematic biases due to model uncertainties, parameterization simplifications, and inadequate representation of complex climate phenomena. Traditional bias correction methods, which rely on historical observation data and statistical techniques, often neglec… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  7. arXiv:2408.11492  [pdf, other

    cs.AI

    Estimating Peer Direct and Indirect Effects in Observational Network Data

    Authors: Xiaojing Du, Jiuyong Li, Debo Cheng, Lin Liu, Wentao Gao, Xiongren Chen

    Abstract: Estimating causal effects is crucial for decision-makers in many applications, but it is particularly challenging with observational network data due to peer interactions. Many algorithms have been proposed to estimate causal effects involving network data, particularly peer effects, but they often overlook the variety of peer effects. To address this issue, we propose a general setting which cons… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: AAAI

  8. arXiv:2408.11481  [pdf, other

    cs.CV

    E-Bench: Subjective-Aligned Benchmark Suite for Text-Driven Video Editing Quality Assessment

    Authors: Shangkun Sun, Xiaoyu Liang, Songlin Fan, Wenxu Gao, Wei Gao

    Abstract: Text-driven video editing has recently experienced rapid development. Despite this, evaluating edited videos remains a considerable challenge. Current metrics tend to fail to align with human perceptions, and effective quantitative metrics for video editing are still notably absent. To address this, we introduce E-Bench, a benchmark suite tailored to the assessment of text-driven video editing. Th… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  9. arXiv:2408.10847  [pdf, other

    math.CO

    Inverse model for network construction: (δ(G), I' (G)) -> G

    Authors: Wei Gao, Yaojun Chen, Hainan Zhang

    Abstract: The isolated toughness variant is a salient parameter for measuring the vulnerability of networks, which is inherently related to fractional factors (used to characterize the feasibility of data transmission). The combination of minimum degree and the corresponding tight bound of isolated toughness variant for fractional factors provide reference standards for network construction. However, previo… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 22 pages, 7 figures

  10. arXiv:2408.09676  [pdf, other

    cs.CV

    Image-based Freeform Handwriting Authentication with Energy-oriented Self-Supervised Learning

    Authors: Jingyao Wang, Luntian Mou, Changwen Zheng, Wen Gao

    Abstract: Freeform handwriting authentication verifies a person's identity from their writing style and habits in messy handwriting data. This technique has gained widespread attention in recent years as a valuable tool for various fields, e.g., fraud prevention and cultural heritage protection. However, it still remains a challenging task in reality due to three reasons: (i) severe damage, (ii) complex hig… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: Accepted by TMM

  11. arXiv:2408.08682  [pdf, other

    cs.AI cs.CL cs.CV

    LLM-PCGC: Large Language Model-based Point Cloud Geometry Compression

    Authors: Yuqi Ye, Wei Gao

    Abstract: The key to effective point cloud compression is to obtain a robust context model consistent with complex 3D data structures. Recently, the advancement of large language models (LLMs) has highlighted their capabilities not only as powerful generators for in-context learning and generation but also as effective compressors. These dual attributes of LLMs make them particularly well-suited to meet the… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  12. arXiv:2408.08152  [pdf, other

    cs.CL cs.AI cs.LG cs.LO

    DeepSeek-Prover-V1.5: Harnessing Proof Assistant Feedback for Reinforcement Learning and Monte-Carlo Tree Search

    Authors: Huajian Xin, Z. Z. Ren, Junxiao Song, Zhihong Shao, Wanjia Zhao, Haocheng Wang, Bo Liu, Liyue Zhang, Xuan Lu, Qiushi Du, Wenjun Gao, Qihao Zhu, Dejian Yang, Zhibin Gou, Z. F. Wu, Fuli Luo, Chong Ruan

    Abstract: We introduce DeepSeek-Prover-V1.5, an open-source language model designed for theorem proving in Lean 4, which enhances DeepSeek-Prover-V1 by optimizing both training and inference processes. Pre-trained on DeepSeekMath-Base with specialization in formal mathematical languages, the model undergoes supervised fine-tuning using an enhanced formal theorem proving dataset derived from DeepSeek-Prover-… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  13. arXiv:2408.05730  [pdf, other

    quant-ph

    Optimal overlapping tomography

    Authors: Kiara Hansenne, Rui Qu, Lisa T. Weinbrenner, Carlos de Gois, Haifei Wang, Yang Ming, Zhengning Yang, Paweł Horodecki, Weibo Gao, Otfried Gühne

    Abstract: Characterising large scale quantum systems is central for fundamental physics as well as for applications of quantum technologies. While a full characterisation requires exponentially increasing effort, focusing on application-relevant information can often lead to significantly simplified analysis. Overlapping tomography is such a scheme, which allows to obtain all the information contained in sp… ▽ More

    Submitted 11 August, 2024; originally announced August 2024.

  14. arXiv:2408.01404  [pdf, other

    physics.optics physics.app-ph

    Digitized Phase Change Material Heterostack for Diffractive Optical Neural Network

    Authors: Ruiyang Chen, Cunxi Yu, Weilu Gao

    Abstract: All-optical and fully reconfigurable diffractive optical neural network (DONN) architectures are promising for high-throughput and energy-efficient machine learning (ML) hardware accelerators for broad applications. However, current device and system implementations have limited performance. This work demonstrates a novel diffractive device architecture, which is named digitized heterostack and co… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  15. arXiv:2408.00275  [pdf, other

    cs.RO

    A Reinforcement Learning Based Motion Planner for Quadrotor Autonomous Flight in Dense Environment

    Authors: Zhaohong Liu, Wenxuan Gao, Yinshuai Sun, Peng Dong

    Abstract: Quadrotor motion planning is critical for autonomous flight in complex environments, such as rescue operations. Traditional methods often employ trajectory generation optimization and passive time allocation strategies, which can limit the exploitation of the quadrotor's dynamic capabilities and introduce delays and inaccuracies. To address these challenges, we propose a novel motion planning fram… ▽ More

    Submitted 5 August, 2024; v1 submitted 1 August, 2024; originally announced August 2024.

  16. arXiv:2407.21283  [pdf, ps, other

    math.NA

    High-order quasi-interpolation with generalized Gaussian kernels restricted over tori

    Authors: Wenwu Gao, Zhengjie Sun, Changwei Wang

    Abstract: The paper proposes a novel and efficient quasi-interpolation scheme with high approximation order for periodic function approximation over tori. The resulting quasi-interpolation takes the form of Schoenberg's tensor-product generalized Gaussian kernels restricted over circles. Notably, theoretical analysis shows that it achieves the highest approximation order equal to the order of the generalize… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    MSC Class: 41A30; 41A25; 42B05; 65D15

  17. arXiv:2407.20738  [pdf, other

    stat.ME

    A Local Modal Outer-Product-Gradient Estimator for Dimension Reduction

    Authors: Zheng Li, Chong Ding, Wei Gao

    Abstract: Sufficient dimension reduction (SDR) is a valuable approach for handling high-dimensional data. Outer Product Gradient (OPG) is an popular approach. However, because of focusing the mean regression function, OPG may ignore some directions of central subspace (CS) when the distribution of errors is symmetric about zero. The mode of a distribution can provide an important summary of data. A Local Mo… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  18. arXiv:2407.20573  [pdf, other

    cs.DC

    Federated Learning as a Service for Hierarchical Edge Networks with Heterogeneous Models

    Authors: Wentao Gao, Omid Tavallaie, Shuaijun Chen, Albert Zomaya

    Abstract: Federated learning (FL) is a distributed Machine Learning (ML) framework that is capable of training a new global model by aggregating clients' locally trained models without sharing users' original data. Federated learning as a service (FLaaS) offers a privacy-preserving approach for training machine learning models on devices with various computational resources. Most proposed FL-based methods t… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  19. arXiv:2407.19633  [pdf, other

    cs.AI

    OptiMUS-0.3: Using Large Language Models to Model and Solve Optimization Problems at Scale

    Authors: Ali AhmadiTeshnizi, Wenzhi Gao, Herman Brunborg, Shayan Talaei, Madeleine Udell

    Abstract: Optimization problems are pervasive in sectors from manufacturing and distribution to healthcare. However, most such problems are still solved heuristically by hand rather than optimally by state-of-the art solvers because the expertise required to formulate and solve these problems limits the widespread adoption of optimization tools and techniques. We introduce a Large Language Model (LLM)-based… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: This paper documents OptiMUS-0.3, improving on OptiMUS-0.1 (arXiv:2310.06116) and OptiMUS-0.2 (arXiv:2402.10172). arXiv admin note: text overlap with arXiv:2402.10172

  20. arXiv:2407.17867  [pdf, other

    cond-mat.mes-hall

    Intrinsic Nonlinear Spin Hall Effect and Manipulation of Perpendicular Magnetization

    Authors: Hui Wang, Huiying Liu, Xukun Feng, Jin Cao, Weikang Wu, Shen Lai, Weibo Gao, Cong Xiao, Shengyuan A. Yang

    Abstract: We propose an intrinsic nonlinear spin Hall effect, which enables the generation of collinearly-polarized spin current in a large class of nonmagnetic materials with the corresponding linear response being symmetry-forbidden. This opens a new avenue for field-free switching of perpendicular magnetization, which is required for the next-generation information storage technology. We develop the micr… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  21. arXiv:2407.17078  [pdf, other

    cs.RO

    Active Loop Closure for OSM-guided Robotic Mapping in Large-Scale Urban Environments

    Authors: Wei Gao, Zezhou Sun, Mingle Zhao, Cheng-Zhong Xu, Hui Kong

    Abstract: The autonomous mapping of large-scale urban scenes presents significant challenges for autonomous robots. To mitigate the challenges, global planning, such as utilizing prior GPS trajectories from OpenStreetMap (OSM), is often used to guide the autonomous navigation of robots for mapping. However, due to factors like complex terrain, unexpected body movement, and sensor noise, the uncertainty of t… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  22. arXiv:2407.16131  [pdf, other

    cond-mat.mtrl-sci cs.LG physics.comp-ph

    Crystals with Transformers on Graphs, for Prediction of Unconventional Crystal Material Properties and the Benchmark

    Authors: Hongyi Wang, Ji Sun, Jinzhe Liang, Li Zhai, Zitian Tang, Zijian Li, Wei Zhai, Xusheng Wang, Weihao Gao, Sheng Gong, Bolong Huang, Hua Zhang

    Abstract: The ionic bonding across the lattice and ordered microscopic structures endow crystals with unique symmetry and determine their macroscopic properties. Unconventional crystals, in particular, exhibit non-traditional lattice structures or possess exotic physical properties, making them intriguing subjects for investigation. Therefore, to accurately predict the physical and chemical properties of cr… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  23. arXiv:2407.15138  [pdf, other

    cs.CV

    D$^4$M: Dataset Distillation via Disentangled Diffusion Model

    Authors: Duo Su, Junjie Hou, Weizhi Gao, Yingjie Tian, Bowen Tang

    Abstract: Dataset distillation offers a lightweight synthetic dataset for fast network training with promising test accuracy. To imitate the performance of the original dataset, most approaches employ bi-level optimization and the distillation space relies on the matching architecture. Nevertheless, these approaches either suffer significant computational costs on large-scale datasets or experience performa… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Accepted to CVPR 2024

  24. arXiv:2407.14774  [pdf, other

    cs.CV cs.AI cs.GR

    Intelligent Artistic Typography: A Comprehensive Review of Artistic Text Design and Generation

    Authors: Yuhang Bai, Zichuan Huang, Wenshuo Gao, Shuai Yang, Jiaying Liu

    Abstract: Artistic text generation aims to amplify the aesthetic qualities of text while maintaining readability. It can make the text more attractive and better convey its expression, thus enjoying a wide range of application scenarios such as social media display, consumer electronics, fashion, and graphic design. Artistic text generation includes artistic text stylization and semantic typography. Artisti… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: GitHub Page: https://github.com/williamyang1991/Awesome-Artistic-Typography/

  25. arXiv:2407.13985  [pdf

    cond-mat.mtrl-sci physics.comp-ph

    Cluster Sliding Ferroelectricity in Trilayer Quasi-Hexagonal C60

    Authors: Xuefei Wang, Yanhan Ren, Shi Qiu, Fan Zhang, Xueao Li, Junfeng Gao, Weiwei Gao, Jijun Zhao

    Abstract: Electric polarization typically originates from non-centrosymmetric charge distributions. Since chemical bonds between atoms of the same elements favor centrosymmetric crystal structures and symmetrically distributed electron charges, elemental ferroelectrics are extremely rare. In comparison to atoms, elemental clusters are less symmetric and typically have various preferred orientations in cryst… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 5 figures

  26. arXiv:2407.13674  [pdf

    cond-mat.mes-hall

    Observation of Ferromagnetic Phase in the Second Moiré Band of Twisted MoTe2

    Authors: Liheng An, Haiyang Pan, Wen-Xuan Qiu, Naizhou Wang, Shihao Ru, Qinghai Tan, Xuran Dai, Xiangbin Cai, Qiuyu Shang, Xiufang Lu, Hao Jiang, Xiaodan Lyu, Kenji Watanabe, Takashi Taniguchi, Fengcheng Wu, Wei-bo Gao

    Abstract: Flat bands and electron correlation in moiré lattices give rise to many exotic phases, including Mott insulators, superconductivity, and topological states. Within the first moiré band, integer and fractional quantum anomalous Hall effects have been observed in twisted bilayer MoTe2 (tMoTe2) at one hole doping and fractional doping per moiré unit cell, respectively. When the second moiré band is f… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Main text: 13 pages, 5 figures. Supplementary: 11 pages, 15 figures

  27. arXiv:2407.10975  [pdf

    cs.OH cs.AI cs.CL

    Stream State-tying for Sign Language Recognition

    Authors: Jiyong Ma, Wen Gao, Chunli Wang

    Abstract: In this paper, a novel approach to sign language recognition based on state tying in each of data streams is presented. In this framework, it is assumed that hand gesture signal is represented in terms of six synchronous data streams, i.e., the left/right hand position, left/right hand orientation and left/right handshape. This approach offers a very accurate representation of the sign space and k… ▽ More

    Submitted 21 April, 2024; originally announced July 2024.

  28. arXiv:2407.10157  [pdf, other

    eess.IV cs.CV

    SACNet: A Spatially Adaptive Convolution Network for 2D Multi-organ Medical Segmentation

    Authors: Lin Zhang, Wenbo Gao, Jie Yi, Yunyun Yang

    Abstract: Multi-organ segmentation in medical image analysis is crucial for diagnosis and treatment planning. However, many factors complicate the task, including variability in different target categories and interference from complex backgrounds. In this paper, we utilize the knowledge of Deformable Convolution V3 (DCNv3) and multi-object segmentation to optimize our Spatially Adaptive Convolution Network… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  29. arXiv:2407.09315  [pdf, other

    physics.comp-ph math-ph

    RBMD: A molecular dynamics package enabling to simulate 10 million all-atom particles in a single graphics processing unit

    Authors: Weihang Gao, Teng Zhao, Yongfa Guo, Jiuyang Liang, Huan Liu, Maoying Luo, Zedong Luo, Wei Qin, Yichao Wang, Qi Zhou, Shi Jin, Zhenli Xu

    Abstract: This paper introduces a random-batch molecular dynamics (RBMD) package for fast simulations of particle systems at the nano/micro scale. Different from existing packages, the RBMD uses random batch methods for nonbonded interactions of particle systems. The long-range part of Coulomb interactions is calculated in Fourier space by the random batch Ewald algorithm, which achieves linear complexity a… ▽ More

    Submitted 22 August, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: 26 pages, 8 figures

  30. arXiv:2407.09254  [pdf, other

    eess.SP

    Power Optimization and Deep Learning for Channel Estimation of Active IRS-Aided IoT

    Authors: Yan Wang, Feng Shu, Rongen Dong, Wei Gao, Qi Zhang, Jiajia Liu

    Abstract: In this paper, channel estimation of an active intelligent reflecting surface (IRS) aided uplink Internet of Things (IoT) network is investigated. Firstly, the least square (LS) estimators for the direct channel and the cascaded channel are presented, respectively. The corresponding mean square errors (MSE) of channel estimators are derived. Subsequently, in order to evaluate the influence of adju… ▽ More

    Submitted 15 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

  31. arXiv:2407.08744  [pdf, ps, other

    cs.NE cs.AI cs.LG

    Toward Efficient Deep Spiking Neuron Networks:A Survey On Compression

    Authors: Hui Xie, Ge Yang, Wenjuan Gao

    Abstract: With the rapid development of deep learning, Deep Spiking Neural Networks (DSNNs) have emerged as promising due to their unique spike event processing and asynchronous computation. When deployed on neuromorphic chips, DSNNs offer significant power advantages over Deep Artificial Neural Networks (DANNs) and eliminate time and energy consuming multiplications due to the binary nature of spikes (0 or… ▽ More

    Submitted 3 June, 2024; originally announced July 2024.

  32. arXiv:2407.08554  [pdf, other

    cs.AI cs.HC

    Establishing Rigorous and Cost-effective Clinical Trials for Artificial Intelligence Models

    Authors: Wanling Gao, Yunyou Huang, Dandan Cui, Zhuoming Yu, Wenjing Liu, Xiaoshuang Liang, Jiahui Zhao, Jiyue Xie, Hao Li, Li Ma, Ning Ye, Yumiao Kang, Dingfeng Luo, Peng Pan, Wei Huang, Zhongmou Liu, Jizhong Hu, Gangyuan Zhao, Chongrong Jiang, Fan Huang, Tianyi Wei, Suqin Tang, Bingjie Xia, Zhifei Zhang, Jianfeng Zhan

    Abstract: A profound gap persists between artificial intelligence (AI) and clinical practice in medicine, primarily due to the lack of rigorous and cost-effective evaluation methodologies. State-of-the-art and state-of-the-practice AI model evaluations are limited to laboratory studies on medical datasets or direct clinical trials with no or solely patient-centered controls. Moreover, the crucial role of cl… ▽ More

    Submitted 28 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

    Comments: 24 pages

  33. arXiv:2407.07723  [pdf, other

    cs.IT cs.AI

    Understanding is Compression

    Authors: Ziguang Li, Chao Huang, Xuliang Wang, Haibo Hu, Cole Wyeth, Dongbo Bu, Quan Yu, Wen Gao, Xingwu Liu, Ming Li

    Abstract: Modern data compression methods are slowly reaching their limits after 80 years of research, millions of papers, and wide range of applications. Yet, the extravagant 6G communication speed requirement raises a major open question for revolutionary new ideas of data compression. We have previously shown all understanding or learning are compression, under reasonable assumptions. Large language mo… ▽ More

    Submitted 20 August, 2024; v1 submitted 23 June, 2024; originally announced July 2024.

  34. arXiv:2407.06886  [pdf, other

    cs.CV cs.AI cs.LG cs.MA cs.RO

    Aligning Cyber Space with Physical World: A Comprehensive Survey on Embodied AI

    Authors: Yang Liu, Weixing Chen, Yongjie Bai, Xiaodan Liang, Guanbin Li, Wen Gao, Liang Lin

    Abstract: Embodied Artificial Intelligence (Embodied AI) is crucial for achieving Artificial General Intelligence (AGI) and serves as a foundation for various applications that bridge cyberspace and the physical world. Recently, the emergence of Multi-modal Large Models (MLMs) and World Models (WMs) have attracted significant attention due to their remarkable perception, interaction, and reasoning capabilit… ▽ More

    Submitted 25 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: The first comprehensive review of Embodied AI in the era of MLMs, 39 pages. We also provide the paper list for Embodied AI: https://github.com/HCPLab-SYSU/Embodied_AI_Paper_List

  35. arXiv:2407.06334  [pdf, other

    cs.AI q-bio.QM

    Double-Ended Synthesis Planning with Goal-Constrained Bidirectional Search

    Authors: Kevin Yu, Jihye Roh, Ziang Li, Wenhao Gao, Runzhong Wang, Connor W. Coley

    Abstract: Computer-aided synthesis planning (CASP) algorithms have demonstrated expert-level abilities in planning retrosynthetic routes to molecules of low to moderate complexity. However, current search methods assume the sufficiency of reaching arbitrary building blocks, failing to address the common real-world constraint where using specific molecules is desired. To this end, we present a formulation of… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages main, 4 figures

  36. arXiv:2407.05677  [pdf, other

    eess.IV

    PCAC-GAN: A Sparse-Tensor-Based Generative Adversarial Network for 3D Point Cloud Attribute Compression

    Authors: Xiaolong Mao, Hui Yuan, Xin Lu, Raouf Hamzaoui, Wei Gao

    Abstract: Learning-based methods have proven successful in compressing geometric information for point clouds. For attribute compression, however, they still lag behind non-learning-based methods such as the MPEG G-PCC standard. To bridge this gap, we propose a novel deep learning-based point cloud attribute compression method that uses a generative adversarial network (GAN) with sparse convolution layers.… ▽ More

    Submitted 19 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 14 pages, 5 figures, Accepted by Computational Visual Media

    MSC Class: 94J20 ACM Class: I.4.2

    Journal ref: Computational Visual Media, 2024

  37. arXiv:2407.05458  [pdf, other

    cs.AI

    A Survey of Models for Cognitive Diagnosis: New Developments and Future Directions

    Authors: Fei Wang, Weibo Gao, Qi Liu, Jiatong Li, Guanhao Zhao, Zheng Zhang, Zhenya Huang, Mengxiao Zhu, Shijin Wang, Wei Tong, Enhong Chen

    Abstract: Cognitive diagnosis has been developed for decades as an effective measurement tool to evaluate human cognitive status such as ability level and knowledge mastery. It has been applied to a wide range of fields including education, sport, psychological diagnosis, etc. By providing better awareness of cognitive status, it can serve as the basis for personalized services such as well-designed medical… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  38. arXiv:2407.04514  [pdf, other

    physics.app-ph cond-mat.mtrl-sci

    Giant Second Harmonic Generation from Wafer-Scale Aligned Chiral Carbon Nanotubes

    Authors: Rui Xu, Jacques Doumani, Viktor Labuntsov, Nina Hong, Anna-Christina Samaha, Weiran Tu, Fuyang Tay, Elizabeth Blackert, Jiaming Luo, Mario El Tahchi, Weilu Gao, Jun Lou, Yohei Yomogida, Kazuhiro Yanagi, Riichiro Saito, Vasili Perebeinos, Andrey Baydin, Junichiro Kono, Hanyu Zhu

    Abstract: Chiral carbon nanotubes (CNTs) are direct-gap semiconductors with optical properties governed by one-dimensional excitons with enormous oscillator strengths. Each species of chiral CNTs has an enantiomeric pair of left- and right-handed CNTs with nearly identical properties, but enantiomer-dependent phenomena can emerge, especially in nonlinear optical processes. Theoretical studies have predicted… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  39. arXiv:2407.03978  [pdf, other

    cs.CL cs.AI

    Benchmarking Complex Instruction-Following with Multiple Constraints Composition

    Authors: Bosi Wen, Pei Ke, Xiaotao Gu, Lindong Wu, Hao Huang, Jinfeng Zhou, Wenchuang Li, Binxin Hu, Wendy Gao, Jiaxin Xu, Yiming Liu, Jie Tang, Hongning Wang, Minlie Huang

    Abstract: Instruction following is one of the fundamental capabilities of large language models (LLMs). As the ability of LLMs is constantly improving, they have been increasingly applied to deal with complex human instructions in real-world scenarios. Therefore, how to evaluate the ability of complex instruction-following of LLMs has become a critical research problem. Existing benchmarks mainly focus on m… ▽ More

    Submitted 11 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

    Comments: 20 pages, 7 figures

  40. arXiv:2407.03716  [pdf, other

    eess.SY

    Prediction-Free Coordinated Dispatch of Microgrid: A Data-Driven Online Optimization Approach

    Authors: Kaidi Huang, Lin Cheng, Ning Qi, David Wenzhong Gao

    Abstract: The integration of renewable energy sources (RES) into microgrids poses challenges to reliable and economic operation due to the inherent uncertainty and volatility of RES. Contrary to the previous dispatch methods that require precise predictions of RES, this paper proposes a novel prediction-free and data-driven coordinated dispatch framework for reliable microgrid operations. In the offline sta… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  41. arXiv:2407.03122  [pdf, other

    cs.RO

    IntentionNet: Map-Lite Visual Navigation at the Kilometre Scale

    Authors: Wei Gao, Bo Ai, Joel Loo, Vinay, David Hsu

    Abstract: This work explores the challenges of creating a scalable and robust robot navigation system that can traverse both indoor and outdoor environments to reach distant goals. We propose a navigation system architecture called IntentionNet that employs a monolithic neural network as the low-level planner/controller, and uses a general interface that we call intentions to steer the controller. The paper… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  42. arXiv:2407.03014  [pdf

    physics.optics physics.app-ph quant-ph

    Dielectric Fano Nanoantennas for Enabling Sub-Nanosecond Lifetimes in NV-based Single Photon Emitters

    Authors: Shu An, Dmitry Kalashnikov, Wenqiao Shi, Zackaria Mahfoud, Ah Bian Chew, Yan Liu, Jing Wu, Di Zhu, Weibo Gao, Cheng-Wei Qiu, Victor Leong, Zhaogang Dong

    Abstract: Solid-state quantum emitters are essential sources of single photons, and enhancing their emission rates is of paramount importance for applications in quantum communications, computing, and metrology. One approach is to couple quantum emitters with resonant photonic nanostructures, where the emission rate is enhanced due to the Purcell effect. Dielectric nanoantennas are promising as they provide… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 20 pages, 4 figures

  43. arXiv:2407.00905  [pdf, other

    cs.CV

    Learning Robust 3D Representation from CLIP via Dual Denoising

    Authors: Shuqing Luo, Bowen Qu, Wei Gao

    Abstract: In this paper, we explore a critical yet under-investigated issue: how to learn robust and well-generalized 3D representation from pre-trained vision language models such as CLIP. Previous works have demonstrated that cross-modal distillation can provide rich and useful knowledge for 3D data. However, like most deep learning models, the resultant 3D learning network is still vulnerable to adversar… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

  44. arXiv:2406.16976  [pdf, other

    cs.NE cs.AI cs.LG physics.chem-ph

    Efficient Evolutionary Search Over Chemical Space with Large Language Models

    Authors: Haorui Wang, Marta Skreta, Cher-Tian Ser, Wenhao Gao, Lingkai Kong, Felix Strieth-Kalthoff, Chenru Duan, Yuchen Zhuang, Yue Yu, Yanqiao Zhu, Yuanqi Du, Alán Aspuru-Guzik, Kirill Neklyudov, Chao Zhang

    Abstract: Molecular discovery, when formulated as an optimization problem, presents significant computational challenges because optimization objectives can be non-differentiable. Evolutionary Algorithms (EAs), often used to optimize black-box objectives in molecular discovery, traverse chemical space by performing random mutations and crossovers, leading to a large number of expensive objective evaluations… ▽ More

    Submitted 2 July, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  45. arXiv:2406.15132  [pdf, other

    cs.LG cs.AI

    Younger: The First Dataset for Artificial Intelligence-Generated Neural Network Architecture

    Authors: Zhengxin Yang, Wanling Gao, Luzhou Peng, Yunyou Huang, Fei Tang, Jianfeng Zhan

    Abstract: Designing and optimizing neural network architectures typically requires extensive expertise, starting with handcrafted designs and then manual or automated refinement. This dependency presents a significant barrier to rapid innovation. Recognizing the complexity of automatically generating neural network architecture from scratch, we introduce Younger, a pioneering dataset to advance this ambitio… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 31 pages, 29 figures, 11 tables

  46. arXiv:2406.14194  [pdf, other

    cs.CV cs.AI

    VLBiasBench: A Comprehensive Benchmark for Evaluating Bias in Large Vision-Language Model

    Authors: Jie Zhang, Sibo Wang, Xiangkui Cao, Zheng Yuan, Shiguang Shan, Xilin Chen, Wen Gao

    Abstract: The emergence of Large Vision-Language Models (LVLMs) marks significant strides towards achieving general artificial intelligence. However, these advancements are tempered by the outputs that often reflect biases, a concern not yet extensively investigated. Existing benchmarks are not sufficiently comprehensive in evaluating biases due to their limited data scale, single questioning format and nar… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  47. arXiv:2406.13190  [pdf, other

    physics.optics cond-mat.mtrl-sci

    A programmable wafer-scale chiroptical heterostructure of twisted aligned carbon nanotubes and phase change materials

    Authors: Jichao Fan, Ruiyang Chen, Minhan Lou, Haoyu Xie, Nina Hong, Yingheng Tang, Weilu Gao

    Abstract: The ability to design and dynamically control chiroptical responses in solid-state matter at wafer scale enables new opportunities in various areas. Here we present a full stack of computer-aided designs and experimental implementations of a dynamically programmable, unified, scalable chiroptical heterostructure containing twisted aligned one-dimensional (1D) carbon nanotubes (CNTs) and non-volati… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  48. arXiv:2406.12613  [pdf

    cond-mat.mtrl-sci

    Understanding the intrinsic framework of the Hall-Petch relationship of metals from the view of the electronic-structure level

    Authors: Xin Li, Wang Gao, Qing Jiang

    Abstract: The relationship between grain size and yield strength of metals follows the Hall-Petch relationship σ = σ0 + kd^-0.5; however, the specific physical factors that affect the coefficients σ0 and k of this relationship remain unclear. Here we propose the intrinsic descriptors to determine the Hall-Petch relation across different metals and alloys. Inspired by the tight-binding theory, we find that σ… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  49. arXiv:2406.11931  [pdf, other

    cs.SE cs.AI cs.LG

    DeepSeek-Coder-V2: Breaking the Barrier of Closed-Source Models in Code Intelligence

    Authors: DeepSeek-AI, Qihao Zhu, Daya Guo, Zhihong Shao, Dejian Yang, Peiyi Wang, Runxin Xu, Y. Wu, Yukun Li, Huazuo Gao, Shirong Ma, Wangding Zeng, Xiao Bi, Zihui Gu, Hanwei Xu, Damai Dai, Kai Dong, Liyue Zhang, Yishi Piao, Zhibin Gou, Zhenda Xie, Zhewen Hao, Bingxuan Wang, Junxiao Song, Deli Chen , et al. (15 additional authors not shown)

    Abstract: We present DeepSeek-Coder-V2, an open-source Mixture-of-Experts (MoE) code language model that achieves performance comparable to GPT4-Turbo in code-specific tasks. Specifically, DeepSeek-Coder-V2 is further pre-trained from an intermediate checkpoint of DeepSeek-V2 with additional 6 trillion tokens. Through this continued pre-training, DeepSeek-Coder-V2 substantially enhances the coding and mathe… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  50. arXiv:2406.09192  [pdf, other

    eess.SP

    Joint Power Allocation and Beamforming Design for Active IRS-Aided Directional Modulation Secure Systems

    Authors: Yifan Zhao, Xiaoyu Wang, Kaibo Zhou, Xuehui Wang, Yan Wang, Wei Gao, Ruiqi Liu, Feng Shu

    Abstract: Since the secrecy rate (SR) performance improvement obtained by secure directional modulation (DM) network is limited, an active intelligent reflective surface (IRS)-assisted DM network is considered to attain a high SR. To address the SR maximization problem, a novel method based on Lagrangian dual transform and closed-form fractional programming algorithm (LDT-CFFP) is proposed, where the soluti… ▽ More

    Submitted 25 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Directional modulation, active intelligent reflective surface, Lagrangian dual transformation, fractional programming, power allocation