Zum Hauptinhalt springen

Showing 1–50 of 387 results for author: Zeng, L

.
  1. arXiv:2408.12164  [pdf

    cond-mat.mtrl-sci

    Excellent and CO$_2$$_{0.85}$Nd$_{0.1}$Cu$_{0.05}$O$_{2-δ}$-Nd$_x$Sr$_{1-x}$Fe$_{1-y}$Cu$_y$O$_{3-δ}$ dual-phase oxygen transport membranes

    Authors: Chao Zhang, Yue Zhu, Xiaopeng Wang, Yanhao Huang, Lingyong Zeng, Kuan Li, Peifeng Yu, Kangwang Wang, Longfu Li, Zaichen Xiang, Rui Chen, Xuefeng Zhu, Huixia Luo

    Abstract: Oxygen transport membranes(OTMs)have provided great opportunities in the last decades but are suffering from the trade-off effect between stability and oxygen permeability. Here, we report a group of new planar dual-phase mixed ionic-electronic conducting (MIEC) OTMs consisting of CO$_2$$_{0.85}$Nd$_{0.1}$Cu$_{0.05}$O$_2$ (CNCO) and Nd$_x$Sr$_{1-x}$Fe$_{1-y}$Cu$_y$O$_3$(NSFCO; $x = 0.4, 0.6$;… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 36 pages, 6 figures

    Journal ref: Journal of Membrane Science, 2024,696,122485

  2. arXiv:2408.12160  [pdf

    cond-mat.mtrl-sci cond-mat.supr-con

    Mapping Hydrogen Evolution Activity Trends of V-based A15 Superconducting Alloys

    Authors: Peifeng Yu, Jie Zhan, Xiaobing Zhang, Kangwang Wang, Lingyong Zeng, Kuan Li, Chao Zhang, Longfu Li, Ying Liang, Kai Yan, Yan Sun, Huixia Luo

    Abstract: Exploring high-efficiency and low-cost electrocatalysts is valuable for water-splitting technologies. Recently, Si-group compounds have attracted increasing attention in electrocatalysis, considering the abundant Si-group elements on Earth. However, Si-group compounds for HER electrocatalysis have not been systematically studied. In this study, we unveil the activity trends of non-noble metal cata… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Comments: 25 pages, 5 figures

    Journal ref: Chemical Engineering Journal,2024, 488, 150961

  3. arXiv:2408.11387  [pdf

    cond-mat.supr-con

    Structural and Superconducting Properties in the Te-doped Spinel CuRh2Se4

    Authors: Kuan Li, Lingyong Zeng, Longfu Li, Rui Chen, Peifeng Yu, Kangwang Wang, Chao Zhang, Zaichen Xiang, Huixia Luo

    Abstract: In this paper, we discuss the impact of tellurium (Te) doping on the spinel superconductor CuRh2Se4. We conducted a comprehensive evaluation of the structural and superconducting properties of the system using various techniques, including X-ray diffraction (XRD), resistivity, magnetization, and specific heat measurements. Based on our XRD analysis, we found that the spinel superconductor CuRh2Se4… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 25 pages, 6 figures, 1 table

    Journal ref: Journal of Alloys and Compounds, 2024, 995, 174756

  4. arXiv:2408.11373  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Revealing the nontrivial topological surface states of catalysts for effective photochemical carbon dioxide conversion

    Authors: Kangwang Wang, Longfu Li, Peifeng Yu, Nannan Tang, Lingyong Zeng, Kuan Li, Chao Zhang, Rui Chen, Zaichen Xiang, Huichao Wang, Yongqing Cai, Kai Yan, Huixia Luo

    Abstract: Topological semimetals with protected surface states mark a new paradigm of research beyond the early landmarks of band-structure engineering, allowing fabrication of efficient catalyst to harness the rich metallic surface states to activate specific chemical processes. Herein, we demonstrate a facile solid-phase method for in-situ doping of Ir at the Os sites in the Os3Sn7, an alloy with topologi… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 33 Pages, 6 Figures, 1 Table

    Journal ref: Applied Catalysis B: Environment and Energy,2024,358,124428

  5. arXiv:2408.11369  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Non-trivial Topological Surface States Regulation of 1T-OsCoTe$_2$ Enables Selective C-C Coupling for Highly Efficient Photochemical CO$_2$ Reduction Toward C$_{2+}$ hydrocarbons

    Authors: Kangwang Wang, Mingjie Wu, Peifeng Yu, Hector F. Garces, Ying Liang, Longfu Li, Lingyong Zeng, Kuan Li, Chao Zhang, Kai Yan, Huixia Luo

    Abstract: Despite ongoing research, the rational design of nontrivial topological semimetal surface states for the selective photocatalytic CO$_2$ conversion into valuable products remains full of challenges. Herein, we present the synthesis of 1T-OsCoTe$_2$ for the photoreduction upgrading of CO$_2$ to tricarbon alkane C$_3$H$_8$,by the integration of experimental work and theory calculation. Experimental… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 31 pages, 6 Figures

    Journal ref: Applied Catalysis B: Environment and Energy,2024,352,124058

  6. arXiv:2408.10746  [pdf, other

    cs.DC cs.AI cs.LG cs.NI

    Pluto and Charon: A Time and Memory Efficient Collaborative Edge AI Framework for Personal LLMs Fine-Tuning

    Authors: Bei Ouyang, Shengyuan Ye, Liekang Zeng, Tianyi Qian, Jingyi Li, Xu Chen

    Abstract: Large language models (LLMs) have unlocked a plethora of powerful applications at the network edge, such as intelligent personal assistants. Data privacy and security concerns have prompted a shift towards edge-based fine-tuning of personal LLMs, away from cloud reliance. However, this raises issues of computational intensity and resource scarcity, hindering training efficiency and feasibility. Wh… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted by The 53rd International Conference on Parallel Processing (ICPP'24)

  7. arXiv:2408.08015  [pdf, other

    cs.DC cs.AI cs.CV cs.LG cs.NI

    Asteroid: Resource-Efficient Hybrid Pipeline Parallelism for Collaborative DNN Training on Heterogeneous Edge Devices

    Authors: Shengyuan Ye, Liekang Zeng, Xiaowen Chu, Guoliang Xing, Xu Chen

    Abstract: On-device Deep Neural Network (DNN) training has been recognized as crucial for privacy-preserving machine learning at the edge. However, the intensive training workload and limited onboard computing resources pose significant challenges to the availability and efficiency of model training. While existing works address these challenges through native resource management optimization, we instead le… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: Accepted by The 30th Annual International Conference on Mobile Computing and Networking (MobiCom'24)

  8. arXiv:2408.05966  [pdf, other

    cs.CV cs.AI cs.GR cs.MM

    Freehand Sketch Generation from Mechanical Components

    Authors: Zhichao Liao, Di Huang, Heming Fang, Yue Ma, Fengyuan Piao, Xinghui Li, Long Zeng, Pingfa Feng

    Abstract: Drawing freehand sketches of mechanical components on multimedia devices for AI-based engineering modeling has become a new trend. However, its development is being impeded because existing works cannot produce suitable sketches for data-driven research. These works either generate sketches lacking a freehand style or utilize generative models not originally designed for this task resulting in poo… ▽ More

    Submitted 21 August, 2024; v1 submitted 12 August, 2024; originally announced August 2024.

    Comments: Published at ACM Multimedia (ACM MM) 2024

  9. arXiv:2408.03272  [pdf, other

    physics.plasm-ph

    Suppression of Edge Localized Modes in ITER Baseline Scenario in EAST using Edge Localized Magnetic Perturbations

    Authors: P. Xie, Y. Sun, M. Jia, A. Loarte, Y. Q. Liu, C. Ye, S. Gu, H. Sheng, Y. Liang, Q. Ma, H. Yang, C. A. Paz-Soldan, G. Deng, S. Fu, G. Chen, K. He, T. Jia, D. Lu, B. Lv, J. Qian, H. H. Wang, S. Wang, D. Weisberg, X. Wu, W. Xu , et al. (9 additional authors not shown)

    Abstract: We report the suppression of Type-I Edge Localized Modes (ELMs) in the EAST tokamak under ITER baseline conditions using $n = 4$ Resonant Magnetic Perturbations (RMPs), while maintaining energy confinement. Achieving RMP-ELM suppression requires a normalized plasma beta ($β_N$) exceeding 1.8 in a target plasma with $q_{95}\approx 3.1$ and tungsten divertors. Quasi-linear modeling shows high plasma… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 6 pages, 4 figures

  10. arXiv:2408.02839  [pdf, other

    stat.ML cs.LG

    Optimizing Cox Models with Stochastic Gradient Descent: Theoretical Foundations and Practical Guidances

    Authors: Lang Zeng, Weijing Tang, Zhao Ren, Ying Ding

    Abstract: Optimizing Cox regression and its neural network variants poses substantial computational challenges in large-scale studies. Stochastic gradient descent (SGD), known for its scalability in model optimization, has recently been adapted to optimize Cox models. Unlike its conventional application, which typically targets a sum of independent individual loss, SGD for Cox models updates parameters base… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  11. Multibeam Hybrid Transmitarray Based on Polarization Rotating Metasurface With Reconfigurable Bidirectional Radiation

    Authors: Fan Qin, Yifei Liu, Chao Gu, Linfeng Zeng, Wenchi Cheng, Hailin Zhang, Steven Gao

    Abstract: This paper proposes a bidirectional multibeam hybrid transmitarray (HTA) employing a transmission polarization-rotating metasurface (TPRM). A novel configuration is introduced to facilitate bidirectional beam scanning by combining the transmitarray (TA) and folded-transmitarray (FTA). To accomplish the reconfiguration of both unidirectional and bidirectional radiation states in the +z, -z, and +/-… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 12 pages, 26 figures, published to TAP

  12. arXiv:2407.20898  [pdf, other

    cs.SE

    ThinkRepair: Self-Directed Automated Program Repair

    Authors: Xin Yin, Chao Ni, Shaohua Wang, Zhenhao Li, Limin Zeng, Xiaohu Yang

    Abstract: Though many approaches have been proposed for Automated Program Repair (APR) and indeed achieved remarkable performance, they still have limitations in fixing bugs that require analyzing and reasoning about the logic of the buggy program. Recently, large language models (LLMs) instructed by prompt engineering have attracted much attention for their powerful ability to address many kinds of tasks i… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted By ISSTA'24

  13. arXiv:2407.19694  [pdf

    cs.CV

    Structural damage detection via hierarchical damage information with volumetric assessment

    Authors: Isaac Osei Agyemang, Jianwen Chen, Liaoyuan Zeng, Isaac Adjei-Mensah, Daniel Acheampong, Gordon Owusu Boateng, Adu Asare Baffour

    Abstract: Image environments and noisy labels hinder deep learning-based inference models in structural damage detection. Post-detection, there is the challenge of reliance on manual assessments of detected damages. As a result, Guided-DetNet, characterized by Generative Attention Module (GAM), Hierarchical Elimination Algorithm (HEA), and Volumetric Contour Visual Assessment (VCVA), is proposed to mitigate… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  14. arXiv:2407.18054  [pdf, other

    eess.IV cs.CV

    LKCell: Efficient Cell Nuclei Instance Segmentation with Large Convolution Kernels

    Authors: Ziwei Cui, Jingfeng Yao, Lunbin Zeng, Juan Yang, Wenyu Liu, Xinggang Wang

    Abstract: The segmentation of cell nuclei in tissue images stained with the blood dye hematoxylin and eosin (H$\&$E) is essential for various clinical applications and analyses. Due to the complex characteristics of cellular morphology, a large receptive field is considered crucial for generating high-quality segmentation. However, previous methods face challenges in achieving a balance between the receptiv… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  15. arXiv:2407.17666  [pdf, other

    stat.ME

    Causal estimands and identification of time-varying effects in non-stationary time series from N-of-1 mobile device data

    Authors: Xiaoxuan Cai, Li Zeng, Charlotte Fowler, Lisa Dixon, Dost Ongur, Justin T. Baker, Jukka-Pekka Onnela, Linda Valeri

    Abstract: Mobile technology (mobile phones and wearable devices) generates continuous data streams encompassing outcomes, exposures and covariates, presented as intensive longitudinal or multivariate time series data. The high frequency of measurements enables granular and dynamic evaluation of treatment effect, revealing their persistence and accumulation over time. Existing methods predominantly focus on… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

  16. arXiv:2407.15342  [pdf, ps, other

    math.GR

    The finite basis problem for additively idempotent semirings of order four, I

    Authors: Miaomiao Ren, Junyang Liu, Lingli Zeng, Menglong Chen

    Abstract: We study the finite basis problem for 4-element additively idempotent semirings whose additive reducts are semilattices of height 1. Up to isomorphism, there are 58 such algebras. We show that 49 of them are finitely based and the remaining ones are nonfinitely based.

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 30pages

    MSC Class: 16Y60; 03C05; 08B05

  17. arXiv:2407.15320  [pdf, other

    cs.DC cs.AI cs.LG cs.NI

    Edge Graph Intelligence: Reciprocally Empowering Edge Networks with Graph Intelligence

    Authors: Liekang Zeng, Shengyuan Ye, Xu Chen, Xiaoxi Zhang, Ju Ren, Jian Tang, Yang Yang, Xuemin, Shen

    Abstract: Recent years have witnessed a thriving growth of computing facilities connected at the network edge, cultivating edge computing networks as a fundamental infrastructure for supporting miscellaneous intelligent services. Meanwhile, Artificial Intelligence frontiers have extrapolated Machine Learning to the graph domain and promoted Graph Intelligence (GI), which unlocks unprecedented ability in lea… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 38 pages, 14 figures

  18. arXiv:2407.12857  [pdf, other

    cs.CL cs.DL cs.IR

    Automated Peer Reviewing in Paper SEA: Standardization, Evaluation, and Analysis

    Authors: Jianxiang Yu, Zichen Ding, Jiaqi Tan, Kangyang Luo, Zhenmin Weng, Chenghua Gong, Long Zeng, Renjing Cui, Chengcheng Han, Qiushi Sun, Zhiyong Wu, Yunshi Lan, Xiang Li

    Abstract: In recent years, the rapid increase in scientific papers has overwhelmed traditional review mechanisms, resulting in varying quality of publications. Although existing methods have explored the capabilities of Large Language Models (LLMs) for automated scientific reviewing, their generated contents are often generic or partial. To address the issues above, we introduce an automated paper reviewing… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  19. Numerical Analysis on the Spatiotemporal Characteristics of the Portevin-Le Chatelier Effect in Ti-12Mo Alloy

    Authors: Shiyuan Luo, Yongxin Jiang, Sandrine Thuillier, Philippe Castany, Liangcai Zeng

    Abstract: A simplified 3D FE model based on McCormick's model is developed to numerically predict the spatiotemporal behaviors of the PLC effect in Ti-12Mo alloy tensile tests at 350 degrees C with strain rates from the order of $10^{-4}$ s$^{-1}$ to $10^{-2}$ s$^{-1}$. The material parameter identification procedure is firstly presented in details, and the simulated results are highly consistent with exper… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Journal ref: Metals and Materials International, 2023, 29 (2), pp.269-279

  20. arXiv:2407.08348  [pdf, other

    cs.AI cs.CL cs.LG

    Skywork-Math: Data Scaling Laws for Mathematical Reasoning in Large Language Models -- The Story Goes On

    Authors: Liang Zeng, Liangjun Zhong, Liang Zhao, Tianwen Wei, Liu Yang, Jujie He, Cheng Cheng, Rui Hu, Yang Liu, Shuicheng Yan, Han Fang, Yahui Zhou

    Abstract: In this paper, we investigate the underlying factors that potentially enhance the mathematical reasoning capabilities of large language models (LLMs). We argue that the data scaling law for math reasoning capabilities in modern LLMs is far from being saturated, highlighting how the model's quality improves with increases in data quantity. To support this claim, we introduce the Skywork-Math model… ▽ More

    Submitted 17 July, 2024; v1 submitted 11 July, 2024; originally announced July 2024.

  21. arXiv:2406.19791  [pdf, other

    cs.RO

    Mobile Robot Oriented Large-Scale Indoor Dataset for Dynamic Scene Understanding

    Authors: Yifan Tang, Cong Tai, Fangxing Chen, Wanting Zhang, Tao Zhang, Xueping Liu, Yongjin Liu, Long Zeng

    Abstract: Most existing robotic datasets capture static scene data and thus are limited in evaluating robots' dynamic performance. To address this, we present a mobile robot oriented large-scale indoor dataset, denoted as THUD (Tsinghua University Dynamic) robotic dataset, for training and evaluating their dynamic scene understanding algorithms. Specifically, the THUD dataset construction is first detailed,… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: This version has been accepted by ICRA2024 and the dataset has been published, where the link can be found in the paper

    Journal ref: IEEE International Conference on Robotics & Automation,2024

  22. arXiv:2406.19613  [pdf, other

    cs.DC

    Online Optimization of DNN Inference Network Utility in Collaborative Edge Computing

    Authors: Rui Li, Tao Ouyang, Liekang Zeng, Guocheng Liao, Zhi Zhou, Xu Chen

    Abstract: Collaborative Edge Computing (CEC) is an emerging paradigm that collaborates heterogeneous edge devices as a resource pool to compute DNN inference tasks in proximity such as edge video analytics. Nevertheless, as the key knob to improve network utility in CEC, existing works mainly focus on the workload routing strategies among edge devices with the aim of minimizing the routing cost, remaining a… ▽ More

    Submitted 27 June, 2024; originally announced June 2024.

    Comments: Accepted by IEEE/ACM TRANSACTIONS ON NETWORKING (ToN)

  23. arXiv:2406.17192  [pdf, other

    astro-ph.IM

    Upgrading the Submillimeter Array: wSMA and beyond

    Authors: Paul K. Grimes, Garrett K. Keating, Raymond Blundell, Robert D. Christensen, Mark Gurwell, Attila Kovacs, Timothy Norton, Scott N. Paine, Ramprasad Rao, Edward C. -Y. Tong, Jonathan Weintroub, David Wilner, Robert W. Wilson, Lingzhen Zeng, Qizhou Zhang

    Abstract: The Submillimeter Array (SMA) is an array of 8 antennas operating at millimeter and submillimeter wavelengths on Maunakea, Hawaii, operated by the Smithsonian Astrophysical Observatory and Academia Sinica Institute of Astronomy and Astrophysics, Taiwan. Over the past several years, we have been preparing a major upgrade to the SMA that will replace the aging original receiver cryostats and receive… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: To be published in the proceedings of SPIE Astronomical Telescopes + Instrumentation 2024, paper number 13096-122

  24. arXiv:2406.14843  [pdf, other

    physics.acc-ph

    Synthesis of Electron Microbunching Rotation for Generating Isolated Attosecond Soft X-ray Free-electron Laser Pulses

    Authors: Hao Sun, Xiaofan Wang, Li Zeng, Weiqing Zhang

    Abstract: Attosecond x-ray pulses play a crucial role in the study of ultrafast phenomena occurring within inner and valence electrons. Especially isolated attosecond pulses with high photon energy and high peak power are of great significance in single-shot imaging in the soft x-ray region, life sciences, and attosecond pump-probe experiments. In modern accelerators, laser manipulation of electrons can be… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  25. arXiv:2406.14283  [pdf, other

    cs.AI

    Q*: Improving Multi-step Reasoning for LLMs with Deliberative Planning

    Authors: Chaojie Wang, Yanchen Deng, Zhiyi Lyu, Liang Zeng, Jujie He, Shuicheng Yan, Bo An

    Abstract: Large Language Models (LLMs) have demonstrated impressive capability in many natural language tasks. However, the auto-regressive generation process makes LLMs prone to produce errors, hallucinations and inconsistent statements when performing multi-step reasoning. In this paper, by casting multi-step reasoning of LLMs as a heuristic search problem, we aim to alleviate the pathology by introducing… ▽ More

    Submitted 22 July, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

  26. arXiv:2406.11800  [pdf, other

    astro-ph.GA

    Magnetic field in mini starburst complex Sgr B2

    Authors: Xing Pan, Qizhou Zhang, Keping Qiu, Ramprasad Rao, Lingzhen Zeng, Xing Lu, Junhao Liu

    Abstract: We report the first arcsecond-resolution observations of the magnetic field in the mini starburst complex Sgr B2. SMA polarization observations revealed magnetic field morphology in three dense cores of Sgr B2 N(orth), M(ain), and S(outh). The total plane-of-sky magnetic field strengths in these cores are estimated to be 4.3-10.0 mG, 6.2-14.7 mG, and 1.9-4.5 mG derived from the angular dispersion… ▽ More

    Submitted 18 June, 2024; v1 submitted 17 June, 2024; originally announced June 2024.

    Comments: 17 pages, 4 figures, accepted for publication in ApJ

  27. arXiv:2406.08877  [pdf, other

    cs.CV cs.AI

    EgoExo-Fitness: Towards Egocentric and Exocentric Full-Body Action Understanding

    Authors: Yuan-Ming Li, Wei-Jin Huang, An-Lan Wang, Ling-An Zeng, Jing-Ke Meng, Wei-Shi Zheng

    Abstract: We present EgoExo-Fitness, a new full-body action understanding dataset, featuring fitness sequence videos recorded from synchronized egocentric and fixed exocentric (third-person) cameras. Compared with existing full-body action understanding datasets, EgoExo-Fitness not only contains videos from first-person perspectives, but also provides rich annotations. Specifically, two-level temporal bound… ▽ More

    Submitted 16 July, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: Accepted by ECCV2024

  28. arXiv:2406.06563  [pdf, other

    cs.CL cs.AI

    Skywork-MoE: A Deep Dive into Training Techniques for Mixture-of-Experts Language Models

    Authors: Tianwen Wei, Bo Zhu, Liang Zhao, Cheng Cheng, Biye Li, Weiwei Lü, Peng Cheng, Jianhao Zhang, Xiaoyu Zhang, Liang Zeng, Xiaokun Wang, Yutuan Ma, Rui Hu, Shuicheng Yan, Han Fang, Yahui Zhou

    Abstract: In this technical report, we introduce the training methodologies implemented in the development of Skywork-MoE, a high-performance mixture-of-experts (MoE) large language model (LLM) with 146 billion parameters and 16 experts. It is initialized from the pre-existing dense checkpoints of our Skywork-13B model. We explore the comparative effectiveness of upcycling versus training from scratch initi… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  29. arXiv:2406.04603  [pdf, ps, other

    cs.CV

    Simplify Implant Depth Prediction as Video Grounding: A Texture Perceive Implant Depth Prediction Network

    Authors: Xinquan Yang, Xuguang Li, Xiaoling Luo, Leilei Zeng, Yudi Zhang, Linlin Shen, Yongqiang Deng

    Abstract: Surgical guide plate is an important tool for the dental implant surgery. However, the design process heavily relies on the dentist to manually simulate the implant angle and depth. When deep neural networks have been applied to assist the dentist quickly locates the implant position, most of them are not able to determine the implant depth. Inspired by the video grounding task which localizes the… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Journal ref: MICCAI'2024

  30. arXiv:2406.00605  [pdf, other

    cs.CL cs.AI

    LongSkywork: A Training Recipe for Efficiently Extending Context Length in Large Language Models

    Authors: Liang Zhao, Tianwen Wei, Liang Zeng, Cheng Cheng, Liu Yang, Peng Cheng, Lijie Wang, Chenxia Li, Xuejie Wu, Bo Zhu, Yimeng Gan, Rui Hu, Shuicheng Yan, Han Fang, Yahui Zhou

    Abstract: We introduce LongSkywork, a long-context Large Language Model (LLM) capable of processing up to 200,000 tokens. We provide a training recipe for efficiently extending context length of LLMs. We identify that the critical element in enhancing long-context processing capability is to incorporate a long-context SFT stage following the standard SFT stage. A mere 200 iterations can convert the standard… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

  31. arXiv:2406.00023  [pdf, other

    cs.CL

    Expert-Token Resonance: Redefining MoE Routing through Affinity-Driven Active Selection

    Authors: Jing Li, Zhijie Sun, Dachao Lin, Xuan He, Yi Lin, Binfan Zheng, Li Zeng, Rongqian Zhao, Xin Chen

    Abstract: Mixture-of-Experts (MoE) architectures have emerged as a paradigm-shifting approach for large language models (LLMs), offering unprecedented computational efficiency. However, these architectures grapple with challenges of token distribution imbalance and expert homogenization, impeding optimal semantic generalization. We introduce a novel framework that redefines MoE routing through affinity-driv… ▽ More

    Submitted 30 August, 2024; v1 submitted 23 May, 2024; originally announced June 2024.

  32. GaussianPrediction: Dynamic 3D Gaussian Prediction for Motion Extrapolation and Free View Synthesis

    Authors: Boming Zhao, Yuan Li, Ziyu Sun, Lin Zeng, Yujun Shen, Rui Ma, Yinda Zhang, Hujun Bao, Zhaopeng Cui

    Abstract: Forecasting future scenarios in dynamic environments is essential for intelligent decision-making and navigation, a challenge yet to be fully realized in computer vision and robotics. Traditional approaches like video prediction and novel-view synthesis either lack the ability to forecast from arbitrary viewpoints or to predict temporal dynamics. In this paper, we introduce GaussianPrediction, a n… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted to SIGGRAPH 2024 Conference. Project Page: https://zju3dv.github.io/gaussian-prediction/

  33. arXiv:2405.19469  [pdf, other

    astro-ph.CO

    Constraining Inflation with the BICEP/Keck CMB Polarization Experiments

    Authors: The BICEP/Keck Collaboration, :, P. A. R. Ade, Z. Ahmed, M. Amiri, D. Barkats, R. Basu Thakur, C. A. Bischoff, D. Beck, J. J. Bock, H. Boenish, V. Buza, J. R. Cheshire IV, J. Connors, J. Cornelison, M. Crumrine, A. Cukierman, E. V. Denison, M. Dierickx, L. Duband, M. Eiben, B. Elwood, S. Fatigoni, J. P. Filippini, M. Gao , et al. (63 additional authors not shown)

    Abstract: The BICEP/$\textit{Keck}$ (BK) series of cosmic microwave background (CMB) polarization experiments has, over the past decade and a half, produced a series of field-leading constraints on cosmic inflation via measurements of the "B-mode" polarization of the CMB. Primordial B modes are directly tied to the amplitude of primordial gravitational waves (PGW), their strength parameterized by the tensor… ▽ More

    Submitted 11 July, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 9 pages, 5 figures. Contribution to the 2024 Cosmology session of the 58th Rencontres de Moriond

  34. arXiv:2405.18739  [pdf, other

    cs.NI eess.SP

    FlocOff: Data Heterogeneity Resilient Federated Learning with Communication-Efficient Edge Offloading

    Authors: Mulei Ma, Chenyu Gong, Liekang Zeng, Yang Yang, Liantao Wu

    Abstract: Federated Learning (FL) has emerged as a fundamental learning paradigm to harness massive data scattered at geo-distributed edge devices in a privacy-preserving way. Given the heterogeneous deployment of edge devices, however, their data are usually Non-IID, introducing significant challenges to FL including degraded training accuracy, intensive communication costs, and high computing complexity.… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  35. arXiv:2405.18435  [pdf, other

    eess.IV cs.CV

    QUBIQ: Uncertainty Quantification for Biomedical Image Segmentation Challenge

    Authors: Hongwei Bran Li, Fernando Navarro, Ivan Ezhov, Amirhossein Bayat, Dhritiman Das, Florian Kofler, Suprosanna Shit, Diana Waldmannstetter, Johannes C. Paetzold, Xiaobin Hu, Benedikt Wiestler, Lucas Zimmer, Tamaz Amiranashvili, Chinmay Prabhakar, Christoph Berger, Jonas Weidner, Michelle Alonso-Basant, Arif Rashid, Ujjwal Baid, Wesam Adel, Deniz Ali, Bhakti Baheti, Yingbin Bai, Ishaan Bhatt, Sabri Can Cetindag , et al. (55 additional authors not shown)

    Abstract: Uncertainty in medical image segmentation tasks, especially inter-rater variability, arising from differences in interpretations and annotations by various experts, presents a significant challenge in achieving consistent and reliable image segmentation. This variability not only reflects the inherent complexity and subjective nature of medical image interpretation but also directly impacts the de… ▽ More

    Submitted 24 June, 2024; v1 submitted 19 March, 2024; originally announced May 2024.

    Comments: initial technical report

  36. arXiv:2405.17245  [pdf, other

    cs.DC cs.AI cs.LG cs.NI

    Galaxy: A Resource-Efficient Collaborative Edge AI System for In-situ Transformer Inference

    Authors: Shengyuan Ye, Jiangsu Du, Liekang Zeng, Wenzhong Ou, Xiaowen Chu, Yutong Lu, Xu Chen

    Abstract: Transformer-based models have unlocked a plethora of powerful intelligent applications at the edge, such as voice assistant in smart home. Traditional deployment approaches offload the inference workloads to the remote cloud server, which would induce substantial pressure on the backbone network as well as raise users' privacy concerns. To address that, in-situ inference has been recently recogniz… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: Accepted by IEEE International Conference on Computer Communications 2024

  37. arXiv:2405.16773  [pdf, other

    astro-ph.GA astro-ph.SR

    On the origin of infrared bands attributed to tryptophan in Spitzer observations of IC 348

    Authors: Aditya Dhariwal, Thomas H. Speak, Linshan Zeng, Amirhossein Rashidi, Brendan Moore, Olivier Berné, Anthony J. Remijan, Ilane Schroetter, Brett A. McGuire, Víctor M. Rivilla, Arnaud Belloche, Jes K. Jørgensen, Pavle Djuricanin, Takamasa Momose, Ilsa R. Cooke

    Abstract: Infrared emission features toward interstellar gas of the IC 348 star cluster in Perseus have been recently proposed to originate from the amino acid tryptophan. The assignment was based on laboratory infrared spectra of tryptophan pressed into pellets, a method which is known to cause large frequency shifts compared to the gas phase. We assess the validity of the assignment based on the original… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  38. arXiv:2405.11323  [pdf

    cond-mat.mes-hall

    High-yield fabrication of bubble-free magic-angle twisted bilayer graphene devices with high twist-angle homogeneity

    Authors: J. Diez-Merida, I. Das, G. Di Battista, A. Diez-Carlon, M. Lee, L. Zeng, K. Watanabe, T. Taniguchi, E. Olsson, D. K. Efetov

    Abstract: Magic-angle twisted bilayer graphene (MATBG) stands as one of the most versatile materials in condensed-matter physics due to its hosting of a wide variety of exotic phases while also offering convenient tunability. However, the fabrication of MATBG is still manual, and remains to be a challenging and inefficient process, with devices being highly dependent on specific fabrication methods, that of… ▽ More

    Submitted 18 May, 2024; originally announced May 2024.

  39. arXiv:2405.03924  [pdf, other

    cs.DB cs.AI cs.LG

    NeurDB: An AI-powered Autonomous Data System

    Authors: Beng Chin Ooi, Shaofeng Cai, Gang Chen, Yanyan Shen, Kian-Lee Tan, Yuncheng Wu, Xiaokui Xiao, Naili Xing, Cong Yue, Lingze Zeng, Meihui Zhang, Zhanhao Zhao

    Abstract: In the wake of rapid advancements in artificial intelligence (AI), we stand on the brink of a transformative leap in data systems. The imminent fusion of AI and DB (AIxDB) promises a new generation of data systems, which will relieve the burden on end-users across all industry sectors by featuring AI-enhanced functionalities, such as personalized and automated in-database AI-powered analytics, sel… ▽ More

    Submitted 4 July, 2024; v1 submitted 6 May, 2024; originally announced May 2024.

  40. arXiv:2405.00568  [pdf, other

    cs.DB cs.AI

    Powering In-Database Dynamic Model Slicing for Structured Data Analytics

    Authors: Lingze Zeng, Naili Xing, Shaofeng Cai, Gang Chen, Beng Chin Ooi, Jian Pei, Yuncheng Wu

    Abstract: Relational database management systems (RDBMS) are widely used for the storage and retrieval of structured data. To derive insights beyond statistical aggregation, we typically have to extract specific subdatasets from the database using conventional database operations, and then apply deep neural networks (DNN) training and inference on these respective subdatasets in a separate machine learning… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

  41. arXiv:2404.17766  [pdf, other

    cs.LG cs.AI cs.DC cs.NI

    Implementation of Big AI Models for Wireless Networks with Collaborative Edge Computing

    Authors: Liekang Zeng, Shengyuan Ye, Xu Chen, Yang Yang

    Abstract: Big Artificial Intelligence (AI) models have emerged as a crucial element in various intelligent applications at the edge, such as voice assistants in smart homes and autonomous robotics in smart factories. Training big AI models, e.g., for personalized fine-tuning and continual model refinement, poses significant challenges to edge devices due to the inherent conflict between limited computing re… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

  42. arXiv:2404.13692  [pdf, other

    cs.CV

    A sustainable development perspective on urban-scale roof greening priorities and benefits

    Authors: Jie Shao, Wei Yao, Lei Luo, Linzhou Zeng, Zhiyi He, Puzuo Wang, Huadong Guo

    Abstract: Greenspaces are tightly linked to human well-being. Yet, rapid urbanization has exacerbated greenspace exposure inequality and declining human life quality. Roof greening has been recognized as an effective strategy to mitigate these negative impacts. Understanding priorities and benefits is crucial to promoting green roofs. Here, using geospatial big data, we conduct an urban-scale assessment of… ▽ More

    Submitted 21 April, 2024; originally announced April 2024.

  43. arXiv:2404.12036  [pdf, other

    physics.comp-ph cond-mat.soft

    Exploring the Premelting Transition through Molecular Simulations Powered by Neural Network Potentials

    Authors: Limin Zeng, Ang Gao

    Abstract: The system has addressed the error of "Bad character(s) in field Abstract" for no reason. Please refer to manuscript for the full abstract.

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 10 pages, 6 figures

  44. arXiv:2404.08681  [pdf, other

    cs.CL

    EFSA: Towards Event-Level Financial Sentiment Analysis

    Authors: Tianyu Chen, Yiming Zhang, Guoxin Yu, Dapeng Zhang, Li Zeng, Qing He, Xiang Ao

    Abstract: In this paper, we extend financial sentiment analysis~(FSA) to event-level since events usually serve as the subject of the sentiment in financial text. Though extracting events from the financial text may be conducive to accurate sentiment predictions, it has specialized challenges due to the lengthy and discontinuity of events in a financial text. To this end, we reconceptualize the event extrac… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

  45. arXiv:2404.04661  [pdf, other

    cs.LG cs.AI

    Transform then Explore: a Simple and Effective Technique for Exploratory Combinatorial Optimization with Reinforcement Learning

    Authors: Tianle Pu, Changjun Fan, Mutian Shen, Yizhou Lu, Li Zeng, Zohar Nussinov, Chao Chen, Zhong Liu

    Abstract: Many complex problems encountered in both production and daily life can be conceptualized as combinatorial optimization problems (COPs) over graphs. Recent years, reinforcement learning (RL) based models have emerged as a promising direction, which treat the COPs solving as a heuristic learning problem. However, current finite-horizon-MDP based RL models have inherent limitations. They are not all… ▽ More

    Submitted 6 April, 2024; originally announced April 2024.

  46. arXiv:2404.01875  [pdf, other

    eess.SP cs.DC cs.IT cs.LG

    Satellite Federated Edge Learning: Architecture Design and Convergence Analysis

    Authors: Yuanming Shi, Li Zeng, Jingyang Zhu, Yong Zhou, Chunxiao Jiang, Khaled B. Letaief

    Abstract: The proliferation of low-earth-orbit (LEO) satellite networks leads to the generation of vast volumes of remote sensing data which is traditionally transferred to the ground server for centralized processing, raising privacy and bandwidth concerns. Federated edge learning (FEEL), as a distributed machine learning approach, has the potential to address these challenges by sharing only model paramet… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 16 pages, 15 figures

  47. arXiv:2404.01050  [pdf, other

    cs.CV cs.GR cs.HC cs.LG

    Drag Your Noise: Interactive Point-based Editing via Diffusion Semantic Propagation

    Authors: Haofeng Liu, Chenshu Xu, Yifei Yang, Lihua Zeng, Shengfeng He

    Abstract: Point-based interactive editing serves as an essential tool to complement the controllability of existing generative models. A concurrent work, DragDiffusion, updates the diffusion latent map in response to user inputs, causing global latent map alterations. This results in imprecise preservation of the original content and unsuccessful editing due to gradient vanishing. In contrast, we present Dr… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Accepted by CVPR 2024

  48. arXiv:2403.16283  [pdf, other

    stat.ME

    Sample Empirical Likelihood Methods for Causal Inference

    Authors: Jingyue Huang, Changbao Wu, Leilei Zeng

    Abstract: Causal inference is crucial for understanding the true impact of interventions, policies, or actions, enabling informed decision-making and providing insights into the underlying mechanisms that shape our world. In this paper, we establish a framework for the estimation and inference of average treatment effects using a two-sample empirical likelihood function. Two different approaches to incorpor… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  49. arXiv:2403.10003  [pdf, ps, other

    hep-ph

    An improved light-cone harmonic oscillator model for the $φ$-meson longitudinal leading-twist light-cone distribution amplitude

    Authors: Dan-Dan Hu, Xing-Gang Wu, Long Zeng, Hai-Bing Fu, Tao Zhong

    Abstract: In the present paper, we study the properties of $φ$-meson longitudinal leading-twist light-cone distribution amplitude $φ_{2;φ}^{\|}(x,μ)$ by starting from a light-cone harmonic oscillator model for its wavefunction. To fix the input parameters, we derive the first ten $ξ$-moments of $φ_{2;φ}^{\|}(x,μ)$ by using the QCD sum rules approach under the background field theory. The shape of… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

    Comments: 32 pages, 7 figures, comments welcome

  50. arXiv:2403.09317  [pdf, other

    cs.CV cs.AI

    SD-Net: Symmetric-Aware Keypoint Prediction and Domain Adaptation for 6D Pose Estimation In Bin-picking Scenarios

    Authors: Ding-Tao Huang, En-Te Lin, Lipeng Chen, Li-Fu Liu, Long Zeng

    Abstract: Despite the success in 6D pose estimation in bin-picking scenarios, existing methods still struggle to produce accurate prediction results for symmetry objects and real world scenarios. The primary bottlenecks include 1) the ambiguity keypoints caused by object symmetries; 2) the domain gap between real and synthetic data. To circumvent these problem, we propose a new 6D pose estimation network wi… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.