Zum Hauptinhalt springen

Showing 1–50 of 891 results for author: Dai, Y

.
  1. arXiv:2408.16927  [pdf, ps, other

    math.OC

    Serial and Parallel Two-Column Probing for Mixed-Integer Programming

    Authors: Yongzheng Dai, Chen Chen

    Abstract: Probing in mixed-integer programming (MIP) is a technique of temporarily fixing variables to discover implications that are useful to branch-and-cut solvers. Such fixing is typically performed one variable at a time -- this paper develops instead a two-column probing scheme that instead fixes a pair of variables per iteration. Although the scheme involves more work per iteration compared to the on… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: 22 pages, 2 figures, 3 charts

    MSC Class: 90C10

  2. arXiv:2408.16215  [pdf, ps, other

    math.OC cs.LG cs.PF eess.SY

    Adversarial Network Optimization under Bandit Feedback: Maximizing Utility in Non-Stationary Multi-Hop Networks

    Authors: Yan Dai, Longbo Huang

    Abstract: Stochastic Network Optimization (SNO) concerns scheduling in stochastic queueing systems. It has been widely studied in network theory. Classical SNO algorithms require network conditions to be stationary with time, which fails to capture the non-stationary components in many real-world scenarios. Many existing algorithms also assume knowledge of network conditions before decision, which rules out… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.11878  [pdf, other

    cs.CL cs.CE q-fin.CP

    Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

    Authors: Qianqian Xie, Dong Li, Mengxi Xiao, Zihao Jiang, Ruoyu Xiang, Xiao Zhang, Zhengyu Chen, Yueru He, Weiguang Han, Yuzhe Yang, Shunian Chen, Yifei Zhang, Lihang Shen, Daniel Kim, Zhiwei Liu, Zheheng Luo, Yangyang Yu, Yupeng Cao, Zhiyang Deng, Zhiyuan Yao, Haohang Li, Duanyu Feng, Yongfu Dai, VijayaSai Somasundaram, Peng Lu , et al. (14 additional authors not shown)

    Abstract: Large language models (LLMs) have advanced financial applications, yet they often lack sufficient financial knowledge and struggle with tasks involving multi-modal inputs like tables and time series data. To address these limitations, we introduce \textit{Open-FinLLMs}, a series of Financial LLMs. We begin with FinLLaMA, pre-trained on a 52 billion token financial corpus, incorporating text, table… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 33 pages, 13 figures

  4. arXiv:2408.11138  [pdf, other

    cs.RO cs.CV

    Target-Oriented Object Grasping via Multimodal Human Guidance

    Authors: Pengwei Xie, Siang Chen, Dingchang Hu, Yixiang Dai, Kaiqin Yang, Guijin Wang

    Abstract: In the context of human-robot interaction and collaboration scenarios, robotic grasping still encounters numerous challenges. Traditional grasp detection methods generally analyze the entire scene to predict grasps, leading to redundancy and inefficiency. In this work, we reconsider 6-DoF grasp detection from a target-referenced perspective and propose a Target-Oriented Grasp Network (TOGNet). TOG… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV 2024 Workshop on Assistive Computer Vision and Robotics (ACVR 2024)

  5. arXiv:2408.09856  [pdf, other

    cs.CL cs.AI

    TeamLoRA: Boosting Low-Rank Adaptation with Expert Collaboration and Competition

    Authors: Tianwei Lin, Jiang Liu, Wenqiao Zhang, Zhaocheng Li, Yang Dai, Haoyuan Li, Zhelun Yu, Wanggui He, Juncheng Li, Hao Jiang, Siliang Tang, Yueting Zhuang

    Abstract: While Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have effectively addressed GPU memory constraints during fine-tuning, their performance often falls short, especially in multidimensional task scenarios. To address this issue, one straightforward solution is to introduce task-specific LoRA modules as domain experts, leveraging the modeling of multiple experts' capabilities and thus en… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  6. arXiv:2408.09661  [pdf, ps, other

    math.OC

    Enhanced Barrier-Smoothing Technique for Bilevel Optimization with Nonsmooth Mappings

    Authors: Mengwei Xu, Yu-Hong Dai, Xin-Wei Liu, Bo Wang

    Abstract: Bilevel optimization problems, encountered in fields such as economics, engineering, and machine learning, pose significant computational challenges due to their hierarchical structure and constraints at both upper and lower levels. Traditional gradient-based methods are effective for unconstrained bilevel programs with unique lower level solutions, but struggle with constrained bilevel problems d… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 August, 2024; originally announced August 2024.

  7. arXiv:2408.08872  [pdf, other

    cs.CV cs.AI cs.CL

    xGen-MM (BLIP-3): A Family of Open Large Multimodal Models

    Authors: Le Xue, Manli Shu, Anas Awadalla, Jun Wang, An Yan, Senthil Purushwalkam, Honglu Zhou, Viraj Prabhu, Yutong Dai, Michael S Ryoo, Shrikant Kendre, Jieyu Zhang, Can Qin, Shu Zhang, Chia-Chih Chen, Ning Yu, Juntao Tan, Tulika Manoj Awalgaonkar, Shelby Heinecke, Huan Wang, Yejin Choi, Ludwig Schmidt, Zeyuan Chen, Silvio Savarese, Juan Carlos Niebles , et al. (2 additional authors not shown)

    Abstract: This report introduces xGen-MM (also known as BLIP-3), a framework for developing Large Multimodal Models (LMMs). The framework comprises meticulously curated datasets, a training recipe, model architectures, and a resulting suite of LMMs. xGen-MM, short for xGen-MultiModal, expands the Salesforce xGen initiative on foundation AI models. Our models undergo rigorous evaluation across a range of tas… ▽ More

    Submitted 28 August, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

  8. MSG-Chart: Multimodal Scene Graph for ChartQA

    Authors: Yue Dai, Soyeon Caren Han, Wei Liu

    Abstract: Automatic Chart Question Answering (ChartQA) is challenging due to the complex distribution of chart elements with patterns of the underlying data not explicitly displayed in charts. To address this challenge, we design a joint multimodal scene graph for charts to explicitly represent the relationships between chart elements and their patterns. Our proposed multimodal scene graph includes a visual… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Accpeted by CIKM Short 2024

  9. arXiv:2408.04203  [pdf, other

    cs.AI

    MMRole: A Comprehensive Framework for Developing and Evaluating Multimodal Role-Playing Agents

    Authors: Yanqi Dai, Huanran Hu, Lei Wang, Shengjie Jin, Xu Chen, Zhiwu Lu

    Abstract: Recently, Role-Playing Agents (RPAs) have garnered increasing attention for their potential to deliver emotional value and facilitate sociological research. However, existing studies are primarily confined to the textual modality, unable to simulate humans' multimodal perceptual capabilities. To bridge this gap, we introduce the concept of Multimodal Role-Playing Agents (MRPAs), and propose a comp… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  10. arXiv:2408.03717  [pdf, other

    cs.CV

    Pick of the Bunch: Detecting Infrared Small Targets Beyond Hit-Miss Trade-Offs via Selective Rank-Aware Attention

    Authors: Yimian Dai, Peiwen Pan, Yulei Qian, Yuxuan Li, Xiang Li, Jian Yang, Huan Wan

    Abstract: Infrared small target detection faces the inherent challenge of precisely localizing dim targets amidst complex background clutter. Traditional approaches struggle to balance detection precision and false alarm rates. To break this dilemma, we propose SeRankDet, a deep network that achieves high accuracy beyond the conventional hit-miss trade-off, by following the ``Pick of the Bunch'' principle.… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

  11. arXiv:2408.02927  [pdf, other

    cs.LG cs.AI cs.CL cs.CR

    HARMONIC: Harnessing LLMs for Tabular Data Synthesis and Privacy Protection

    Authors: Yuxin Wang, Duanyu Feng, Yongfu Dai, Zhengyu Chen, Jimin Huang, Sophia Ananiadou, Qianqian Xie, Hao Wang

    Abstract: Data serves as the fundamental foundation for advancing deep learning, particularly tabular data presented in a structured format, which is highly conducive to modeling. However, even in the era of LLM, obtaining tabular data from sensitive domains remains a challenge due to privacy or copyright concerns. Hence, exploring how to effectively use models like LLMs to generate realistic and privacy-pr… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  12. The most distant HI galaxies discovered by the 500 m dish FAST

    Authors: Hongwei Xi, Bo Peng, Lister Staveley-Smith, Bi-Qing For, Bin Liu, Ru-Rong Chen, Lei Yu, Dejian Ding, Wei-Jian Guo, Hu Zou, Suijian Xue, Jing Wang, Thomas G. Brink, WeiKang Zheng, Alexei V. Filippenko, Yi Yang, Jianyan Wei, Y. Sophia Dai, Zi-Jian Li, Zizhao He, Chengzi Jiang, Alexei Moiseev, Sergey Kotov

    Abstract: Neutral hydrogen (HI) is the primary component of the cool interstellar medium (ISM) and is the reservoir of fuel for star formation. Owing to the sensitivity of existing radio telescopes, our understanding of the evolution of the ISM in galaxies remains limited, as it is based on only a few hundred galaxies detected in HI beyond the local Universe. With the high sensitivity of the Five-hundred-me… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 14 pages, 6 figures, 3 tables

    Journal ref: ApJL, 966(2024), L36

  13. arXiv:2407.21467  [pdf

    cs.CV cs.AI

    Deep Learning-Based Longitudinal Prediction of Childhood Myopia Progression Using Fundus Image Sequences and Baseline Refraction Data

    Authors: Mengtian Kang, Yansong Hu, Shuo Gao, Yuanyuan Liu, Hongbei Meng, Xuemeng Li, Xuhang Chen, Hubin Zhao, Jing Fu, Guohua Hu, Wei Wang, Yanning Dai, Arokia Nathan, Peter Smielewski, Ningli Wang, Shiming Li

    Abstract: Childhood myopia constitutes a significant global health concern. It exhibits an escalating prevalence and has the potential to evolve into severe, irreversible conditions that detrimentally impact familial well-being and create substantial economic costs. Contemporary research underscores the importance of precisely predicting myopia progression to enable timely and effective interventions, there… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  14. arXiv:2407.21289  [pdf, other

    cs.CV cs.GR

    Fine-grained Metrics for Point Cloud Semantic Segmentation

    Authors: Zhuheng Lu, Ting Wu, Yuewei Dai, Weiqing Li, Zhiyong Su

    Abstract: Two forms of imbalances are commonly observed in point cloud semantic segmentation datasets: (1) category imbalances, where certain objects are more prevalent than others; and (2) size imbalances, where certain objects occupy more points than others. Because of this, the majority of categories and large objects are favored in the existing evaluation metrics. This paper suggests fine-grained mIoU a… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: PRCV 2024

  15. arXiv:2407.20712  [pdf, other

    cs.HC cs.AI

    Cocobo: Exploring Large Language Models as the Engine for End-User Robot Programming

    Authors: Yate Ge, Yi Dai, Run Shan, Kechun Li, Yuanda Hu, Xiaohua Sun

    Abstract: End-user development allows everyday users to tailor service robots or applications to their needs. One user-friendly approach is natural language programming. However, it encounters challenges such as an expansive user expression space and limited support for debugging and editing, which restrict its application in end-user programming. The emergence of large language models (LLMs) offers promisi… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: This is the preprint version of a paper accepted for presentation at the IEEE Symposium on Visual Languages and Human-Centric Computing (VL/HCC), 2024

  16. arXiv:2407.20078  [pdf, other

    cs.CV

    Background Semantics Matter: Cross-Task Feature Exchange Network for Clustered Infrared Small Target Detection With Sky-Annotated Dataset

    Authors: Yimian Dai, Mengxuan Xiao, Yiming Zhu, Huan Wang, Kehua Guo, Jian Yang

    Abstract: Infrared small target detection poses unique challenges due to the scarcity of intrinsic target features and the abundance of similar background distractors. We argue that background semantics play a pivotal role in distinguishing visually similar objects for this task. To address this, we introduce a new task -- clustered infrared small target detection, and present DenseSIRST, a novel benchmark… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  17. arXiv:2407.19220  [pdf

    physics.ed-ph eess.SY

    A Low-Frequency Vibration Experimental Platform for University Physics Experiment Designed by LabVIEW

    Authors: Yangjie Dai, Leijian Wang, Wenbin Wu, Aiping Chen, Dawei Gu

    Abstract: Virtual instrument technology has been increasingly used in university physics experiment teaching. An experimental platform is specifically constructed for studying low-frequency vibrations in university physics, which is based on a computer and its internal sound card, along with a program developed in LabVIEW programming environment to perform control and measurement on our experimental platfor… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 13 pages, 8 figures, 2 supplementary files

  18. arXiv:2407.18929  [pdf, other

    cs.IT cs.ET cs.LG

    THEA-Code: an Autoencoder-Based IDS-correcting Code for DNA Storage

    Authors: Alan J. X. Guo, Mengyi Wei, Yufan Dai, Yali Wei, Pengchen Zhang

    Abstract: The insertion, deletion, substitution (IDS) correcting code has garnered increased attention due to significant advancements in DNA storage that emerged recently. Despite this, the pursuit of optimal solutions in IDS-correcting codes remains an open challenge, drawing interest from both theoretical and engineering perspectives. This work introduces a pioneering approach named THEA-code. The propos… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

  19. arXiv:2407.15369  [pdf, other

    cs.CV

    Sparse Prior Is Not All You Need: When Differential Directionality Meets Saliency Coherence for Infrared Small Target Detection

    Authors: Fei Zhou, Maixia Fu, Yulei Qian, Jian Yang, Yimian Dai

    Abstract: Infrared small target detection is crucial for the efficacy of infrared search and tracking systems. Current tensor decomposition methods emphasize representing small targets with sparsity but struggle to separate targets from complex backgrounds due to insufficient use of intrinsic directional information and reduced target visibility during decomposition. To address these challenges, this study… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Submitted to IEEE TIM, Minor Revision

  20. arXiv:2407.13137  [pdf, other

    cs.CV

    OE-BevSeg: An Object Informed and Environment Aware Multimodal Framework for Bird's-eye-view Vehicle Semantic Segmentation

    Authors: Jian Sun, Yuqi Dai, Chi-Man Vong, Qing Xu, Shengbo Eben Li, Jianqiang Wang, Lei He, Keqiang Li

    Abstract: Bird's-eye-view (BEV) semantic segmentation is becoming crucial in autonomous driving systems. It realizes ego-vehicle surrounding environment perception by projecting 2D multi-view images into 3D world space. Recently, BEV segmentation has made notable progress, attributed to better view transformation modules, larger image encoders, or more temporal information. However, there are still two issu… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  21. arXiv:2407.13126  [pdf, other

    cs.DC

    Improving GPU Multi-Tenancy Through Dynamic Multi-Instance GPU Reconfiguration

    Authors: Tianyu Wang, Sheng Li, Bingyao Li, Yue Dai, Ao Li, Geng Yuan, Yufei Ding, Youtao Zhang, Xulong Tang

    Abstract: Continuous learning (CL) has emerged as one of the most popular deep learning paradigms deployed in modern cloud GPUs. Specifically, CL has the capability to continuously update the model parameters (through model retraining) and use the updated model (if available) to serve overtime arriving inference requests. It is generally beneficial to co-locate the retraining and inference together to enabl… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  22. arXiv:2407.12568  [pdf, other

    cs.CV

    LTRL: Boosting Long-tail Recognition via Reflective Learning

    Authors: Qihao Zhao, Yalun Dai, Shen Lin, Wei Hu, Fan Zhang, Jun Liu

    Abstract: In real-world scenarios, where knowledge distributions exhibit long-tail. Humans manage to master knowledge uniformly across imbalanced distributions, a feat attributed to their diligent practices of reviewing, summarizing, and correcting errors. Motivated by this learning process, we propose a novel learning paradigm, called reflecting learning, in handling long-tail recognition. Our method integ… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: ECCV2024

  23. arXiv:2407.12491  [pdf, other

    cs.CV

    Hierarchical and Decoupled BEV Perception Learning Framework for Autonomous Driving

    Authors: Yuqi Dai, Jian Sun, Shengbo Eben Li, Qing Xu, Jianqiang Wang, Lei He, Keqiang Li

    Abstract: Perception is essential for autonomous driving system. Recent approaches based on Bird's-eye-view (BEV) and deep learning have made significant progress. However, there exists challenging issues including lengthy development cycles, poor reusability, and complex sensor setups in perception algorithm development process. To tackle the above challenges, this paper proposes a novel hierarchical BEV p… ▽ More

    Submitted 25 July, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

  24. arXiv:2407.12262  [pdf, other

    cond-mat.mtrl-sci

    Small exciton effective mass in QL Bi2Se2Te: A material platform towards high-temperature excitonic condensate

    Authors: Yuanyuan Wang, Ying Dai, Baibiao Huang, Yee Sin Ang, Wei Wei

    Abstract: Using first-principles simulations combined with many-body calculations, we show that two-dimensional free-standing quintuple-layer Bi2Se2Te is an inversion symmetric monolayer expected to achieve spatially indirect exciton with large exciton radius, small exciton effective mass and long exciton lifetime. Such system is theoretically predicted to be a promising platform for realizing excitonic Bos… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  25. arXiv:2407.09367  [pdf, other

    cs.CV

    Reshaping the Online Data Buffering and Organizing Mechanism for Continual Test-Time Adaptation

    Authors: Zhilin Zhu, Xiaopeng Hong, Zhiheng Ma, Weijun Zhuang, Yaohui Ma, Yong Dai, Yaowei Wang

    Abstract: Continual Test-Time Adaptation (CTTA) involves adapting a pre-trained source model to continually changing unsupervised target domains. In this paper, we systematically analyze the challenges of this task: online environment, unsupervised nature, and the risks of error accumulation and catastrophic forgetting under continual domain shifts. To address these challenges, we reshape the online data bu… ▽ More

    Submitted 18 July, 2024; v1 submitted 12 July, 2024; originally announced July 2024.

    Comments: This is the preprint version of our paper and supplemental material to appear in ECCV 2024

  26. arXiv:2407.09001  [pdf

    cond-mat.mtrl-sci

    Coupling multi-space topologies in 2D ferromagnetic lattice

    Authors: Zhonglin He, Wenhui Du, Kaiying Dou, Ying Dai, Baibiao Huang, Yandong Ma

    Abstract: Topology can manifest topological magnetism (e.g., skyrmion and bimeron) in real space and quantum anomalous Hall (QAH) state in momentum space, which have changed the modern conceptions of matter phase. While the topologies in different spaces are widely studied separately, their coexistence and coupling in single phase is seldomly explored. Here, we report a novel phenomenon that arises from the… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  27. arXiv:2407.07510  [pdf, other

    cs.CR cs.CV eess.SY

    Invisible Optical Adversarial Stripes on Traffic Sign against Autonomous Vehicles

    Authors: Dongfang Guo, Yuting Wu, Yimin Dai, Pengfei Zhou, Xin Lou, Rui Tan

    Abstract: Camera-based computer vision is essential to autonomous vehicle's perception. This paper presents an attack that uses light-emitting diodes and exploits the camera's rolling shutter effect to create adversarial stripes in the captured images to mislead traffic sign recognition. The attack is stealthy because the stripes on the traffic sign are invisible to human. For the attack to be threatening,… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Journal ref: In Proceedings of the 22nd Annual International Conference on Mobile Systems, Applications and Services (MobiSys 2024), 534-546

  28. arXiv:2407.07206  [pdf

    cond-mat.mes-hall cond-mat.mtrl-sci hep-ex

    Plasmonic Vortices Host Magnetoelectric Interactions

    Authors: Atreyie Ghosh, Sena Yang, Yanan Dai, W. Vincent Liu, Hrvoje Petek

    Abstract: The vector cross product and pseudoscalar dot products of electric (E) and magnetic (H) fields are separately finite in vacuum transverse electric and magnetic (TEM) plane waves, and angular momentum structured light. Current theories of interactions beyond the standard model of particle physics invoke non-zero dot(E,H) as the source term in the axion law that describes interactions with the cosmo… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 5 Figures

    Journal ref: Physical Review RESEARCH 6, 013163 (2024)

  29. arXiv:2407.04995  [pdf

    physics.optics

    A Broadband Algorithm for Adiabatic Mode Evolution and its Application on Polarization Splitter-Rotator on LNOI Platform

    Authors: Geng Chen, Chijun Li, Xuanhao Wang, An Pan, Junjie Wei, Yuankang Huang, Siyu Lu, Yiqi Dai, Xiangyu Meng, Cheng Zeng, Jinsong Xia

    Abstract: Adiabatic mode evolution waveguides (AMEWs) are widely utilized in integrated photonics, including tapered waveguides, edge couplers, mode converters, splitters, etc. An analytical theory and a novel AMEW design algorithm are developed to create shortcuts to adiabaticity (STA). This new algorithm is effective in shortening the total length of the AMEW while maintaining the desired wavelength range… ▽ More

    Submitted 22 July, 2024; v1 submitted 6 July, 2024; originally announced July 2024.

    Comments: 16 pages, 6 figures, 2 tables

  30. arXiv:2407.02973  [pdf, other

    astro-ph.GA

    NOEMA formIng Cluster survEy (NICE): Characterizing eight massive galaxy groups at $1.5 < z < 4$ in the COSMOS field

    Authors: Nikolaj B. Sillassen, Shuowen Jin, Georgios E. Magdis, Emanuele Daddi, Tao Wang, Shiying Lu, Hanwen Sun, Vinod Arumugam, Daizhong Liu, Malte Brinch, Chiara D'Eugenio, Raphael Gobat, Carlos Gómez-Guijarro, Michael Rich, Eva Schinnerer, Veronica Strazzullo, Qinghua Tan, Francesco Valentino, Yijun Wang, Mengyuan Xiao, Luwenjia Zhou, David Blánquez-Sesé, Zheng Cai, Yanmei Chen, Laure Ciesla , et al. (19 additional authors not shown)

    Abstract: The NOEMA formIng Cluster survEy (NICE) is a large program targeting 69 massive galaxy group candidates at $z>2$ in six deep fields. We report spectroscopic confirmation of eight groups at $1.65\leq z\leq3.61$ in COSMOS. Homogeneously selected as significant overdensities of red IRAC sources with red Herschel colors, four groups are confirmed by CO and [CI] with NOEMA 3mm observations, three are c… ▽ More

    Submitted 5 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: 44 pages (27pp appendix), 32 figures, 18 tables, accepted for publication in A&A

  31. arXiv:2407.01614  [pdf, other

    cs.LG cs.AI

    Enhancing Stability for Large Models Training in Constrained Bandwidth Networks

    Authors: Yun Dai, Tejas Dharamsi, Byron Hsu, Tao Song, Hamed Firooz

    Abstract: Training extremely large language models with billions of parameters is a computationally intensive task that pushes the limits of current data parallel training systems. While techniques like ZeRO++ have enabled efficient distributed training of such giant models on inexpensive low-bandwidth clusters, they can suffer from convergence issues due to potential race conditions in the hierarchical par… ▽ More

    Submitted 31 July, 2024; v1 submitted 27 June, 2024; originally announced July 2024.

  32. arXiv:2406.18169  [pdf, ps, other

    astro-ph.HE hep-ph

    Timing and Scintillation Studies of Pulsars in Globular Cluster M3 (NGC 5272) with FAST

    Authors: Baoda Li, Li-yun Zhang, Jumei Yao, Dejiang Yin, Ralph P. Eatough, Minghui Li, Yifeng Li, Yujie Lian, Yu Pan, Yinfeng Dai, Yaowei Li, Xingnan Zhang, Tianhao Su, Yuxiao Wu, Tong Liu, Kuo Liu, Lin Wang, Lei Qian, Zhichen Pan

    Abstract: We present the phase-connected timing solutions of all the five pulsars in globular cluster (GC) M3 (NGC 5272), namely PSRs M3A to F (PSRs J1342+2822A to F), with the exception of PSR M3C, from FAST archival data. In these timing solutions, those of PSRs M3E, and F are obtained for the first time. We find that PSRs M3E and F have low mass companions, and are in circular orbits with periods of 7.1… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

    Comments: 14 pages, 4 figures, accepted for publication in The Astrophysical Journal

  33. arXiv:2406.16360  [pdf, other

    cs.CV cs.GR

    MIRReS: Multi-bounce Inverse Rendering using Reservoir Sampling

    Authors: Yuxin Dai, Qi Wang, Jingsen Zhu, Dianbing Xi, Yuchi Huo, Chen Qian, Ying He

    Abstract: We present MIRReS, a novel two-stage inverse rendering framework that jointly reconstructs and optimizes the explicit geometry, material, and lighting from multi-view images. Unlike previous methods that rely on implicit irradiance fields or simplified path tracing algorithms, our method extracts an explicit geometry (triangular mesh) in stage one, and introduces a more realistic physically-based… ▽ More

    Submitted 24 June, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

    Comments: 16 pages, 14 figures

  34. arXiv:2406.14417  [pdf

    cond-mat.mes-hall

    Electrical switching of Ising-superconducting nonreciprocity for quantum neuronal transistor

    Authors: Junlin Xiong, Jiao Xie, Bin Cheng, Yudi Dai, Xinyu Cui, Lizheng Wang, Zenglin Liu, Ji Zhou, Naizhou Wang, Xianghan Xu, Xianhui Chen, Sang-Wook Cheong, Shi-Jun Liang, Feng Miao

    Abstract: Nonreciprocal quantum transport effect is mainly governed by the symmetry breaking of the material systems and is gaining extensive attention in condensed matter physics. Realizing electrical switching of the polarity of the nonreciprocal transport without external magnetic field is essential to the development of nonreciprocal quantum devices. However, electrical switching of superconducting nonr… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Journal ref: Nature Communications 15, 4953 (2024)

  35. arXiv:2406.14056  [pdf, other

    cs.CV

    VGA: Vision GUI Assistant -- Minimizing Hallucinations through Image-Centric Fine-Tuning

    Authors: Ziyang Meng, Yu Dai, Zezheng Gong, Shaoxiong Guo, Minglong Tang, Tongquan Wei

    Abstract: Recent advances in Large Vision-Language Models (LVLMs) have significantly improve performance in image comprehension tasks, such as formatted charts and rich-content images. Yet, Graphical User Interface (GUI) pose a greater challenge due to their structured format and detailed textual information. Existing LVLMs often overly depend on internal knowledge and neglect image content, resulting in ha… ▽ More

    Submitted 21 June, 2024; v1 submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages

    MSC Class: 68-04 68-04 ACM Class: I.2.7; I.2.10

  36. arXiv:2406.13526  [pdf, other

    eess.SP

    Using Geometrical information to Measure the Vibration of A Swaying Millimeter-wave Radar

    Authors: Chengyao Tang, Yongpeng Dai, Zhi Li, Tian Jin

    Abstract: This paper presents two new, simple yet effective approaches to measure the vibration of a swaying millimeter-wave radar (mmRadar) utilizing geometrical information. Specifically, for the planar vibrations, we firstly establish an equation based on the area difference between the swaying mmRadar and the reference objects at different moments, which enables the quantification of planar displacement… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 5 pages, 4 figures,submitted to the IEEE for publication

  37. arXiv:2406.11882  [pdf

    cs.AI cs.LG

    Applications of Explainable artificial intelligence in Earth system science

    Authors: Feini Huang, Shijie Jiang, Lu Li, Yongkun Zhang, Ye Zhang, Ruqing Zhang, Qingliang Li, Danxi Li, Wei Shangguan, Yongjiu Dai

    Abstract: In recent years, artificial intelligence (AI) rapidly accelerated its influence and is expected to promote the development of Earth system science (ESS) if properly harnessed. In application of AI to ESS, a significant hurdle lies in the interpretability conundrum, an inherent problem of black-box nature arising from the complexity of AI algorithms. To address this, explainable AI (XAI) offers a s… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  38. arXiv:2406.10472  [pdf, ps, other

    math.OC

    Exploiting Overlap Information in Chance-constrained Program with Random Right-hand Side

    Authors: Wei Lv, Wei-Kun Chen, Yu-Hong Dai, Xiao-Jiao Tong

    Abstract: We consider the chance-constrained program (CCP) with random right-hand side under a finite discrete distribution. It is known that the standard mixed integer linear programming (MILP) reformulation of the CCP is generally difficult to solve by general-purpose solvers as the branch-and-cut search trees are enormously large, partly due to the weak linear programming relaxation. In this paper, we id… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 34 pages, 3 figures, submitted for possible publication

    MSC Class: 90C11; 90C15

  39. arXiv:2406.07006  [pdf, other

    cs.CV

    MIPI 2024 Challenge on Few-shot RAW Image Denoising: Methods and Results

    Authors: Xin Jin, Chunle Guo, Xiaoming Li, Zongsheng Yue, Chongyi Li, Shangchen Zhou, Ruicheng Feng, Yuekun Dai, Peiqing Yang, Chen Change Loy, Ruoqi Li, Chang Liu, Ziyi Wang, Yao Du, Jingjing Yang, Long Bao, Heng Sun, Xiangyu Kong, Xiaoxia Xing, Jinlong Wu, Yuanyang Xue, Hyunhee Park, Sejun Song, Changho Kim, Jingfan Tan , et al. (17 additional authors not shown)

    Abstract: The increasing demand for computational photography and imaging on mobile platforms has led to the widespread development and integration of advanced image sensors with novel algorithms in camera systems. However, the scarcity of high-quality data for research and the rare opportunity for in-depth exchange of views from industry and academia constrain the development of mobile intelligent photogra… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: CVPR 2024 Mobile Intelligent Photography and Imaging (MIPI) Workshop--Few-shot RAWImage Denoising Challenge Report. Website: https://mipi-challenge.org/MIPI2024/

  40. arXiv:2406.06885  [pdf, other

    astro-ph.EP

    The Prevalence of Resonance Among Young, Close-in Planets

    Authors: Fei Dai, Max Goldberg, Konstantin Batygin, Jennifer van Saders, Eugene Chiang, Nick Choksi, Rixin Li, Erik A. Petigura, Gregory J. Gilbert, Sarah C. Millholland, Yuan-Zhe Dai, Luke Bouma, Lauren M. Weiss, Joshua N. Winn

    Abstract: Multiple planets undergoing disk migration may be captured into a chain of mean-motion resonances with the innermost planet parked near the disk's inner edge. Subsequent dynamical evolution may disrupt these resonances, leading to the non-resonant configurations typically observed among {\it Kepler} planets that are Gyrs old. In this scenario, resonant configurations are expected to be more common… ▽ More

    Submitted 21 August, 2024; v1 submitted 10 June, 2024; originally announced June 2024.

    Comments: 17 pages, 9 figures, accepted to AAS

  41. arXiv:2406.05775  [pdf, ps, other

    math.OC

    An efficient branch-and-cut approach for large-scale competitive facility location problems with limited choice rule

    Authors: Wei-Kun Chen, Wei-Yang Zhang, Yan-Ru Wang, Shahin Gelareh, Yu-Hong Dai

    Abstract: In the paper, we consider the competitive facility location problem with limited choice rule (CFLPLCR), which attempts to open a subset of facilities to maximize the net profit of a newcomer company, requiring customers to patronize only a limited number of opening facilities and an outside option. We propose an efficient branch-and-cut (B&C) approach for the CFLPLCR based on newly proposed mixed… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: 29 pages, 2 figures, submitted for possible publication

    MSC Class: 90C11

  42. arXiv:2406.05380  [pdf

    cond-mat.mtrl-sci

    Observation of floating surface state in obstructed atomic insulator candidate NiP$_2$

    Authors: Xiang-Rui Liu, Ming-Yuan Zhu, Yuanwen Feng, Meng Zeng, Xiao-Ming Ma, Yu-Jie Hao, Yue Dai, Rong-Hao Luo, Kohei Yamagami, Yi Liu, Shengtao Cui, Zhe Sun, Jia-Yu Liu, Zhengtai Liu, Mao Ye, Dawei Shen, Bing Li, Chang Liu

    Abstract: Obstructed atomic insulator is recently proposed as an unconventional material, in which electric charge centers localized at sites away from the atoms. A half-filling surface state would emerge at specific interfaces cutting through these charge centers and avoid intersecting any atoms. In this article, we utilized angle-resolved photoemission spectroscopy and density functional theory calculatio… ▽ More

    Submitted 16 June, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

    Comments: 21 pages, 5 figures

  43. arXiv:2406.02833  [pdf, other

    cs.CV

    DenoDet: Attention as Deformable Multi-Subspace Feature Denoising for Target Detection in SAR Images

    Authors: Yimian Dai, Minrui Zou, Yuxuan Li, Xiang Li, Kang Ni, Jian Yang

    Abstract: Synthetic Aperture Radar (SAR) target detection has long been impeded by inherent speckle noise and the prevalence of diminutive, ambiguous targets. While deep neural networks have advanced SAR target detection, their intrinsic low-frequency bias and static post-training weights falter with coherent noise and preserving subtle details across heterogeneous terrains. Motivated by traditional SAR ima… ▽ More

    Submitted 10 August, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  44. arXiv:2405.21075  [pdf, other

    cs.CV cs.CL

    Video-MME: The First-Ever Comprehensive Evaluation Benchmark of Multi-modal LLMs in Video Analysis

    Authors: Chaoyou Fu, Yuhan Dai, Yongdong Luo, Lei Li, Shuhuai Ren, Renrui Zhang, Zihan Wang, Chenyu Zhou, Yunhang Shen, Mengdan Zhang, Peixian Chen, Yanwei Li, Shaohui Lin, Sirui Zhao, Ke Li, Tong Xu, Xiawu Zheng, Enhong Chen, Rongrong Ji, Xing Sun

    Abstract: In the quest for artificial general intelligence, Multi-modal Large Language Models (MLLMs) have emerged as a focal point in recent advancements. However, the predominant focus remains on developing their capabilities in static image understanding. The potential of MLLMs in processing sequential visual data is still insufficiently explored, highlighting the absence of a comprehensive, high-quality… ▽ More

    Submitted 16 June, 2024; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: Project Page: https://video-mme.github.io

  45. arXiv:2405.21022  [pdf, other

    cs.CL cs.CV

    You Only Scan Once: Efficient Multi-dimension Sequential Modeling with LightNet

    Authors: Zhen Qin, Yuxin Mao, Xuyang Shen, Dong Li, Jing Zhang, Yuchao Dai, Yiran Zhong

    Abstract: Linear attention mechanisms have gained prominence in causal language models due to their linear computational complexity and enhanced speed. However, the inherent decay mechanism in linear attention presents challenges when applied to multi-dimensional sequence modeling tasks, such as image processing and multi-modal learning. In these scenarios, the utilization of sequential scanning to establis… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Technical report. Yiran Zhong is the corresponding author. The code is available at https://github.com/OpenNLPLab/LightNet

  46. arXiv:2405.20862  [pdf, other

    cs.CR

    BackdoorIndicator: Leveraging OOD Data for Proactive Backdoor Detection in Federated Learning

    Authors: Songze Li, Yanbo Dai

    Abstract: In a federated learning (FL) system, decentralized data owners (clients) could upload their locally trained models to a central server, to jointly train a global model. Malicious clients may plant backdoors into the global model through uploading poisoned local models, causing misclassification to a target class when encountering attacker-defined triggers. Existing backdoor defenses show inconsist… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  47. FAST Discovery of Eight Isolated Millisecond Pulsars in NGC 6517

    Authors: Dejiang Yin, Li-yun Zhang, Lei Qian, Ralph P. Eatough, Baoda Li, Duncan R. Lorimer, Yinfeng Dai, Yaowei Li, Xingnan Zhang, Minghui Li, Tianhao Su, Yuxiao Wu, Yu Pan, Yujie Lian, Tong Liu, Zhen Yan, Zhichen Pan

    Abstract: We present the discovery of 8 isolated millisecond pulsars in Globular Cluster (GC) NGC 6517 using the Five-Hundred-meter Aperture Spherical radio Telescope (FAST). The spin periods of those pulsars (namely PSR J1801-0857K to R, or, NGC 6517K to R) are all shorter than 10 ms. With these discoveries, NGC 6517 is currently the GC with the most known pulsars in the FAST sky. The largest difference in… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: 21 pages, 2 figures, accepted for publication in The Astrophysical Journal Letters

  48. arXiv:2405.14291  [pdf, other

    cs.LG cs.AI cs.DC

    Variational Bayes for Federated Continual Learning

    Authors: Dezhong Yao, Sanmu Li, Yutong Dai, Zhiqiang Xu, Shengshan Hu, Peilin Zhao, Lichao Sun

    Abstract: Federated continual learning (FCL) has received increasing attention due to its potential in handling real-world streaming data, characterized by evolving data distributions and varying client classes over time. The constraints of storage limitations and privacy concerns confine local models to exclusively access the present data within each learning cycle. Consequently, this restriction induces p… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

  49. arXiv:2405.12679  [pdf

    cond-mat.mtrl-sci

    Observation of Spin Splitting in Room-Temperature Metallic Antiferromagnet CrSb

    Authors: Meng Zeng, Ming-Yuan Zhu, Yu-Peng Zhu, Xiang-Rui Liu, Xiao-Ming Ma, Yu-Jie Hao, Pengfei Liu, Gexing Qu, Yichen Yang, Zhicheng Jiang, Kohei Yamagami, Masashi Arita, Xiaoqian Zhang, Tian-Hao Shao, Yue Dai, Kenya Shimada, Zhengtai Liu, Mao Ye, Yaobo Huang, Qihang Liu, Chang Liu

    Abstract: Recently, unconventional antiferromagnets that enable the splitting of electronic spins have been theoretically proposed and experimentally realized, where the magnetic sublattices containing moments pointing at different directions are connected by a novel set of symmetries. Such spin splitting (SS) is substantial, $k$-dependent, and independent of the spin-orbit coupling strength, making these m… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 14 pages, 4 figures

  50. arXiv:2405.12483  [pdf

    physics.optics physics.app-ph

    Molecule-induced surface second-order nonlinearity in an inversion symmetric microcavity

    Authors: Ru Wang, Yue Dai, Jinsong Cheng, Ruoyu Wang, Xiaoqin Shen

    Abstract: Inversion symmetry eliminates the second-order nonlinear responses in materials commonly used in silicon photonics with electric-dipole approximation. The lack of effective methods to induce the second-order nonlinearity in silicon photonic materials prevents their applications in second-order nonlinear integrated photonics. Here, we experimentally demonstrate a surface second-order nonlinear opti… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 22