Zum Hauptinhalt springen

Showing 101–150 of 891 results for author: Dai, Y

.
  1. arXiv:2402.18795  [pdf, ps, other

    math.OC

    Towards Large-scale Probabilistic Set Covering Problem: An Efficient Benders Decomposition Approach

    Authors: Wei-Kun Chen, Yi-Long Chen, Yu-Hong Dai, Wei Lv

    Abstract: In this paper, we investigate the probabilistic set covering problems (PSCP) in which the right-hand side is a random vector ξ and the covering constraint is required to be satisfied with a prespecified probability. We consider the case arising from sample average approximation (or finite discrete distributions). We develop an effective Benders decomposition (BD) algorithm for solving large-scale… ▽ More

    Submitted 28 February, 2024; originally announced February 2024.

    Comments: 13 pages, accepted for publication in IOS 2024

  2. arXiv:2402.16696  [pdf, other

    cs.CL

    Look Before You Leap: Towards Decision-Aware and Generalizable Tool-Usage for Large Language Models

    Authors: Anchun Gui, Jian Li, Yong Dai, Nan Du, Han Xiao

    Abstract: Tool-augmented large language models (LLMs) are attracting widespread attention when accessing up-to-date knowledge and alleviating hallucination issues. Nowadays, advanced closed-source LLMs (e.g., ChatGPT) have demonstrated surprising tool-usage capabilities through prompting and in-context learning techniques. To empower the capabilities of open-source LLMs (e.g., LLaMA) in manipulating tools,… ▽ More

    Submitted 28 August, 2024; v1 submitted 26 February, 2024; originally announced February 2024.

    Comments: 20 pages, 18 figures

  3. arXiv:2402.15623  [pdf, other

    cs.CL cs.HC cs.IR cs.LG

    Language-Based User Profiles for Recommendation

    Authors: Joyce Zhou, Yijia Dai, Thorsten Joachims

    Abstract: Most conventional recommendation methods (e.g., matrix factorization) represent user profiles as high-dimensional vectors. Unfortunately, these vectors lack interpretability and steerability, and often perform poorly in cold-start settings. To address these shortcomings, we explore the use of user profiles that are represented as human-readable text. We propose the Language-based Factorization Mod… ▽ More

    Submitted 23 February, 2024; originally announced February 2024.

    Comments: 8 pages (4 in appendix), 22 tables/figures (16 in appendix). Accepted to LLM-IGS@WSDM2024 workshop, now sharing this slightly updated revision version with workshop

  4. arXiv:2402.12659  [pdf, other

    cs.CL cs.AI cs.CE

    FinBen: A Holistic Financial Benchmark for Large Language Models

    Authors: Qianqian Xie, Weiguang Han, Zhengyu Chen, Ruoyu Xiang, Xiao Zhang, Yueru He, Mengxi Xiao, Dong Li, Yongfu Dai, Duanyu Feng, Yijing Xu, Haoqiang Kang, Ziyan Kuang, Chenhan Yuan, Kailai Yang, Zheheng Luo, Tianlin Zhang, Zhiwei Liu, Guojun Xiong, Zhiyang Deng, Yuechen Jiang, Zhiyuan Yao, Haohang Li, Yangyang Yu, Gang Hu , et al. (9 additional authors not shown)

    Abstract: LLMs have transformed NLP and shown promise in various fields, yet their potential in finance is underexplored due to a lack of comprehensive evaluation benchmarks, the rapid development of LLMs, and the complexity of financial tasks. In this paper, we introduce FinBen, the first extensive open-source evaluation benchmark, including 36 datasets spanning 24 financial tasks, covering seven critical… ▽ More

    Submitted 18 June, 2024; v1 submitted 19 February, 2024; originally announced February 2024.

    Comments: 26 pages, 11 figures

  5. arXiv:2402.07082  [pdf, ps, other

    cs.LG cs.GT stat.ML

    Refined Sample Complexity for Markov Games with Independent Linear Function Approximation

    Authors: Yan Dai, Qiwen Cui, Simon S. Du

    Abstract: Markov Games (MG) is an important model for Multi-Agent Reinforcement Learning (MARL). It was long believed that the "curse of multi-agents" (i.e., the algorithmic performance drops exponentially with the number of agents) is unavoidable until several recent works (Daskalakis et al., 2023; Cui et al., 2023; Wang et al., 2023). While these works resolved the curse of multi-agents, when the state sp… ▽ More

    Submitted 11 June, 2024; v1 submitted 10 February, 2024; originally announced February 2024.

    Comments: Accepted for presentation at the Conference on Learning Theory (COLT) 2024

  6. arXiv:2402.06859  [pdf, other

    cs.LG cs.AI cs.IR

    LiRank: Industrial Large Scale Ranking Models at LinkedIn

    Authors: Fedor Borisyuk, Mingzhou Zhou, Qingquan Song, Siyu Zhu, Birjodh Tiwana, Ganesh Parameswaran, Siddharth Dangi, Lars Hertel, Qiang Xiao, Xiaochen Hou, Yunbo Ouyang, Aman Gupta, Sheallika Singh, Dan Liu, Hailing Cheng, Lei Le, Jonathan Hung, Sathiya Keerthi, Ruoyan Wang, Fengyu Zhang, Mohit Kothari, Chen Zhu, Daqi Sun, Yun Dai, Xun Luan , et al. (9 additional authors not shown)

    Abstract: We present LiRank, a large-scale ranking framework at LinkedIn that brings to production state-of-the-art modeling architectures and optimization methods. We unveil several modeling improvements, including Residual DCN, which adds attention and residual connections to the famous DCNv2 architecture. We share insights into combining and tuning SOTA architectures to create a unified model, including… ▽ More

    Submitted 7 August, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

    ACM Class: H.3.3

  7. arXiv:2402.05394  [pdf, other

    cs.CV

    Enhancing Zero-shot Counting via Language-guided Exemplar Learning

    Authors: Mingjie Wang, Jun Zhou, Yong Dai, Eric Buys, Minglun Gong

    Abstract: Recently, Class-Agnostic Counting (CAC) problem has garnered increasing attention owing to its intriguing generality and superior efficiency compared to Category-Specific Counting (CSC). This paper proposes a novel ExpressCount to enhance zero-shot object counting by delving deeply into language-guided exemplar learning. Specifically, the ExpressCount is comprised of an innovative Language-oriente… ▽ More

    Submitted 7 February, 2024; originally announced February 2024.

  8. arXiv:2402.05141  [pdf, other

    math.OC cs.LG

    Tensor Completion via Integer Optimization

    Authors: Xin Chen, Sukanya Kudva, Yongzheng Dai, Anil Aswani, Chen Chen

    Abstract: The main challenge with the tensor completion problem is a fundamental tension between computation power and the information-theoretic sample complexity rate. Past approaches either achieve the information-theoretic rate but lack practical algorithms to compute the corresponding solution, or have polynomial-time algorithms that require an exponentially-larger number of samples for low estimation e… ▽ More

    Submitted 3 April, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

  9. arXiv:2402.03909  [pdf, other

    astro-ph.GA

    Dust and Cold Gas Properties of Starburst HyLIRG-Quasars at $z \sim 2.5$

    Authors: Feng-Yuan Liu, Y. Sophia Dai, Alain Omont, Daizhong Liu, Pierre Cox, Roberto Neri, Melanie Krips, Chentao Yang, Xue-Bing Wu, Jia-Sheng Huang

    Abstract: Some high-z active galactic nuclei (AGNs) are found to reside in extreme star-forming galaxies, such as hyper-luminous infrared galaxies (HyLIRGs), with AGN-removed $L_{\rm{IR}}$ of $>10^{13} L_{\rm{\odot}}$. In this paper, we report NOEMA observations of six apparent starburst HyLIRGs associated with optical quasars at $z\sim2-3$ in the Stripe 82 field, to study their dust and molecular CO proper… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: 22 pages, 6 figures, 7 tables. Accepted for publication in The Astrophysical Journal

  10. arXiv:2402.03817   

    eess.SY

    Improvement of Frequency Source Phase Noise Reduction Design under Vibration Condition

    Authors: Liwei Yin, Yongjiang Shu, Heng Zhang, Yuefei Dai, Xiaopeng Lu, Yunlong Lian, Zhonghua Wang, Yong Ding

    Abstract: Reasonable vibration reduction design is an important way to achieve low phase noise index of airborne frequency source output signal. Aiming at the problem of phase noise deterioration of an airborne frequency source under random condition, this paper proposes to improve the vibration reduction mode crystal oscillator and reduce the distance between the barycenter of frequency source and crystal… ▽ More

    Submitted 16 July, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: There are many errors. 1.Fig. 2 Block Diagram of Frequency Source Circuit is not correct. 2.C-band C1 signal 6000MHz continuous wave signal is error. 3.Fig. 4 Steady State Phase Noise and Spectrum of 2400MHz before Improvement is error. 4.Table 1 Steady State Phase Noise at each Frequency Point of the Output of the Frequency Source before Improvement is error. 5. Frequency range is error

    MSC Class: D.3.2 ACM Class: B.6.2

  11. arXiv:2402.03741  [pdf, other

    cs.LG cs.AI cs.CR

    SUB-PLAY: Adversarial Policies against Partially Observed Multi-Agent Reinforcement Learning Systems

    Authors: Oubo Ma, Yuwen Pu, Linkang Du, Yang Dai, Ruo Wang, Xiaolei Liu, Yingcai Wu, Shouling Ji

    Abstract: Recent advancements in multi-agent reinforcement learning (MARL) have opened up vast application prospects, such as swarm control of drones, collaborative manipulation by robotic arms, and multi-target encirclement. However, potential security threats during the MARL deployment need more attention and thorough investigation. Recent research reveals that attackers can rapidly exploit the victim's v… ▽ More

    Submitted 26 June, 2024; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: To appear in the ACM Conference on Computer and Communications Security (CCS'24), October 14-18, 2024, Salt Lake City, UT, USA

  12. arXiv:2402.03692  [pdf, ps, other

    hep-ph hep-ex nucl-ex nucl-th

    On the non-universality of heavy quark hadronization in elementary high-energy collisions

    Authors: Yuxuan Dai, Shouxing Zhao, Min He

    Abstract: It has been traditionally hypothesized that the heavy quark (charm, $c$ and bottom, $b$) fragmentation is universal across different collision systems, based on the notion that hadronization as a soft process should occur at the characteristic non-perturbative QCD scale, $Λ_{QCD}$. However, this universality hypothesis has recently been challenged by the observation that the $c$- and $b$-baryon pr… ▽ More

    Submitted 5 February, 2024; originally announced February 2024.

    Comments: 6 pages, 3 figures

  13. arXiv:2402.03352  [pdf, other

    math.OC cs.LG stat.ML

    Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

    Authors: Huiling Zhang, Zi Xu, Yuhong Dai

    Abstract: In this paper, we study zeroth-order algorithms for nonconvex minimax problems with coupled linear constraints under the deterministic and stochastic settings, which have attracted wide attention in machine learning, signal processing and many other fields in recent years, e.g., adversarial attacks in resource allocation problems and network flow problems etc. We propose two single-loop algorithms… ▽ More

    Submitted 26 January, 2024; originally announced February 2024.

    Comments: arXiv admin note: text overlap with arXiv:2212.04672

  14. arXiv:2402.02447  [pdf, other

    cs.LG cs.CL

    Breaking MLPerf Training: A Case Study on Optimizing BERT

    Authors: Yongdeok Kim, Jaehyung Ahn, Myeongwoo Kim, Changin Choi, Heejae Kim, Narankhuu Tuvshinjargal, Seungwon Lee, Yanzi Zhang, Yuan Pei, Xiongzhan Linghu, Jingkun Ma, Lin Chen, Yuehua Dai, Sungjoo Yoo

    Abstract: Speeding up the large-scale distributed training is challenging in that it requires improving various components of training including load balancing, communication, optimizers, etc. We present novel approaches for fast large-scale training of BERT model which individually ameliorates each component thereby leading to a new level of BERT training performance. Load balancing is imperative in distri… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Total 15 pages (Appendix 3 pages)

  15. arXiv:2402.01567  [pdf, other

    cs.LG math.OC

    Understanding Adam Optimizer via Online Learning of Updates: Adam is FTRL in Disguise

    Authors: Kwangjun Ahn, Zhiyu Zhang, Yunbum Kook, Yan Dai

    Abstract: Despite the success of the Adam optimizer in practice, the theoretical understanding of its algorithmic components still remains limited. In particular, most existing analyses of Adam show the convergence rate that can be simply achieved by non-adative algorithms like SGD. In this work, we provide a different perspective based on online learning that underscores the importance of Adam's algorithmi… ▽ More

    Submitted 30 May, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

    Comments: Accepted at ICML 2024

  16. arXiv:2401.17186  [pdf, other

    cs.CV cs.AI cs.IR

    Embracing Language Inclusivity and Diversity in CLIP through Continual Language Learning

    Authors: Bang Yang, Yong Dai, Xuxin Cheng, Yaowei Li, Asif Raza, Yuexian Zou

    Abstract: While vision-language pre-trained models (VL-PTMs) have advanced multimodal research in recent years, their mastery in a few languages like English restricts their applicability in broader communities. To this end, there is an increasing interest in developing multilingual VL models via a joint-learning setup, which, however, could be unrealistic due to expensive costs and data availability. In th… ▽ More

    Submitted 30 January, 2024; originally announced January 2024.

    Comments: Accepted by AAAI'2024, 15 pages (with appendix), 7 figures, 10 tables

  17. arXiv:2401.16720  [pdf, other

    cs.LG cs.CV

    SmartFRZ: An Efficient Training Framework using Attention-Based Layer Freezing

    Authors: Sheng Li, Geng Yuan, Yue Dai, Youtao Zhang, Yanzhi Wang, Xulong Tang

    Abstract: There has been a proliferation of artificial intelligence applications, where model training is key to promising high-quality services for these applications. However, the model training process is both time-intensive and energy-intensive, inevitably affecting the user's demand for application efficiency. Layer freezing, an efficient model training technique, has been proposed to improve training… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  18. arXiv:2401.16694  [pdf, other

    cs.LG cs.CV cs.DC

    etuner: A Redundancy-Aware Framework for Efficient Continual Learning Application on Edge Devices

    Authors: Sheng Li, Geng Yuan, Yawen Wu, Yue Dai, Tianyu Wang, Chao Wu, Alex K. Jones, Jingtong Hu, Yanzhi Wang, Xulong Tang

    Abstract: Many emerging applications, such as robot-assisted eldercare and object recognition, generally employ deep learning neural networks (DNNs) and require the deployment of DNN models on edge devices. These applications naturally require i) handling streaming-in inference requests and ii) fine-tuning the deployed models to adapt to possible deployment scenario changes. Continual learning (CL) is widel… ▽ More

    Submitted 22 August, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

  19. arXiv:2401.14579  [pdf

    cs.CV

    Recognizing Multiple Ingredients in Food Images Using a Single-Ingredient Classification Model

    Authors: Kun Fu, Ying Dai

    Abstract: Recognizing food images presents unique challenges due to the variable spatial layout and shape changes of ingredients with different cooking and cutting methods. This study introduces an advanced approach for recognizing ingredients segmented from food images. The method localizes the candidate regions of the ingredients using the locating and sliding window techniques. Then, these regions are as… ▽ More

    Submitted 18 February, 2024; v1 submitted 25 January, 2024; originally announced January 2024.

    Comments: 9 pages, 21 figures, 6 tables

  20. arXiv:2401.13919  [pdf, other

    cs.CL cs.AI

    WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

    Authors: Hongliang He, Wenlin Yao, Kaixin Ma, Wenhao Yu, Yong Dai, Hongming Zhang, Zhenzhong Lan, Dong Yu

    Abstract: The rapid advancement of large language models (LLMs) has led to a new era marked by the development of autonomous applications in real-world scenarios, which drives innovation in creating advanced web agents. Existing web agents typically only handle one input modality and are evaluated only in simplified web simulators or static web snapshots, greatly limiting their applicability in real-world s… ▽ More

    Submitted 6 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: Accepted to ACL 2024 (main). Code and data is released at https://github.com/MinorJerry/WebVoyager

  21. arXiv:2401.13250  [pdf, ps, other

    physics.plasm-ph

    Interferences effects in polarized nonlinear Breit-Wheeler process

    Authors: Jing-Jing Jiang, Ya-Nan Dai, Kai-Hong Zhuang, Yunquan Gao, Suo Tang, Yue-Yue Chen

    Abstract: The creation of polarized electron-positron pairs by the nonlinear Breit-Wheeler process in short laser pulses is investigated using the Baier-Katkov semiclassical method beyond local-constant-field approximation (LCFA), which allows for identifying the interferences effects in the positron polarization. When the laser intensity is in the intermediate %multiphoton regime, the interferences of pair… ▽ More

    Submitted 24 January, 2024; originally announced January 2024.

  22. arXiv:2401.11168  [pdf, ps, other

    hep-th physics.plasm-ph

    Fermionic signal of vacuum polarization in strong laser fields

    Authors: Ya-Nan Dai, Karen Z. Hatsagortsyan, Christoph H. Keitel, Yue-Yue Chen

    Abstract: Vacuum polarization (VP) is investigated for the interaction of a polarized $γ$-ray beam of GeV photons with a counterpropagating ultraintense laser pulse. In a conventional setup of a vacuum birefringence measurement, a VP signal is the emerging small circular (linear) polarization of the initially linearly (circularly) polarized probe photons. The pair production via the nonlinear Breit-Wheeler… ▽ More

    Submitted 2 June, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

  23. arXiv:2401.07828  [pdf

    cond-mat.mtrl-sci

    Transient Magnetoelastic Coupling in CrSBr

    Authors: Youn Jue Bae, Taketo Handa, Yanan Dai, Jue Wang, Huicong Liu, Allen Scheie, Daniel G. Chica, Michael E. Ziebel, Andrew D. Kent, Xiaodong Xu, Ka Shen, Xavier Roy, Xiaoyang Zhu

    Abstract: Recent research has revealed remarkable properties of the two-dimensional (2D) van der Waals layered crystal CrSBr, which is both a semiconductor and an A-type antiferromagnet. Here we show the role of strong magnetoelastic coupling in the generation and propagation of coherent magnons in CrSBr. Time and spatially resolved magneto-optical Kerr effect (tr-MOKE) microscopy reveals two time-varying t… ▽ More

    Submitted 15 January, 2024; originally announced January 2024.

    Comments: 12 pages, 4 figures, SI

  24. arXiv:2401.07253  [pdf, ps, other

    physics.plasm-ph

    Enhanced α particle generation via proton-boron fusion reactions in laser-modulated plasma

    Authors: Yihang Zhang, Zhe Zhang, Yufeng Dong, Ke Fang, Haochen Gu, Yu Dai, Wei Qi, Zhigang Deng, Xiaohui Zhang, Lei Yang, Feng Lu, Zheng Huang, Kainan Zhou, Yuchi Wu, Weimin Zhou, Feng Liu, Guoqiang Zhang, Bingjun Li, Xu Zhao, Xiaohui Yuan, Chen Wang, Yutong Li

    Abstract: Aneutronic and nonradioactive properties make the proton-boron fusion a prospective candidate for fusion energy production through reactions following p+$^{11}$B$\rightarrow$3$α$ (p-$^{11}$B). However, it is difficult to achieve a thermal fusion ignition, since the low reaction cross-sections for center-of-mass energy below $\sim$100 keV. To realize fusion energy gain, it is essential to consider… ▽ More

    Submitted 14 January, 2024; originally announced January 2024.

  25. arXiv:2401.05881  [pdf

    cs.RO

    Volume Transfer: A New Design Concept for Fabric-Based Pneumatic Exosuits

    Authors: Chendong Liu, Dapeng Yang, Jiachen Chen, Yiming Dai, Li Jiang, Hong Liu

    Abstract: The fabric-based pneumatic exosuit is now a hot research topic because it is lighter and softer than traditional exoskeletons. Existing research focused more on the mechanical properties of the exosuit (e.g., torque and speed), but less on its wearability (e.g., appearance and comfort). This work presents a new design concept for fabric-based pneumatic exosuits Volume Transfer, which means transfe… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

  26. arXiv:2401.03868  [pdf, other

    cs.AR cs.AI

    FlightLLM: Efficient Large Language Model Inference with a Complete Mapping Flow on FPGAs

    Authors: Shulin Zeng, Jun Liu, Guohao Dai, Xinhao Yang, Tianyu Fu, Hongyi Wang, Wenheng Ma, Hanbo Sun, Shiyao Li, Zixiao Huang, Yadong Dai, Jintao Li, Zehao Wang, Ruoyu Zhang, Kairui Wen, Xuefei Ning, Yu Wang

    Abstract: Transformer-based Large Language Models (LLMs) have made a significant impact on various domains. However, LLMs' efficiency suffers from both heavy computation and memory overheads. Compression techniques like sparsification and quantization are commonly used to mitigate the gap between LLM's computation/memory overheads and hardware capacity. However, existing GPU and transformer-based accelerato… ▽ More

    Submitted 9 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

    Comments: Accepted to FPGA'24

  27. arXiv:2401.03416  [pdf, other

    astro-ph.GA

    Active Galactic Nuclei in a Mid-Infrared Selected Galaxy Sample at z>0.13: [Ne V]3426 Line Emission as a Benchmark

    Authors: Zi-Jian Li, Y. Sophia Dai, Jia-Sheng Huang, Stijn Wuyts, Tian-Wen Cao

    Abstract: We present a 24 um-selected spectroscopic sample z > 0.13 (median z = 0.41) in the Lockman Hole field, consisting of 4035 spectra. Our aim is to identify AGNs and determine their fraction in this mid-infrared selected sample. In this work, we use the [Ne V]3426 emission line to spectroscopically identify AGNs. Combined with broad-line Type I AGNs selected in our previous study, our sample consists… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 16 pages, 14 figures. Accepted for publication in ApJ

  28. Exploring Multi-Modal Control in Music-Driven Dance Generation

    Authors: Ronghui Li, Yuqin Dai, Yachao Zhang, Jun Li, Jian Yang, Jie Guo, Xiu Li

    Abstract: Existing music-driven 3D dance generation methods mainly concentrate on high-quality dance generation, but lack sufficient control during the generation process. To address these issues, we propose a unified framework capable of generating high-quality dance movements and supporting multi-modal control, including genre control, semantic control, and spatial control. First, we decouple the dance ge… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  29. arXiv:2401.00926  [pdf, other

    cs.CV cs.AI

    Accurate Leukocyte Detection Based on Deformable-DETR and Multi-Level Feature Fusion for Aiding Diagnosis of Blood Diseases

    Authors: Yifei Chen, Chenyan Zhang, Ben Chen, Yiyu Huang, Yifei Sun, Changmiao Wang, Xianjun Fu, Yuxing Dai, Feiwei Qin, Yong Peng, Yu Gao

    Abstract: In standard hospital blood tests, the traditional process requires doctors to manually isolate leukocytes from microscopic images of patients' blood using microscopes. These isolated leukocytes are then categorized via automatic leukocyte classifiers to determine the proportion and volume of different types of leukocytes present in the blood samples, aiding disease diagnosis. This methodology is n… ▽ More

    Submitted 10 January, 2024; v1 submitted 1 January, 2024; originally announced January 2024.

    Comments: 15 pages, 11 figures, accept Computers in Biology and Medicine 2024

    Journal ref: Computers in Biology and Medicine 2024

  30. Text2Avatar: Text to 3D Human Avatar Generation with Codebook-Driven Body Controllable Attribute

    Authors: Chaoqun Gong, Yuqin Dai, Ronghui Li, Achun Bao, Jun Li, Jian Yang, Yachao Zhang, Xiu Li

    Abstract: Generating 3D human models directly from text helps reduce the cost and time of character modeling. However, achieving multi-attribute controllable and realistic 3D human avatar generation is still challenging due to feature coupling and the scarcity of realistic 3D human avatar datasets. To address these issues, we propose Text2Avatar, which can generate realistic-style 3D avatars based on the co… ▽ More

    Submitted 1 January, 2024; originally announced January 2024.

  31. Host galaxy and nuclear properties of IR-selected AGNs with and without outflow signatures

    Authors: Gabriel A. Oio, Y. Sophia Dai, C. G. Bornancini, Zi-Jian Li

    Abstract: Active galactic nucleus (AGN) driven outflows can have a significant impact on the evolution of the host galaxy. In this work, we compare the properties of galaxies that hosts AGNs with and without outflows. Our sample consists of 103 AGNs identified by mid-IR color-color selection, and confirmed with optical spectroscopy at a redshift range of 0.3 $\lesssim$ z $\lesssim$ 0.9. We fit the [OIII]… ▽ More

    Submitted 5 March, 2024; v1 submitted 26 December, 2023; originally announced December 2023.

    Comments: Published in ApJ, 23 pages, 16 figures

    Journal ref: ApJ 962 146 (2024)

  32. arXiv:2312.14988  [pdf, other

    cs.CV

    Emage: Non-Autoregressive Text-to-Image Generation

    Authors: Zhangyin Feng, Runyi Hu, Liangxin Liu, Fan Zhang, Duyu Tang, Yong Dai, Xiaocheng Feng, Jiwei Li, Bing Qin, Shuming Shi

    Abstract: Autoregressive and diffusion models drive the recent breakthroughs on text-to-image generation. Despite their huge success of generating high-realistic images, a common shortcoming of these models is their high inference latency - autoregressive models run more than a thousand times successively to produce image tokens and diffusion models convert Gaussian noise into images with many hundreds of d… ▽ More

    Submitted 22 December, 2023; originally announced December 2023.

  33. arXiv:2312.14161  [pdf, other

    stat.AP physics.data-an physics.soc-ph

    Statistical Machine Learning Meets High-Dimensional Spatiotemporal Challenges -- A Case Study of COVID-19 Modeling

    Authors: Binbin Lin, Yimin Dai, Lei Zou, Ning Ning

    Abstract: Diverse non-pharmacological interventions (NPIs), serving as the primary approach for COVID-19 control prior to pharmaceutical interventions, showed heterogeneous spatiotemporal effects on pandemic management. Investigating the dynamic compounding impacts of NPIs on pandemic spread is imperative. However, the challenges posed by data availability of high-dimensional human behaviors and the complex… ▽ More

    Submitted 28 November, 2023; originally announced December 2023.

  34. arXiv:2312.10997  [pdf, other

    cs.CL cs.AI

    Retrieval-Augmented Generation for Large Language Models: A Survey

    Authors: Yunfan Gao, Yun Xiong, Xinyu Gao, Kangxiang Jia, Jinliu Pan, Yuxi Bi, Yi Dai, Jiawei Sun, Meng Wang, Haofen Wang

    Abstract: Large Language Models (LLMs) showcase impressive capabilities but encounter challenges like hallucination, outdated knowledge, and non-transparent, untraceable reasoning processes. Retrieval-Augmented Generation (RAG) has emerged as a promising solution by incorporating knowledge from external databases. This enhances the accuracy and credibility of the generation, particularly for knowledge-inten… ▽ More

    Submitted 27 March, 2024; v1 submitted 18 December, 2023; originally announced December 2023.

    Comments: Ongoing Work

  35. arXiv:2312.10181  [pdf, other

    cs.LG cs.AI cs.CY

    Coupling Fairness and Pruning in a Single Run: a Bi-level Optimization Perspective

    Authors: Yucong Dai, Gen Li, Feng Luo, Xiaolong Ma, Yongkai Wu

    Abstract: Deep neural networks have demonstrated remarkable performance in various tasks. With a growing need for sparse deep learning, model compression techniques, especially pruning, have gained significant attention. However, conventional pruning techniques can inadvertently exacerbate algorithmic bias, resulting in unequal predictions. To address this, we define a fair pruning task where a sparse model… ▽ More

    Submitted 15 December, 2023; originally announced December 2023.

  36. arXiv:2312.07401  [pdf, other

    cs.AI

    On Diversified Preferences of Large Language Model Alignment

    Authors: Dun Zeng, Yong Dai, Pengyu Cheng, Longyue Wang, Tianhao Hu, Wanshun Chen, Nan Du, Zenglin Xu

    Abstract: Aligning large language models (LLMs) with human preferences has been recognized as the key to improving LLMs' interaction quality. However, in this pluralistic world, human preferences can be diversified due to annotators' different tastes, which hinders the effectiveness of LLM alignment methods. This paper presents the first quantitative analysis of commonly used human feedback datasets to inve… ▽ More

    Submitted 17 April, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: preprint

  37. arXiv:2312.06964  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Ground Calibration Result of the Lobster Eye Imager for Astronomy

    Authors: Huaqing Cheng, Zhixing Ling, Chen Zhang, Xiaojin Sun, Shengli Sun, Yuan Liu, Yanfeng Dai, Zhenqing Jia, Haiwu Pan, Wenxin Wang, Donghua Zhao, Yifan Chen, Zhiwei Cheng, Wei Fu, Yixiao Han, Junfei Li, Zhengda Li, Xiaohao Ma, Yulong Xue, Ailiang Yan, Qiang Zhang, Yusa Wang, Xiongtao Yang, Zijian Zhao, Weimin Yuan

    Abstract: We report on results of the on-ground X-ray calibration of the Lobster Eye Imager for Astronomy (LEIA), an experimental space wide-field (18.6*18.6 square degrees) X-ray telescope built from novel lobster eye mirco-pore optics. LEIA was successfully launched on July 27, 2022 onboard the SATech-01 satellite. To achieve full characterisation of its performance before launch, a series of tests and ca… ▽ More

    Submitted 11 December, 2023; originally announced December 2023.

    Comments: 24 pages, 13 figures. Submitted to Experimental Astronomy

  38. arXiv:2312.06171  [pdf, other

    cs.CV cs.MM

    Jointly Explicit and Implicit Cross-Modal Interaction Network for Anterior Chamber Inflammation Diagnosis

    Authors: Qian Shao, Ye Dai, Haochao Ying, Kan Xu, Jinhong Wang, Wei Chi, Jian Wu

    Abstract: Uveitis demands the precise diagnosis of anterior chamber inflammation (ACI) for optimal treatment. However, current diagnostic methods only rely on a limited single-modal disease perspective, which leads to poor performance. In this paper, we investigate a promising yet challenging way to fuse multimodal data for ACI diagnosis. Notably, existing fusion paradigms focus on empowering implicit modal… ▽ More

    Submitted 19 December, 2023; v1 submitted 11 December, 2023; originally announced December 2023.

  39. arXiv:2312.06085  [pdf, other

    cs.CV

    Robust Geometry and Reflectance Disentanglement for 3D Face Reconstruction from Sparse-view Images

    Authors: Daisheng Jin, Jiangbei Hu, Baixin Xu, Yuxin Dai, Chen Qian, Ying He

    Abstract: This paper presents a novel two-stage approach for reconstructing human faces from sparse-view images, a task made challenging by the unique geometry and complex skin reflectance of each individual. Our method focuses on decomposing key facial attributes, including geometry, diffuse reflectance, and specular reflectance, from ambient light. Initially, we create a general facial template from a div… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 8 pages, 8 figures

  40. arXiv:2312.06067  [pdf, other

    astro-ph.HE astro-ph.GA

    Three Pulsars Discovered in Globular Cluster M15 (NGC 7078) with FAST

    Authors: Yuxiao Wu, Zhichen Pan, Lei Qian, Scott Ransom, BoJun Wang, Zhen Yan, Jintao Luo, Liyun Zhang, Minghui Li, Dejiang Yin, Baoda Li, Yifeng Li, Yinfeng Dai, Yaowei Li, Xinnan Zhang, Tong Liu, Yu Pan

    Abstract: We present the discovery of three pulsars in Globular Cluster M15 (NGC 7078) by the Five-hundred-meter Aperture Spherical radio Telescope (FAST). In the three pulsars, PSR~J2129+1210J (M15J) is a millisecond pulsar with a spinning period of 11.84 ms and a dispersion measure of 66.68 pc cm$^{-3}$. Both PSR~J2129+1210K and L (M15K and L) are long period pulsars with spinning periods of 1928 ms and 3… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 10 pages, 4 figures, 2 tables, submitted to ApJ Letter

  41. arXiv:2312.05385  [pdf, other

    cs.DC cs.LG

    Apparate: Rethinking Early Exits to Tame Latency-Throughput Tensions in ML Serving

    Authors: Yinwei Dai, Rui Pan, Anand Iyer, Kai Li, Ravi Netravali

    Abstract: Machine learning (ML) inference platforms are tasked with balancing two competing goals: ensuring high throughput given many requests, and delivering low-latency responses to support interactive applications. Unfortunately, existing platform knobs (e.g., batch sizes) fail to ease this fundamental tension, and instead only enable users to harshly trade off one property for the other. This paper exp… ▽ More

    Submitted 8 December, 2023; originally announced December 2023.

    Comments: The first two authors contributed equally and are alphabetically ordered

  42. arXiv:2312.03284  [pdf

    eess.SP

    Adaptive Multi-band Modulation for Robust and Low-complexity Faster-than-Nyquist Non-Orthogonal FDM IM-DD System

    Authors: Peiji Song, Zhouyi Hu, Yizhan Dai, Yuan Liu, Chao Gao, Chun-Kit Chan

    Abstract: Faster-than-Nyquist non-orthogonal frequency-division multiplexing (FTN-NOFDM) is robust against the steep frequency roll-off by saving signal bandwidth. Among the FTN-NOFDM techniques, the non-orthogonal matrix precoding (NOM-p) based FTN has high compatibility with the conventional orthogonal frequency division multiplexing (OFDM), in terms of the advanced digital signal processing already used… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  43. arXiv:2312.03175  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Large electrobending deformation caused by defect dipoles

    Authors: Shuo Tian, Bin Li, Yejing Dai

    Abstract: Ultrahigh electrostrains (greater than 1%) in several piezoceramic systems have been reported since 2022, which attract more and more interest in the field of piezoelectricity; however, the mechanism is still unclear. Here, we have directly observed a novel electric field-induced bending (electrobending) phenomenon that visually exhibites as an alternating concave-convex deformation under an elect… ▽ More

    Submitted 5 December, 2023; originally announced December 2023.

  44. arXiv:2312.00643  [pdf

    cond-mat.mtrl-sci

    Voltage-driven 90 switching of bulk perpendicular magnetic anisotropy in ferrimagnets

    Authors: Zhengyu Xiao, Ruiwen Xie, Fernando Maccari, Philipp Klaassen, Benedikt Eggert, Di Wang, Yuting Dai, Raquel Lizarraga, Johanna Lill, Tom Helbig, Heiko Wende, Kurt Kummer, Katharina Ollefs, Konstantin Skokov, Hongbin Zhang, Zhiyong Quan, Xiaohong Xu, Robert Kruk, Horst Hahn, Oliver Gutfleisch, Xinglong Ye

    Abstract: Rare earth-transition metal ferrimagnets, featuring antiferromagnetically coupled, inequivalent magnetic sublattices, have garnered increasing interest in the burgeoning field of ferrimagnetic spintronics. However, controlling their magnetism with low voltages,a key to reducing power consumption,remains challenging, particularly due to the poorly understood mechanisms underlying bulk perpendicular… ▽ More

    Submitted 14 June, 2024; v1 submitted 1 December, 2023; originally announced December 2023.

    Comments: 4 Figures in manuscript (15 pages); 16 Figures, two Tables in Supplementary materials (21 pages)

  45. arXiv:2312.00267  [pdf, other

    cs.LG cs.AI stat.ML

    Sample Efficient Reinforcement Learning from Human Feedback via Active Exploration

    Authors: Viraj Mehta, Vikramjeet Das, Ojash Neopane, Yijia Dai, Ilija Bogunovic, Jeff Schneider, Willie Neiswanger

    Abstract: Preference-based feedback is important for many applications in reinforcement learning where direct evaluation of a reward function is not feasible. A notable recent example arises in reinforcement learning from human feedback (RLHF) on large language models. For many applications of RLHF, the cost of acquiring the human feedback can be substantial. In this work, we take advantage of the fact that… ▽ More

    Submitted 30 November, 2023; originally announced December 2023.

  46. arXiv:2311.17637  [pdf, other

    astro-ph.SR

    Observations of a Failed Solar Filament Eruption Involving External Reconnection

    Authors: Yuehong Chen, Xin Cheng, Jun Chen, Yu Dai, Mingde Ding

    Abstract: We report a failed solar filament eruption that involves external magnetic reconnection in a quadrupolar magnetic configuration. The evolution exhibits three kinematic evolution phases: a slow-rise phase, an acceleration phase, and a deceleration phase. In the early slow rise, extreme-ultraviolet (EUV) brightenings appear at the expected null point above the filament and are connected to the outer… ▽ More

    Submitted 29 November, 2023; originally announced November 2023.

    Comments: Accepted by ApJ

  47. arXiv:2311.15835  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Surface skyrmions and dual topological Hall effect in antiferromagnetic topological insulator EuCd$_2$As$_2$

    Authors: Min Wu, R. Yang, Xiangde Zhu, Yixiong Ren, Ang Qian, Yongjie Xie, Changming Yue, Yong Nie, Xiang Yuan, Ning Wang, Daifeng Tu, Ding Li, Yuyan Han, Zhaosheng Wang, Yaomin Dai, Guolin Zheng, Jianhui Zhou, Wei Ning, Xianggang Qiu, Mingliang Tian

    Abstract: In this work, we synthesized single crystal of EuCd$_2$As$_2$, which exhibits A-type antiferromagnetic (AFM) order with in-plane spin orientation below $T_N$ = 9.5~K.Optical spectroscopy and transport measurements suggest its topological insulator (TI) nature with an insulating gap around 0.1eV. Remarkably, a dual topological Hall resistivity that exhibits same magnitude but opposite signs in the… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures

  48. arXiv:2311.15834  [pdf, other

    cond-mat.supr-con cond-mat.mtrl-sci

    Charge-density wave transition in magnetic topological semimetal EuAl$_4$

    Authors: R. Yang, C. C. Le, P. Zhu, Z. W. Wang, T. Shang, Y. M. Dai, J. P. Hu, M. Dressel

    Abstract: The interplay among topology, charge-density wave (CDW), and magnetism can give rise to a plethora of exotic quantum phenomena. Recently, a group of magnetic topological semimetals with tetragonal lattices and CDW order were found to exhibit anomalous magnetic instability, helical spin ordering, and the presence of skyrmions. However, the underlying mechanism responsible for these observations rem… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: 8 pages, 4 figures

    Report number: RIKEN-iTHEMS-Report-24

  49. arXiv:2311.15664  [pdf, other

    astro-ph.GA

    The UV luminosity function at 0.6 < z < 1 from UVCANDELS

    Authors: Lei Sun, Xin Wang, Harry I. Teplitz, Vihang Mehta, Anahita Alavi, Marc Rafelski, Rogier A. Windhorst, Claudia Scarlata, Jonathan P. Gardner, Brent M. Smith, Ben Sunnquist, Laura Prichard, Yingjie Cheng, Norman Grogin, Nimish P. Hathi, Matthew Hayes, Anton M. Koekemoer, Bahram Mobasher, Kalina V. Nedkova, Robert O'Connell, Brant Robertson, Sina Taamoli, L. Y. Aaron Yung, Gabriel Brammer, James Colbert , et al. (53 additional authors not shown)

    Abstract: UVCANDELS is a HST Cycle-26 Treasury Program awarded 164 orbits of primary ultraviolet (UV) F275W imaging and coordinated parallel optical F435W imaging in four CANDELS fields: GOODS-N, GOODS-S, EGS, and COSMOS, covering a total area of $\sim426$ arcmin$^2$. This is $\sim2.7$ times larger than the area covered by previous deep-field space UV data combined, reaching a depth of about 27 and 28 ABmag… ▽ More

    Submitted 2 May, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: 17 pages, 8 figures, Accepted for publication in ApJ

  50. arXiv:2311.15609  [pdf, other

    cs.LG cs.CV

    A manometric feature descriptor with linear-SVM to distinguish esophageal contraction vigor

    Authors: Jialin Liu, Lu Yan, Xiaowei Liu, Yuzhuo Dai, Fanggen Lu, Yuanting Ma, Muzhou Hou, Zheng Wang

    Abstract: n clinical, if a patient presents with nonmechanical obstructive dysphagia, esophageal chest pain, and gastro esophageal reflux symptoms, the physician will usually assess the esophageal dynamic function. High-resolution manometry (HRM) is a clinically commonly used technique for detection of esophageal dynamic function comprehensively and objectively. However, after the results of HRM are obtaine… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.