Zum Hauptinhalt springen

Showing 1–50 of 273 results for author: Tao, M

.
  1. arXiv:2408.04284  [pdf, other

    cs.CL

    LLM-DetectAIve: a Tool for Fine-Grained Machine-Generated Text Detection

    Authors: Mervat Abassy, Kareem Elozeiri, Alexander Aziz, Minh Ngoc Ta, Raj Vardhan Tomar, Bimarsha Adhikari, Saad El Dine Ahmed, Yuxia Wang, Osama Mohammed Afzal, Zhuohan Xie, Jonibek Mansurov, Ekaterina Artemova, Vladislav Mikhailov, Rui Xing, Jiahui Geng, Hasan Iqbal, Zain Muhammad Mujahid, Tarek Mahmoud, Akim Tsvigun, Alham Fikri Aji, Artem Shelmanov, Nizar Habash, Iryna Gurevych, Preslav Nakov

    Abstract: The widespread accessibility of large language models (LLMs) to the general public has significantly amplified the dissemination of machine-generated texts (MGTs). Advancements in prompt manipulation have exacerbated the difficulty in discerning the origin of a text (human-authored vs machinegenerated). This raises concerns regarding the potential misuse of MGTs, particularly within educational an… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  2. arXiv:2407.20523  [pdf, other

    cs.IT cs.MM

    Wireless Multi-User Interactive Virtual Reality in Metaverse with Edge-Device Collaborative Computing

    Authors: Caolu Xu, Zhiyong Chen, Meixia Tao, Wenjun Zhang

    Abstract: The immersive nature of the metaverse presents significant challenges for wireless multi-user interactive virtual reality (VR), such as ultra-low latency, high throughput and intensive computing, which place substantial demands on the wireless bandwidth and rendering resources of mobile edge computing (MEC). In this paper, we propose a wireless multi-user interactive VR with edge-device collaborat… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

    Comments: submitted to IEEE journal

  3. Observational Evidence for Magnetic Field Amplification in SN 1006

    Authors: Moeri Tao, Jun Kataoka, Takaaki Tanaka

    Abstract: We report the first observational evidence for magnetic field amplification in the north-east/south-west (NE/SW) shells of supernova remnant SN 1006, one of the most promising sites of cosmic ray (CR) acceleration. In previous studies, the strength of magnetic fields in these shells was estimated to be $B_{\rm SED}$ $\simeq$ 25$μ$G from the spectral energy distribution, where the synchrotron emiss… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

    Journal ref: The Astrophysical Journal Letters, 970:L27 (6pp), 2024 August 1

  4. arXiv:2407.16936  [pdf, ps, other

    stat.ML cs.LG math.ST stat.CO

    Provable Benefit of Annealed Langevin Monte Carlo for Non-log-concave Sampling

    Authors: Wei Guo, Molei Tao, Yongxin Chen

    Abstract: We address the outstanding problem of sampling from an unnormalized density that may be non-log-concave and multimodal. To enhance the performance of simple Markov chain Monte Carlo (MCMC) methods, techniques of annealing type have been widely used. However, quantitative theoretical guarantees of these techniques are under-explored. This study takes a first step toward providing a non-asymptotic a… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  5. arXiv:2407.16725  [pdf, other

    cs.CV

    Category-Extensible Out-of-Distribution Detection via Hierarchical Context Descriptions

    Authors: Kai Liu, Zhihang Fu, Chao Chen, Sheng Jin, Ze Chen, Mingyuan Tao, Rongxin Jiang, Jieping Ye

    Abstract: The key to OOD detection has two aspects: generalized feature representation and precise category description. Recently, vision-language models such as CLIP provide significant advances in both two issues, but constructing precise category descriptions is still in its infancy due to the absence of unseen categories. This work introduces two hierarchical contexts, namely perceptual context and spur… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted by 37th Conference on Neural Information Processing Systems (NeurIPS 2023)

  6. arXiv:2407.08439  [pdf, other

    math.NA

    A fitted space-time finite element method for an advection-diffusion problem with moving interfaces

    Authors: Quang Huy Nguyen, Van Chien Le, Phuong Cuc Hoang, Thi Thanh Mai Ta

    Abstract: This paper presents a fitted space-time finite element method for solving a parabolic advection-diffusion problem with a nonstationary interface. The jumping diffusion coefficient gives rise to the discontinuity of the spatial gradient of solution across the interface. We use the Banach-Necas-Babuska theorem to show the well-posedness of the continuous variational problem. A fully discrete finite-… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 19 pages

  7. arXiv:2407.05289  [pdf, other

    cs.IT eess.SP

    DM-MIMO: Diffusion Models for Robust Semantic Communications over MIMO Channels

    Authors: Yiheng Duan, Tong Wu, Zhiyong Chen, Meixia Tao

    Abstract: This paper investigates robust semantic communications over multiple-input multiple-output (MIMO) fading channels. Current semantic communications over MIMO channels mainly focus on channel adaptive encoding and decoding, which lacks exploration of signal distribution. To leverage the potential of signal distribution in signal space denoising, we develop a diffusion model over MIMO channels (DM-MI… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

  8. arXiv:2407.03994  [pdf, other

    cs.CL cs.AI

    Unlocking the Potential of Model Merging for Low-Resource Languages

    Authors: Mingxu Tao, Chen Zhang, Quzhe Huang, Tianyao Ma, Songfang Huang, Dongyan Zhao, Yansong Feng

    Abstract: Adapting large language models (LLMs) to new languages typically involves continual pre-training (CT) followed by supervised fine-tuning (SFT). However, this CT-then-SFT approach struggles with limited data in the context of low-resource languages, failing to balance language modeling and task-solving capabilities. We thus propose model merging as an alternative for low-resource languages, combini… ▽ More

    Submitted 9 July, 2024; v1 submitted 4 July, 2024; originally announced July 2024.

  9. arXiv:2407.02015  [pdf, other

    math.NA

    Robust First and Second-Order Differentiation for Regularized Optimal Transport

    Authors: Xingjie Li, Fei Lu, Molei Tao, Felix X. -F. Ye

    Abstract: Applications such as unbalanced and fully shuffled regression can be approached by optimizing regularized optimal transport (OT) distances, such as the entropic OT and Sinkhorn distances. A common approach for this optimization is to use a first-order optimizer, which requires the gradient of the OT distance. For faster convergence, one might also resort to a second-order optimizer, which addition… ▽ More

    Submitted 2 July, 2024; originally announced July 2024.

    MSC Class: 68Q25; 68R10; 68U05

  10. arXiv:2406.17807  [pdf, other

    cs.CL cs.AI

    Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary

    Authors: Meiling Tao, Xuechen Liang, Ziyi Wang, Yiling Tao, Tianyu Shi

    Abstract: Recent advancements in large language models (LLMs) have unlocked the potential for generating high-quality game commentary. However, producing insightful and engaging commentary for complex games with incomplete information remains a significant challenge. In this paper, we introduce a novel commentary method that combine Reinforcement Learning (RL) and LLMs, tailored specifically for the Chinese… ▽ More

    Submitted 3 August, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  11. arXiv:2406.15354  [pdf, other

    physics.bio-ph

    Can Specific THz Fields Induce Collective Base-Flipping in DNA? A Stochastic Averaging and Resonant Enhancement Investigation Based on a New Mesoscopic Model

    Authors: Wang Sang Koon, Houman Owhadi, Molei Tao, Tomohiro Yanao

    Abstract: We study the metastability, internal frequencies, activation mechanism, energy transfer, and the collective base-flipping in a mesoscopic DNA via resonance with specific electric fields. Our new mesoscopic DNA model takes into account not only the issues of helicity and the coupling of an electric field with the base dipole moments, but also includes environmental effects such as fluid viscosity a… ▽ More

    Submitted 18 March, 2024; originally announced June 2024.

    Comments: 37 pages, 8 figures

  12. arXiv:2406.12839  [pdf, other

    cs.LG math.DS math.OC math.PR stat.ML

    Evaluating the design space of diffusion-based generative models

    Authors: Yuqing Wang, Ye He, Molei Tao

    Abstract: Most existing theoretical investigations of the accuracy of diffusion models, albeit significant, assume the score function has been approximated to a certain accuracy, and then use this a priori bound to control the error of generation. This article instead provides a first quantitative understanding of the whole generation process, i.e., both training and sampling. More precisely, it conducts a… ▽ More

    Submitted 25 July, 2024; v1 submitted 18 June, 2024; originally announced June 2024.

    Comments: Comments are welcome. Out of admiration we titled our paper after EDM, and hoped theorists' humor is not too corny

  13. arXiv:2406.10556  [pdf, other

    cs.IT cs.AI

    Multi-User Semantic Fusion for Semantic Communications over Degraded Broadcast Channels

    Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Bin Xia, Wenjun Zhang

    Abstract: Degraded broadcast channels (DBC) are a typical multiuser communication scenario, Semantic communications over DBC still lack in-depth research. In this paper, we design a semantic communications approach based on multi-user semantic fusion for wireless image transmission over DBC. In the proposed method, the transmitter extracts semantic features for two users separately. It then effectively fuse… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: accepted by China Communications

  14. arXiv:2406.07915  [pdf, ps, other

    cs.IT eess.SP

    Aggregation Design for Personalized Federated Multi-Modal Learning over Wireless Networks

    Authors: Benshun Yin, Zhiyong Chen, Meixia Tao

    Abstract: Federated Multi-Modal Learning (FMML) is an emerging field that integrates information from different modalities in federated learning to improve the learning performance. In this letter, we develop a parameter scheduling scheme to improve personalized performance and communication efficiency in personalized FMML, considering the non-independent and nonidentically distributed (non-IID) data along… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: accepted by IEEE Communications Letters

  15. arXiv:2406.01937  [pdf, other

    cs.IT eess.SP

    Cramér-Rao Bound Analysis and Beamforming Design for Integrated Sensing and Communication with Extended Targets

    Authors: Yiqiu Wang, Meixia Tao, Shu Sun

    Abstract: This paper studies an integrated sensing and communication (ISAC) system, where a multi-antenna base station transmits beamformed signals for joint downlink multi-user communication and radar sensing of an extended target (ET). By considering echo signals as reflections from valid elements on the ET contour, a set of novel Cramér-Rao bounds (CRBs) is derived for parameter estimation of the ET, inc… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: Submitted to IEEE Transactions on Wireless Communications. arXiv admin note: text overlap with arXiv:2312.10641

  16. arXiv:2405.21050  [pdf, other

    cs.CV cs.LG

    Spectrum-Aware Parameter Efficient Fine-Tuning for Diffusion Models

    Authors: Xinxi Zhang, Song Wen, Ligong Han, Felix Juefei-Xu, Akash Srivastava, Junzhou Huang, Hao Wang, Molei Tao, Dimitris N. Metaxas

    Abstract: Adapting large-scale pre-trained generative models in a parameter-efficient manner is gaining traction. Traditional methods like low rank adaptation achieve parameter efficiency by imposing constraints but may not be optimal for tasks requiring high representation capacity. We propose a novel spectrum-aware adaptation framework for generative models. Our method adjusts both singular values and the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  17. arXiv:2405.20390  [pdf, other

    cs.LG math.NA math.OC stat.ML

    Quantitative Convergences of Lie Group Momentum Optimizers

    Authors: Lingkai Kong, Molei Tao

    Abstract: Explicit, momentum-based dynamics that optimize functions defined on Lie groups can be constructed via variational optimization and momentum trivialization. Structure preserving time discretizations can then turn this dynamics into optimization algorithms. This article investigates two types of discretization, Lie Heavy-Ball, which is a known splitting scheme, and Lie NAG-SC, which is newly propos… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  18. arXiv:2405.16381  [pdf, other

    cs.LG cs.AI stat.ML

    Trivialized Momentum Facilitates Diffusion Generative Modeling on Lie Groups

    Authors: Yuchen Zhu, Tianrong Chen, Lingkai Kong, Evangelos A. Theodorou, Molei Tao

    Abstract: The generative modeling of data on manifold is an important task, for which diffusion models in flat spaces typically need nontrivial adaptations. This article demonstrates how a technique called `trivialization' can transfer the effectiveness of diffusion models in Euclidean spaces to Lie groups. In particular, an auxiliary momentum variable was algorithmically introduced to help transport the po… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  19. arXiv:2405.06105  [pdf, ps, other

    cs.CL

    Can Perplexity Reflect Large Language Model's Ability in Long Text Understanding?

    Authors: Yutong Hu, Quzhe Huang, Mingxu Tao, Chen Zhang, Yansong Feng

    Abstract: Recent studies have shown that Large Language Models (LLMs) have the potential to process extremely long text. Many works only evaluate LLMs' long-text processing ability on the language modeling task, with perplexity (PPL) as the evaluation metric. However, in our study, we find that there is no correlation between PPL and LLMs' long-text understanding ability. Besides, PPL may only reflect the m… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  20. arXiv:2405.03131  [pdf, other

    cs.IT cs.AI cs.LG

    WDMoE: Wireless Distributed Large Language Models with Mixture of Experts

    Authors: Nan Xue, Yaping Sun, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Liang Qian, Shuguang Cui, Ping Zhang

    Abstract: Large Language Models (LLMs) have achieved significant success in various natural language processing tasks, but how wireless communications can support LLMs has not been extensively studied. In this paper, we propose a wireless distributed LLMs paradigm based on Mixture of Experts (MoE), named WDMoE, deploying LLMs collaboratively across edge servers of base station (BS) and mobile devices in the… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  21. arXiv:2405.03125  [pdf, other

    cs.IT

    MambaJSCC: Deep Joint Source-Channel Coding with Visual State Space Model

    Authors: Tong Wu, Zhiyong Chen, Meixia Tao, Xiaodong Xu, Wenjun Zhang, Ping Zhang

    Abstract: Lightweight and efficient deep joint source-channel coding (JSCC) is a key technology for semantic communications. In this paper, we design a novel JSCC scheme named MambaJSCC, which utilizes a visual state space model with channel adaptation (VSSM-CA) block as its backbone for transmitting images over wireless channels. The VSSM-CA block utilizes VSSM to integrate two-dimensional images with the… ▽ More

    Submitted 5 May, 2024; originally announced May 2024.

    Comments: submitted to IEEE conference

  22. arXiv:2404.06336  [pdf, other

    quant-ph cs.LG stat.ML

    Quantum State Generation with Structure-Preserving Diffusion Model

    Authors: Yuchen Zhu, Tianrong Chen, Evangelos A. Theodorou, Xie Chen, Molei Tao

    Abstract: This article considers the generative modeling of the (mixed) states of quantum systems, and an approach based on denoising diffusion model is proposed. The key contribution is an algorithmic innovation that respects the physical nature of quantum states. More precisely, the commonly used density matrix representation of mixed-state has to be complex-valued Hermitian, positive semi-definite, and t… ▽ More

    Submitted 25 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  23. arXiv:2404.05979  [pdf, other

    cs.CV

    StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion

    Authors: Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, Changsheng Xu

    Abstract: Story visualization aims to generate a series of realistic and coherent images based on a storyline. Current models adopt a frame-by-frame architecture by transforming the pre-trained text-to-image model into an auto-regressive manner. Although these models have shown notable progress, there are still three flaws. 1) The unidirectional generation of auto-regressive manner restricts the usability i… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 17 pages

  24. arXiv:2404.01663  [pdf, other

    cs.CL

    CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

    Authors: Xuechen Liang, Meiling Tao, Yinghui Xia, Tianyu Shi, Jun Wang, JingSong Yang

    Abstract: Open large language models (LLMs) have significantly advanced the field of natural language processing, showcasing impressive performance across various tasks.Despite the significant advancements in LLMs, their effective operation still relies heavily on human input to accurately guide the dialogue flow, with agent tuning being a crucial optimization technique that involves human adjustments to th… ▽ More

    Submitted 26 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  25. arXiv:2403.12012  [pdf, other

    math.ST cs.LG math.NA math.PR stat.ML

    Convergence of Kinetic Langevin Monte Carlo on Lie groups

    Authors: Lingkai Kong, Molei Tao

    Abstract: Explicit, momentum-based dynamics for optimizing functions defined on Lie groups was recently constructed, based on techniques such as variational optimization and left trivialization. We appropriately add tractable noise to the optimization dynamics to turn it into a sampling dynamics, leveraging the advantageous feature that the trivialized momentum variable is Euclidean despite that the potenti… ▽ More

    Submitted 17 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  26. arXiv:2403.07652  [pdf, other

    cs.LG cs.CL

    Harder Tasks Need More Experts: Dynamic Routing in MoE Models

    Authors: Quzhe Huang, Zhenwei An, Nan Zhuang, Mingxu Tao, Chen Zhang, Yang Jin, Kun Xu, Kun Xu, Liwei Chen, Songfang Huang, Yansong Feng

    Abstract: In this paper, we introduce a novel dynamic expert selection framework for Mixture of Experts (MoE) models, aiming to enhance computational efficiency and model performance by adjusting the number of activated experts based on input difficulty. Unlike traditional MoE approaches that rely on fixed Top-K routing, which activates a predetermined number of experts regardless of the input's complexity,… ▽ More

    Submitted 12 March, 2024; originally announced March 2024.

  27. arXiv:2402.17886  [pdf, other

    stat.ML cs.LG math.PR math.ST stat.ME

    Zeroth-Order Sampling Methods for Non-Log-Concave Distributions: Alleviating Metastability by Denoising Diffusion

    Authors: Ye He, Kevin Rojas, Molei Tao

    Abstract: This paper considers the problem of sampling from non-logconcave distribution, based on queries of its unnormalized density. It first describes a framework, Diffusion Monte Carlo (DMC), based on the simulation of a denoising diffusion process with its score function approximated by a generic Monte Carlo estimator. DMC is an oracle-based meta-algorithm, where its oracle is the assumed access to sam… ▽ More

    Submitted 26 May, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  28. arXiv:2402.17304  [pdf, ps, other

    cs.CL cs.AI

    Probing Multimodal Large Language Models for Global and Local Semantic Representations

    Authors: Mingxu Tao, Quzhe Huang, Kun Xu, Liwei Chen, Yansong Feng, Dongyan Zhao

    Abstract: The advancement of Multimodal Large Language Models (MLLMs) has greatly accelerated the development of applications in understanding integrated texts and images. Recent works leverage image-caption datasets to train MLLMs, achieving state-of-the-art performance on image-to-text tasks. However, there are few studies exploring which layers of MLLMs make the most effort to the global image informatio… ▽ More

    Submitted 26 March, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

    Comments: Accepted by LREC-COLING 2024 as a short paper (Camera Ready)

  29. arXiv:2402.16313  [pdf, other

    cs.CL cs.AI

    Chain-of-Discussion: A Multi-Model Framework for Complex Evidence-Based Question Answering

    Authors: Mingxu Tao, Dongyan Zhao, Yansong Feng

    Abstract: Open-ended question answering requires models to find appropriate evidence to form well-reasoned, comprehensive and helpful answers. In practical applications, models also need to engage in extended discussions on potential scenarios closely relevant to the question. With augmentation of retrieval module, open-source Large Language Models (LLMs) can produce coherent answers often with different fo… ▽ More

    Submitted 26 February, 2024; originally announced February 2024.

    Comments: Under review

  30. arXiv:2402.10062  [pdf, other

    cs.LG stat.ML

    Optimal Parameter and Neuron Pruning for Out-of-Distribution Detection

    Authors: Chao Chen, Zhihang Fu, Kai Liu, Ze Chen, Mingyuan Tao, Jieping Ye

    Abstract: For a machine learning model deployed in real world scenarios, the ability of detecting out-of-distribution (OOD) samples is indispensable and challenging. Most existing OOD detection methods focused on exploring advanced training skills or training-free tricks to prevent the model from yielding overconfident confidence score for unknown samples. The training-based methods require expensive traini… ▽ More

    Submitted 4 February, 2024; originally announced February 2024.

    Comments: Accepted by NeurIPS 2023. 19 pages

    Journal ref: NeurIPS 2023

  31. arXiv:2402.03744  [pdf, other

    cs.CL

    INSIDE: LLMs' Internal States Retain the Power of Hallucination Detection

    Authors: Chao Chen, Kai Liu, Ze Chen, Yi Gu, Yue Wu, Mingyuan Tao, Zhihang Fu, Jieping Ye

    Abstract: Knowledge hallucination have raised widespread concerns for the security and reliability of deployed LLMs. Previous efforts in detecting hallucinations have been employed at logit-level uncertainty estimation or language-level self-consistency evaluation, where the semantic information is inevitably lost during the token-decoding procedure. Thus, we propose to explore the dense semantic informatio… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

    Comments: Accepted by ICLR-2024

  32. arXiv:2401.15614  [pdf, other

    quant-ph cond-mat.mes-hall cond-mat.quant-gas

    Liouvillian skin effect in a one-dimensional open many-body quantum system with generalized boundary conditions

    Authors: Liang Mao, Xuanpu Yang, Ming-Jie Tao, Haiping Hu, Lei Pan

    Abstract: Non-Hermitian skin effect (NHSE), namely that eigenstates of non-Hermitian Hamiltonains are localized at one boundary in the open boundary condition, attracts great interest recently.In this paper, we investigate the skin effect in one-dimensional dissipative quantum many-body systems, which we call the Liouvillian skin effect (LSE). We rigorously identify the existence of LSE for generalized boun… ▽ More

    Submitted 16 July, 2024; v1 submitted 28 January, 2024; originally announced January 2024.

    Comments: 15 pages, 5 figures. Comments are welcome

  33. arXiv:2401.15405  [pdf, ps, other

    math.OC

    On Partly Smoothness, Activity Identification and Faster Algorithms of $L_1$ over $L_2$ Minimization

    Authors: Min Tao, Xiao-Ping Zhang, Zi-Hao Xia

    Abstract: The $L_1/L_2$ norm ratio arose as a sparseness measure and attracted a considerable amount of attention due to three merits: (i) sharper approximations of $L_0$ compared to the $L_1$; (ii) parameter-free and scale-invariant; (iii) more attractive than $L_1$ under highly-coherent matrices. In this paper, we first establish the partly smooth property of $L_1$ over $L_2$ minimization relative to an… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

  34. arXiv:2401.15344  [pdf, other

    cs.IT eess.SP

    IRS Aided Millimeter-Wave Sensing and Communication: Beam Scanning, Beam Splitting, and Performance Analysis

    Authors: Renwang Li, Xiaodan Shao, Shu Sun, Meixia Tao, Rui Zhang

    Abstract: Integrated sensing and communication (ISAC) has attracted growing interests for enabling the future 6G wireless networks, due to its capability of sharing spectrum and hardware resources between communication and sensing systems. However, existing works on ISAC usually need to modify the communication protocol to cater for the new sensing performance requirement, which may be difficult to implemen… ▽ More

    Submitted 27 January, 2024; originally announced January 2024.

    Comments: submitted to IEEE TWC

  35. arXiv:2401.09432  [pdf, other

    cs.CL cs.AI cs.LG

    RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models

    Authors: Meiling Tao, Xuechen Liang, Tianyu Shi, Lei Yu, Yiting Xie

    Abstract: This study presents RoleCraft-GLM, an innovative framework aimed at enhancing personalized role-playing with Large Language Models (LLMs). RoleCraft-GLM addresses the key issue of lacking personalized interactions in conversational AI, and offers a solution with detailed and emotionally nuanced character portrayals. We contribute a unique conversational dataset that shifts from conventional celebr… ▽ More

    Submitted 4 April, 2024; v1 submitted 17 December, 2023; originally announced January 2024.

  36. arXiv:2401.06144  [pdf, other

    cs.CV cs.LG

    DFU: scale-robust diffusion model for zero-shot super-resolution image generation

    Authors: Alex Havrilla, Kevin Rojas, Wenjing Liao, Molei Tao

    Abstract: Diffusion generative models have achieved remarkable success in generating images with a fixed resolution. However, existing models have limited ability to generalize to different resolutions when training data at those resolutions are not available. Leveraging techniques from operator learning, we present a novel deep-learning architecture, Dual-FNO UNet (DFU), which approximates the score operat… ▽ More

    Submitted 22 January, 2024; v1 submitted 30 November, 2023; originally announced January 2024.

  37. arXiv:2401.04335  [pdf

    physics.optics physics.app-ph

    SiN-on-SOI Optical Phased Array LiDAR for Ultra-Wide Field of View and 4D Sensing

    Authors: Baisong Chen, Yingzhi Li, Qijie Xie, Quanxin Na, Min Tao, Ziming Wang, Zihao Zhi, Heming Hu, Xuetong Li, Huan Qu, Yafang He, Xiaolong Hu, Guoqiang Lo, Junfeng Song

    Abstract: Three-dimensional (3D) imaging techniques are facilitating the autonomous vehicles to build intelligent system. Optical phased arrays (OPAs) featured by all solid-state configurations are becoming a promising solution for 3D imaging. However, majority of state-of-art OPAs commonly suffer from severe power degradation at the edge of field of view (FoV), resulting in limited effective FoV and deteri… ▽ More

    Submitted 8 January, 2024; originally announced January 2024.

    Comments: 18 pages with 13 figures

    Journal ref: Laser Photonics Rev 2024, 2301360

  38. arXiv:2401.03511  [pdf, other

    math.NA

    Automated construction of effective potential via algorithmic implicit bias

    Authors: Xingjie Helen Li, Molei Tao

    Abstract: We introduce a novel approach for decomposing and learning every scale of a given multiscale objective function in $\mathbb{R}^d$, where $d\ge 1$. This approach leverages a recently demonstrated implicit bias of the optimization method of gradient descent by Kong and Tao, which enables the automatic generation of data that nearly follow Gibbs distribution with an effective potential at any desired… ▽ More

    Submitted 7 January, 2024; originally announced January 2024.

    Comments: 10 Figures

  39. arXiv:2401.01564  [pdf, other

    cs.IT eess.SP

    Deep Learning Based Superposition Coded Modulation for Hierarchical Semantic Communications over Broadcast Channels

    Authors: Yufei Bo, Shuo Shao, Meixia tao

    Abstract: We consider multi-user semantic communications over broadcast channels. While most existing works consider that each receiver requires either the same or independent semantic information, this paper explores the scenario where the semantic information desired by different receivers is different but correlated. In particular, we investigate semantic communications over Gaussian broadcast channels w… ▽ More

    Submitted 12 June, 2024; v1 submitted 3 January, 2024; originally announced January 2024.

  40. arXiv:2312.17428  [pdf, other

    cs.CV

    ChangeNet: Multi-Temporal Asymmetric Change Detection Dataset

    Authors: Deyi Ji, Siqi Gao, Mingyuan Tao, Hongtao Lu, Feng Zhao

    Abstract: Change Detection (CD) has been attracting extensive interests with the availability of bi-temporal datasets. However, due to the huge cost of multi-temporal images acquisition and labeling, existing change detection datasets are small in quantity, short in temporal, and low in practicability. Therefore, a large-scale practical-oriented dataset covering wide temporal phases is urgently needed to fa… ▽ More

    Submitted 11 April, 2024; v1 submitted 28 December, 2023; originally announced December 2023.

    Comments: Accepted to ICASSP 2024 Oral/Lecture

  41. arXiv:2312.14341  [pdf, other

    math.OC

    A full splitting algorithm for fractional programs with structured numerators and denominators

    Authors: Radu Ioan Boţ, Guoyin Li, Min Tao

    Abstract: In this paper, we consider a class of nonconvex and nonsmooth fractional programming problems, which involve the sum of a convex, possibly nonsmooth function composed with a linear operator and a differentiable, possibly nonconvex function in the numerator and a convex, possibly nonsmooth function composed with a linear operator in the denominator. These problems have applications in various field… ▽ More

    Submitted 21 December, 2023; originally announced December 2023.

    Comments: 27 pages, 4 figures

    MSC Class: 90C26; 90C32; 49M27; 65K05

  42. arXiv:2312.10641  [pdf, other

    cs.IT eess.SP

    Beamforming Design for Integrated Sensing and Communication with Extended Target

    Authors: Yiqiu Wang, Meixia Tao, Shu Sun

    Abstract: This paper studies transmit beamforming design in an integrated sensing and communication (ISAC) system, where a base station sends symbols to perform downlink multi-user communication and sense an extended target simultaneously. We first model the extended target contour with truncated Fourier series. By considering echo signals as reflections from the valid elements on the target contour, a nove… ▽ More

    Submitted 17 December, 2023; originally announced December 2023.

    Comments: 8 pages, 3 figures, published to 8th Workshop on Integrated Sensing and Communications for Internet of Things in IEEE Global Communications Conference 2023

  43. arXiv:2312.07817  [pdf, ps, other

    math.PR

    Appropriate State-Dependent Friction Coefficient Accelerates Kinetic Langevin Dynamics

    Authors: Keunwoo Lim, Molei Tao

    Abstract: We consider the convergence of kinetic Langevin dynamics to its ergodic invariant measure, which is Gibbs distribution. Instead of the standard setup where the friction coefficient is a constant scalar, we investigate position-dependent friction coefficient and the possible accelerated convergence it enables. We show that by choosing this coefficient matrix to be $2\sqrt{\text{Hess}V}$, convergenc… ▽ More

    Submitted 30 June, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

  44. arXiv:2312.05786  [pdf, other

    eess.SP cs.IT

    Deep Learning for Joint Design of Pilot, Channel Feedback, and Hybrid Beamforming in FDD Massive MIMO-OFDM Systems

    Authors: Junyi Yang, Weifeng Zhu, Shu Sun, Xiaofeng Li, Xingqin Lin, Meixia Tao

    Abstract: This letter considers the transceiver design in frequency division duplex (FDD) massive multiple-input multiple-output (MIMO) orthogonal frequency division multiplexing (OFDM) systems for high-quality data transmission. We propose a novel deep learning based framework where the procedures of pilot design, channel feedback, and hybrid beamforming are realized by carefully crafted deep neural networ… ▽ More

    Submitted 10 December, 2023; originally announced December 2023.

    Comments: 5 pages, 4 figures, acccpted by IEEE Communication Letters

  45. arXiv:2311.08348  [pdf, other

    cs.CL

    MC$^2$: Towards Transparent and Culturally-Aware NLP for Minority Languages in China

    Authors: Chen Zhang, Mingxu Tao, Quzhe Huang, Jiuheng Lin, Zhibin Chen, Yansong Feng

    Abstract: Current large language models demonstrate deficiencies in understanding low-resource languages, particularly the minority languages in China. This limitation stems from the scarcity of available pre-training data. To address this accessibility challenge, we present MC$^2$, a Multilingual Corpus of Minority Languages in China, which is the largest open-source corpus of its kind so far. MC$^2$ inclu… ▽ More

    Submitted 13 June, 2024; v1 submitted 14 November, 2023; originally announced November 2023.

    Comments: ACL 2024 https://github.com/luciusssss/mc2_corpus

  46. arXiv:2311.06500  [pdf, other

    cs.IT

    Knowledge Distillation and Training Balance for Heterogeneous Decentralized Multi-Modal Learning over Wireless Networks

    Authors: Benshun Yin, Zhiyong Chen, Meixia Tao

    Abstract: Decentralized learning is widely employed for collaboratively training models using distributed data over wireless networks. Existing decentralized learning methods primarily focus on training single-modal networks. For the decentralized multi-modal learning (DMML), the modality heterogeneity and the non-independent and non-identically distributed (non-IID) data across devices make it difficult fo… ▽ More

    Submitted 11 November, 2023; originally announced November 2023.

    Comments: submitted to IEEE Trans. on Mobile Computing

  47. arXiv:2311.05103  [pdf, other

    math.OC

    PID-inspired Continuous-time Distributed Optimization

    Authors: Meng Tao, Dongdong Yue, Jinde Cao

    Abstract: This paper proposes two novel distributed continuous-time algorithms inspired by PID control to solve distributed optimization problems. The algorithms are referred to as first-order and second-order, respectively, depend on the intrinsic dynamics of the agents in the network. Sufficient conditions are derived so that both algorithms converge exponentially over undirected connected graphs. Finally… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: 7 pages, 3 figures, The 49th Annual Conference of the IEEE Industrial Electronics Society

  48. arXiv:2311.02958  [pdf, other

    eess.SP

    Optimization of RIS Placement for Satellite-to-Ground Coverage Enhancement

    Authors: Xingchen Liu, Liuxun Xue, Shu Sun, Meixia Tao

    Abstract: In satellite-to-ground communication, ensuring reliable and efficient connectivity poses significant challenges. The reconfigurable intelligent surface (RIS) offers a promising solution due to its ability to manipulate wireless propagation environments and thus enhance communication performance. In this paper, we propose a method for optimizing the placement of RISs on building facets to improve s… ▽ More

    Submitted 6 November, 2023; originally announced November 2023.

  49. arXiv:2310.17087  [pdf, other

    cs.LG math.DS math.OC stat.ML

    Good regularity creates large learning rate implicit biases: edge of stability, balancing, and catapult

    Authors: Yuqing Wang, Zhenghao Xu, Tuo Zhao, Molei Tao

    Abstract: Large learning rates, when applied to gradient descent for nonconvex optimization, yield various implicit biases including the edge of stability (Cohen et al., 2021), balancing (Wang et al., 2022), and catapult (Lewkowycz et al., 2020). These phenomena cannot be well explained by classical optimization theory. Though significant theoretical progress has been made in understanding these implicit bi… ▽ More

    Submitted 11 December, 2023; v1 submitted 25 October, 2023; originally announced October 2023.

  50. arXiv:2310.08233  [pdf, other

    cs.RO cs.AI

    The Impact of Time Step Frequency on the Realism of Robotic Manipulation Simulation for Objects of Different Scales

    Authors: Minh Q. Ta, Holly Dinkel, Hameed Abdul-Rashid, Yangfei Dai, Jessica Myers, Tan Chen, Junyi Geng, Timothy Bretl

    Abstract: This work evaluates the impact of time step frequency and component scale on robotic manipulation simulation accuracy. Increasing the time step frequency for small-scale objects is shown to improve simulation accuracy. This simulation, demonstrating pre-assembly part picking for two object geometries, serves as a starting point for discussing how to improve Sim2Real transfer in robotic assembly pr… ▽ More

    Submitted 12 October, 2023; originally announced October 2023.

    Comments: 3 pages, 3 figures, Best Poster Finalist at the 2023 Robotics and AI in Future Factory Workshop at the IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). Video presentation [https://www.youtube.com/watch?v=JOXrBpMmI0A]. Robotics and AI in Future Factory workshop [https://sites.google.com/view/robot-ai-future-factory/]