Zum Hauptinhalt springen

Showing 51–100 of 1,100 results for author: Tang, S

.
  1. arXiv:2406.15471  [pdf, other

    cs.CL cs.AI cs.LG

    Improving Large Models with Small models: Lower Costs and Better Performance

    Authors: Dong Chen, Shuo Zhang, Yueting Zhuang, Siliang Tang, Qidong Liu, Hua Wang, Mingliang Xu

    Abstract: Pretrained large models (PLMs), such as ChatGPT, have demonstrated remarkable performance across diverse tasks. However, the significant computational requirements of PLMs have discouraged most product teams from running or fine-tuning them. In such cases, to harness the exceptional performance of PLMs, one must rely on expensive APIs, thereby exacerbating the economic burden. Despite the overall… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

    Comments: 11 pages

  2. arXiv:2406.15122  [pdf, ps, other

    cs.IT

    Convolutional dynamical sampling and some new results

    Authors: Longxiu Huang, A. Martina Neuman, Sui Tang, Yuying Xie

    Abstract: In this work, we explore the dynamical sampling problem on $\ell^2(\mathbb{Z})$ driven by a convolution operator defined by a convolution kernel. This problem is inspired by the need to recover a bandlimited heat diffusion field from space-time samples and its discrete analogue. In this book chapter, we review recent results in the finite-dimensional case and extend these findings to the infinite-… ▽ More

    Submitted 4 July, 2024; v1 submitted 21 June, 2024; originally announced June 2024.

  3. arXiv:2406.13586  [pdf, ps, other

    cs.GT cs.AI

    Submodular Participatory Budgeting

    Authors: Jing Yuan, Shaojie Tang

    Abstract: Participatory budgeting refers to the practice of allocating public resources by collecting and aggregating individual preferences. Most existing studies in this field often assume an additive utility function, where each individual holds a private utility for each candidate project, and the total utility of a set of funded projects is simply the sum of the utilities of all projects. We argue that… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  4. arXiv:2406.13198  [pdf, other

    quant-ph

    Single-photon triggered quantum entanglement between two qubits or at least 2000 identical qubits

    Authors: Wangjun Lu, Cuilu Zhai, Hong Tao, Yaju Song, Shiqing Tang, Lan Xu

    Abstract: This paper studies the effect of single-photon light fields on quantum entanglement between two qubits and multiple identical qubits initially in a direct state. For two qubits, we first analyze the impact of the excited state's weight on single-photon-triggered entanglement, finding that excessive weight disrupts this process. We then explore how initial coherence affects entanglement, discoverin… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 19 pages, 11 figures

  5. arXiv:2406.13011  [pdf, other

    astro-ph.SR astro-ph.EP

    Measuring the Spot Variability of T Tauri Stars Using Near-IR Atomic Fe and Molecular OH Lines

    Authors: Shih-Yun Tang, Christopher M. Johns-Krull, L. Prato, Asa G. Stahl

    Abstract: As part of the Young Exoplanets Spectroscopic Survey (YESS), this study explores the spot variability of 13 T Tauri Stars (TTSs) in the near-infrared $H$ band, using spectra from the Immersion GRating INfrared Spectrometer (IGRINS). By analyzing effective temperature ($T_{\rm eff}$) sensitive lines of atomic FeI at ~1.56259 um and ~1.56362 um, and molecular OH at ~1.56310 um and ~1.56317 um, we de… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: 31 pages, 19 figures, and 5 tables. Accepted to ApJ

  6. arXiv:2406.12608  [pdf, other

    cs.CL cs.AI

    Bridging Local Details and Global Context in Text-Attributed Graphs

    Authors: Yaoke Wang, Yun Zhu, Wenqiao Zhang, Yueting Zhuang, Yunfei Li, Siliang Tang

    Abstract: Representation learning on text-attributed graphs (TAGs) is vital for real-world applications, as they combine semantic textual and contextual structural information. Research in this field generally consist of two main perspectives: local-level encoding and global-level aggregating, respectively refer to textual node information unification (e.g., using Language Models) and structure-augmented mo… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

  7. arXiv:2406.11607  [pdf, other

    astro-ph.CO astro-ph.HE gr-qc

    Multi-spectral sirens: Gravitational-wave cosmology with (multi-) sub-populations of binary black holes

    Authors: Yin-Jie Li, Shao-Peng Tang, Yuan-Zhu Wang, Yi-Zhong Fan

    Abstract: The cosmic expansion rate can be directly measured with gravitational waves (GWs) of the compact binary mergers, by jointly constraining the mass function of the population and the cosmological model via the so called spectral sirens. Such a method relies on the features in the mass functions, which may originate from some individual sub-populations, and hence become blurred/indistinct due to the… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 14 pages, 8 figures, 1 table; comments are welcome

  8. arXiv:2406.11253  [pdf, other

    cs.CV

    Holistic-Motion2D: Scalable Whole-body Human Motion Generation in 2D Space

    Authors: Yuan Wang, Zhao Wang, Junhao Gong, Di Huang, Tong He, Wanli Ouyang, Jile Jiao, Xuetao Feng, Qi Dou, Shixiang Tang, Dan Xu

    Abstract: In this paper, we introduce a novel path to $\textit{general}$ human motion generation by focusing on 2D space. Traditional methods have primarily generated human motions in 3D, which, while detailed and realistic, are often limited by the scope of available 3D motion data in terms of both the size and the diversity. To address these limitations, we exploit extensive availability of 2D motion data… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 22 pages, 11figures, 17 tables

  9. Contextual Distillation Model for Diversified Recommendation

    Authors: Fan Li, Xu Si, Shisong Tang, Dingmin Wang, Kunyan Han, Bing Han, Guorui Zhou, Yang Song, Hechang Chen

    Abstract: The diversity of recommendation is equally crucial as accuracy in improving user experience. Existing studies, e.g., Determinantal Point Process (DPP) and Maximal Marginal Relevance (MMR), employ a greedy paradigm to iteratively select items that optimize both accuracy and diversity. However, prior methods typically exhibit quadratic complexity, limiting their applications to the re-ranking stage… ▽ More

    Submitted 14 August, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

    Comments: accepted by KDD 2024 v2

  10. arXiv:2406.07843  [pdf, other

    cs.CV q-bio.NC

    Incremental Learning and Self-Attention Mechanisms Improve Neural System Identification

    Authors: Isaac Lin, Tianye Wang, Shang Gao, Shiming Tang, Tai Sing Lee

    Abstract: Convolutional neural networks (CNNs) have been shown to be the state-of-the-art approach for modeling the transfer functions of visual cortical neurons. Cortical neurons in the primary visual cortex are are sensitive to contextual information mediated by extensive horizontal and feedback connections. Standard CNNs can integrate global spatial image information to model such contextual modulation v… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Preprint NeurIPS 2024

  11. arXiv:2406.07119  [pdf, other

    cs.CV cs.AI

    T2S-GPT: Dynamic Vector Quantization for Autoregressive Sign Language Production from Text

    Authors: Aoxiong Yin, Haoyuan Li, Kai Shen, Siliang Tang, Yueting Zhuang

    Abstract: In this work, we propose a two-stage sign language production (SLP) paradigm that first encodes sign language sequences into discrete codes and then autoregressively generates sign language from text based on the learned codebook. However, existing vector quantization (VQ) methods are fixed-length encodings, overlooking the uneven information density in sign language, which leads to under-encoding… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

    Comments: Accepted by ACL 2024

  12. arXiv:2406.05768  [pdf, other

    cs.CV cs.AI

    MLCM: Multistep Consistency Distillation of Latent Diffusion Model

    Authors: Qingsong Xie, Zhenyi Liao, Chen chen, Zhijie Deng, Shixiang Tang, Haonan Lu

    Abstract: Distilling large latent diffusion models (LDMs) into ones that are fast to sample from is attracting growing research interest. However, the majority of existing methods face a dilemma where they either (i) depend on multiple individual distilled models for different sampling budgets, or (ii) sacrifice generation quality with limited (e.g., 2-4) and/or moderate (e.g., 5-8) sampling steps. To addre… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  13. arXiv:2406.05513  [pdf, ps, other

    cs.CV

    A Two-Stage Adverse Weather Semantic Segmentation Method for WeatherProof Challenge CVPR 2024 Workshop UG2+

    Authors: Jianzhao Wang, Yanyan Wei, Dehua Hu, Yilin Zhang, Shengeng Tang, Kun Li, Zhao Zhang

    Abstract: This technical report presents our team's solution for the WeatherProof Dataset Challenge: Semantic Segmentation in Adverse Weather at CVPR'24 UG2+. We propose a two-stage deep learning framework for this task. In the first stage, we preprocess the provided dataset by concatenating images into video sequences. Subsequently, we leverage a low-rank video deraining method to generate high-fidelity ps… ▽ More

    Submitted 10 July, 2024; v1 submitted 8 June, 2024; originally announced June 2024.

  14. arXiv:2406.05232  [pdf, other

    cs.CL cs.LG

    Improving Logits-based Detector without Logits from Black-box LLMs

    Authors: Cong Zeng, Shengkun Tang, Xianjun Yang, Yuanzhou Chen, Yiyou Sun, zhiqiang xu, Yao Li, Haifeng Chen, Wei Cheng, Dongkuan Xu

    Abstract: The advent of Large Language Models (LLMs) has revolutionized text generation, producing outputs that closely mimic human writing. This blurring of lines between machine- and human-written text presents new challenges in distinguishing one from the other a task further complicated by the frequent updates and closed nature of leading proprietary LLMs. Traditional logits-based detection methods leve… ▽ More

    Submitted 18 August, 2024; v1 submitted 7 June, 2024; originally announced June 2024.

  15. arXiv:2406.03625  [pdf, other

    cs.CV cs.AI

    Degrees of Freedom Matter: Inferring Dynamics from Point Trajectories

    Authors: Yan Zhang, Sergey Prokudin, Marko Mihajlovic, Qianli Ma, Siyu Tang

    Abstract: Understanding the dynamics of generic 3D scenes is fundamentally challenging in computer vision, essential in enhancing applications related to scene reconstruction, motion tracking, and avatar creation. In this work, we address the task as the problem of inferring dense, long-range motion of 3D points. By observing a set of point trajectories, we aim to learn an implicit motion field parameterize… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

    Comments: cvpr24 post camera ready

  16. arXiv:2406.03474  [pdf, other

    cs.CV

    AD-H: Autonomous Driving with Hierarchical Agents

    Authors: Zaibin Zhang, Shiyu Tang, Yuanhang Zhang, Talas Fu, Yifan Wang, Yang Liu, Dong Wang, Jing Shao, Lijun Wang, Huchuan Lu

    Abstract: Due to the impressive capabilities of multimodal large language models (MLLMs), recent works have focused on employing MLLM-based agents for autonomous driving in large-scale and dynamic environments. However, prevalent approaches often directly translate high-level instructions into low-level vehicle control signals, which deviates from the inherent language generation paradigm of MLLMs and fails… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  17. arXiv:2406.02975  [pdf, other

    eess.SP

    A Shared-Aperture Dual-Band sub-6 GHz and mmWave Reconfigurable Intelligent Surface With Independent Operation

    Authors: Junhui Rao, Yujie Zhang, Shiwen Tang, Zan Li, Zhaoyang Ming, Jichen Zhang, Chi Yuk Chiu, Ross Murch

    Abstract: A novel dual-band reconfigurable intelligent surface (DBI-RIS) design that combines the functionalities of millimeter-wave (mmWave) and sub-6 GHz bands within a single aperture is proposed. This design aims to bridge the gap between current single-band reconfigurable intelligent surfaces (RISs) and wireless systems utilizing sub-6 GHz and mmWave bands that require RIS with independently reconfigur… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  18. arXiv:2406.01658  [pdf, other

    cs.CV

    Proxy Denoising for Source-Free Domain Adaptation

    Authors: Song Tang, Wenxin Su, Mao Ye, Jianwei Zhang, Xiatian Zhu

    Abstract: Source-free Domain Adaptation (SFDA) aims to adapt a pre-trained source model to an unlabeled target domain with no access to the source data. Inspired by the success of pre-trained large vision-language (ViL) models in many other applications, the latest SFDA methods have also validated the benefit of ViL models by leveraging their predictions as pseudo supervision. However, we observe that ViL's… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  19. arXiv:2406.00593  [pdf

    physics.optics

    Low threshold optical bistability based on MoS2 in asymmetric Fabry-Perot cavity structure in visible light band

    Authors: Songqing Tang, Mengjiao Ren, Zhiheng Li, Zhiwei Zheng, Leyong Jiang

    Abstract: This article theoretically proposes a multi-layer Fabry-Perot cavity structure based on nonlinear MoS2, whose cavity is composed of asymmetric photonic crystals. In this structure, we observed a low threshold optical bistability phenomenon on the order of a in the visible light band, which is caused by the large third-order nonlinear conductivity of the bilayer MoS2 and the Fabry-Perot cavity reso… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 22 pages, 6 figures

  20. arXiv:2406.00590  [pdf

    physics.optics

    MoS2-based optical bistability in silver-Bragg reflector multilayer structure at visible light band

    Authors: Songqing Tang, Mengjiao Ren, Zhiheng Li, Zhiwei Zheng, Leyong Jiang

    Abstract: In this paper, we present a theoretical analysis of the optical bistability in a metallic silver-Bragg reflector structure by embedding bilayer MoS2 at the visible band. The nonlinear OB is achieved due to the nonlinear conductivity of the bilayer MoS2 and the excitation of the optical Tamm state at the interface between the silver and the Bragg reflector. It is found that the hysteresis behaviour… ▽ More

    Submitted 1 June, 2024; originally announced June 2024.

    Comments: 23 pages, 6 figures

  21. arXiv:2405.20598  [pdf

    cond-mat.str-el cond-mat.mes-hall

    Mott insulating phase and coherent-incoherent crossover across magnetic phase transition in 2D antiferromagnetic CrSBr

    Authors: Fan Wu, Xuefeng Zhang, Yi Chen, Ding Pei, Mengwen Zhan, Zicheng Tao, Cheng Chen, Shipeng Lu, Jingzhi Chen, Shujie Tang, Xia Wang, Yanfeng Guo, Lexian Yang, Yan Zhang, Yulin Chen, Qixi Mi, Gang Li, Zhongkai Liu

    Abstract: In two-dimensional van der Waals magnetic materials, the interplay between magnetism and electron correlation can give rise to new ground states and lead to novel transport and optical properties. A fundamental question in these materials is how the electron correlation manifests and interacts with the magnetic orders. In this study, we demonstrate that the recently discovered 2D antiferromagnetic… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  22. arXiv:2405.20272  [pdf, other

    cs.LG cs.CR

    Reconstruction Attacks on Machine Unlearning: Simple Models are Vulnerable

    Authors: Martin Bertran, Shuai Tang, Michael Kearns, Jamie Morgenstern, Aaron Roth, Zhiwei Steven Wu

    Abstract: Machine unlearning is motivated by desire for data autonomy: a person can request to have their data's influence removed from deployed models, and those models should be updated as if they were retrained without the person's data. We show that, counter-intuitively, these updates expose individuals to high-accuracy reconstruction attacks which allow the attacker to recover their data in its entiret… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

  23. arXiv:2405.19789  [pdf, other

    cs.LG cs.DC

    Estimating before Debiasing: A Bayesian Approach to Detaching Prior Bias in Federated Semi-Supervised Learning

    Authors: Guogang Zhu, Xuefeng Liu, Xinghao Wu, Shaojie Tang, Chao Tang, Jianwei Niu, Hao Su

    Abstract: Federated Semi-Supervised Learning (FSSL) leverages both labeled and unlabeled data on clients to collaboratively train a model.In FSSL, the heterogeneous data can introduce prediction bias into the model, causing the model's prediction to skew towards some certain classes. Existing FSSL methods primarily tackle this issue by enhancing consistency in model parameters or outputs. However, as the mo… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: Accepted by IJCAI 2024

  24. arXiv:2405.18999  [pdf, other

    stat.AP cs.AI cs.RO

    Continuously Optimizing Radar Placement with Model Predictive Path Integrals

    Authors: Michael Potter, Shuo Tang, Paul Ghanem, Milica Stojanovic, Pau Closas, Murat Akcakaya, Ben Wright, Marius Necsoiu, Deniz Erdogmus, Michael Everett, Tales Imbiriba

    Abstract: Continuously optimizing sensor placement is essential for precise target localization in various military and civilian applications. While information theory has shown promise in optimizing sensor placement, many studies oversimplify sensor measurement models or neglect dynamic constraints of mobile sensors. To address these challenges, we employ a range measurement model that incorporates radar p… ▽ More

    Submitted 29 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: Submitted to IEEE Aerospace and Electronic Systems

  25. arXiv:2405.17790  [pdf, other

    cs.CV

    Instruct-ReID++: Towards Universal Purpose Instruction-Guided Person Re-identification

    Authors: Weizhen He, Yiheng Deng, Yunfeng Yan, Feng Zhu, Yizhou Wang, Lei Bai, Qingsong Xie, Donglian Qi, Wanli Ouyang, Shixiang Tang

    Abstract: Human intelligence can retrieve any person according to both visual and language descriptions. However, the current computer vision community studies specific person re-identification (ReID) tasks in different scenarios separately, which limits the applications in the real world. This paper strives to resolve this problem by proposing a novel instruct-ReID task that requires the model to retrieve… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2306.07520

  26. arXiv:2405.16995  [pdf, other

    hep-ph nucl-th

    Electron form factors in Basis Light-front Quantization

    Authors: Lingdi Meng, Shuo Tang, Zhi Hu, Guo-Li Wang, Yang Li, Xingbo Zhao, James P. Vary

    Abstract: In this paper, we evaluate the electromagnetic and gravitational form factors as well as the corresponding generalized parton distributions of the electron using the Basis Light-front Quantization approach to QED. We compare our results with those from light-front perturbation theory. We adopt a novel basis with its scale depending on the constituents' longitudinal momentum fraction. We show that… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  27. High-field magnetoelectric coupling and successive magnetic transitions in Mn-doped polar antiferromagnet Ni3TeO6

    Authors: J. H. Zhang, L. Lin, C. Dong, Y. T. Chang, J. F. Wang, C. L. Lu, P. Z. Chen, W. J. Zhai, G. Z. Zhou, L. Huang, Y. S. Tang, S. H. Zheng, M. F. Liu, X. H. Zhou, Z. B. Yan, J. -M. Liu

    Abstract: Among the 3d transition metal ions doped polar Ni3TeO6, Mn-doped Ni3TeO6 has stimulated great interest due to its high magnetic ordering temperature and complex magnetic phases, but the mechanism of magnetoelectric (ME) coupling is far from understood. Herein we report our systematic investigation of the chemical control of magnetism, metamagnetic transition, and ME properties of Ni3-xMnxTeO6 sing… ▽ More

    Submitted 29 May, 2024; v1 submitted 24 May, 2024; originally announced May 2024.

    Comments: 30 pages with 8 figures

    Journal ref: Phys. Rev. B 109, 184112 (2024)

  28. arXiv:2405.13279  [pdf, other

    gr-qc

    Constraints on Einstein-dilation-Gauss-Bonnet gravity and electric charge of compact binary systems from GW230529

    Authors: Bo Gao, Shao-Peng Tang, Hai-Tian Wang, Jingzhi Yan, Yi-Zhong Fan

    Abstract: In this work, we study the implications of GW230529 on gravity theories and the charge of black holes. The GW230529, which was initially released in O4a, is most likely neutron star-black hole (NSBH) mergers. We reanalyze the data from the GW230529 event to obtain bounds on the Einstein-dilation-Gauss-Bonnet (EdGB) gravity parameter $\sqrt{α_{\rm EdGB}}$ and the electric charge of compact binary s… ▽ More

    Submitted 12 July, 2024; v1 submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 pages, 4 figures, PRD accepted

  29. arXiv:2405.13002  [pdf, other

    cs.CL cs.AI

    DuetRAG: Collaborative Retrieval-Augmented Generation

    Authors: Dian Jiao, Li Cai, Jingsheng Huang, Wenqiao Zhang, Siliang Tang, Yueting Zhuang

    Abstract: Retrieval-Augmented Generation (RAG) methods augment the input of Large Language Models (LLMs) with relevant retrieved passages, reducing factual errors in knowledge-intensive tasks. However, contemporary RAG approaches suffer from irrelevant knowledge retrieval issues in complex domain questions (e.g., HotPot QA) due to the lack of corresponding domain knowledge, leading to low-quality generation… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 5 pages

  30. arXiv:2405.12796  [pdf, other

    cs.CV

    DisenStudio: Customized Multi-subject Text-to-Video Generation with Disentangled Spatial Control

    Authors: Hong Chen, Xin Wang, Yipeng Zhang, Yuwei Zhou, Zeyang Zhang, Siao Tang, Wenwu Zhu

    Abstract: Generating customized content in videos has received increasing attention recently. However, existing works primarily focus on customized text-to-video generation for single subject, suffering from subject-missing and attribute-binding problems when the video is expected to contain multiple subjects. Furthermore, existing models struggle to assign the desired actions to the corresponding subjects… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  31. arXiv:2405.10553  [pdf, other

    eess.SP

    Revealing the Trade-off in ISAC Systems: The KL Divergence Perspective

    Authors: Zesong Fei, Shuntian Tang, Xinyi Wang, Fanghao Xia, Fan Liu, J. Andrew Zhang

    Abstract: Integrated sensing and communication (ISAC) is regarded as a promising technique for 6G communication network. In this letter, we investigate the Pareto bound of the ISAC system in terms of a unified Kullback-Leibler (KL) divergence performance metric. We firstly present the relationship between KL divergence and explicit ISAC performance metric, i.e., demodulation error and probability of detecti… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 5 pages, 5 figures; submitted to IEEE journals for possible publication

  32. arXiv:2405.09776  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Magnetic structure and magnetoelectric coupling in antiferromagnet Co5(TeO3)4Cl2

    Authors: B. Yu, L. Huang, J. S. Li, L. Lin, V. Ovidiu Garlea, Q. Zhang, T. Zou, J. C. Zhang, J. Peng, Y. S. Tang, G. Z. Zhou, J. H. Zhang, S. H. Zheng, M. F. Liu, Z. B. Yan, X. H. Zhou, S. Dong, J. G. Wan, J. -M. Liu

    Abstract: The van der Waals (vdW) layered multiferroics, which host simultaneous ferroelectric and magnetic orders, have attracted attention not only for their potentials to be utilized in nanoelectric devices and spintronics, but also offer alternative opportunities for emergent physical phenomena. To date, the vdW layered multiferroic materials are still very rare. In this work, we have investigated the m… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 31 pages, 9 figures

    Journal ref: Phys. Rev. B 109, 184106(2024)

  33. arXiv:2405.09350  [pdf, other

    astro-ph.CO astro-ph.GA astro-ph.HE

    Digging into the ultraviolet luminosity functions of galaxies at high redshifts: galaxies evolution, reionization, and cosmological parameters

    Authors: Yi-Ying Wang, Lei Lei, Shao-Peng Tang, Guan-Wen Yuan, Yi-Zhong Fan

    Abstract: Thanks to the successful performance of the James Webb Space Telescope, our understanding of the epoch of reionization of the Universe has been advanced. The ultraviolet luminosity functions (UV LFs) of galaxies span a wide range of redshift, not only revealing the connection between galaxies and dark matter (DM) halos but also providing the information during reionization. In this work, we develo… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures and 3 tables

  34. arXiv:2405.09234  [pdf, other

    eess.IV

    Enhancing Image Privacy in Semantic Communication over Wiretap Channels leveraging Differential Privacy

    Authors: Weixuan Chen, Shunpu Tang, Qianqian Yang

    Abstract: Semantic communication (SemCom) enhances transmission efficiency by sending only task-relevant information compared to traditional methods. However, transmitting semantic-rich data over insecure or public channels poses security and privacy risks. This paper addresses the privacy problem of transmitting images over wiretap channels and proposes a novel SemCom approach ensuring privacy through a di… ▽ More

    Submitted 15 May, 2024; originally announced May 2024.

  35. arXiv:2405.07442  [pdf

    cs.SD cs.AI eess.AS q-bio.QM

    Rene: A Pre-trained Multi-modal Architecture for Auscultation of Respiratory Diseases

    Authors: Pengfei Zhang, Zhihang Zheng, Shichen Zhang, Minghao Yang, Shaojun Tang

    Abstract: Compared with invasive examinations that require tissue sampling, respiratory sound testing is a non-invasive examination method that is safer and easier for patients to accept. In this study, we introduce Rene, a pioneering large-scale model tailored for respiratory sound recognition. Rene has been rigorously fine-tuned with an extensive dataset featuring a broad array of respiratory audio sample… ▽ More

    Submitted 6 June, 2024; v1 submitted 12 May, 2024; originally announced May 2024.

  36. arXiv:2405.07413  [pdf

    cond-mat.mtrl-sci

    Unraveling Anisotropic Hybridizations of Solid-state Electrolyte Nano-films in Li-ion Batteries

    Authors: Yuanjie Ning, Wenjun Wu, Liang Dai, Shuo Sun, Zhigang Zeng, Dengsong Zhang, Mark B. H. Breese, Chuanbing Cai, Chi Sin Tang, Xinmao Yin

    Abstract: Li2WO4 (LWO) is recognized for its potential as a solid-state electrolyte and it has demonstrated the ability to enhance the electrochemical performance of LiCoO2 (LCO) cathodes in Li-ion batteries. However, prior investigations into LWO have predominantly involved polycrystalline structures, thereby lacking a comprehensive understanding of its behavior when interfaced with single crystal systems,… ▽ More

    Submitted 12 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures

  37. arXiv:2405.06309  [pdf, ps, other

    math.PR math.OC

    Viscosity Solutions of Second Order Path-Dependent Partial Differential Equations and Applications

    Authors: Shanjian Tang, Jianjun Zhou

    Abstract: In this article, a notion of viscosity solutions is introduced for fully nonlinear second order path-dependent partial differential equations in the spirit of [Zhou, Ann. Appl. Probab., 33 (2023), 5564-5612]. We prove the existence, comparison principle, consistency and stability for the viscosity solutions. Application to path-dependent stochastic differential games is given.

    Submitted 10 May, 2024; originally announced May 2024.

    Comments: 27 pages. arXiv admin note: text overlap with arXiv:2005.05309

    MSC Class: 93E20; 60H30; 49L20; 49L25

  38. arXiv:2405.05545  [pdf, other

    cs.LG stat.ML

    Deep Hierarchical Graph Alignment Kernels

    Authors: Shuhao Tang, Hao Tian, Xiaofeng Cao, Wei Ye

    Abstract: Typical R-convolution graph kernels invoke the kernel functions that decompose graphs into non-isomorphic substructures and compare them. However, overlooking implicit similarities and topological position information between those substructures limits their performances. In this paper, we introduce Deep Hierarchical Graph Alignment Kernels (DHGAK) to resolve this problem. Specifically, the relati… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  39. arXiv:2405.02965  [pdf, other

    cs.AI cs.RO

    Robust Collaborative Perception without External Localization and Clock Devices

    Authors: Zixing Lei, Zhenyang Ni, Ruize Han, Shuo Tang, Dingju Wang, Chen Feng, Siheng Chen, Yanfeng Wang

    Abstract: A consistent spatial-temporal coordination across multiple agents is fundamental for collaborative perception, which seeks to improve perception abilities through information exchange among agents. To achieve this spatial-temporal alignment, traditional methods depend on external devices to provide localization and clock signals. However, hardware-generated signals could be vulnerable to noise and… ▽ More

    Submitted 31 May, 2024; v1 submitted 5 May, 2024; originally announced May 2024.

    Comments: 6pages, accepted to ICRA 2024

  40. arXiv:2405.01926  [pdf, other

    cs.CV

    Auto-Encoding Morph-Tokens for Multimodal LLM

    Authors: Kaihang Pan, Siliang Tang, Juncheng Li, Zhaoyu Fan, Wei Chow, Shuicheng Yan, Tat-Seng Chua, Yueting Zhuang, Hanwang Zhang

    Abstract: For multimodal LLMs, the synergy of visual comprehension (textual output) and generation (visual output) presents an ongoing challenge. This is due to a conflicting objective: for comprehension, an MLLM needs to abstract the visuals; for generation, it needs to preserve the visuals as much as possible. Thus, the objective is a dilemma for visual-tokens. To resolve the conflict, we propose encoding… ▽ More

    Submitted 3 May, 2024; originally announced May 2024.

    Comments: Accepted by ICML 2024

  41. arXiv:2404.19193  [pdf

    cond-mat.mtrl-sci physics.optics physics.plasm-ph

    Tunable Collective Excitations in Epitaxial Perovskite Nickelates

    Authors: Mengxia Sun, Xu He, Mingyao Chen, Chi Sin Tang, Xiongfang Liu, Liang Dai, Jishan Liu, Zhigang Zeng, Shuo Sun, Mark B. H. Breese, Chuanbing Cai, Yingge Du, Le Wang, Andrew T. S. Wee, Xinmao Yin

    Abstract: The formation of plasmons through the collective excitation of charge density has generated intense discussions, offering insights to fundamental sciences and potential applications. While the underlying physical principles have been well-established, the effects of many-body interactions and orbital hybridization on plasmonic dynamics remain understudied. In this work, we present the observation… ▽ More

    Submitted 1 June, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  42. arXiv:2404.18848  [pdf, other

    cs.LG cs.AI cs.CL

    FeDeRA:Efficient Fine-tuning of Language Models in Federated Learning Leveraging Weight Decomposition

    Authors: Yuxuan Yan, Qianqian Yang, Shunpu Tang, Zhiguo Shi

    Abstract: Despite their exceptional performance on various tasks after fine-tuning, pre-trained language models (PLMs) face significant challenges due to growing privacy concerns with data in centralized training methods. We consider federated learning (FL) to fine-tune PLMs in this paper. However, the substantial number of parameters in PLMs poses significant difficulties for client devices with limited co… ▽ More

    Submitted 25 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  43. arXiv:2404.18430  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Realization of a Two-Dimensional Lieb Lattice in a Metal-Inorganic Framework with Flat Bands and Topological Edge States

    Authors: Wenjun Wu, Shuo Sun, Chi Sin Tang, Jing Wu, Yu Ma, Lingfeng Zhang, Chuanbing Cai, Jianxin Zhong, Milorad V. Milošević, Andrew T. S. Wee, Xinmao Yin

    Abstract: Flat bands and Dirac cones in materials are at the source of the exotic electronic and topological properties. The Lieb lattice is expected to host these electronic structures, arising from quantum destructive interference. Nevertheless, the experimental realization of a two-dimensional Lieb lattice remained challenging to date due to its intrinsic structural instability. After computationally des… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 24 pages,11 figures

  44. arXiv:2404.18412  [pdf

    cond-mat.mtrl-sci cond-mat.str-el

    Uncovering an Interfacial Band Resulting from Orbital Hybridization in Nickelate Heterostructures

    Authors: Mingyao Chen, Huimin Liu, Xu He, Minjuan Li, Chi Sin Tang, Mengxia Sun, Krishna Prasad Koirala, Mark E. Bowden, Yangyang Li, Xiongfang Liu, Difan Zhou, Shuo Sun, Mark B. H. Breese, Chuanbing Cai, Yingge Du, Andrew T. S. Wee, Le Wang, Xinmao Yin

    Abstract: The interaction of atomic orbitals at the interface of perovskite oxide heterostructures has been investigated for its profound impact on the band structures and electronic properties, giving rise to unique electronic states and a variety of tunable functionalities. In this study, we conducted an extensive investigation of the optical and electronic properties of epitaxial NdNiO3 thin films grown… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: 26 pages,4 figures

  45. arXiv:2404.18202  [pdf, other

    cs.AI cs.MM

    WorldGPT: Empowering LLM as Multimodal World Model

    Authors: Zhiqi Ge, Hongzhe Huang, Mingze Zhou, Juncheng Li, Guoming Wang, Siliang Tang, Yueting Zhuang

    Abstract: World models are progressively being employed across diverse fields, extending from basic environment simulation to complex scenario construction. However, existing models are mainly trained on domain-specific states and actions, and confined to single-modality state representations. In this paper, We introduce WorldGPT, a generalist world model built upon Multimodal Large Language Model (MLLM). W… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

  46. arXiv:2404.14059  [pdf, ps, other

    math.PR

    Dual Representation of Unbounded Dynamic Concave Utilities

    Authors: Shengjun Fan, Ying Hu, Shanjian Tang

    Abstract: In several linear spaces of possibly unbounded endowments, we represent the dynamic concave utilities (hence the dynamic convex risk measures) as the solutions of backward stochastic differential equations (BSDEs) with unbounded terminal values, with the help of our recent existence and uniqueness results on unbounded solutions of scalar BSDEs whose generators have a linear, super-linear, sub-quad… ▽ More

    Submitted 22 April, 2024; originally announced April 2024.

    Comments: 35 pages

  47. arXiv:2404.13558  [pdf, other

    cs.CV

    LASER: Tuning-Free LLM-Driven Attention Control for Efficient Text-conditioned Image-to-Animation

    Authors: Haoyu Zheng, Wenqiao Zhang, Yaoke Wang, Hao Zhou, Jiang Liu, Juncheng Li, Zheqi Lv, Siliang Tang, Yueting Zhuang

    Abstract: Revolutionary advancements in text-to-image models have unlocked new dimensions for sophisticated content creation, e.g., text-conditioned image editing, allowing us to edit the diverse images that convey highly complex visual concepts according to the textual guidance. Despite being promising, existing methods focus on texture- or non-rigid-based visual manipulation, which struggles to produce th… ▽ More

    Submitted 23 April, 2024; v1 submitted 21 April, 2024; originally announced April 2024.

    Comments: 10 pages, 7 figures

  48. arXiv:2404.12170  [pdf, other

    eess.SP cs.IT

    Secure Semantic Communication for Image Transmission in the Presence of Eavesdroppers

    Authors: Shunpu Tang, Chen Liu, Qianqian Yang, Shibo He, Dusit Niyato

    Abstract: Semantic communication (SemCom) has emerged as a key technology for the forthcoming sixth-generation (6G) network, attributed to its enhanced communication efficiency and robustness against channel noise. However, the open nature of wireless channels renders them vulnerable to eavesdropping, posing a serious threat to privacy. To address this issue, we propose a novel secure semantic communication… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

  49. arXiv:2404.11129  [pdf, other

    cs.CV

    Fact :Teaching MLLMs with Faithful, Concise and Transferable Rationales

    Authors: Minghe Gao, Shuang Chen, Liang Pang, Yuan Yao, Jisheng Dang, Wenqiao Zhang, Juncheng Li, Siliang Tang, Yueting Zhuang, Tat-Seng Chua

    Abstract: The remarkable performance of Multimodal Large Language Models (MLLMs) has unequivocally demonstrated their proficient understanding capabilities in handling a wide array of visual tasks. Nevertheless, the opaque nature of their black-box reasoning processes persists as an enigma, rendering them uninterpretable and struggling with hallucination. Their ability to execute intricate compositional rea… ▽ More

    Submitted 5 August, 2024; v1 submitted 17 April, 2024; originally announced April 2024.

  50. arXiv:2404.10394  [pdf, other

    cs.CV

    Portrait3D: Text-Guided High-Quality 3D Portrait Generation Using Pyramid Representation and GANs Prior

    Authors: Yiqian Wu, Hao Xu, Xiangjun Tang, Xien Chen, Siyu Tang, Zhebin Zhang, Chen Li, Xiaogang Jin

    Abstract: Existing neural rendering-based text-to-3D-portrait generation methods typically make use of human geometry prior and diffusion models to obtain guidance. However, relying solely on geometry information introduces issues such as the Janus problem, over-saturation, and over-smoothing. We present Portrait3D, a novel neural rendering-based framework with a novel joint geometry-appearance prior to ach… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.