Zum Hauptinhalt springen

Showing 1–50 of 336 results for author: Shi, T

.
  1. arXiv:2408.15965  [pdf, other

    cond-mat.quant-gas quant-ph

    Novel ground states and emergent quantum many-body scars in a two-species Rydberg atom array

    Authors: Lei-Yi-Nan Liu, Shun-Yao Yu, Shi-Rong Peng, Jie Sheng, Su Yi, Peng Xu, Shou-Shu Gong, Tao Shi, Jian Cui

    Abstract: Rydberg atom array has been established as one appealing platform for quantum simulation and quantum computation. Recent experimental development of trapping and controlling two-species atoms using optical tweezer arrays has brought more complex interactions in this game, enabling much versatile novel quantum states and phenomena to emerge and thus leading to a growing need for both theoretical an… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 19 pages, 19 figures

  2. arXiv:2408.15549  [pdf, other

    cs.CL

    WildFeedback: Aligning LLMs With In-situ User Interactions And Feedback

    Authors: Taiwei Shi, Zhuoer Wang, Longqi Yang, Ying-Chun Lin, Zexue He, Mengting Wan, Pei Zhou, Sujay Jauhar, Xiaofeng Xu, Xia Song, Jennifer Neville

    Abstract: As large language models (LLMs) continue to advance, aligning these models with human preferences has emerged as a critical challenge. Traditional alignment methods, relying on human or LLM annotated datasets, are limited by their resource-intensive nature, inherent subjectivity, and the risk of feedback loops that amplify model biases. To overcome these limitations, we introduce WildFeedback, a n… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 24 pages

  3. arXiv:2408.08394  [pdf

    cond-mat.str-el cond-mat.mtrl-sci physics.app-ph

    A topological Hund nodal line antiferromagnet

    Authors: Xian P. Yang, Yueh-Ting Yao, Pengyu Zheng, Shuyue Guan, Huibin Zhou, Tyler A. Cochran, Che-Min Lin, Jia-Xin Yin, Xiaoting Zhou, Zi-Jia Cheng, Zhaohu Li, Tong Shi, Md Shafayat Hossain, Shengwei Chi, Ilya Belopolski, Yu-Xiao Jiang, Maksim Litskevich, Gang Xu, Zhaoming Tian, Arun Bansil, Zhiping Yin, Shuang Jia, Tay-Rong Chang, M. Zahid Hasan

    Abstract: The interplay of topology, magnetism, and correlations gives rise to intriguing phases of matter. In this study, through state-of-the-art angle-resolved photoemission spectroscopy, density functional theory and dynamical mean-field theory calculations, we visualize a fourfold degenerate Dirac nodal line at the boundary of the bulk Brillouin zone in the antiferromagnet YMn2Ge2. We further demonstra… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Journal ref: Nature Communications volume 15, Article number: 7052 (2024)

  4. arXiv:2408.08209  [pdf, other

    cs.IR

    Modeling Domain and Feedback Transitions for Cross-Domain Sequential Recommendation

    Authors: Changshuo Zhang, Teng Shi, Xiao Zhang, Qi Liu, Ruobing Xie, Jun Xu, Ji-Rong Wen

    Abstract: Nowadays, many recommender systems encompass various domains to cater to users' diverse needs, leading to user behaviors transitioning across different domains. In fact, user behaviors across different domains reveal changes in preference toward recommended items. For instance, a shift from negative feedback to positive feedback indicates improved user satisfaction. However, existing cross-domain… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  5. arXiv:2408.07791  [pdf, other

    cs.MM cs.AI cs.CV cs.LG

    An Efficient and Explanatory Image and Text Clustering System with Multimodal Autoencoder Architecture

    Authors: Tiancheng Shi, Yuanchen Wei, John R. Kender

    Abstract: We demonstrate the efficiencies and explanatory abilities of extensions to the common tools of Autoencoders and LLM interpreters, in the novel context of comparing different cultural approaches to the same international news event. We develop a new Convolutional-Recurrent Variational Autoencoder (CRVAE) model that extends the modalities of previous CVAE models, by using fully-connected latent laye… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  6. arXiv:2408.04998  [pdf, other

    cs.CL cs.AI

    ProFuser: Progressive Fusion of Large Language Models

    Authors: Tianyuan Shi, Fanqi Wan, Canbin Huang, Xiaojun Quan, Chenliang Li, Ming Yan, Ji Zhang

    Abstract: While fusing the capacities and advantages of various large language models (LLMs) offers a pathway to construct more powerful and versatile models, a fundamental challenge is to properly select advantageous model during the training. Existing fusion methods primarily focus on the training mode that uses cross entropy on ground truth in a teacher-forcing setup to measure a model's advantage, which… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  7. arXiv:2408.02559  [pdf, other

    cs.CL cs.AI

    Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information

    Authors: Yauwai Yim, Chunkit Chan, Tianyu Shi, Zheye Deng, Wei Fan, Tianshi Zheng, Yangqiu Song

    Abstract: Large language models (LLMs) have shown success in handling simple games with imperfect information and enabling multi-agent coordination, but their ability to facilitate practical collaboration against other agents in complex, imperfect information environments, especially in a non-English environment, still needs to be explored. This study investigates the applicability of knowledge acquired by… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  8. arXiv:2407.19256  [pdf

    cs.AI cs.CL cs.LG

    Stochastic Parrots or ICU Experts? Large Language Models in Critical Care Medicine: A Scoping Review

    Authors: Tongyue Shi, Jun Ma, Zihan Yu, Haowei Xu, Minqi Xiong, Meirong Xiao, Yilin Li, Huiying Zhao, Guilan Kong

    Abstract: With the rapid development of artificial intelligence (AI), large language models (LLMs) have shown strong capabilities in natural language understanding, reasoning, and generation, attracting amounts of research interest in applying LLMs to health and medicine. Critical care medicine (CCM) provides diagnosis and treatment for critically ill patients who often require intensive monitoring and inte… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 28 pages, 5 figures

  9. arXiv:2407.17702  [pdf, other

    cond-mat.quant-gas

    Universal clusters in quasi-two-dimensional ultracold Fermi mixtures

    Authors: Ruijin Liu, Tingting Shi, Matteo Zaccanti, Xiaoling Cui

    Abstract: We study universal clusters in quasi-two dimensions (q2D) that consist of a light (L) atom interacting with two or three heavy (H) identical fermions, forming the trimer or tetramer bound state. The axial confinement in q2D is shown to lift the three-fold degeneracy of 3D trimer (tetramer) in $p$-wave channel and uniquely select the ground state with magnetic angular momentum $|m|=1$ ($m=0$). By v… ▽ More

    Submitted 3 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures, with supplementary material (8 pages, 4 figures)

  10. arXiv:2407.16857  [pdf, other

    cs.RO cs.LG stat.ML

    SECRM-2D: RL-Based Efficient and Comfortable Route-Following Autonomous Driving with Analytic Safety Guarantees

    Authors: Tianyu Shi, Ilia Smirnov, Omar ElSamadisy, Baher Abdulhai

    Abstract: Over the last decade, there has been increasing interest in autonomous driving systems. Reinforcement Learning (RL) shows great promise for training autonomous driving controllers, being able to directly optimize a combination of criteria such as efficiency comfort, and stability. However, RL- based controllers typically offer no safety guarantees, making their readiness for real deployment questi… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  11. arXiv:2407.11290  [pdf, other

    math.NA

    Distributed memory parallel adaptive tensor-train cross approximation

    Authors: Tianyi Shi, Daniel Hayes, Jing-Mei Qiu

    Abstract: The tensor-train (TT) format is a data-sparse tensor representation commonly used in high dimensional function approximations arising from computational and data sciences. Various sequential and parallel TT decomposition algorithms have been proposed for different tensor inputs and assumptions. In this paper, we propose subtensor parallel adaptive TT cross, which partitions a tensor onto distribut… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    MSC Class: 15A69; 65Y05; 65F99

  12. arXiv:2407.09066  [pdf

    physics.optics eess.SP

    Physical encryption and decryption for secure data transmission in optical networks leveraging the temporal Talbot effect and microwave photonics

    Authors: Chulun Lin, Taixia Shi, Yiqing Liu, Yang Chen

    Abstract: A novel microwave photonic scheme for secure data transmission in optical networks is proposed. The security of the scheme is guaranteed by physical encryption and decryption via the temporal Talbot effect in dispersive mediums. First, the original data is randomized in the digital domain by performing an exclusive OR operation using a random matrix. Subsequently, a time-varying multi-tone electri… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 19 pages, 15 figures, 1 table

  13. arXiv:2407.06083  [pdf, other

    cs.LG cs.IR

    A Survey of Controllable Learning: Methods and Applications in Information Retrieval

    Authors: Chenglei Shen, Xiao Zhang, Teng Shi, Changshuo Zhang, Guofu Xie, Jun Xu

    Abstract: Controllable learning (CL) emerges as a critical component in trustworthy machine learning, ensuring that learners meet predefined targets and can adaptively adjust without retraining according to the changes in those targets. We provide a formal definition of CL, and discuss its applications in information retrieval (IR) where information needs are often complex and dynamic. The survey categorize… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  14. arXiv:2407.06067  [pdf, other

    physics.atom-ph

    Faraday laser pumped cesium beam clock

    Authors: Hangbo Shi, Xiaomin Qin, Haijun Chen, Yufei Yan, Ziqi Lu, Zhiyang Wang, Zijie Liu, Xiaolei Guan, Qiang Wei, Tiantian Shi, Jingbiao Chen

    Abstract: We realize a high-performance compact optically pumped cesium beam clock using Faraday laser simultaneously as pumping and detection lasers. The Faraday laser, which is frequency stabilized by modulation transfer spectroscopy (MTS) technique, has narrow linewidth and superior frequency stability. Measured by optical heterodyne method between two identical systems, the linewidth of the Faraday lase… ▽ More

    Submitted 11 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  15. arXiv:2407.03332  [pdf, other

    cs.CV cs.AI

    DDPM-MoCo: Advancing Industrial Surface Defect Generation and Detection with Generative and Contrastive Learning

    Authors: Yangfan He, Xinyan Wang, Tianyu Shi

    Abstract: The task of industrial detection based on deep learning often involves solving two problems: (1) obtaining sufficient and effective data samples, (2) and using efficient and convenient model training methods. In this paper, we introduce a novel defect-generation method, named DDPM-MoCo, to address these issues. Firstly, we utilize the Denoising Diffusion Probabilistic Model (DDPM) to generate high… ▽ More

    Submitted 9 May, 2024; originally announced July 2024.

  16. arXiv:2407.01219  [pdf, other

    cs.CL

    Searching for Best Practices in Retrieval-Augmented Generation

    Authors: Xiaohua Wang, Zhenghua Wang, Xuan Gao, Feiran Zhang, Yixin Wu, Zhibo Xu, Tianyuan Shi, Zhengyuan Wang, Shizheng Li, Qi Qian, Ruicheng Yin, Changze Lv, Xiaoqing Zheng, Xuanjing Huang

    Abstract: Retrieval-augmented generation (RAG) techniques have proven to be effective in integrating up-to-date information, mitigating hallucinations, and enhancing response quality, particularly in specialized domains. While many RAG approaches have been proposed to enhance large language models through query-dependent retrievals, these approaches still suffer from their complex implementation and prolong… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  17. arXiv:2406.17807  [pdf, other

    cs.CL cs.AI

    Enhancing Commentary Strategies for Imperfect Information Card Games: A Study of Large Language Models in Guandan Commentary

    Authors: Meiling Tao, Xuechen Liang, Ziyi Wang, Yiling Tao, Tianyu Shi

    Abstract: Recent advancements in large language models (LLMs) have unlocked the potential for generating high-quality game commentary. However, producing insightful and engaging commentary for complex games with incomplete information remains a significant challenge. In this paper, we introduce a novel commentary method that combine Reinforcement Learning (RL) and LLMs, tailored specifically for the Chinese… ▽ More

    Submitted 3 August, 2024; v1 submitted 23 June, 2024; originally announced June 2024.

  18. arXiv:2406.16942  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

    Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

    Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

  19. arXiv:2406.16062  [pdf, other

    cs.NE

    Towards Biologically Plausible Computing: A Comprehensive Comparison

    Authors: Changze Lv, Yufei Gu, Zhengkang Guo, Zhibo Xu, Yixin Wu, Feiran Zhang, Tianyuan Shi, Zhenghua Wang, Ruicheng Yin, Yu Shang, Siqi Zhong, Xiaohua Wang, Muling Wu, Wenhao Liu, Tianlong Li, Jianhao Zhu, Cenyuan Zhang, Zixuan Ling, Xiaoqing Zheng

    Abstract: Backpropagation is a cornerstone algorithm in training neural networks for supervised learning, which uses a gradient descent method to update network weights by minimizing the discrepancy between actual and desired outputs. Despite its pivotal role in propelling deep learning advancements, the biological plausibility of backpropagation is questioned due to its requirements for weight symmetry, gl… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

  20. arXiv:2406.14067  [pdf

    physics.optics eess.SP

    A microwave photonic prototype for concurrent radar detection and spectrum sensing over an 8 to 40 GHz bandwidth

    Authors: Taixia Shi, Dingding Liang, Lu Wang, Lin Li, Shaogang Guo, Jiawei Gao, Xiaowei Li, Chulun Lin, Lei Shi, Baogang Ding, Shiyang Liu, Fangyi Yang, Chi Jiang, Yang Chen

    Abstract: In this work, a microwave photonic prototype for concurrent radar detection and spectrum sensing is proposed, designed, built, and investigated. A direct digital synthesizer and an analog electronic circuit are integrated to generate an intermediate frequency (IF) linearly frequency-modulated (LFM) signal with a tunable center frequency from 2.5 to 9.5 GHz and an instantaneous bandwidth of 1 GHz.… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 18 pages, 12 figures, 1 table

  21. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  22. arXiv:2406.07590  [pdf, other

    cs.LG cs.AI

    StreamPrompt: Learnable Prompt-guided Data Selection for Efficient Stream Learning

    Authors: Tongjun Shi, Shuhao Zhang

    Abstract: Stream Learning (SL) requires models to rapidly adapt to continuous data streams, setting it apart from traditional Continual Learning (CL). Recent SL methods emphasize efficiency by selecting data subsets for training, but they often struggle due to their reliance on static, rule-based selection algorithms that cannot effectively adapt to the changing importance of data. In this work, we introduc… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  23. arXiv:2406.06412  [pdf, ps, other

    cond-mat.quant-gas

    Bose-Einstein condensates of microwave-shielded polar molecules

    Authors: Wei-Jian Jin, Fulin Deng, Su Yi, Tao Shi

    Abstract: We investigate the ground-state properties of the ultracold gases of bosonic microwave-shielded polar molecules. To account for the large shielding core of the inter-molecular potential, we adopt a variational ansatz incorporating the Jastrow correlation factor. We show that the system is always stable and supports a self-bound gas phase and an expanding gas phase. We also calculate the condensate… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

  24. arXiv:2406.05628  [pdf, other

    cs.LG

    Domain Generalization Guided by Large-Scale Pre-Trained Priors

    Authors: Zongbin Wang, Bin Pan, Shiyu Shen, Tianyang Shi, Zhenwei Shi

    Abstract: Domain generalization (DG) aims to train a model from limited source domains, allowing it to generalize to unknown target domains. Typically, DG models only employ large-scale pre-trained models during the initialization of fine-tuning. However, large-scale pre-trained models already possess the ability to resist domain shift. If we reference pre-trained models continuously during fine-tuning to m… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  25. arXiv:2406.04828  [pdf, other

    cs.IR

    QAGCF: Graph Collaborative Filtering for Q&A Recommendation

    Authors: Changshuo Zhang, Teng Shi, Xiao Zhang, Yanping Zheng, Ruobing Xie, Qi Liu, Jun Xu, Ji-Rong Wen

    Abstract: Question and answer (Q&A) platforms usually recommend question-answer pairs to meet users' knowledge acquisition needs, unlike traditional recommendations that recommend only one item. This makes user behaviors more complex, and presents two challenges for Q&A recommendation, including: the collaborative information entanglement, which means user feedback is influenced by either the question or th… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  26. arXiv:2406.04712  [pdf, other

    cs.CL

    AICoderEval: Improving AI Domain Code Generation of Large Language Models

    Authors: Yinghui Xia, Yuyan Chen, Tianyu Shi, Jun Wang, Jinsong Yang

    Abstract: Automated code generation is a pivotal capability of large language models (LLMs). However, assessing this capability in real-world scenarios remains challenging. Previous methods focus more on low-level code generation, such as model loading, instead of generating high-level codes catering for real-world tasks, such as image-to-text, text classification, in various domains. Therefore, we construc… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  27. arXiv:2405.16847  [pdf, other

    cs.CV cs.AI

    TokenUnify: Scalable Autoregressive Visual Pre-training with Mixture Token Prediction

    Authors: Yinda Chen, Haoyuan Shi, Xiaoyu Liu, Te Shi, Ruobing Zhang, Dong Liu, Zhiwei Xiong, Feng Wu

    Abstract: Autoregressive next-token prediction is a standard pretraining method for large-scale language models, but its application to vision tasks is hindered by the non-sequential nature of image data, leading to cumulative errors. Most vision models employ masked autoencoder (MAE) based pretraining, which faces scalability issues. To address these challenges, we introduce \textbf{TokenUnify}, a novel pr… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  28. arXiv:2405.16701  [pdf, other

    cs.CV

    Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

    Authors: Tong Shi, Xuri Ge, Joemon M. Jose, Nicolas Pugeault, Paul Henderson

    Abstract: Capturing complex temporal relationships between video and audio modalities is vital for Audio-Visual Emotion Recognition (AVER). However, existing methods lack attention to local details, such as facial state changes between video frames, which can reduce the discriminability of features and thus lower recognition accuracy. In this paper, we propose a Detail-Enhanced Intra- and Inter-modal Intera… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Submitted to 27th International Conference of Pattern Recognition (ICPR 2024)

  29. arXiv:2405.16553  [pdf, ps, other

    cond-mat.quant-gas

    Unveiling quantum phases in quasi-one-dimensional dipolar gases using continuous matrix product state

    Authors: Li Peng, Junqiao Pan, Su Yi, Tao Shi

    Abstract: We investigate the ground-state properties of the quasi-one-dimensional dipolar gases using continuous matrix product states techniques. Making use of the first- and second-order correlation functions, we find that the system supports the superfluid, super-Tonks-Girardeau, and quasicrystal phases according to the Luttinger liquid theory. We also map out the phase diagram on the parameter plane con… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

  30. arXiv:2405.15271  [pdf

    eess.SY physics.ins-det physics.optics

    Seamless Integration and Implementation of Distributed Contact and Contactless Vital Sign Monitoring

    Authors: Dingding Liang, Yang Chen, Jiawei Gao, Taixia Shi, Jianping Yao

    Abstract: Real-time vital sign monitoring is gaining immense significance not only in the medical field but also in personal health management. Facing the needs of different application scenarios of the smart and healthy city in the future, the low-cost, large-scale, scalable, and distributed vital sign monitoring system is of great significance. In this work, a seamlessly integrated contact and contactless… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 14 pages,9 figures

  31. arXiv:2405.13645  [pdf, other

    quant-ph

    Formation and Dissociation of Field-Linked Tetramers

    Authors: Fulin Deng, Xing-Yan Chen, Xin-Yu Luo, Wenxian Zhang, Su Yi, Tao Shi

    Abstract: We investigate the static and dynamic properties of tetratomic molecules formed by two microwave-shielded polar molecules across field-linked resonances. In particular, we focus on two-body physics and experimental techniques unexplored in the recent experiment [X.-Y. Chen {\it et al}., Nature {\bf626}, 283 (2024)]. We show that, compared to the lowest tetramer state, higher tetramer states typica… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

  32. arXiv:2405.04135  [pdf, other

    cs.AI

    In-context Learning for Automated Driving Scenarios

    Authors: Ziqi Zhou, Jingyue Zhang, Jingyuan Zhang, Boyue Wang, Tianyu Shi, Alaa Khamis

    Abstract: One of the key challenges in current Reinforcement Learning (RL)-based Automated Driving (AD) agents is achieving flexible, precise, and human-like behavior cost-effectively. This paper introduces an innovative approach utilizing Large Language Models (LLMs) to intuitively and effectively optimize RL reward functions in a human-centric way. We developed a framework where instructions and dynamic e… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

    Comments: 7 pages, 6 figures, 35 references

  33. arXiv:2405.02289  [pdf, other

    cs.RO

    TSDiT: Traffic Scene Diffusion Models With Transformers

    Authors: Chen Yang, Tianyu Shi

    Abstract: In this paper, we introduce a novel approach to trajectory generation for autonomous driving, combining the strengths of Diffusion models and Transformers. First, we use the historical trajectory data for efficient preprocessing and generate action latent using a diffusion model with DiT(Diffusion with Transformers) Blocks to increase scene diversity and stochasticity of agent actions. Then, we co… ▽ More

    Submitted 21 December, 2023; originally announced May 2024.

  34. arXiv:2405.01063  [pdf, other

    cs.IR cs.CY cs.LG

    Fair Recommendations with Limited Sensitive Attributes: A Distributionally Robust Optimization Approach

    Authors: Tianhao Shi, Yang Zhang, Jizhi Zhang, Fuli Feng, Xiangnan He

    Abstract: As recommender systems are indispensable in various domains such as job searching and e-commerce, providing equitable recommendations to users with different sensitive attributes becomes an imperative requirement. Prior approaches for enhancing fairness in recommender systems presume the availability of all sensitive attributes, which can be difficult to obtain due to privacy concerns or inadequat… ▽ More

    Submitted 27 May, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Comments: 8 pages, 5 figures, accepted by SIGIR'24

  35. arXiv:2405.00478  [pdf, ps, other

    physics.atom-ph physics.optics quant-ph

    Dual-frequency optical-microwave atomic clocks based on cesium atoms

    Authors: Tiantian Shi, Qiang Wei, Xiaomin Qin, Zhenfeng Liu, Kunkun Chen, Shiying Cao, Hangbo Shi, Zijie Liu, Jingbiao Chen

    Abstract: $^{133}$Cs, which is the only stable cesium (Cs) isotope, is one of the most investigated elements in atomic spectroscopy and was used to realize the atomic clock in 1955. Among all atomic clocks, the cesium atomic clock has a special place, since the current unit of time is based on a microwave transition in the Cs atom. In addition, the long lifetime of the $6{\text{P}}_{3/2}… ▽ More

    Submitted 1 May, 2024; originally announced May 2024.

    Comments: 8 pages, 4 figures

  36. arXiv:2404.12529  [pdf, other

    cs.NI cs.HC

    A Survey of Bluetooth Indoor Localization

    Authors: Taolei Shi, Wei Gong

    Abstract: Nowadays, indoor localization has received extensive research interest due to more and more applications' needs for location information to provide a more precise and effective service [1], [2]. There are various wireless techniques and mechanisms that have been proposed; some of them have been studied in depth and come into use, such as Wi-Fi, RFID, and sensor networks. In comparison, the develop… ▽ More

    Submitted 18 April, 2024; originally announced April 2024.

    Comments: 8 pages, 2 figures

  37. arXiv:2404.11168  [pdf

    physics.optics eess.SP

    Microwave photonic short-time Fourier transform based on stabilized period-one nonlinear laser dynamics and stimulated Brillouin scattering

    Authors: Sunan Zhang, Taixia Shi, Lizhong Jiang, Yang Chen

    Abstract: A microwave photonic short-time Fourier transform (STFT) system based on stabilized period-one (P1) nonlinear laser dynamics and stimulated Brillouin scattering (SBS) is proposed. By using an optoelectronic feedback loop, the frequency-sweep optical signal generated by the P1 nonlinear laser dynamics is stabilized, which is further used in conjunction with an optical bandpass filter implemented by… ▽ More

    Submitted 17 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures

  38. arXiv:2404.09520  [pdf, other

    cs.IR

    UniSAR: Modeling User Transition Behaviors between Search and Recommendation

    Authors: Teng Shi, Zihua Si, Jun Xu, Xiao Zhang, Xiaoxue Zang, Kai Zheng, Dewei Leng, Yanan Niu, Yang Song

    Abstract: Nowadays, many platforms provide users with both search and recommendation services as important tools for accessing information. The phenomenon has led to a correlation between user search and recommendation behaviors, providing an opportunity to model user interests in a fine-grained way. Existing approaches either model user search and recommendation behaviors separately or overlook the differe… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted by SIGIR 2024

  39. arXiv:2404.08226  [pdf, other

    cs.CV

    Improving Continuous Sign Language Recognition with Adapted Image Models

    Authors: Lianyu Hu, Tongkai Shi, Liqing Gao, Zekang Liu, Wei Feng

    Abstract: The increase of web-scale weakly labelled image-text pairs have greatly facilitated the development of large-scale vision-language models (e.g., CLIP), which have shown impressive generalization performance over a series of downstream tasks. However, the massive model size and scarcity of available data limit their applications to fine-tune the whole model in downstream tasks. Besides, fully fine-… ▽ More

    Submitted 11 April, 2024; originally announced April 2024.

  40. arXiv:2404.05680  [pdf, other

    cs.CV

    SphereHead: Stable 3D Full-head Synthesis with Spherical Tri-plane Representation

    Authors: Heyuan Li, Ce Chen, Tianhao Shi, Yuda Qiu, Sizhe An, Guanying Chen, Xiaoguang Han

    Abstract: While recent advances in 3D-aware Generative Adversarial Networks (GANs) have aided the development of near-frontal view human face synthesis, the challenge of comprehensively synthesizing a full 3D head viewable from all angles still persists. Although PanoHead proves the possibilities of using a large-scale dataset with images of both frontal and back views for full-head synthesis, it often caus… ▽ More

    Submitted 16 July, 2024; v1 submitted 8 April, 2024; originally announced April 2024.

    Comments: Accepted by ECCV 2024. Project page: https://lhyfst.github.io/spherehead

  41. arXiv:2404.05374  [pdf

    eess.SP

    Seamlessly merging radar ranging/imaging, wireless communications, and spectrum sensing, for 6G empowered by microwave photonics

    Authors: Taixia Shi, Yang Chen, Jianping Yao

    Abstract: Integration of radar, wireless communications, and spectrum sensing is being investigated for 6G with an increased spectral efficiency. Microwave photonics (MWP), a technique that combines microwave engineering and photonic technology to take advantage of the wide bandwidth offered by photonics for microwave signal generation and processing is considered an effective solution for the implementatio… ▽ More

    Submitted 8 April, 2024; originally announced April 2024.

    Comments: 18 pages, 10 figures

  42. arXiv:2404.02082  [pdf, other

    cs.CV

    WcDT: World-centric Diffusion Transformer for Traffic Scene Generation

    Authors: Chen Yang, Aaron Xuxiang Tian, Dong Chen, Tianyu Shi, Arsalan Heydarian

    Abstract: In this paper, we introduce a novel approach for autonomous driving trajectory generation by harnessing the complementary strengths of diffusion probabilistic models (a.k.a., diffusion models) and transformers. Our proposed framework, termed the "World-Centric Diffusion Transformer" (WcDT), optimizes the entire trajectory generation process, from feature extraction to model inference. To enhance t… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

    Comments: 12 pages, 6 figures

  43. arXiv:2404.01663  [pdf, other

    cs.CL

    CMAT: A Multi-Agent Collaboration Tuning Framework for Enhancing Small Language Models

    Authors: Xuechen Liang, Meiling Tao, Yinghui Xia, Tianyu Shi, Jun Wang, JingSong Yang

    Abstract: Open large language models (LLMs) have significantly advanced the field of natural language processing, showcasing impressive performance across various tasks.Despite the significant advancements in LLMs, their effective operation still relies heavily on human input to accurately guide the dialogue flow, with agent tuning being a crucial optimization technique that involves human adjustments to th… ▽ More

    Submitted 26 August, 2024; v1 submitted 2 April, 2024; originally announced April 2024.

  44. Exact Work Distribution and Jarzynski's Equality of a Relativistic Particle in an Expanding Piston

    Authors: Xianghang Zhang, Tingzhang Shi, H. T. Quan

    Abstract: We study the non-equilibrium work in a pedagogical model of relativistic ideal gas. We obtain the exact work distribution and verify the Jarzynski's equality. In the non-relativistic limit, our results recover the non-relativistic results [arXiv:cond-mat/0502434]. We also find that, unlike the non-relativistic case, the work distribution no longer has zeros and the number of collisions in this rel… ▽ More

    Submitted 23 March, 2024; originally announced March 2024.

  45. arXiv:2403.06384  [pdf, other

    physics.atom-ph

    Precision Spectroscopy and Nuclear Structure Parameters in 7Li+ ion

    Authors: Hua Guan, Xiao-Qiu Qi, Peng-Peng Zhou, Wei Sun, Shao-Long Chen, Xu-Rui Chang, Yao Huang, Pei-Pei Zhang, Zong-Chao Yan, G. W. F. Drake, Ai-Xi Chen, Zhen-Xiang Zhong, Ting-Yun Shi, Ke-Lin Gao

    Abstract: The optical Ramsey technique is used to obtain precise measurements of the hyperfine splittings in the $2\,^3\!S_1$ and $2\,^3\!P_J$ states of $^7$Li$^+$. Together with bound-state quantum electrodynamic theory, the Zemach radius and quadrupole moment of the $^7$Li nucleus are determined to be $3.35(1)$~fm and $-3.86(5)$~fm$^2$ respectively, with the quadrupole moment deviating from the recommende… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  46. arXiv:2402.11725  [pdf, other

    cs.CL cs.CR cs.CY

    How Susceptible are Large Language Models to Ideological Manipulation?

    Authors: Kai Chen, Zihao He, Jun Yan, Taiwei Shi, Kristina Lerman

    Abstract: Large Language Models (LLMs) possess the potential to exert substantial influence on public perceptions and interactions with information. This raises concerns about the societal impact that could arise if the ideologies within these models can be easily manipulated. In this work, we investigate how effectively LLMs can learn and generalize ideological biases from their instruction-tuning data. Ou… ▽ More

    Submitted 18 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

  47. arXiv:2402.04159  [pdf, ps, other

    math.AP math.DS

    Optimal transport in the frame of abstract Lax-Oleinik operator revisited

    Authors: Wei Cheng, Jiahui Hong, Tianqi Shi

    Abstract: This is our first paper on the extension of our recent work on the Lax-Oleinik commutators and its applications to the intrinsic approach of propagation of singularities of the viscosity solutions of Hamilton-Jacobi equations. We reformulate Kantorovich-Rubinstein duality theorem in the theory of optimal transport in terms of abstract Lax-Oleinik operators, and analyze the relevant optimal transpo… ▽ More

    Submitted 6 February, 2024; originally announced February 2024.

  48. arXiv:2401.16501  [pdf

    cs.LG cond-mat.mtrl-sci cs.AI

    AFSD-Physics: Exploring the governing equations of temperature evolution during additive friction stir deposition by a human-AI teaming approach

    Authors: Tony Shi, Mason Ma, Jiajie Wu, Chase Post, Elijah Charles, Tony Schmitz

    Abstract: This paper presents a modeling effort to explore the underlying physics of temperature evolution during additive friction stir deposition (AFSD) by a human-AI teaming approach. AFSD is an emerging solid-state additive manufacturing technology that deposits materials without melting. However, both process modeling and modeling of the AFSD tool are at an early stage. In this paper, a human-AI teamin… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

  49. arXiv:2401.10785  [pdf, ps, other

    eess.SY

    Composite learning backstepping control with guaranteed exponential stability and robustness

    Authors: Tian Shi, Changyun Wen, Yongping Pan

    Abstract: Adaptive backstepping control provides a feasible solution to achieve asymptotic tracking for mismatched uncertain nonlinear systems. However, input-to-state stability depends on high-gain feedback generated by nonlinear damping terms, and closed-loop exponential stability with parameter convergence involves a stringent condition named persistent excitation (PE). This paper proposes a composite le… ▽ More

    Submitted 19 January, 2024; originally announced January 2024.

  50. arXiv:2401.09432  [pdf, other

    cs.CL cs.AI cs.LG

    RoleCraft-GLM: Advancing Personalized Role-Playing in Large Language Models

    Authors: Meiling Tao, Xuechen Liang, Tianyu Shi, Lei Yu, Yiting Xie

    Abstract: This study presents RoleCraft-GLM, an innovative framework aimed at enhancing personalized role-playing with Large Language Models (LLMs). RoleCraft-GLM addresses the key issue of lacking personalized interactions in conversational AI, and offers a solution with detailed and emotionally nuanced character portrayals. We contribute a unique conversational dataset that shifts from conventional celebr… ▽ More

    Submitted 4 April, 2024; v1 submitted 17 December, 2023; originally announced January 2024.