Zum Hauptinhalt springen

Showing 1–50 of 1,717 results for author: Yan, Y

.
  1. arXiv:2408.16765  [pdf, ps, other

    cs.LG cs.AI math.PR math.ST stat.ML

    A Score-Based Density Formula, with Applications in Diffusion Generative Models

    Authors: Gen Li, Yuling Yan

    Abstract: Score-based generative models (SGMs) have revolutionized the field of generative modeling, achieving unprecedented success in generating realistic and diverse content. Despite empirical advances, the theoretical basis for why optimizing the evidence lower bound (ELBO) on the log-likelihood is effective for training diffusion generative models, such as DDPMs, remains largely unexplored. In this pap… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

  2. arXiv:2408.16288  [pdf, other

    cs.LG cs.AI cs.DB cs.SI

    OpenFGL: A Comprehensive Benchmarks for Federated Graph Learning

    Authors: Xunkai Li, Yinlin Zhu, Boyang Pang, Guochen Yan, Yeyu Yan, Zening Li, Zhengyu Wu, Wentao Zhang, Rong-Hua Li, Guoren Wang

    Abstract: Federated graph learning (FGL) has emerged as a promising distributed training paradigm for graph neural networks across multiple local systems without direct data sharing. This approach is particularly beneficial in privacy-sensitive scenarios and offers a new perspective on addressing scalability challenges in large-scale graph learning. Despite the proliferation of FGL, the diverse motivations… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Under Review

  3. arXiv:2408.15493  [pdf, ps, other

    hep-ph nucl-th

    Investigating the $p$-$Ω$ Interaction and Correlation Functions

    Authors: Ye Yan, Youchang Yang, Qi Huang, Hongxia Huang, Jialun Ping

    Abstract: Motivated by the experimental measurements, we investigate the $p$-$Ω$ correlation functions and interactions. By solving the inverse scattering problem, we derive the $p$-$Ω$ potentials from a quark model. The effects of Coulomb interaction and spin-averaging are discussed. According to our results, the depletion of the $p$-$Ω$ correlation functions, attributed to the $J^P = 2^+$ bound state not… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 9 pages, 6 figures

  4. arXiv:2408.14917  [pdf, other

    cs.NE

    PMSN: A Parallel Multi-compartment Spiking Neuron for Multi-scale Temporal Processing

    Authors: Xinyi Chen, Jibin Wu, Chenxiang Ma, Yinsong Yan, Yujie Wu, Kay Chen Tan

    Abstract: Spiking Neural Networks (SNNs) hold great potential to realize brain-inspired, energy-efficient computational systems. However, current SNNs still fall short in terms of multi-scale temporal processing compared to their biological counterparts. This limitation has resulted in poor performance in many pattern recognition tasks with information that varies across different timescales. To address thi… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  5. arXiv:2408.14520  [pdf, other

    cs.LG cs.AI cs.SI

    Towards Graph Prompt Learning: A Survey and Beyond

    Authors: Qingqing Long, Yuchen Yan, Peiyan Zhang, Chen Fang, Wentao Cui, Zhiyuan Ning, Meng Xiao, Ning Cao, Xiao Luo, Lingjun Xu, Shiyue Jiang, Zheng Fang, Chong Chen, Xian-Sheng Hua, Yuanchun Zhou

    Abstract: Large-scale "pre-train and prompt learning" paradigms have demonstrated remarkable adaptability, enabling broad applications across diverse domains such as question answering, image recognition, and multimodal retrieval. This approach fully leverages the potential of large-scale pre-trained models, reducing downstream data requirements and computational costs while enhancing model applicability ac… ▽ More

    Submitted 29 August, 2024; v1 submitted 26 August, 2024; originally announced August 2024.

    Comments: 19 pages, 2 figures

  6. arXiv:2408.14506  [pdf, other

    cs.LG

    Distilling Long-tailed Datasets

    Authors: Zhenghao Zhao, Haoxuan Wang, Yuzhang Shang, Kai Wang, Yan Yan

    Abstract: Dataset distillation (DD) aims to distill a small, information-rich dataset from a larger one for efficient neural network training. However, existing DD methods struggle with long-tailed datasets, which are prevalent in real-world scenarios. By investigating the reasons behind this unexpected result, we identified two main causes: 1) Expert networks trained on imbalanced data develop biased gradi… ▽ More

    Submitted 24 August, 2024; originally announced August 2024.

  7. arXiv:2408.13430  [pdf, other

    stat.AP cs.DL cs.GT cs.LG stat.ML

    Analysis of the ICML 2023 Ranking Data: Can Authors' Opinions of Their Own Papers Assist Peer Review in Machine Learning?

    Authors: Buxin Su, Jiayao Zhang, Natalie Collina, Yuling Yan, Didong Li, Kyunghyun Cho, Jianqing Fan, Aaron Roth, Weijie J. Su

    Abstract: We conducted an experiment during the review process of the 2023 International Conference on Machine Learning (ICML) that requested authors with multiple submissions to rank their own papers based on perceived quality. We received 1,342 rankings, each from a distinct author, pertaining to 2,592 submissions. In this paper, we present an empirical analysis of how author-provided rankings could be le… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: See more details about the experiment at https://openrank.cc/

  8. arXiv:2408.12710  [pdf, other

    cs.HC

    CasualGaze: Towards Modeling and Recognizing Casual Gaze Behavior for Efficient Gaze-based Object Selection

    Authors: Yingtian Shi, Yukang Yan, Zisu Li, Chen Liang, Yuntao Wang, Chun Yu, Yuanchun Shi

    Abstract: We present CasualGaze, a novel eye-gaze-based target selection technique to support natural and casual eye-gaze input. Unlike existing solutions that require users to keep the eye-gaze center on the target actively, CasualGaze allows users to glance at the target object to complete the selection simply. To understand casual gaze behavior, we studied the spatial distribution of casual gaze for diff… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  9. arXiv:2408.12352  [pdf, other

    cs.CV

    GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections

    Authors: Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang

    Abstract: General text-to-image models bring revolutionary innovation to the fields of arts, design, and media. However, when applied to garment generation, even the state-of-the-art text-to-image models suffer from fine-grained semantic misalignment, particularly concerning the quantity, position, and interrelations of garment components. Addressing this, we propose GarmentAligner, a text-to-garment diffus… ▽ More

    Submitted 23 August, 2024; v1 submitted 22 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV 2024

  10. arXiv:2408.11660  [pdf, other

    cs.AR cs.NI

    Anteumbler: Non-Invasive Antenna Orientation Error Measurement for WiFi APs

    Authors: Dawei Yan, Panlong Yang, Fei Shang, Nikolaos M. Freris, Yubo Yan

    Abstract: The performance of WiFi-based localization systems is affected by the spatial accuracy of WiFi AP. Compared with the imprecision of AP location and antenna separation, the imprecision of AP's or antenna's orientation is more important in real scenarios, including AP rotation and antenna irregular tilt. In this paper, we propose Anteumbler that non-invasively, accurately and efficiently measures th… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  11. arXiv:2408.11366  [pdf, other

    cs.CL cs.LG

    GeoReasoner: Reasoning On Geospatially Grounded Context For Natural Language Understanding

    Authors: Yibo Yan, Joey Lee

    Abstract: In human reading and communication, individuals tend to engage in geospatial reasoning, which involves recognizing geographic entities and making informed inferences about their interrelationships. To mimic such cognitive process, current methods either utilize conventional natural language understanding toolkits, or directly apply models pretrained on geo-related natural language corpora. However… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: Accepted by International Conference on Information and Knowledge Management 2024

  12. arXiv:2408.09452  [pdf, other

    cs.CL

    Identifying Speakers and Addressees of Quotations in Novels with Prompt Learning

    Authors: Yuchen Yan, Hanjie Zhao, Senbin Zhu, Hongde Liu, Zhihong Zhang, Yuxiang Jia

    Abstract: Quotations in literary works, especially novels, are important to create characters, reflect character relationships, and drive plot development. Current research on quotation extraction in novels primarily focuses on quotation attribution, i.e., identifying the speaker of the quotation. However, the addressee of the quotation is also important to construct the relationship between the speaker and… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

    Comments: This paper has been accepted by NLPCC 2024

  13. arXiv:2408.09429  [pdf, other

    cs.LG cs.CL cs.CV

    Reefknot: A Comprehensive Benchmark for Relation Hallucination Evaluation, Analysis and Mitigation in Multimodal Large Language Models

    Authors: Kening Zheng, Junkai Chen, Yibo Yan, Xin Zou, Xuming Hu

    Abstract: Hallucination issues persistently plagued current multimodal large language models (MLLMs). While existing research primarily focuses on object-level or attribute-level hallucinations, sidelining the more sophisticated relation hallucinations that necessitate advanced reasoning abilities from MLLMs. Besides, recent benchmarks regarding relation hallucinations lack in-depth evaluation and effective… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  14. arXiv:2408.09320  [pdf, other

    cs.HC cs.SD eess.AS

    Auptimize: Optimal Placement of Spatial Audio Cues for Extended Reality

    Authors: Hyunsung Cho, Alexander Wang, Divya Kartik, Emily Liying Xie, Yukang Yan, David Lindlbauer

    Abstract: Spatial audio in Extended Reality (XR) provides users with better awareness of where virtual elements are placed, and efficiently guides them to events such as notifications, system alerts from different windows, or approaching avatars. Humans, however, are inaccurate in localizing sound cues, especially with multiple sources due to limitations in human auditory perception such as angular discrimi… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: UIST 2024

    ACM Class: H.5.1; H.5.2; H.5.5

  15. arXiv:2408.07522  [pdf, other

    cs.SD cs.LG eess.AS

    Optimising MFCC parameters for the automatic detection of respiratory diseases

    Authors: Yuyang Yan, Sami O. Simons, Loes van Bemmel, Lauren Reinders, Frits M. E. Franssen, Visara Urovi

    Abstract: Voice signals originating from the respiratory tract are utilized as valuable acoustic biomarkers for the diagnosis and assessment of respiratory diseases. Among the employed acoustic features, Mel Frequency Cepstral Coefficients (MFCC) is widely used for automatic analysis, with MFCC extraction commonly relying on default parameters. However, no comprehensive study has systematically investigated… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  16. arXiv:2408.07098  [pdf, other

    cs.MA cs.AI

    QTypeMix: Enhancing Multi-Agent Cooperative Strategies through Heterogeneous and Homogeneous Value Decomposition

    Authors: Songchen Fu, Shaojing Zhao, Ta Li, YongHong Yan

    Abstract: In multi-agent cooperative tasks, the presence of heterogeneous agents is familiar. Compared to cooperation among homogeneous agents, collaboration requires considering the best-suited sub-tasks for each agent. However, the operation of multi-agent systems often involves a large amount of complex interaction information, making it more challenging to learn heterogeneous strategies. Related multi-a… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 16 pages, 8 figures

    ACM Class: I.2.6; I.2.11

  17. InfinityMATH: A Scalable Instruction Tuning Dataset in Programmatic Mathematical Reasoning

    Authors: Bo-Wen Zhang, Yan Yan, Lin Li, Guang Liu

    Abstract: Recent advancements in Chain-of-Thoughts (CoT) and Program-of-Thoughts (PoT) methods have greatly enhanced language models' mathematical reasoning capabilities, facilitating their integration into instruction tuning datasets with LLMs. However, existing methods for large-scale dataset creation require substantial seed data and high computational costs for data synthesis, posing significant challen… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Accepted by CIKM 2024

    ACM Class: I.2.7

  18. arXiv:2408.05687  [pdf, other

    nucl-th astro-ph.HE hep-ph

    Investigating the competition between the deconfinement and chiral phase transitions in light of the multimessenger observations of neutron stars

    Authors: Wen-Li Yuan, Bikai Gao, Yan Yan, Bolin Li, Renxin Xu

    Abstract: We extend the parity doublet model for hadronic matter and study the possible presence of quark matter inside the cores of neutron stars with the Nambu-Jona-Lasinio (NJL) model. Considering the uncertainties of the QCD phase diagram and the location of the critical endpoint, we aim to explore the competition between the chiral phase transition and the deconfinement phase transition systematically,… ▽ More

    Submitted 12 August, 2024; v1 submitted 10 August, 2024; originally announced August 2024.

    Comments: 10pages,7 figures

  19. arXiv:2408.05112  [pdf, other

    cs.LG cs.AI eess.IV

    Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework

    Authors: Kexin Zhang, Lixin Li, Wensheng Lin, Yuna Yan, Rui Li, Wenchi Cheng, Zhu Han

    Abstract: Semantic Communication (SC) is an emerging technology aiming to surpass the Shannon limit. Traditional SC strategies often minimize signal distortion between the original and reconstructed data, neglecting perceptual quality, especially in low Signal-to-Noise Ratio (SNR) environments. To address this issue, we introduce a novel Generative AI Semantic Communication (GSC) system for single-user scen… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  20. arXiv:2408.05006  [pdf, other

    cs.SE cs.AI

    Enhancing the Code Debugging Ability of LLMs via Communicative Agent Based Data Refinement

    Authors: Weiqing Yang, Hanbin Wang, Zhenghao Liu, Xinze Li, Yukun Yan, Shuo Wang, Yu Gu, Minghe Yu, Zhiyuan Liu, Ge Yu

    Abstract: Debugging is a vital aspect of software development, yet the debugging capabilities of Large Language Models (LLMs) remain largely unexplored. This paper first introduces DEBUGEVAL, a comprehensive benchmark designed to evaluate the debugging capabilities of LLMs. DEBUGEVAL collects data from existing high-quality datasets and designs four different tasks to evaluate the debugging effectiveness, i… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  21. arXiv:2408.04130  [pdf, ps, other

    hep-ph nucl-th

    Predicting X(2370) glueball-like particle production in pp collisions at the LHC energy with PACIAE model

    Authors: Jian Cao, Jin-Peng Zhang, Jia-Hao Shi, Zhi-Ying Qin, Wen-Chao Zhang, Hua Zheng, An-Ke Lei, Zhi-Lei She, Dai-Mei Zhou, Yu-Liang Yan, Ben-Hao Sa

    Abstract: Inspired by the BESIII newest observation of X(2370) glueball-like particle production in $e^+e^-$ collisions, we search its production in proton-proton (pp) collisions at $\sqrt{s}=$ 13 TeV with a parton and hadron cascade model PACIAE. In this model, the final partonic state (FPS) and final hadronic state (FHS) are consecutively simulated and recorded. The X(2370) glueball- or tetraquark-state i… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 6 pages, 5 figures

  22. Computational Trichromacy Reconstruction: Empowering the Color-Vision Deficient to Recognize Colors Using Augmented Reality

    Authors: Yuhao Zhu, Ethan Chen, Colin Hascup, Yukang Yan, Gaurav Charma

    Abstract: We propose an assistive technology that helps individuals with Color Vision Deficiencies (CVD) to recognize/name colors. A dichromat's color perception is a reduced two-dimensional (2D) subset of a normal trichromat's three dimensional color (3D) perception, leading to confusion when visual stimuli that appear identical to the dichromat are referred to by different color names. Using our proposed… ▽ More

    Submitted 3 August, 2024; originally announced August 2024.

  23. arXiv:2408.01431  [pdf

    cs.CY cs.AI

    Building an Ethical and Trustworthy Biomedical AI Ecosystem for the Translational and Clinical Integration of Foundational Models

    Authors: Simha Sankar Baradwaj, Destiny Gilliland, Jack Rincon, Henning Hermjakob, Yu Yan, Irsyad Adam, Gwyneth Lemaster, Dean Wang, Karol Watson, Alex Bui, Wei Wang, Peipei Ping

    Abstract: Foundational Models (FMs) are gaining increasing attention in the biomedical AI ecosystem due to their ability to represent and contextualize multimodal biomedical data. These capabilities make FMs a valuable tool for a variety of tasks, including biomedical reasoning, hypothesis generation, and interpreting complex imaging data. In this review paper, we address the unique challenges associated wi… ▽ More

    Submitted 13 August, 2024; v1 submitted 18 July, 2024; originally announced August 2024.

    Comments: 3 figures, 3 tables

  24. arXiv:2408.01262  [pdf, other

    cs.CL cs.IR

    RAGEval: Scenario Specific RAG Evaluation Dataset Generation Framework

    Authors: Kunlun Zhu, Yifan Luo, Dingling Xu, Ruobing Wang, Shi Yu, Shuo Wang, Yukun Yan, Zhenghao Liu, Xu Han, Zhiyuan Liu, Maosong Sun

    Abstract: Retrieval-Augmented Generation (RAG) systems have demonstrated their advantages in alleviating the hallucination of Large Language Models (LLMs). Existing RAG benchmarks mainly focus on evaluating whether LLMs can correctly answer the general knowledge. However, they are unable to evaluate the effectiveness of the RAG system in dealing with the data from different vertical domains. This paper intr… ▽ More

    Submitted 26 August, 2024; v1 submitted 2 August, 2024; originally announced August 2024.

    Comments: add github repo

  25. arXiv:2408.00971  [pdf, other

    gr-qc

    Two distinct types of echoes in compact objects

    Authors: Shui-Fa Shen, Kai Lin, Tao Zhu, Yu-Peng Yan, Cheng-Gang Shao, Wei-Liang Qian

    Abstract: In the black hole perturbation theory framework, two different physical pictures for echoes in compact objects have been proposed. The first mechanism interprets echoes as repeated reflections of gravitational waves within a potential well, where the echo period is defined by twice the distance related to the spatial displacement operator that separates two local maxima of the effective potential.… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 17 pages and 6 figures

  26. arXiv:2408.00469  [pdf

    cond-mat.supr-con cond-mat.str-el

    Evidence of electron interaction with an unidentified bosonic mode in superconductor CsCa$_2$Fe$_4$As$_4$F$_2$

    Authors: Peng Li, Sen Liao, Zhicheng Wang, Huaxun Li, Shiwu Su, Jiakang Zhang, Ziyuan Chen, Zhicheng Jiang, Zhengtai Liu, Lexian Yang, Linwei Huai, Junfeng He, Shengtao Cui, Zhe Sun, Yajun Yan, Guanghan Cao, Dawei Shen, Juan Jiang, Donglai Feng

    Abstract: The kink structure in band dispersion usually refers to a certain electron-boson interaction, which is crucial in understanding the pairing in unconventional superconductors. Here we report the evidence of the observation of a kink structure in Fe-based superconductor CsCa$_2$Fe$_4$As$_4$F$_2$ using angle-resolved photoemission spectroscopy. The kink shows an orbital selective and momentum depende… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 14 pages, 4 figures

    Journal ref: Nature Communications 15,2024,6433

  27. arXiv:2408.00247  [pdf, other

    cs.IR

    Simple but Efficient: A Multi-Scenario Nearline Retrieval Framework for Recommendation on Taobao

    Authors: Yingcai Ma, Ziyang Wang, Yuliang Yan, Jian Wu, Yuning Jiang, Longbin Li, Wen Chen, Jianhang Huang

    Abstract: In recommendation systems, the matching stage is becoming increasingly critical, serving as the upper limit for the entire recommendation process. Recently, some studies have started to explore the use of multi-scenario information for recommendations, such as model-based and data-based approaches. However, the matching stage faces significant challenges due to the need for ultra-large-scale retri… ▽ More

    Submitted 5 August, 2024; v1 submitted 31 July, 2024; originally announced August 2024.

  28. arXiv:2407.21507  [pdf, other

    cs.AI cs.LG eess.IV

    FSSC: Federated Learning of Transformer Neural Networks for Semantic Image Communication

    Authors: Yuna Yan, Xin Zhang, Lixin Li, Wensheng Lin, Rui Li, Wenchi Cheng, Zhu Han

    Abstract: In this paper, we address the problem of image semantic communication in a multi-user deployment scenario and propose a federated learning (FL) strategy for a Swin Transformer-based semantic communication system (FSSC). Firstly, we demonstrate that the adoption of a Swin Transformer for joint source-channel coding (JSCC) effectively extracts semantic information in the communication system. Next,… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  29. arXiv:2407.20499  [pdf, other

    cs.LG

    Optimizing Long-tailed Link Prediction in Graph Neural Networks through Structure Representation Enhancement

    Authors: Yakun Wang, Daixin Wang, Hongrui Liu, Binbin Hu, Yingcui Yan, Qiyang Zhang, Zhiqiang Zhang

    Abstract: Link prediction, as a fundamental task for graph neural networks (GNNs), has boasted significant progress in varied domains. Its success is typically influenced by the expressive power of node representation, but recent developments reveal the inferior performance of low-degree nodes owing to their sparse neighbor connections, known as the degree-based long-tailed problem. Will the degree-based lo… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  30. arXiv:2407.20272  [pdf, other

    cs.CL cs.AI cs.LG

    An Efficient Inference Framework for Early-exit Large Language Models

    Authors: Ruijie Miao, Yihan Yan, Xinshuo Yao, Tong Yang

    Abstract: Building efficient inference framework has gained increasing interests for research community. Early-exit models, a variant of LLMs, improves the inference efficiency of LLMs by skipping rest layers and directly generate output tokens when they are confident enough. However, there is no work of LLM inference framework that takes early-exit models into consideration. This is non-trivial as prior ar… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

  31. arXiv:2407.19499  [pdf, other

    quant-ph

    Optimization for expectation value estimation with shallow quantum circuits

    Authors: Bujiao Wu, Yuxuan Yan, Fuchuan Wei, Zhenhuan Liu

    Abstract: Estimating linear properties of quantum states, such as fidelities, molecular energies, and correlation functions, is a fundamental task in quantum information science. The classical shadow has emerged as a prevalent tool due to its efficiency in estimating many independent observables simultaneously. However, it does not utilize the information of the target observable and the constraints of quan… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 14 pages, 4 figures

  32. arXiv:2407.15452  [pdf, other

    cs.LG cs.DC cs.SI

    GraphScale: A Framework to Enable Machine Learning over Billion-node Graphs

    Authors: Vipul Gupta, Xin Chen, Ruoyun Huang, Fanlong Meng, Jianjun Chen, Yujun Yan

    Abstract: Graph Neural Networks (GNNs) have emerged as powerful tools for supervised machine learning over graph-structured data, while sampling-based node representation learning is widely utilized in unsupervised learning. However, scalability remains a major challenge in both supervised and unsupervised learning for large graphs (e.g., those with over 1 billion nodes). The scalability bottleneck largely… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: Published in the Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024), 8 Pages, 12 Figures

    Journal ref: Proceedings of the 33rd ACM International Conference on Information and Knowledge Management (CIKM 2024), October 21-25, 2024, Boise, ID, USA

  33. arXiv:2407.15345  [pdf, other

    quant-ph physics.chem-ph

    Stability of Quantum Systems beyond Canonical Typicality

    Authors: Yu Su, Zi-Fan Zhu, Yao Wang, Rui-Xue Xu, YiJing Yan

    Abstract: Involvement of the environment is indispensable for establishing the statistical distribution of system. We analyze the statistical distribution of a quantum system coupled strongly with a heat bath. This distribution is determined by tracing over the bath's degrees of freedom for the equilibrium system-plus-bath composite. The stability of system distribution is largely affected by the system--ba… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: 5 pages, 4 figures

  34. arXiv:2407.14769  [pdf, other

    cs.HC

    A Two-Phase Visualization System for Continuous Human-AI Collaboration in Sequelae Analysis and Modeling

    Authors: Yang Ouyang, Chenyang Zhang, He Wang, Tianle Ma, Chang Jiang, Yuheng Yan, Zuoqin Yan, Xiaojuan Ma, Chuhan Shi, Quan Li

    Abstract: In healthcare, AI techniques are widely used for tasks like risk assessment and anomaly detection. Despite AI's potential as a valuable assistant, its role in complex medical data analysis often oversimplifies human-AI collaboration dynamics. To address this, we collaborated with a local hospital, engaging six physicians and one data scientist in a formative study. From this collaboration, we prop… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: To appear at the IEEE VIS Conference 2024

  35. arXiv:2407.13598  [pdf, other

    cs.HC

    KNOWNET: Guided Health Information Seeking from LLMs via Knowledge Graph Integration

    Authors: Youfu Yan, Yu Hou, Yongkang Xiao, Rui Zhang, Qianwen Wang

    Abstract: The increasing reliance on Large Language Models (LLMs) for health information seeking can pose severe risks due to the potential for misinformation and the complexity of these topics. This paper introduces KNOWNET a visualization system that integrates LLMs with Knowledge Graphs (KG) to provide enhanced accuracy and structured exploration. Specifically, for enhanced accuracy, KNOWNET extracts tri… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: 9 pages, 9 figures, accepted by IEEE VIS 2024

  36. arXiv:2407.12996  [pdf, other

    stat.ML cs.LG

    Sharpness-diversity tradeoff: improving flat ensembles with SharpBalance

    Authors: Haiquan Lu, Xiaotian Liu, Yefan Zhou, Qunli Li, Kurt Keutzer, Michael W. Mahoney, Yujun Yan, Huanrui Yang, Yaoqing Yang

    Abstract: Recent studies on deep ensembles have identified the sharpness of the local minima of individual learners and the diversity of the ensemble members as key factors in improving test-time performance. Building on this, our study investigates the interplay between sharpness and diversity within deep ensembles, illustrating their crucial role in robust generalization to both in-distribution (ID) and o… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  37. arXiv:2407.12888  [pdf

    cs.CL cs.AI

    Explainable Biomedical Hypothesis Generation via Retrieval Augmented Generation enabled Large Language Models

    Authors: Alexander R. Pelletier, Joseph Ramirez, Irsyad Adam, Simha Sankar, Yu Yan, Ding Wang, Dylan Steinecke, Wei Wang, Peipei Ping

    Abstract: The vast amount of biomedical information available today presents a significant challenge for investigators seeking to digest, process, and understand these findings effectively. Large Language Models (LLMs) have emerged as powerful tools to navigate this complex and challenging data landscape. However, LLMs may lead to hallucinatory responses, making Retrieval Augmented Generation (RAG) crucial… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  38. arXiv:2407.12735  [pdf, other

    cs.CV

    EchoSight: Advancing Visual-Language Models with Wiki Knowledge

    Authors: Yibin Yan, Weidi Xie

    Abstract: Knowledge-based Visual Question Answering (KVQA) tasks require answering questions about images using extensive background knowledge. Despite significant advancements, generative models often struggle with these tasks due to the limited integration of external knowledge. In this paper, we introduce EchoSight, a novel multimodal Retrieval-Augmented Generation (RAG) framework that enables large lang… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Technical Report; Project Page: https://go2heart.github.io/echosight

  39. arXiv:2407.12393  [pdf, other

    cs.CL cs.AI cs.CY

    PersLLM: A Personified Training Approach for Large Language Models

    Authors: Zheni Zeng, Jiayi Chen, Huimin Chen, Yukun Yan, Yuxuan Chen, Zhenghao Liu, Zhiyuan Liu, Maosong Sun

    Abstract: Large language models exhibit aspects of human-level intelligence that catalyze their application as human-like agents in domains such as social simulations, human-machine interactions, and collaborative multi-agent systems. However, the absence of distinct personalities, such as displaying ingratiating behaviors, inconsistent opinions, and uniform response patterns, diminish LLMs utility in pract… ▽ More

    Submitted 8 August, 2024; v1 submitted 17 July, 2024; originally announced July 2024.

    Comments: 10 pages for main text, 5 figures

  40. arXiv:2407.12385  [pdf, other

    cs.IR

    RankTower: A Synergistic Framework for Enhancing Two-Tower Pre-Ranking Model

    Authors: YaChen Yan, Liubo Li

    Abstract: In large-scale ranking systems, cascading architectures have been widely adopted to achieve a balance between efficiency and effectiveness. The pre-ranking module plays a vital role in selecting a subset of candidates for the subsequent ranking module. It is crucial for the pre-ranking model to maintain a balance between efficiency and accuracy to adhere to online latency constraints. In this pape… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  41. arXiv:2407.12371  [pdf, other

    cs.CV cs.AI

    HIMO: A New Benchmark for Full-Body Human Interacting with Multiple Objects

    Authors: Xintao Lv, Liang Xu, Yichao Yan, Xin Jin, Congsheng Xu, Shuwen Wu, Yifan Liu, Lincheng Li, Mengxiao Bi, Wenjun Zeng, Xiaokang Yang

    Abstract: Generating human-object interactions (HOIs) is critical with the tremendous advances of digital avatars. Existing datasets are typically limited to humans interacting with a single object while neglecting the ubiquitous manipulation of multiple objects. Thus, we propose HIMO, a large-scale MoCap dataset of full-body human interacting with multiple objects, containing 3.3K 4D HOI sequences and 4.08… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: Project page: https://lvxintao.github.io/himo, accepted by ECCV 2024

  42. arXiv:2407.12228  [pdf, other

    quant-ph

    Variational approach to light-matter interaction: Bridging quantum and semiclassical limits

    Authors: Yiying Yan, Zhiguo Lü, JunYan Luo

    Abstract: We present a time-dependent variational approach with the multiple Davydov $D_2$ trial state to simulate the dynamics of light-matter systems when the field is in a coherent state with an arbitrary finite mean photon number. The variational approach captures not only the system dynamics but also the field dynamics and is applicable to a variety of quantum models of light-matter interaction such as… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

    Comments: 14 pages, 8 figures

  43. arXiv:2407.09935  [pdf, other

    cs.CV cs.MM eess.IV

    LeRF: Learning Resampling Function for Adaptive and Efficient Image Interpolation

    Authors: Jiacheng Li, Chang Chen, Fenglong Song, Youliang Yan, Zhiwei Xiong

    Abstract: Image resampling is a basic technique that is widely employed in daily applications, such as camera photo editing. Recent deep neural networks (DNNs) have made impressive progress in performance by introducing learned data priors. Still, these methods are not the perfect substitute for interpolation, due to the drawbacks in efficiency and versatility. In this work, we propose a novel method of Lea… ▽ More

    Submitted 13 July, 2024; originally announced July 2024.

    Comments: Code: https://github.com/ddlee-cn/LeRF-PyTorch

  44. arXiv:2407.08916  [pdf

    cs.LG cs.IR

    Transforming Movie Recommendations with Advanced Machine Learning: A Study of NMF, SVD,and K-Means Clustering

    Authors: Yubing Yan, Camille Moreau, Zhuoyue Wang, Wenhan Fan, Chengqian Fu

    Abstract: This study develops a robust movie recommendation system using various machine learning techniques, including Non- Negative Matrix Factorization (NMF), Truncated Singular Value Decomposition (SVD), and K-Means clustering. The primary objective is to enhance user experience by providing personalized movie recommendations. The research encompasses data preprocessing, model training, and evaluation,… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: Accepted by 2024 4th International Symposium on Computer Technology and Information Science, IEEE

  45. arXiv:2407.08610  [pdf, other

    cs.SE cs.LG

    Semantic GUI Scene Learning and Video Alignment for Detecting Duplicate Video-based Bug Reports

    Authors: Yanfu Yan, Nathan Cooper, Oscar Chaparro, Kevin Moran, Denys Poshyvanyk

    Abstract: Video-based bug reports are increasingly being used to document bugs for programs centered around a graphical user interface (GUI). However, developing automated techniques to manage video-based reports is challenging as it requires identifying and understanding often nuanced visual patterns that capture key information about a reported bug. In this paper, we aim to overcome these challenges by ad… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

    Comments: 13 pages, accepted to 46th International Conference on Software Engineering (ICSE 2024)

  46. arXiv:2407.08468  [pdf, other

    math.ST

    Matching-Based Policy Learning

    Authors: Xuqiao Li, Ying Yan

    Abstract: Treatment heterogeneity is ubiquitous in many areas, motivating practitioners to search for the optimal policy that maximizes the expected outcome based on individualized characteristics. However, most existing policy learning methods rely on weighting-based approaches, which may suffer from high instability in observational studies. To enhance the robustness of the estimated policy, we propose a… ▽ More

    Submitted 11 July, 2024; originally announced July 2024.

  47. arXiv:2407.07661  [pdf, ps, other

    hep-ph

    Confirming the glueball-like particle $X(2370)$ productions in $e^+e^-$ collisions at BESIII energy with PACIAE model

    Authors: Zhi-Lei She, An-Ke Lei, Wen-Chao Zhang, Yu-Liang Yan, Dai-Mei Zhou, Hua Zheng, Ben-Hao Sa

    Abstract: The parton and hadron cascade model {\footnotesize PACIAE} is employed to confirm the BESIII newest observation of glueball-like particle $\rm X(2370)$ production in $e^+e^-$ collisions at $\sqrt{s}=4.95\,\mathrm{GeV}$. We coalesce the $\rm X(2370)$ glueball state with two gluons in the simulated partonic final state by the Dynamically Constrained Phase-space Coalescence ({\footnotesize DCPC}) mod… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: 4 pages, 2 figures

  48. arXiv:2407.07268  [pdf, other

    cs.CV

    Dataset Quantization with Active Learning based Adaptive Sampling

    Authors: Zhenghao Zhao, Yuzhang Shang, Junyi Wu, Yan Yan

    Abstract: Deep learning has made remarkable progress recently, largely due to the availability of large, well-labeled datasets. However, the training on such datasets elevates costs and computational demands. To address this, various techniques like coreset selection, dataset distillation, and dataset quantization have been explored in the literature. Unlike traditional techniques that depend on uniform sam… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: Accepted to ECCV 2024

  49. arXiv:2407.06067  [pdf, other

    physics.atom-ph

    Faraday laser pumped cesium beam clock

    Authors: Hangbo Shi, Xiaomin Qin, Haijun Chen, Yufei Yan, Ziqi Lu, Zhiyang Wang, Zijie Liu, Xiaolei Guan, Qiang Wei, Tiantian Shi, Jingbiao Chen

    Abstract: We realize a high-performance compact optically pumped cesium beam clock using Faraday laser simultaneously as pumping and detection lasers. The Faraday laser, which is frequency stabilized by modulation transfer spectroscopy (MTS) technique, has narrow linewidth and superior frequency stability. Measured by optical heterodyne method between two identical systems, the linewidth of the Faraday lase… ▽ More

    Submitted 11 July, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  50. arXiv:2407.05771  [pdf, other

    cs.CV

    Multi-times Monte Carlo Rendering for Inter-reflection Reconstruction

    Authors: Tengjie Zhu, Zhuo Chen, Jingnan Gao, Yichao Yan, Xiaokang Yang

    Abstract: Inverse rendering methods have achieved remarkable performance in reconstructing high-fidelity 3D objects with disentangled geometries, materials, and environmental light. However, they still face huge challenges in reflective surface reconstruction. Although recent methods model the light trace to learn specularity, the ignorance of indirect illumination makes it hard to handle inter-reflections… ▽ More

    Submitted 7 August, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

    Comments: 10 pages,6 figures,NeurIPS 2024 Submitted