Zum Hauptinhalt springen

Showing 1–50 of 220 results for author: Xi, Y

.
  1. arXiv:2408.10520  [pdf, other

    cs.IR

    Efficient and Deployable Knowledge Infusion for Open-World Recommendations via Large Language Models

    Authors: Yunjia Xi, Weiwen Liu, Jianghao Lin, Muyan Weng, Xiaoling Cai, Hong Zhu, Jieming Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: Recommender systems (RSs) play a pervasive role in today's online services, yet their closed-loop nature constrains their access to open-world knowledge. Recently, large language models (LLMs) have shown promise in bridging this gap. However, previous attempts to directly implement LLMs as recommenders fall short in meeting the requirements of industrial RSs, particularly in terms of online infere… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: text overlap with arXiv:2306.10933

  2. arXiv:2408.07379  [pdf, other

    stat.ML cs.LG math.NA math.ST

    Posterior Covariance Structures in Gaussian Processes

    Authors: Difeng Cai, Edmond Chow, Yuanzhe Xi

    Abstract: In this paper, we present a comprehensive analysis of the posterior covariance field in Gaussian processes, with applications to the posterior covariance matrix. The analysis is based on the Gaussian prior covariance but the approach also applies to other covariance kernels. Our geometric analysis reveals how the Gaussian kernel's bandwidth parameter and the spatial distribution of the observation… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 22 papges

  3. arXiv:2408.05676  [pdf, other

    cs.IR

    A Decoding Acceleration Framework for Industrial Deployable LLM-based Recommender Systems

    Authors: Yunjia Xi, Hangyu Wang, Bo Chen, Jianghao Lin, Menghui Zhu, Weiwen Liu, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: Recently, increasing attention has been paid to LLM-based recommender systems, but their deployment is still under exploration in the industry. Most deployments utilize LLMs as feature enhancers, generating augmentation knowledge in the offline stage. However, in recommendation scenarios, involving numerous users and items, even offline generation with LLMs consumes considerable time and resources… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  4. arXiv:2408.04299  [pdf, other

    cs.CV

    Respiratory Subtraction for Pulmonary Microwave Ablation Evaluation

    Authors: Wan Li, Xinyun Zhong, Wei Li, Song Zhang, Moheng Rong, Yan Xi, Peng Yuan, Zechen Wang, Xiaolei Jiang, Rongxi Yi, Hui Tang, Yang Chen, Chaohui Tong, Zhan Wu, Feng Wang

    Abstract: Currently, lung cancer is a leading cause of global cancer mortality, often necessitating minimally invasive interventions. Microwave ablation (MWA) is extensively utilized for both primary and secondary lung tumors. Although numerous clinical guidelines and standards for MWA have been established, the clinical evaluation of ablation surgery remains challenging and requires long-term patient follo… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

  5. arXiv:2407.18797  [pdf, ps, other

    math.AP math.SP

    Hearing the shape of a drum by knocking around

    Authors: Xing Wang, Emmett L. Wyman, Yakun Xi

    Abstract: We study a variation of Kac's question, "Can one hear the shape of a drum?" if we allow ourselves access to some additional information. In particular, we allow ourselves to ``hear" the local Weyl counting function at each point on the manifold and ask if this is enough to uniquely recover the Riemannian metric. This is physically equivalent to asking whether one can determine the shape of a drum… ▽ More

    Submitted 26 July, 2024; originally announced July 2024.

    Comments: 7 pages

  6. arXiv:2407.15469  [pdf, other

    physics.flu-dyn

    Numerical simulations of attachment-line boundary layer in hypersonic flow, Part II: the features of three-dimensional turbulent boundary layer

    Authors: Youcheng Xi, Bowen Yan, Guangwen Yang, Song Fu

    Abstract: In this study,we investigate the characteristics of three-dimensional turbulent boundary layers influenced by transverse flow and pressure gradients. Our findings reveal that even without assuming an infinite sweep, a fully developed turbulent boundary layer over the present swept blunt body maintains spanwise homogeneity, consistent with infinite sweep assumptions.We critically examine the law-of… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  7. arXiv:2407.15465  [pdf, other

    physics.flu-dyn

    Numerical simulations of attachment-line boundary layer in hypersonic flow, Part I: roughness-induced subcritical transitions

    Authors: Youcheng Xi, Bowen Yan, Guangwen Yang, Xinguo Sha, Dehua Zhu, Song Fu

    Abstract: The attachment-line boundary layer is critical in hypersonic flows because of its significant impact on heat transfer and aerodynamic performance. In this study, high-fidelity numerical simulations are conducted to analyze the subcritical roughness-induced laminar-turbulent transition at the leading-edge attachment-line boundary layer of a blunt swept body under hypersonic conditions. This simulat… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  8. arXiv:2407.12447  [pdf

    physics.ao-ph

    Low latency carbon budget analysis reveals a large decline of the land carbon sink in 2023

    Authors: Piyu Ke, Philippe Ciais, Stephen Sitch, Wei Li, Ana Bastos, Zhu Liu, Yidi Xu, Xiaofan Gui, Jiang Bian, Daniel S Goll, Yi Xi, Wanjing Li, Michael O'Sullivan, Jeffeson Goncalves de Souza, Pierre Friedlingstein, Frederic Chevallier

    Abstract: In 2023, the CO2 growth rate was 3.37 +/- 0.11 ppm at Mauna Loa, 86% above the previous year, and hitting a record high since observations began in 1958, while global fossil fuel CO2 emissions only increased by 0.6 +/- 0.5%. This implies an unprecedented weakening of land and ocean sinks, and raises the question of where and why this reduction happened. Here we show a global net land CO2 sink of 0… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

  9. arXiv:2407.10208  [pdf, other

    physics.app-ph cond-mat.mes-hall

    Achieving Peta-Ohm Resistance for Semi-Insulating 4H-SiC Devices by Atomic Layer Deposition

    Authors: Yuying Xi, Helios Y. Li, Guohui Li, Qingmei Su, Kaili Mao, Bingshe Xu, Yuying Hao, Nicholas X. Fang, Yanxia Cui

    Abstract: Growing demands for precise current measurements, such as atto-ampere-level measurement of cross-cellular biological current transduction, have spotlighted a pressing need for low-noise resistors with ultra-high resistance immune to voltage fluctuations. Traditional semi-insulating materials, however, struggle to provide consistent resistance across varying voltages. To bridge this gap, we introdu… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

  10. arXiv:2407.04960  [pdf, other

    cs.IR

    MemoCRS: Memory-enhanced Sequential Conversational Recommender Systems with Large Language Models

    Authors: Yunjia Xi, Weiwen Liu, Jianghao Lin, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: Conversational recommender systems (CRSs) aim to capture user preferences and provide personalized recommendations through multi-round natural language dialogues. However, most existing CRS models mainly focus on dialogue comprehension and preferences mining from the current dialogue session, overlooking user preferences in historical dialogue sessions. The preferences embedded in the user's histo… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

  11. arXiv:2407.04368  [pdf, other

    cs.CL cs.SD eess.AS

    Romanization Encoding For Multilingual ASR

    Authors: Wen Ding, Fei Jia, Hainan Xu, Yu Xi, Junjie Lai, Boris Ginsburg

    Abstract: We introduce romanization encoding for script-heavy languages to optimize multilingual and code-switching Automatic Speech Recognition (ASR) systems. By adopting romanization encoding alongside a balanced concatenated tokenizer within a FastConformer-RNNT framework equipped with a Roman2Char module, we significantly reduce vocabulary and output dimensions, enabling larger training batches and redu… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  12. arXiv:2407.04219  [pdf, other

    eess.AS

    Semi-supervised Learning for Code-Switching ASR with Large Language Model Filter

    Authors: Yu Xi, Wen Ding, Kai Yu, Junjie Lai

    Abstract: Code-switching (CS) phenomenon occurs when words or phrases from different languages are alternated in a single sentence. Due to data scarcity, building an effective CS Automatic Speech Recognition (ASR) system remains challenging. In this paper, we propose to enhance CS-ASR systems by utilizing rich unsupervised monolingual speech data within a semi-supervised learning framework, particularly whe… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  13. arXiv:2407.03581  [pdf, ps, other

    cond-mat.str-el

    Topologically nontrivial $1/3$-magnetization plateau state in a spin-1/2 trimer chain

    Authors: Y. Y. Han, B. C. Yu, Z. Du, L. S. Ling, L. Zhang, W. Tong, C. Y. Xi, J. L. Zhang, T. Shang, Li Pi, Long Ma

    Abstract: Topologically nontrivial Haldane phase is theoretically proposed to be realized in the 1/3-magnetization ($M$) plateau of spin-1/2 trimer systems. However, the spin excitation gap, typical characteristic of Haldane phase, is not yet experimentally verified. Here, we report the nuclear magnetic resonance investigations into the low-energy spin dynamics in the $S=1/2$ spin-trimer antiferromagnetic c… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures

  14. arXiv:2407.03204  [pdf, other

    cs.CV

    Expressive Gaussian Human Avatars from Monocular RGB Video

    Authors: Hezhen Hu, Zhiwen Fan, Tianhao Wu, Yihan Xi, Seoyoung Lee, Georgios Pavlakos, Zhangyang Wang

    Abstract: Nuanced expressiveness, particularly through fine-grained hand and facial expressions, is pivotal for enhancing the realism and vitality of digital human representations. In this work, we focus on investigating the expressiveness of human avatars when learned from monocular RGB video; a setting that introduces new challenges in capturing and animating fine-grained details. To this end, we introduc… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  15. arXiv:2407.01098  [pdf, other

    math.NA

    Randomized linear solvers for computational architectures with straggling workers

    Authors: Vassilis Kalantzis, Yuanzhe Xi, Lior Horesh, Yousef Saad

    Abstract: In this paper, we consider the iterative solution of sparse systems of linear algebraic equations under the condition that sparse matrix-vector products with the coefficient matrix are computed only partially. At the same time, non-computed entries are set to zeros. We assume that both the number of computed entries and their associated row index set are random variables, with the row index set sa… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

    MSC Class: 65F08; 65F10; 68M15; 68Q87

  16. arXiv:2407.00676  [pdf, other

    cs.CV

    Instruct-IPT: All-in-One Image Processing Transformer via Weight Modulation

    Authors: Yuchuan Tian, Jianhong Han, Hanting Chen, Yuanyuan Xi, Guoyang Zhang, Jie Hu, Chao Xu, Yunhe Wang

    Abstract: Due to the unaffordable size and intensive computation costs of low-level vision models, All-in-One models that are designed to address a handful of low-level vision tasks simultaneously have been popular. However, existing All-in-One models are limited in terms of the range of tasks and performance. To overcome these limitations, we propose Instruct-IPT -- an All-in-One Image Processing Transform… ▽ More

    Submitted 30 June, 2024; originally announced July 2024.

    Comments: 15 pages, 4 figures

  17. arXiv:2406.12447  [pdf, other

    eess.AS

    Text-aware Speech Separation for Multi-talker Keyword Spotting

    Authors: Haoyu Li, Baochen Yang, Yu Xi, Linfeng Yu, Tian Tan, Hao Li, Kai Yu

    Abstract: For noisy environments, ensuring the robustness of keyword spotting (KWS) systems is essential. While much research has focused on noisy KWS, less attention has been paid to multi-talker mixed speech scenarios. Unlike the usual cocktail party problem where multi-talker speech is separated using speaker clues, the key challenge here is to extract the target speech for KWS based on text clues. To ad… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepted by INTERSPEECH2024

  18. arXiv:2406.11683  [pdf, other

    cs.CL

    HoLLMwood: Unleashing the Creativity of Large Language Models in Screenwriting via Role Playing

    Authors: Jing Chen, Xinyu Zhu, Cheng Yang, Chufan Shi, Yadong Xi, Yuxiang Zhang, Junjie Wang, Jiashu Pu, Rongsheng Zhang, Yujiu Yang, Tian Feng

    Abstract: Generative AI has demonstrated unprecedented creativity in the field of computer vision, yet such phenomena have not been observed in natural language processing. In particular, large language models (LLMs) can hardly produce written works at the level of human experts due to the extremely high complexity of literature writing. In this paper, we present HoLLMwood, an automated framework for unleas… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  19. arXiv:2406.11282  [pdf, other

    cs.CV cs.AI

    From Pixels to Progress: Generating Road Network from Satellite Imagery for Socioeconomic Insights in Impoverished Areas

    Authors: Yanxin Xi, Yu Liu, Zhicheng Liu, Sasu Tarkoma, Pan Hui, Yong Li

    Abstract: The Sustainable Development Goals (SDGs) aim to resolve societal challenges, such as eradicating poverty and improving the lives of vulnerable populations in impoverished areas. Those areas rely on road infrastructure construction to promote accessibility and economic development. Although publicly available data like OpenStreetMap is available to monitor road status, data completeness in impoveri… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: 12 pages, 13 figures, IJCAI2024 (AI and Social Good)

  20. arXiv:2406.00011  [pdf, other

    cs.IR cs.AI

    DisCo: Towards Harmonious Disentanglement and Collaboration between Tabular and Semantic Space for Recommendation

    Authors: Kounianhua Du, Jizheng Chen, Jianghao Lin, Yunjia Xi, Hangyu Wang, Xinyi Dai, Bo Chen, Ruiming Tang, Weinan Zhang

    Abstract: Recommender systems play important roles in various applications such as e-commerce, social media, etc. Conventional recommendation methods usually model the collaborative signals within the tabular representation space. Despite the personalization modeling and the efficiency, the latent semantic dependencies are omitted. Methods that introduce semantics into recommendation then emerge, injecting… ▽ More

    Submitted 4 June, 2024; v1 submitted 20 May, 2024; originally announced June 2024.

  21. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  22. arXiv:2405.17211  [pdf, other

    cs.LG math.NA physics.flu-dyn

    Spectral-Refiner: Fine-Tuning of Accurate Spatiotemporal Neural Operator for Turbulent Flows

    Authors: Shuhao Cao, Francesco Brarda, Ruipeng Li, Yuanzhe Xi

    Abstract: Recent advancements in operator-type neural networks have shown promising results in approximating the solutions of spatiotemporal Partial Differential Equations (PDEs). However, these neural networks often entail considerable training expenses, and may not always achieve the desired accuracy required in many scientific and engineering disciplines. In this paper, we propose a new Spatiotemporal Fo… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    MSC Class: 65M70 (Primary); 35Q30; 76M22; 65M50; 68T07 (Secondary)

  23. arXiv:2405.16361  [pdf, other

    cs.LG cs.CR cs.CY

    LDPKiT: Recovering Utility in LDP Schemes by Training with Noise^2

    Authors: Kexin Li, Yang Xi, Aastha Mehta, David Lie

    Abstract: The adoption of large cloud-based models for inference has been hampered by concerns about the privacy leakage of end-user data. One method to mitigate this leakage is to add local differentially private noise to queries before sending them to the cloud, but this degrades utility as a side effect. Our key insight is that knowledge available in the noisy labels returned from performing inference on… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  24. arXiv:2405.13785  [pdf, other

    cs.LG cs.AI math.PR stat.ML

    Efficient Two-Stage Gaussian Process Regression Via Automatic Kernel Search and Subsampling

    Authors: Shifan Zhao, Jiaying Lu, Ji Yang, Edmond Chow, Yuanzhe Xi

    Abstract: Gaussian Process Regression (GPR) is widely used in statistics and machine learning for prediction tasks requiring uncertainty measures. Its efficacy depends on the appropriate specification of the mean function, covariance kernel function, and associated hyperparameters. Severe misspecifications can lead to inaccurate results and problematic consequences, especially in safety-critical application… ▽ More

    Submitted 22 May, 2024; originally announced May 2024.

    ACM Class: G.3; J.3

  25. arXiv:2404.10614  [pdf, other

    cond-mat.soft nlin.AO

    Emergent intelligence of buckling-driven elasto-active structures

    Authors: Yuchen Xi, Trevor J. Jones, Richard Huang, Tom Marzin, P. -T. Brun

    Abstract: Active systems of self-propelled agents, e.g., birds, fish, and bacteria, can organize their collective motion into myriad autonomous behaviors. Ubiquitous in nature and across length scales, such phenomena are also amenable to artificial settings, e.g., where brainless self-propelled robots orchestrate their movements into spatio-temportal patterns via the application of external cues or when con… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  26. arXiv:2404.09300  [pdf, other

    math.NA

    Analysis of a finite element DtN method for scattering resonances of sound hard obstacles

    Authors: Yingxia Xi, Bo Gong, Jiguang Sun

    Abstract: Scattering resonances have important applications in many areas of science and engineering. They are the replacement of discrete spectral data for problems on non-compact domains. In this paper, we consider the computation of scattering resonances defined on the exterior to a compact sound hard obstacle. The resonances are the eigenvalues of a holomorphic Fredholm operator function. We truncate th… ▽ More

    Submitted 14 April, 2024; originally announced April 2024.

  27. arXiv:2404.09000  [pdf, other

    eess.IV cs.CV cs.LG

    MaSkel: A Model for Human Whole-body X-rays Generation from Human Masking Images

    Authors: Yingjie Xi, Boyuan Cheng, Jingyao Cai, Jian Jun Zhang, Xiaosong Yang

    Abstract: The human whole-body X-rays could offer a valuable reference for various applications, including medical diagnostics, digital animation modeling, and ergonomic design. The traditional method of obtaining X-ray information requires the use of CT (Computed Tomography) scan machines, which emit potentially harmful radiation. Thus it faces a significant limitation for realistic applications because it… ▽ More

    Submitted 13 April, 2024; originally announced April 2024.

  28. arXiv:2403.16378  [pdf, other

    cs.IR

    Play to Your Strengths: Collaborative Intelligence of Conventional Recommender Models and Large Language Models

    Authors: Yunjia Xi, Weiwen Liu, Jianghao Lin, Chuhan Wu, Bo Chen, Ruiming Tang, Weinan Zhang, Yong Yu

    Abstract: The rise of large language models (LLMs) has opened new opportunities in Recommender Systems (RSs) by enhancing user behavior modeling and content understanding. However, current approaches that integrate LLMs into RSs solely utilize either LLM or conventional recommender model (CRM) to generate final recommendations, without considering which data segments LLM or CRM excel in. To fill in this gap… ▽ More

    Submitted 24 March, 2024; originally announced March 2024.

  29. arXiv:2403.16361  [pdf, other

    eess.IV cs.CV

    RSTAR: Rotational Streak Artifact Reduction in 4D CBCT using Separable and Circular Convolutions

    Authors: Ziheng Deng, Hua Chen, Haibo Hu, Zhiyong Xu, Jiayuan Sun, Tianling Lyu, Yan Xi, Yang Chen, Jun Zhao

    Abstract: Four-dimensional cone-beam computed tomography (4D CBCT) provides respiration-resolved images and can be used for image-guided radiation therapy. However, the ability to reveal respiratory motion comes at the cost of image artifacts. As raw projection data are sorted into multiple respiratory phases, the cone-beam projections become much sparser and the reconstructed 4D CBCT images will be covered… ▽ More

    Submitted 22 August, 2024; v1 submitted 24 March, 2024; originally announced March 2024.

  30. arXiv:2403.14961  [pdf, ps, other

    math.NA

    Anderson Acceleration with Truncated Gram-Schmidt

    Authors: Ziyuan Tang, Tianshi Xu, Huan He, Yousef Saad, Yuanzhe Xi

    Abstract: Anderson Acceleration (AA) is a popular algorithm designed to enhance the convergence of fixed-point iterations. In this paper, we introduce a variant of AA based on a Truncated Gram-Schmidt process (AATGS) which has a few advantages over the classical AA. In particular, an attractive feature of AATGS is that its iterates obey a three-term recurrence in the situation when it is applied to solving… ▽ More

    Submitted 16 July, 2024; v1 submitted 22 March, 2024; originally announced March 2024.

    MSC Class: 65F10; 68W25; 65B99; 65N22

  31. arXiv:2403.13332  [pdf, other

    eess.AS cs.SD

    TDT-KWS: Fast And Accurate Keyword Spotting Using Token-and-duration Transducer

    Authors: Yu Xi, Hao Li, Baochen Yang, Haoyu Li, Hainan Xu, Kai Yu

    Abstract: Designing an efficient keyword spotting (KWS) system that delivers exceptional performance on resource-constrained edge devices has long been a subject of significant attention. Existing KWS search algorithms typically follow a frame-synchronous approach, where search decisions are made repeatedly at each frame despite the fact that most frames are keyword-irrelevant. In this paper, we propose TDT… ▽ More

    Submitted 20 March, 2024; originally announced March 2024.

    Comments: Accepted by ICASSP2024

  32. arXiv:2403.10245  [pdf, other

    cs.CV

    CoLeCLIP: Open-Domain Continual Learning via Joint Task Prompt and Vocabulary Learning

    Authors: Yukun Li, Guansong Pang, Wei Suo, Chenchen Jing, Yuling Xi, Lingqiao Liu, Hao Chen, Guoqiang Liang, Peng Wang

    Abstract: This paper explores the problem of continual learning (CL) of vision-language models (VLMs) in open domains, where the models need to perform continual updating and inference on a streaming of datasets from diverse seen and unseen domains with novel classes. Such a capability is crucial for various applications in open environments, e.g., AI assistants, autonomous driving systems, and robotics. Cu… ▽ More

    Submitted 15 March, 2024; originally announced March 2024.

  33. arXiv:2403.04514  [pdf, other

    math.NA

    A finite element contour integral method for computing the resonances of metallic grating structures with subwavelength holes

    Authors: Yingxia Xi, Junshan Lin, Jiguang Sun

    Abstract: We consider the numerical computation of resonances for metallic grating structures with dispersive media and small slit holes. The underlying eigenvalue problem is nonlinear and the mathematical model is multiscale due to the existence of several length scales in problem geometry and material contrast. We discretize the partial differential equation model over the truncated domain using the finit… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: 21 pages and 27 figures

    MSC Class: 35B34; 68U01; 68W25

  34. arXiv:2402.03302  [pdf, other

    eess.IV cs.CV cs.LG

    Swin-UMamba: Mamba-based UNet with ImageNet-based pretraining

    Authors: Jiarun Liu, Hao Yang, Hong-Yu Zhou, Yan Xi, Lequan Yu, Yizhou Yu, Yong Liang, Guangming Shi, Shaoting Zhang, Hairong Zheng, Shanshan Wang

    Abstract: Accurate medical image segmentation demands the integration of multi-scale information, spanning from local features to global dependencies. However, it is challenging for existing methods to model long-range global information, where convolutional neural networks (CNNs) are constrained by their local receptive fields, and vision transformers (ViTs) suffer from high quadratic complexity of their a… ▽ More

    Submitted 6 March, 2024; v1 submitted 5 February, 2024; originally announced February 2024.

    Comments: Code and models of Swin-UMamba are publicly available at: https://github.com/JiarunLiu/Swin-UMamba

  35. arXiv:2401.16150  [pdf

    cond-mat.mes-hall

    Sliding ferroelectric memories and synapses

    Authors: Xiuzhen Li, Biao Qin, Yaxian Wang, Yue Xi, Zhiheng Huang, Mengze Zhao, Yalin Peng, Zitao Chen, Zitian Pan, Jundong Zhu, Chenyang Cui, Rong Yang, Wei Yang, Sheng Meng, Dongxia Shi, Xuedong Bai, Can Liu, Na Li, Jianshi Tang, Kaihui Liu, Luojun Du, Guangyu Zhang

    Abstract: Ferroelectric materials with switchable electric polarization hold great promise for a plethora of emergent applications, such as post-Moore's law nanoelectronics, beyond-Boltzmann transistors, non-volatile memories, and above-bandgap photovoltaic devices. Recent advances have uncovered an exotic sliding ferroelectric mechanism, which endows to design atomically thin ferroelectrics from non-ferroe… ▽ More

    Submitted 29 January, 2024; originally announced January 2024.

    Comments: 16 pages, 4 figures

  36. arXiv:2401.06485  [pdf, other

    eess.AS cs.SD

    Contrastive Learning With Audio Discrimination For Customizable Keyword Spotting In Continuous Speech

    Authors: Yu Xi, Baochen Yang, Hao Li, Jiaqi Guo, Kai Yu

    Abstract: Customizable keyword spotting (KWS) in continuous speech has attracted increasing attention due to its real-world application potential. While contrastive learning (CL) has been widely used to extract keyword representations, previous CL approaches all operate on pre-segmented isolated words and employ only audio-text representations matching strategy. However, for KWS in continuous speech, co-art… ▽ More

    Submitted 12 January, 2024; originally announced January 2024.

    Comments: Accepted by ICASSP2024

  37. arXiv:2401.06022  [pdf, other

    math.CO math.SP

    Cospectral vertices, walk-regular planar graphs and the echolocation problem

    Authors: Shi-Lei Kong, Emmett L. Wyman, Yakun Xi

    Abstract: We study cospectral vertices on finite graphs in relation to the echolocation problem on Riemannian manifolds. First, We prove a computationally simple criterion to determine whether two vertices are cospectral. Then, we use this criterion in conjunction with a computer search to find minimal examples of various types of graphs on which cospectral but non-similar vertices exist, including minimal… ▽ More

    Submitted 18 July, 2024; v1 submitted 11 January, 2024; originally announced January 2024.

    Comments: 25 pages, 23 figures. Fixed a gap in the argument, added a toroidal graph example

  38. arXiv:2312.13117  [pdf, ps, other

    math.NA

    Parallel Multi-Step Contour Integral Methods for Nonlinear Eigenvalue Problems

    Authors: Yingxia Xi, Jiguang Sun

    Abstract: We consider nonlinear eigenvalue problems to compute all eigenvalues in a bounded region on the complex plane. Based on domain decomposition and contour integrals, two robust and scalable parallel multi-step methods are proposed. The first method 1) uses the spectral indicator method to find eigenvalues and 2) calls a linear eigensolver to compute the associated eigenvectors. The second method 1)… ▽ More

    Submitted 17 January, 2024; v1 submitted 20 December, 2023; originally announced December 2023.

    MSC Class: 15A18; 35P30; 65N25

  39. Devil in the Landscapes: Inferring Epidemic Exposure Risks from Street View Imagery

    Authors: Zhenyu Han, Yanxin Xi, Tong Xia, Yu Liu, Yong Li

    Abstract: Built environment supports all the daily activities and shapes our health. Leveraging informative street view imagery, previous research has established the profound correlation between the built environment and chronic, non-communicable diseases; however, predicting the exposure risk of infectious diseases remains largely unexplored. The person-to-person contacts and interactions contribute to th… ▽ More

    Submitted 8 November, 2023; originally announced November 2023.

    Comments: Published in ACM SIGSPATIAL 2023

  40. arXiv:2311.05816  [pdf, other

    physics.med-ph

    Full-length-body CBCT imaging in upright position with robotic-arm system: a simulation study

    Authors: Tong Lin, Tianling Lyu, Zhan Wu, Yan Xi, Wentao Zhu, Yang Chen

    Abstract: Upright position CT scans make it possible for full-length-body imaging at conditions more relevant to daily situations, but the substantial weight of the upright CT scanners increases the risks to floor's stability and patients'safety. Robotic-arm CBCT systems are supposed to be a better solution for this task, but such systems still face challenges including long scanning time and low reconstruc… ▽ More

    Submitted 9 November, 2023; originally announced November 2023.

    Comments: Submitted to ISBI'24

  41. arXiv:2310.09234  [pdf, other

    cs.IR cs.AI

    ClickPrompt: CTR Models are Strong Prompt Generators for Adapting Language Models to CTR Prediction

    Authors: Jianghao Lin, Bo Chen, Hangyu Wang, Yunjia Xi, Yanru Qu, Xinyi Dai, Kangning Zhang, Ruiming Tang, Yong Yu, Weinan Zhang

    Abstract: Click-through rate (CTR) prediction has become increasingly indispensable for various Internet applications. Traditional CTR models convert the multi-field categorical data into ID features via one-hot encoding, and extract the collaborative signals among features. Such a paradigm suffers from the problem of semantic information loss. Another line of research explores the potential of pretrained l… ▽ More

    Submitted 26 June, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by WWW 2024

  42. arXiv:2309.15019  [pdf, other

    cs.CV

    IFT: Image Fusion Transformer for Ghost-free High Dynamic Range Imaging

    Authors: Hailing Wang, Wei Li, Yuanyuan Xi, Jie Hu, Hanting Chen, Longyu Li, Yunhe Wang

    Abstract: Multi-frame high dynamic range (HDR) imaging aims to reconstruct ghost-free images with photo-realistic details from content-complementary but spatially misaligned low dynamic range (LDR) images. Existing HDR algorithms are prone to producing ghosting artifacts as their methods fail to capture long-range dependencies between LDR frames with large motion in dynamic scenes. To address this issue, we… ▽ More

    Submitted 8 October, 2023; v1 submitted 26 September, 2023; originally announced September 2023.

  43. arXiv:2309.07925  [pdf, other

    eess.AS cs.AI cs.MM cs.SD

    Hierarchical Audio-Visual Information Fusion with Multi-label Joint Decoding for MER 2023

    Authors: Haotian Wang, Yuxuan Xi, Hang Chen, Jun Du, Yan Song, Qing Wang, Hengshun Zhou, Chenxi Wang, Jiefeng Ma, Pengfei Hu, Ya Jiang, Shi Cheng, Jie Zhang, Yuzhe Weng

    Abstract: In this paper, we propose a novel framework for recognizing both discrete and dimensional emotions. In our framework, deep features extracted from foundation models are used as robust acoustic and visual representations of raw video. Three different structures based on attention-guided feature gathering (AFG) are designed for deep feature fusion. Then, we introduce a joint decoding structure for e… ▽ More

    Submitted 10 September, 2023; originally announced September 2023.

    Comments: 5 pages, 4 figures

    Journal ref: The 31st ACM International Conference on Multimedia (MM'23), 2023

  44. arXiv:2309.07109  [pdf, ps, other

    hep-ex astro-ph.HE hep-ph

    Real-time Monitoring for the Next Core-Collapse Supernova in JUNO

    Authors: Angel Abusleme, Thomas Adam, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Muhammad Akram, Abid Aleem, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, Burin Asavapibhop, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli , et al. (606 additional authors not shown)

    Abstract: The core-collapse supernova (CCSN) is considered one of the most energetic astrophysical events in the universe. The early and prompt detection of neutrinos before (pre-SN) and during the supernova (SN) burst presents a unique opportunity for multi-messenger observations of CCSN events. In this study, we describe the monitoring concept and present the sensitivity of the system to pre-SN and SN neu… ▽ More

    Submitted 4 December, 2023; v1 submitted 13 September, 2023; originally announced September 2023.

    Comments: 24 pages, 9 figures, accepted for the publication at JCAP

  45. arXiv:2309.01204  [pdf, ps, other

    math.CA math.AP

    Falconer distance problem on Riemannian manifolds

    Authors: Changbiao Jian, Bochen Liu, Yakun Xi

    Abstract: We prove that on a $d$-dimensional Riemannian manifold, the distance set of a Borel set $E$ has a positive Lebesgue measure if $\dim_{\mathcal H} E>\frac d2+\frac14+\frac{3}{8d+4}.$ Moreover, on a Riemannian manifold with constant sectional curvature, we show that the distance set of $E$ has a positive Lebesgue measure if $\dim_{\mathcal{H}}(E)>\frac d2+\frac14+\frac{1-(-1)^d}{8d}.$

    Submitted 22 January, 2024; v1 submitted 3 September, 2023; originally announced September 2023.

    Comments: 22 pages. Added results for the general metric case, and for the odd-dimensional constant curvature case

  46. arXiv:2308.12831  [pdf, other

    cs.CV

    EFormer: Enhanced Transformer towards Semantic-Contour Features of Foreground for Portraits Matting

    Authors: Zitao Wang, Qiguang Miao, Peipei Zhao, Yue Xi

    Abstract: The portrait matting task aims to extract an alpha matte with complete semantics and finely-detailed contours. In comparison to CNN-based approaches, transformers with self-attention module have a better capacity to capture long-range dependencies and low-frequency semantic information of a portrait. However, the recent research shows that self-attention mechanism struggles with modeling high-freq… ▽ More

    Submitted 30 November, 2023; v1 submitted 24 August, 2023; originally announced August 2023.

    Comments: 10 pages, 5 figures

  47. arXiv:2308.04952  [pdf, other

    cs.CV cs.AI

    Prototypical Kernel Learning and Open-set Foreground Perception for Generalized Few-shot Semantic Segmentation

    Authors: Kai Huang, Feigege Wang, Ye Xi, Yutao Gao

    Abstract: Generalized Few-shot Semantic Segmentation (GFSS) extends Few-shot Semantic Segmentation (FSS) to simultaneously segment unseen classes and seen classes during evaluation. Previous works leverage additional branch or prototypical aggregation to eliminate the constrained setting of FSS. However, representation division and embedding prejudice, which heavily results in poor performance of GFSS, have… ▽ More

    Submitted 18 August, 2023; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted by ICCV2023

  48. arXiv:2308.00465  [pdf, other

    cs.CV cs.AI

    A Satellite Imagery Dataset for Long-Term Sustainable Development in United States Cities

    Authors: Yanxin Xi, Yu Liu, Tong Li, Jintao Ding, Yunke Zhang, Sasu Tarkoma, Yong Li, Pan Hui

    Abstract: Cities play an important role in achieving sustainable development goals (SDGs) to promote economic growth and meet social needs. Especially satellite imagery is a potential data source for studying sustainable urban development. However, a comprehensive dataset in the United States (U.S.) covering multiple cities, multiple years, multiple scales, and multiple indicators for SDG monitoring is lack… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

    Comments: 20 pages, 5 figures

  49. arXiv:2307.07695  [pdf, other

    math.NA cs.LG math.AP

    Reducing operator complexity in Algebraic Multigrid with Machine Learning Approaches

    Authors: Ru Huang, Kai Chang, Huan He, Ruipeng Li, Yuanzhe Xi

    Abstract: We propose a data-driven and machine-learning-based approach to compute non-Galerkin coarse-grid operators in algebraic multigrid (AMG) methods, addressing the well-known issue of increasing operator complexity. Guided by the AMG theory on spectrally equivalent coarse-grid operators, we have developed novel ML algorithms that utilize neural networks (NNs) combined with smooth test vectors from mul… ▽ More

    Submitted 14 July, 2023; originally announced July 2023.

    Comments: Sparse Operator, Attention, PDE

  50. arXiv:2307.06224  [pdf, other

    math.AP math.CA math.DG math.SP

    Surfaces in which every point sounds the same

    Authors: Feng Wang, Emmett L. Wyman, Yakun Xi

    Abstract: We address a maximally structured case of the question, "Can you hear your location on a manifold," posed in arXiv:2304.04659 for dimension $2$. In short, we show that if a compact surface without boundary sounds the same at every point, then the surface has a transitive action by the isometry group. In the process, we show that you can hear your location on Klein bottles and that you can hear the… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: 9 pages, 1 figure