Zum Hauptinhalt springen

Showing 1–50 of 1,229 results for author: Lin, W

.
  1. arXiv:2408.16226  [pdf, other

    physics.optics

    On-Chip Optical Skyrmionic Beam Generators

    Authors: Wenbo Lin, Yasutomo Ota, Yasuhiko Arakawa, Satoshi Iwamoto

    Abstract: Optical skyrmion beams, which encompass two-dimensional topology in their spatial structures, are promising for ultra-dense optical communications and advanced matter manipulation. Generating such light beams via a chip-based approach will vastly broaden their applications and promote the advancement of untapped fundamental science. Here, we present a breakthrough in chip-based technology by exper… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: 8 pages, 4 figures

  2. arXiv:2408.15861  [pdf, other

    cs.CR cs.LG

    Fusing Pruned and Backdoored Models: Optimal Transport-based Data-free Backdoor Mitigation

    Authors: Weilin Lin, Li Liu, Jianze Li, Hui Xiong

    Abstract: Backdoor attacks present a serious security threat to deep neuron networks (DNNs). Although numerous effective defense techniques have been proposed in recent years, they inevitably rely on the availability of either clean or poisoned data. In contrast, data-free defense techniques have evolved slowly and still lag significantly in performance. To address this issue, different from the traditional… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.15252  [pdf, other

    eess.SP cs.AI

    Generative AI on SpectrumNet: An Open Benchmark of Multiband 3D Radio Maps

    Authors: Shuhang Zhang, Shuai Jiang, Wanjie Lin, Zheng Fang, Kangjun Liu, Hongliang Zhang, Ke Chen

    Abstract: Radio map is an efficient demonstration for visually displaying the wireless signal coverage within a certain region. It has been considered to be increasingly helpful for the future sixth generation (6G) of wireless networks, as wireless nodes are becoming more crowded and complicated. However, the construction of high resolution radio map is very challenging due to the sparse sampling in practic… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 30 pages, 15 figures

  4. arXiv:2408.14968  [pdf, other

    cs.IR cs.CL

    MRSE: An Efficient Multi-modality Retrieval System for Large Scale E-commerce

    Authors: Hao Jiang, Haoxiang Zhang, Qingshan Hou, Chaofeng Chen, Weisi Lin, Jingchang Zhang, Annan Wang

    Abstract: Providing high-quality item recall for text queries is crucial in large-scale e-commerce search systems. Current Embedding-based Retrieval Systems (ERS) embed queries and items into a shared low-dimensional space, but uni-modality ERS rely too heavily on textual features, making them unreliable in complex contexts. While multi-modality ERS incorporate various data sources, they often overlook indi… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

  5. arXiv:2408.14180  [pdf, other

    cs.CV cs.AI

    I2EBench: A Comprehensive Benchmark for Instruction-based Image Editing

    Authors: Yiwei Ma, Jiayi Ji, Ke Ye, Weihuang Lin, Zhibin Wang, Yonghan Zheng, Qiang Zhou, Xiaoshuai Sun, Rongrong Ji

    Abstract: Significant progress has been made in the field of Instruction-based Image Editing (IIE). However, evaluating these models poses a significant challenge. A crucial requirement in this field is the establishment of a comprehensive evaluation benchmark for accurately assessing editing results and providing valuable insights for its further development. In response to this need, we propose I2EBench,… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

    Comments: Tech report, 39 pages, 41 figures

  6. arXiv:2408.13380  [pdf, other

    physics.ins-det nucl-ex

    The MUSE Beamline Calorimeter

    Authors: W. Lin, T. Rostomyan, R. Gilman, S. Strauch, C. Meier, C. Nestler, M. Ali, H. Atac, J. C. Bernauer, W. J. Briscoe, A. Christopher Ndukwe, E. W. Cline, K. Deiters, S. Dogra, E. J. Downie, Z. Duan, I. P. Fernando, A. Flannery, D. Ghosal, A. Golossanov, J. Guo, N. S. Ifat, Y. Ilieva, M. Kohl, I. Lavrukhin , et al. (18 additional authors not shown)

    Abstract: The MUon Scattering Experiment (MUSE) was motivated by the proton radius puzzle arising from the discrepancy between muonic hydrogen spectroscopy and electron-proton measurements. The MUSE physics goals also include testing lepton universality, precisely measuring two-photon exchange contribution, and testing radiative corrections. MUSE addresses these physics goals through simultaneous measuremen… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  7. arXiv:2408.12867  [pdf, other

    cs.CV

    Semantic Alignment for Multimodal Large Language Models

    Authors: Tao Wu, Mengze Li, Jingyuan Chen, Wei Ji, Wang Lin, Jinyang Gao, Kun Kuang, Zhou Zhao, Fei Wu

    Abstract: Research on Multi-modal Large Language Models (MLLMs) towards the multi-image cross-modal instruction has received increasing attention and made significant progress, particularly in scenarios involving closely resembling images (e.g., change captioning). Existing MLLMs typically follow a two-step process in their pipelines: first, extracting visual tokens independently for each input image, and t… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Accepted by MM 2024

  8. arXiv:2408.12104  [pdf, other

    astro-ph.SR

    Minute-Cadence Observations of the LAMOST Fields with the TMTS: IV -- Catalog of Cataclysmic Variables from the First 3-yr Survey

    Authors: Qichun Liu, Jie Lin, Xiaofeng Wang, Zhibin Dai, Yongkang Sun, Gaobo Xi, Jun Mo, Jialian Liu, Shengyu Yan, Alexei V. Filippenko, Thomas G. Brink, Yi Yang, Kishore C. Patra, Yongzhi Cai, Zhihao Chen, Liyang Chen, Fangzhou Guo, Xiaojun Jiang, Gaici Li, Wenxiong Li, Weili Lin, Cheng Miao, Xiaoran Ma, Haowei Peng, Qiqi Xia , et al. (2 additional authors not shown)

    Abstract: The Tsinghua University--Ma Huateng Telescopes for Survey (TMTS) started to monitor the LAMOST plates in 2020, leading to the discovery of numerous short-period eclipsing binaries, peculiar pulsators, flare stars, and other variable objects. Here, we present the uninterrupted light curves for a sample of 64 cataclysmic variables (CVs) observed/discovered using the TMTS during its first three-year… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 27 pages, 12 figures in main text, accepted for the publication in Universe

  9. LARR: Large Language Model Aided Real-time Scene Recommendation with Semantic Understanding

    Authors: Zhizhong Wan, Bin Yin, Junjie Xie, Fei Jiang, Xiang Li, Wei Lin

    Abstract: Click-Through Rate (CTR) prediction is crucial for Recommendation System(RS), aiming to provide personalized recommendation services for users in many aspects such as food delivery, e-commerce and so on. However, traditional RS relies on collaborative signals, which lacks semantic understanding to real-time scenes. We also noticed that a major challenge in utilizing Large Language Models (LLMs) fo… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  10. arXiv:2408.11393  [pdf, other

    cs.CL cs.LG

    First Activations Matter: Training-Free Methods for Dynamic Activation in Large Language Models

    Authors: Chi Ma, Mincong Huang, Ying Zhang, Chao Wang, Yujie Wang, Lei Yu, Chuan Liu, Wei Lin

    Abstract: Dynamic activation (DA) techniques, such as DejaVu and MoEfication, have demonstrated their potential to significantly enhance the inference efficiency of large language models (LLMs). However, these techniques often rely on ReLU activation functions or require additional parameters and training to maintain performance. This paper introduces a training-free Threshold-based Dynamic Activation(TDA)… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  11. arXiv:2408.11003  [pdf, other

    stat.ME

    DEEPEAST technique to enhance power in two-sample tests via the same-attraction function

    Authors: Yiting Chen, Min Gao, Wei Lin, Andrew Jirasek, Kirsty Milligan, Xiaoping Shi

    Abstract: Data depth has emerged as an invaluable nonparametric measure for the ranking of multivariate samples. The main contribution of depth-based two-sample comparisons is the introduction of the Q statistic (Liu and Singh, 1993), a quality index. Unlike traditional methods, data depth does not require the assumption of normal distributions and adheres to four fundamental properties. Many existing two-s… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

  12. arXiv:2408.08926  [pdf, other

    cs.CR cs.AI cs.CL cs.CY cs.LG

    Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risk of Language Models

    Authors: Andy K. Zhang, Neil Perry, Riya Dulepet, Eliot Jones, Justin W. Lin, Joey Ji, Celeste Menders, Gashon Hussein, Samantha Liu, Donovan Jasper, Pura Peetathawatchai, Ari Glenn, Vikram Sivashankar, Daniel Zamoshchin, Leo Glikbarg, Derek Askaryar, Mike Yang, Teddy Zhang, Rishi Alluri, Nathan Tran, Rinnara Sangpisit, Polycarpos Yiorkadjis, Kenny Osele, Gautham Raghupathi, Dan Boneh , et al. (2 additional authors not shown)

    Abstract: Language Model (LM) agents for cybersecurity that are capable of autonomously identifying vulnerabilities and executing exploits have the potential to cause real-world impact. Policymakers, model providers, and other researchers in the AI and cybersecurity communities are interested in quantifying the capabilities of such agents to help mitigate cyberrisk and investigate opportunities for penetrat… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

    Comments: 86 pages, 7 figures

  13. arXiv:2408.08586  [pdf, other

    cs.DC

    Rubick: Exploiting Job Reconfigurability for Deep Learning Cluster Scheduling

    Authors: Xinyi Zhang, Hanyu Zhao, Wencong Xiao, Xianyan Jia, Fei Xu, Yong Li, Wei Lin, Fangming Liu

    Abstract: The era of large deep learning models has given rise to advanced training strategies such as 3D parallelism and the ZeRO series. These strategies enable various (re-)configurable execution plans for a training job, which exhibit remarkably different requirements of multiple resource types. Existing cluster scheduling systems, however, treat such reconfigurable training jobs as black boxes: they re… ▽ More

    Submitted 16 August, 2024; originally announced August 2024.

  14. arXiv:2408.08315  [pdf, other

    cs.CV cs.AI

    Segment Anything for Videos: A Systematic Survey

    Authors: Chunhui Zhang, Yawen Cui, Weilin Lin, Guanjie Huang, Yan Rong, Li Liu, Shiguang Shan

    Abstract: The recent wave of foundation models has witnessed tremendous success in computer vision (CV) and beyond, with the segment anything model (SAM) having sparked a passion for exploring task-agnostic visual foundation models. Empowered by its remarkable zero-shot generalization, SAM is currently challenging numerous traditional paradigms in CV, delivering extraordinary performance not only in various… ▽ More

    Submitted 30 July, 2024; originally announced August 2024.

    Comments: https://github.com/983632847/SAM-for-Videos

  15. arXiv:2408.08044  [pdf, other

    cs.CE

    Crystalline Material Discovery in the Era of Artificial Intelligence

    Authors: Zhenzhong Wang, Haowei Hua, Wanyu Lin, Ming Yang, Kay Chen Tan

    Abstract: Crystalline materials, with their symmetrical and periodic structures, possess a diverse array of properties and have been widely used in various fields, ranging from electronic devices to energy applications. To discover crystalline materials, traditional experimental and computational approaches are often time-consuming and expensive. In these years, thanks to the explosive amount of crystalline… ▽ More

    Submitted 23 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  16. arXiv:2408.07299  [pdf, ps, other

    physics.plasm-ph

    On the singularity of Lie-transform perturbation approach to the guiding-center problem

    Authors: W. H. Lin, J. Garcia, J. Q. Li

    Abstract: We present a novel scheme of carrying out the Lie-transform perturbation for the guiding-center motion, with an aim at addressing directly the problem of singularity which exists intrinsically in the determining equation for the generating vector, and which gives rise to the formidable gauge functions in the pure oscillating part of the Lie transformation. Whereas in most applications of Lie-trans… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 12 pages

  17. arXiv:2408.06969  [pdf, ps, other

    cs.NI cs.LG

    IRS-Assisted Lossy Communications Under Correlated Rayleigh Fading: Outage Probability Analysis and Optimization

    Authors: Guanchang Li, Wensheng Lin, Lixin Li, Yixuan He, Fucheng Yang, Zhu Han

    Abstract: This paper focuses on an intelligent reflecting surface (IRS)-assisted lossy communication system with correlated Rayleigh fading. We analyze the correlated channel model and derive the outage probability of the system. Then, we design a deep reinforce learning (DRL) method to optimize the phase shift of IRS, in order to maximize the received signal power. Moreover, this paper presents results of… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  18. arXiv:2408.06608  [pdf, other

    cs.AR cs.GR

    Potamoi: Accelerating Neural Rendering via a Unified Streaming Architecture

    Authors: Yu Feng, Weikai Lin, Zihan Liu, Jingwen Leng, Minyi Guo, Han Zhao, Xiaofeng Hou, Jieru Zhao, Yuhao Zhu

    Abstract: Neural Radiance Field (NeRF) has emerged as a promising alternative for photorealistic rendering. Despite recent algorithmic advancements, achieving real-time performance on today's resource-constrained devices remains challenging. In this paper, we identify the primary bottlenecks in current NeRF algorithms and introduce a unified algorithm-architecture co-design, Potamoi, designed to accommodate… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: arXiv admin note: substantial text overlap with arXiv:2404.11852

  19. arXiv:2408.05631  [pdf, other

    cs.CV cs.AI

    PRTGaussian: Efficient Relighting Using 3D Gaussians with Precomputed Radiance Transfer

    Authors: Libo Zhang, Yuxuan Han, Wenbin Lin, Jingwang Ling, Feng Xu

    Abstract: We present PRTGaussian, a realtime relightable novel-view synthesis method made possible by combining 3D Gaussians and Precomputed Radiance Transfer (PRT). By fitting relightable Gaussians to multi-view OLAT data, our method enables real-time, free-viewpoint relighting. By estimating the radiance transfer based on high-order spherical harmonics, we achieve a balance between capturing detailed reli… ▽ More

    Submitted 10 August, 2024; originally announced August 2024.

  20. arXiv:2408.05112  [pdf, other

    cs.LG cs.AI eess.IV

    Semantic Successive Refinement: A Generative AI-aided Semantic Communication Framework

    Authors: Kexin Zhang, Lixin Li, Wensheng Lin, Yuna Yan, Rui Li, Wenchi Cheng, Zhu Han

    Abstract: Semantic Communication (SC) is an emerging technology aiming to surpass the Shannon limit. Traditional SC strategies often minimize signal distortion between the original and reconstructed data, neglecting perceptual quality, especially in low Signal-to-Noise Ratio (SNR) environments. To address this issue, we introduce a novel Generative AI Semantic Communication (GSC) system for single-user scen… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  21. arXiv:2408.05019  [pdf, other

    cs.CV

    Instruction Tuning-free Visual Token Complement for Multimodal LLMs

    Authors: Dongsheng Wang, Jiequan Cui, Miaoge Li, Wang Lin, Bo Chen, Hanwang Zhang

    Abstract: As the open community of large language models (LLMs) matures, multimodal LLMs (MLLMs) have promised an elegant bridge between vision and language. However, current research is inherently constrained by challenges such as the need for high-quality instruction pairs and the loss of visual information in image-to-text training objectives. To this end, we propose a Visual Token Complement framework (… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: Accepted by ECCV2024 (20pages)

  22. arXiv:2408.04947  [pdf, other

    astro-ph.EP astro-ph.SR

    Revealing the Fate of Exoplanet Systems: Asteroseismic Identification of Host Star in the Red Clump or Red Giant Branch

    Authors: Wen-Xu Lin, Sheng-Bang Qian, Li-Ying Zhu

    Abstract: Determining the evolutionary stage of stars is crucial for understanding the evolution of exoplanetary systems. In this context, Red Giant Branch (RGB) and Red Clump (RC) stars, stages in the later evolution of stars situated before and after the helium flash, harbor critical clues to unveiling the evolution of planets. The first step in revealing these clues is to confirm the evolutionary stage o… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

  23. arXiv:2408.04158  [pdf, other

    eess.IV cs.CV

    Efficient Single Image Super-Resolution with Entropy Attention and Receptive Field Augmentation

    Authors: Xiaole Zhao, Linze Li, Chengxing Xie, Xiaoming Zhang, Ting Jiang, Wenjie Lin, Shuaicheng Liu, Tianrui Li

    Abstract: Transformer-based deep models for single image super-resolution (SISR) have greatly improved the performance of lightweight SISR tasks in recent years. However, they often suffer from heavy computational burden and slow inference due to the complex calculation of multi-head self-attention (MSA), seriously hindering their practical application and deployment. In this work, we present an efficient S… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted to ACM MM 2024

  24. arXiv:2408.03790  [pdf, other

    cs.CV

    Vision-Language Guidance for LiDAR-based Unsupervised 3D Object Detection

    Authors: Christian Fruhwirth-Reisinger, Wei Lin, Dušan Malić, Horst Bischof, Horst Possegger

    Abstract: Accurate 3D object detection in LiDAR point clouds is crucial for autonomous driving systems. To achieve state-of-the-art performance, the supervised training of detectors requires large amounts of human-annotated data, which is expensive to obtain and restricted to predefined object categories. To mitigate manual labeling efforts, recent unsupervised object detection approaches generate class-agn… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: Accepted to BMVC 2024

  25. arXiv:2408.02657  [pdf, other

    cs.CV

    Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining

    Authors: Dongyang Liu, Shitian Zhao, Le Zhuo, Weifeng Lin, Yu Qiao, Hongsheng Li, Peng Gao

    Abstract: We present Lumina-mGPT, a family of multimodal autoregressive models capable of various vision and language tasks, particularly excelling in generating flexible photorealistic images from text descriptions. Unlike existing autoregressive image generation approaches, Lumina-mGPT employs a pretrained decoder-only transformer as a unified framework for modeling multimodal token sequences. Our key ins… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

    Comments: Code available at: https://github.com/Alpha-VLLM/Lumina-mGPT

  26. Mock Observations: Three Different Types of Galaxy Alignment in TNG100 Simulations

    Authors: Yanyao Lan, Lin Tang, Weipeng Lin, Junyu Gong

    Abstract: In this study, galaxy samples have been generated using mock observation techniques based on the results of TNG100-1 simulations to investigate three forms of intrinsic alignment: satellite-central alignment between the orientation of the brightest group galaxies (BGG) and the spatial distribution of their satellites, radial alignment between the satellites' orientation and the direction towards t… ▽ More

    Submitted 6 August, 2024; v1 submitted 5 August, 2024; originally announced August 2024.

    Comments: 18 pages, 10 figures, 2 tables, ApJ accepted. As suggested by the TNG team, we have changed "IllustrisTNG100" to "TNG100"

  27. arXiv:2407.21507  [pdf, other

    cs.AI cs.LG eess.IV

    FSSC: Federated Learning of Transformer Neural Networks for Semantic Image Communication

    Authors: Yuna Yan, Xin Zhang, Lixin Li, Wensheng Lin, Rui Li, Wenchi Cheng, Zhu Han

    Abstract: In this paper, we address the problem of image semantic communication in a multi-user deployment scenario and propose a federated learning (FL) strategy for a Swin Transformer-based semantic communication system (FSSC). Firstly, we demonstrate that the adoption of a Swin Transformer for joint source-channel coding (JSCC) effectively extracts semantic information in the communication system. Next,… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

  28. arXiv:2407.21142  [pdf, other

    astro-ph.EP

    Candidate Distant Trans-Neptunian Objects Detected by the New Horizons Subaru TNO Survey

    Authors: Wesley C. Fraser, Simon B. Porter, Lowell Peltier, JJ Kavelaars, Anne J. Verbiscer, Marc W. Buie, S. Alan Stern, John R. Spencer, Susan D. Benecchi, Tsuyoshi Terai, Takashi Ito, Fumi Yoshida, David W. Gerdes, Kevin J. Napier, Hsing Wen Lin, Stephen D. J. Gwyn, Hayden Smotherman, Sebastien Fabbro, Kelsi N. Singer, Amanda M. Alexander, Ko Arimatsu, Maria E. Banks, Veronica J. Bray, Mohamed Ramy El-Maarry, Chelsea L. Ferrell , et al. (19 additional authors not shown)

    Abstract: We report the detection of 239 trans-Neptunian Objects discovered through the on-going New Horizons survey for distant minor bodies being performed with the Hyper Suprime-Cam mosaic imager on the Subaru Telescope. These objects were discovered in images acquired with either the r2 or the recently commissioned EB-gri filter using shift and stack routines. Due to the extremely high stellar density o… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in the Planetary Science Journal, 28 pages, 7 figures, 3 tables

  29. arXiv:2407.21118  [pdf, other

    cs.AI cs.LG

    Palu: Compressing KV-Cache with Low-Rank Projection

    Authors: Chi-Chih Chang, Wei-Cheng Lin, Chien-Yu Lin, Chong-Yan Chen, Yu-Fang Hu, Pei-Shuo Wang, Ning-Chi Huang, Luis Ceze, Kai-Chiang Wu

    Abstract: KV-Cache compression methods generally sample a KV-Cache of effectual tokens or quantize it into lower bits. However, these methods cannot exploit the redundancy of the hidden dimension of KV tensors. This paper investigates a unique hidden dimension approach called Palu, a novel KV-Cache compression framework that utilizes low-rank projection. Palu decomposes the linear layers into low-rank matri… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  30. arXiv:2407.20956  [pdf, other

    cs.LG cs.AI

    An Effective Dynamic Gradient Calibration Method for Continual Learning

    Authors: Weichen Lin, Jiaxiang Chen, Ruomin Huang, Hu Ding

    Abstract: Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the ``catastrophic forgetting'' problem, i.e., the performance on the previous tasks can substantially decrease because of the missing information in the latter peri… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  31. arXiv:2407.20119  [pdf, ps, other

    cs.LG cs.AI

    Adaptive Self-supervised Robust Clustering for Unstructured Data with Unknown Cluster Number

    Authors: Chen-Lu Ding, Jiancan Wu, Wei Lin, Shiyang Shen, Xiang Wang, Yancheng Yuan

    Abstract: We introduce a novel self-supervised deep clustering approach tailored for unstructured data without requiring prior knowledge of the number of clusters, termed Adaptive Self-supervised Robust Clustering (ASRC). In particular, ASRC adaptively learns the graph structure and edge weights to capture both local and global structural information. The obtained graph enables us to learn clustering-friend… ▽ More

    Submitted 30 July, 2024; v1 submitted 29 July, 2024; originally announced July 2024.

  32. arXiv:2407.19727  [pdf, other

    cs.IR

    Adaptive Utilization of Cross-scenario Information for Multi-scenario Recommendation

    Authors: Xiufeng Shu, Ruidong Han, Xiang Li, Wei Lin

    Abstract: Recommender system of the e-commerce platform usually serves multiple business scenarios. Multi-scenario Recommendation (MSR) is an important topic that improves ranking performance by leveraging information from different scenarios. Recent methods for MSR mostly construct scenario shared or specific modules to model commonalities and differences among scenarios. However, when the amount of data a… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  33. arXiv:2407.19704  [pdf, other

    eess.IV cs.MM cs.SD eess.AS

    UNQA: Unified No-Reference Quality Assessment for Audio, Image, Video, and Audio-Visual Content

    Authors: Yuqin Cao, Xiongkuo Min, Yixuan Gao, Wei Sun, Weisi Lin, Guangtao Zhai

    Abstract: As multimedia data flourishes on the Internet, quality assessment (QA) of multimedia data becomes paramount for digital media applications. Since multimedia data includes multiple modalities including audio, image, video, and audio-visual (A/V) content, researchers have developed a range of QA methods to evaluate the quality of different modality data. While they exclusively focus on addressing th… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  34. Enhancing CTR Prediction through Sequential Recommendation Pre-training: Introducing the SRP4CTR Framework

    Authors: Ruidong Han, Qianzhong Li, He Jiang, Rui Li, Yurou Zhao, Xiang Li, Wei Lin

    Abstract: Understanding user interests is crucial for Click-Through Rate (CTR) prediction tasks. In sequential recommendation, pre-training from user historical behaviors through self-supervised learning can better comprehend user dynamic preferences, presenting the potential for direct integration with CTR tasks. Previous methods have integrated pre-trained models into downstream tasks with the sole purpos… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  35. arXiv:2407.19482  [pdf

    physics.optics

    Bistability in spatiotemporal mode-locking with dynamic multimode gain

    Authors: Zhijin Xiong, Yuankai Guo, Wei Lin, Hao Xiu, Yuncong Ma, Xuewen Chen, Zhaoheng Liang, Lin Ling, Tao Liu, Xiaoming Wei, Zhongmin Yang

    Abstract: Three-dimensional (3D) dissipative soliton existed in spatiotemporal mode-locked (STML) multimode fiber laser has been demonstrated to be a promising formalism for generating high-energy femtosecond pulses, which unfortunately exhibit diverse spatiotemporal dynamics that have not been fully understood. Completely modeling the STML multimode fiber lasers can shed new light on the underlying physics… ▽ More

    Submitted 30 July, 2024; v1 submitted 28 July, 2024; originally announced July 2024.

  36. arXiv:2407.19387  [pdf, other

    hep-th

    BPS Chaos

    Authors: Yiming Chen, Henry W. Lin, Stephen H. Shenker

    Abstract: Black holes are chaotic quantum systems that are expected to exhibit random matrix statistics in their finite energy spectrum. Lin, Maldacena, Rozenberg and Shan (LMRS) have proposed a related characterization of chaos for the ground states of BPS black holes with finite area horizons. On a separate front, the "fuzzball program" has uncovered large families of horizon-free geometries that account… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

    Comments: 52 pages plus appendices, 23 figures

  37. arXiv:2407.19272  [pdf, other

    math.NA

    Isovolumetric Energy Minimization for Ball-Shaped Volume-Preserving Parameterizations of 3-Manifolds

    Authors: Shu-Yung Liu, Tsung-Ming Huang, Wen-Wei Lin, Mei-Heng Yueh

    Abstract: A volume-preserving parameterization is a bijective mapping that maps a 3-manifold onto a specified canonical domain that preserves the local volume. This paper formulates the computation of ball-shaped volume-preserving parameterizations as an isovolumetric energy minimization (IEM) problem with the boundary points constrained on a unit sphere. In addition, we develop a new preconditioned nonline… ▽ More

    Submitted 27 July, 2024; originally announced July 2024.

    Comments: 23 pages, 10 figures

    MSC Class: 65D18; 68U05; 68U01; 65D17

  38. DAC: 2D-3D Retrieval with Noisy Labels via Divide-and-Conquer Alignment and Correction

    Authors: Chaofan Gan, Yuanpeng Tu, Yuxi Li, Weiyao Lin

    Abstract: With the recent burst of 2D and 3D data, cross-modal retrieval has attracted increasing attention recently. However, manual labeling by non-experts will inevitably introduce corrupted annotations given ambiguous 2D/3D content. Though previous works have addressed this issue by designing a naive division strategy with hand-crafted thresholds, their performance generally exhibits great sensitivity t… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: accepted by ACM MM 2024

  39. arXiv:2407.17035  [pdf, other

    cs.CV

    Q-Ground: Image Quality Grounding with Large Multi-modality Models

    Authors: Chaofeng Chen, Sensen Yang, Haoning Wu, Liang Liao, Zicheng Zhang, Annan Wang, Wenxiu Sun, Qiong Yan, Weisi Lin

    Abstract: Recent advances of large multi-modality models (LMM) have greatly improved the ability of image quality assessment (IQA) method to evaluate and explain the quality of visual content. However, these advancements are mostly focused on overall quality assessment, and the detailed examination of local quality, which is crucial for comprehensive visual understanding, is still largely unexplored. In thi… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: ACM Multimedia 2024 (Oral)

  40. arXiv:2407.16198  [pdf, other

    cs.CV cs.AI

    INF-LLaVA: Dual-perspective Perception for High-Resolution Multimodal Large Language Model

    Authors: Yiwei Ma, Zhibin Wang, Xiaoshuai Sun, Weihuang Lin, Qiang Zhou, Jiayi Ji, Rongrong Ji

    Abstract: With advancements in data availability and computing resources, Multimodal Large Language Models (MLLMs) have showcased capabilities across various fields. However, the quadratic complexity of the vision encoder in MLLMs constrains the resolution of input images. Most current approaches mitigate this issue by cropping high-resolution images into smaller sub-images, which are then processed indepen… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  41. arXiv:2407.15783  [pdf, other

    quant-ph cond-mat.mes-hall

    24 days-stable CNOT-gate on fluxonium qubits with over 99.9% fidelity

    Authors: Wei-Ju Lin, Hyunheung Cho, Yinqi Chen, Maxim G. Vavilov, Chen Wang, Vladimir E. Manucharyan

    Abstract: Fluxonium qubit is a promising building block for quantum information processing due to its long coherence time and strong anharmonicity. In this paper, we realize a 60 ns direct CNOT-gate on two inductively-coupled fluxonium qubits using selective darkening approach, resulting in a gate fidelity as high as 99.94%. The fidelity remains above 99.9% for 24 days without any recalibration between rand… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  42. arXiv:2407.15450  [pdf, other

    quant-ph cond-mat.mes-hall

    Verifying the analogy between transversely coupled spin-1/2 systems and inductively-coupled fluxoniums

    Authors: Wei-Ju Lin, Hyunheung Cho, Yinqi Chen, Maxim G. Vavilov, Chen Wang, Vladimir E. Manucharyan

    Abstract: We report a detailed characterization of two inductively coupled superconducting fluxonium qubits for implementing high-fidelity cross-resonance gates. Our circuit stands out because it behaves very closely to the case of two transversely coupled spin-1/2 systems. In particular, the generally unwanted static ZZ-term due to the non-computational transitions is nearly absent despite a strong qubit-q… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

  43. arXiv:2407.14829  [pdf, other

    cs.CL

    Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

    Authors: Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao, Jipeng Wu, Wen Su, Jinglu Chen, Xinyi Li, Tianyu Chen, Mingjie Han, Shuaiwen Du, Zijian Wang, Jiyin Li, Fuzhong Suo, Hao Wang, Nuanchen Lin, Xuanjing Huang, Changjian Jiang, RuiFeng Xu , et al. (4 additional authors not shown)

    Abstract: In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct data… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

  44. arXiv:2407.14697  [pdf, ps, other

    nucl-ex

    Single-proton removal reaction in the IQMD+GEMINI model benchmarked by elemental fragmentation cross sections of $^{29-33}\mathrm{Si}$ on carbon at $\sim$230~MeV/nucleon

    Authors: Guang-Shuai Li, Jun Su, Satoru Terashima, Jian-Wei Zhao, Er-Xi Xiao, Ji-Chao Zhang, Liu-Chun He, Ge Guo, Wei-Ping Lin, Wen-Jian Lin, Chuan-Ye Liu, Chen-Gui Lu, Bo Mei, Dan-Yang Pang, Ye-Lei Sun, Zhi-Yu Sun, Meng Wang, Feng Wang, Jing Wang, Shi-Tao Wang, Xiu-Lin Wei, Xiao-Dong Xu, Jun-Yao Xu, Li-Hua Zhu, Yong Zheng , et al. (2 additional authors not shown)

    Abstract: We report on the first measurement of the elemental fragmentation cross sections (EFCSs) of $^{29-33}\mathrm{Si}$ on a carbon target at $\sim$230~MeV/nucleon. The experimental data covering charge changes of $ΔZ$ = 1-4 are reproduced well by the isospin-dependent quantum molecular dynamics (IQMD) coupled with the evaporation GEMINI (IQMD+GEMINI) model. We further explore the mechanisms underlying… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: 7 pages, 4 figures

  45. arXiv:2407.13664  [pdf, other

    cs.LG

    Decision Focused Causal Learning for Direct Counterfactual Marketing Optimization

    Authors: Hao Zhou, Rongxiao Huang, Shaoming Li, Guibin Jiang, Jiaqi Zheng, Bing Cheng, Wei Lin

    Abstract: Marketing optimization plays an important role to enhance user engagement in online Internet platforms. Existing studies usually formulate this problem as a budget allocation problem and solve it by utilizing two fully decoupled stages, i.e., machine learning (ML) and operation research (OR). However, the learning objective in ML does not take account of the downstream optimization task in OR, whi… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

    Comments: Accepted by KDD 2024

  46. arXiv:2407.13274  [pdf, other

    cs.IR

    Aligning Explanations for Recommendation with Rating and Feature via Maximizing Mutual Information

    Authors: Yurou Zhao, Yiding Sun, Ruidong Han, Fei Jiang, Lu Guan, Xiang Li, Wei Lin, Weizhi Ma, Jiaxin Mao

    Abstract: Providing natural language-based explanations to justify recommendations helps to improve users' satisfaction and gain users' trust. However, as current explanation generation methods are commonly trained with an objective to mimic existing user reviews, the generated explanations are often not aligned with the predicted ratings or some important features of the recommended items, and thus, are su… ▽ More

    Submitted 20 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by cikm2024, and the code repository will be updated soon

  47. arXiv:2407.10648  [pdf, other

    cs.RO

    Back to Newton's Laws: Learning Vision-based Agile Flight via Differentiable Physics

    Authors: Yuang Zhang, Yu Hu, Yunlong Song, Danping Zou, Weiyao Lin

    Abstract: Swarm navigation in cluttered environments is a grand challenge in robotics. This work combines deep learning with first-principle physics through differentiable simulation to enable autonomous navigation of multiple aerial robots through complex environments at high speed. Our approach optimizes a neural network control policy directly by backpropagating loss gradients through the robot simulatio… ▽ More

    Submitted 15 July, 2024; v1 submitted 15 July, 2024; originally announced July 2024.

  48. arXiv:2407.10199  [pdf, other

    nucl-ex nucl-th

    Charge radii of $^{11-16}$C, $^{13-17}$N and $^{15-18}$O determined from their charge-changing cross-sections and the mirror-difference charge radii

    Authors: J. W. Zhao, B. -H. Sun, I. Tanihata, J. Y. Xu, K. Y. Zhang, A. Prochazka, L. H. Zhu, S. Terashima, J. Meng, L. C. He, C. Y. Liu, G. S. Li, C. G. Lu, W. J. Lin, W. P. Lin, Z. Liu, P. P Ren, Z. Y. Sun, F. Wang, J. Wang, M. Wang, S. T. Wang, X. L. Wei, X. D. Xu, J. C. Zhang , et al. (2 additional authors not shown)

    Abstract: Charge-changing cross-sections of $^{11-16}$C, $^{13-17}$N and $^{15-18}$O on a carbon target have been determined at energies around 300 MeV/nucleon. A nucleon separation energy dependent correction factor has been introduced to the Glauber model calculation for extracting the nuclear charge radii from the experimental CCCSs. The charge radii of $^{11}$C, $^{13,16}$N and $^{15}$O thus were determ… ▽ More

    Submitted 4 August, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    Comments: 3 figures, submitted to Physics Letters B

  49. arXiv:2407.06766  [pdf, other

    cs.DB

    Relational Perspective on Graph Query Languages

    Authors: Diego Figueira, Anthony W. Lin, Liat Peterfreund

    Abstract: We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is q… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  50. arXiv:2407.05382  [pdf, other

    cs.CV

    Rethinking Unsupervised Outlier Detection via Multiple Thresholding

    Authors: Zhonghang Liu, Panzhong Lu, Guoyang Xie, Zhichao Lu, Wen-Yan Lin

    Abstract: In the realm of unsupervised image outlier detection, assigning outlier scores holds greater significance than its subsequent task: thresholding for predicting labels. This is because determining the optimal threshold on non-separable outlier score functions is an ill-posed problem. However, the lack of predicted labels not only hiders some real applications of current outlier detectors but also c… ▽ More

    Submitted 14 July, 2024; v1 submitted 7 July, 2024; originally announced July 2024.