Zum Hauptinhalt springen

Showing 1–50 of 832 results for author: Wu, R

.
  1. The NIRSpec Micro-Shutter Array: Operability and Operations After Two Years of JWST Science

    Authors: Katie Bechtold, Torsten Böker, David E. Franz, Maurice te Plate, Timothy D. Rawle, Rai Wu, Peter Zeidler

    Abstract: The Near Infrared Spectrograph (NIRSpec) on the James Webb Space Telescope affords the astronomical community an unprecedented space-based Multi-Object Spectroscopy (MOS) capability through the use of a programmable array of micro-electro-mechanical shutters. Launched in December 2021 and commissioned along with a suite of other observatory instruments throughout the first half of 2022, NIRSpec ha… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

    Comments: SPIE Astronomical Telescopes + Instrumentation: Ground-based and Airborne Instrumentation for Astronomy X (Yokohama 2024), paper number 13092-38

  2. arXiv:2408.15568  [pdf, other

    cs.AR

    Affordable HPC: Leveraging Small Clusters for Big Data and Graph Computing

    Authors: Ruilong Wu, Yisu Wang, Dirk Kutscher

    Abstract: This study explores strategies for academic researchers to optimize computational resources within limited budgets, focusing on building small, efficient computing clusters. It delves into the comparative costs of purchasing versus renting servers, guided by market research and economic theories on tiered pricing. The paper offers detailed insights into the selection and assembly of hardware compo… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  3. arXiv:2408.10236  [pdf, other

    eess.IV cs.CV

    AID-DTI: Accelerating High-fidelity Diffusion Tensor Imaging with Detail-preserving Model-based Deep Learning

    Authors: Wenxin Fan, Jian Cheng, Cheng Li, Jing Yang, Ruoyou Wu, Juan Zou, Shanshan Wang

    Abstract: Deep learning has shown great potential in accelerating diffusion tensor imaging (DTI). Nevertheless, existing methods tend to suffer from Rician noise and eddy current, leading to detail loss in reconstructing the DTI-derived parametric maps especially when sparsely sampled q-space data are used. To address this, this paper proposes a novel method, AID-DTI (\textbf{A}ccelerating h\textbf{I}gh fi\… ▽ More

    Submitted 4 August, 2024; originally announced August 2024.

    Comments: 12 pages, 3 figures, MICCAI 2024 Workshop on Computational Diffusion MRI. arXiv admin note: text overlap with arXiv:2401.01693, arXiv:2405.03159

  4. arXiv:2408.06301  [pdf, other

    astro-ph.GA astro-ph.HE

    Density Distribution of Plasmas Resembling Dark Matter Halo Due to Ionization lag and Ambipolar Electric Field

    Authors: Haibin Chen, Rong Wu

    Abstract: In a spherically symmetric plasma constrained by its own gravity, the ionization degree lags behind changes in temperature and density. The ambipolar electric field accelerates ions radially and cools electrons. Ions lose energy and angular momentum in collisions with low-temperature electrons. The angular momentum of ions decreases much faster than their energy in cycles. The trajectories of ions… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: 10 pages, 4 figures

  5. arXiv:2408.06185  [pdf, other

    eess.SY cs.CY cs.GT cs.NI

    Hi-SAM: A high-scalable authentication model for satellite-ground Zero-Trust system using mean field game

    Authors: Xuesong Wu, Tianshuai Zheng, Runfang Wu, Jie Ren, Junyan Guo, Ye Du

    Abstract: As more and more Internet of Thing (IoT) devices are connected to satellite networks, the Zero-Trust Architecture brings dynamic security to the satellite-ground system, while frequent authentication creates challenges for system availability. To make the system's accommodate more IoT devices, this paper proposes a high-scalable authentication model (Hi-SAM). Hi-SAM introduces the Proof-of-Work id… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  6. arXiv:2408.03190  [pdf

    cond-mat.mtrl-sci physics.app-ph physics.chem-ph physics.comp-ph

    Phase field simulations of thermal annealing for all-small molecule organic solar cells

    Authors: Yasin Ameslon, Olivier J. J. Ronsin, Christina Harreiss, Johannes Will, Stefanie Rechberger Mingjian Wu, Erdmann Spiecker, Jens Harting

    Abstract: Interest in organic solar cells (OSCs) is constantly rising in the field of photovoltaic devices. The device performance relies on the bulk heterojunction (BHJ) nanomorphology, which develops during the drying process and additional post-treatment. This work studies the effect of thermal annealing (TA) on an all-small molecule DRCN5T: PC71 BM blend with phase field simulations. The objective is to… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 39 pages incl. SI

  7. arXiv:2408.01953  [pdf, other

    cs.RO cs.CV cs.LG

    EqvAfford: SE(3) Equivariance for Point-Level Affordance Learning

    Authors: Yue Chen, Chenrui Tie, Ruihai Wu, Hao Dong

    Abstract: Humans perceive and interact with the world with the awareness of equivariance, facilitating us in manipulating different objects in diverse poses. For robotic manipulation, such equivariance also exists in many scenarios. For example, no matter what the pose of a drawer is (translation, rotation and tilt), the manipulation strategy is consistent (grasp the handle and pull in a line). While tradit… ▽ More

    Submitted 7 August, 2024; v1 submitted 4 August, 2024; originally announced August 2024.

    Comments: Accept to CVPRWorkshop on Equivariant Vision: From Theory to Practice 2024

  8. arXiv:2407.18610  [pdf

    cond-mat.supr-con

    Direct observation of quantum vortex fractionalization in multiband superconductors

    Authors: Yu Zheng, Quanxin Hu, Haijiao Ji, Igor Timoshuk, Hanxiang Xu, Yongwei Li, Ye Gao, Xin Yu, Rui Wu, Xingye Lu, Vadim Grinenko, Egor Babaev, Noah F. Q. Yuan, Baiqing Lv, Chi-Ming Yim, Hong Ding

    Abstract: Magnetic field is expelled from a superconductor, unless it forms quantum vortices, consisting of a core singularity with current circulating around it. The London quantization condition implies that there is one core singularity per quantum of magnetic flux in single-component superconductors, while in multiband materials fractional vortices are possible. Here, we report the first observation of… ▽ More

    Submitted 27 August, 2024; v1 submitted 26 July, 2024; originally announced July 2024.

    Comments: 16 pages, 4 figures

  9. arXiv:2407.15102  [pdf, other

    quant-ph

    Experimental demonstration of reconstructing quantum states with generative models

    Authors: Xuegang Li, Wenjie Jiang, Ziyue Hua, Weiting Wang, Xiaoxuan Pan, Weizhou Cai, Zhide Lu, Jiaxiu Han, Rebing Wu, Chang-Ling Zou, Dong-Ling Deng, Luyan Sun

    Abstract: Quantum state tomography, a process that reconstructs a quantum state from measurements on an ensemble of identically prepared copies, plays a crucial role in benchmarking quantum devices. However, brute-force approaches to quantum state tomography would become impractical for large systems, as the required resources scale exponentially with the system size. Here, we explore a machine learning app… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  10. arXiv:2407.13690  [pdf, other

    cs.CL cs.AI

    DART-Math: Difficulty-Aware Rejection Tuning for Mathematical Problem-Solving

    Authors: Yuxuan Tong, Xiwen Zhang, Rui Wang, Ruidong Wu, Junxian He

    Abstract: Solving mathematical problems requires advanced reasoning abilities and presents notable challenges for large language models. Previous works usually synthesize data from proprietary models to augment existing datasets, followed by instruction tuning to achieve top-tier results. However, our analysis of these datasets reveals severe biases towards easy queries, with frequent failures to generate a… ▽ More

    Submitted 18 June, 2024; originally announced July 2024.

    Comments: Preprint. Data and model checkpoints are available at https://github.com/hkust-nlp/dart-math

  11. arXiv:2407.13113  [pdf, other

    cs.AI

    Multiobjective Vehicle Routing Optimization with Time Windows: A Hybrid Approach Using Deep Reinforcement Learning and NSGA-II

    Authors: Rixin Wu, Ran Wang, Jie Hao, Qiang Wu, Ping Wang, Dusit Niyato

    Abstract: This paper proposes a weight-aware deep reinforcement learning (WADRL) approach designed to address the multiobjective vehicle routing problem with time windows (MOVRPTW), aiming to use a single deep reinforcement learning (DRL) model to solve the entire multiobjective optimization problem. The Non-dominated sorting genetic algorithm-II (NSGA-II) method is then employed to optimize the outcomes pr… ▽ More

    Submitted 17 July, 2024; originally announced July 2024.

    Comments: 13 pages; Under Review; Submitted to IEEE Transactions on Intelligent Transportation Systems

  12. arXiv:2407.10737  [pdf, other

    cs.CV cs.AI

    Aligning Neuronal Coding of Dynamic Visual Scenes with Foundation Vision Models

    Authors: Rining Wu, Feixiang Zhou, Ziwei Yin, Jian K. Liu

    Abstract: Our brains represent the ever-changing environment with neurons in a highly dynamic fashion. The temporal features of visual pixels in dynamic natural scenes are entrapped in the neuronal responses of the retina. It is crucial to establish the intrinsic temporal relationship between visual pixels and neuronal responses. Recent foundation vision models have paved an advanced way of understanding im… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Comments: This article is accepted by ECCV 2024, which ID is 12149. Accepted papers' id can be found in: https://eccv2024.ecva.net/Conferences/2024/AcceptedPapers

  13. arXiv:2407.06172  [pdf, other

    cs.AI cs.CL

    On Speeding Up Language Model Evaluation

    Authors: Jin Peng Zhou, Christian K. Belardi, Ruihan Wu, Travis Zhang, Carla P. Gomes, Wen Sun, Kilian Q. Weinberger

    Abstract: Developing prompt-based methods with Large Language Models (LLMs) requires making numerous decisions, which give rise to a combinatorial search problem. For example, selecting the right pre-trained LLM, prompt, and hyperparameters to attain the best performance for a task typically necessitates evaluating an expoential number of candidates on large validation sets. This exhaustive evaluation can b… ▽ More

    Submitted 14 August, 2024; v1 submitted 8 July, 2024; originally announced July 2024.

  14. arXiv:2407.05282  [pdf, other

    cs.CV

    UltraEdit: Instruction-based Fine-Grained Image Editing at Scale

    Authors: Haozhe Zhao, Xiaojian Ma, Liang Chen, Shuzheng Si, Rujie Wu, Kaikai An, Peiyu Yu, Minjia Zhang, Qing Li, Baobao Chang

    Abstract: This paper presents UltraEdit, a large-scale (approximately 4 million editing samples), automatically generated dataset for instruction-based image editing. Our key idea is to address the drawbacks in existing image editing datasets like InstructPix2Pix and MagicBrush, and provide a systematic approach to producing massive and high-quality image editing samples. UltraEdit offers several distinct a… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: 32 pages, 14 figures

  15. arXiv:2407.05149  [pdf

    physics.bio-ph physics.app-ph physics.chem-ph physics.optics

    Quantized Acoustic Phonons Map the Dynamics of a Single Virus

    Authors: Yaqing Zhang, Rihan Wu, Md Shahjahan, Canchai Yang, Dohun Pyeon, Elad Harel

    Abstract: The natural vibrational frequencies of biological particles such as viruses and bacteria encode critical information about their mechanical and biological states as they interact with their local environment and undergo structural evolution. However, detecting and tracking these vibrations within a biological context at the single particle level has remained elusive. In this study, we track the vi… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: Main Manuscript: 19 pages, 4 figures Supplementary Information: 29 pages, 17 figures

  16. arXiv:2407.04346  [pdf, other

    cs.CV

    MobileFlow: A Multimodal LLM For Mobile GUI Agent

    Authors: Songqin Nong, Jiali Zhu, Rui Wu, Jiongchao Jin, Shuo Shan, Xiutian Huang, Wenhao Xu

    Abstract: Currently, the integration of mobile Graphical User Interfaces (GUIs) is ubiquitous in most people's daily lives. And the ongoing evolution of multimodal large-scale models, such as GPT-4v, Qwen-VL-Max, has significantly bolstered the capabilities of GUI comprehension and user action analysis, showcasing the potentiality of intelligent GUI assistants. However, current GUI Agents often need to acce… ▽ More

    Submitted 7 August, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  17. arXiv:2407.03951  [pdf, other

    cs.LG

    Uncertainty-Guided Optimization on Large Language Model Search Trees

    Authors: Julia Grosse, Ruotian Wu, Ahmad Rashid, Philipp Hennig, Pascal Poupart, Agustinus Kristiadi

    Abstract: Beam search is a standard tree search algorithm when it comes to finding sequences of maximum likelihood, for example, in the decoding processes of large language models. However, it is myopic since it does not take the whole path from the root to a leaf into account. Moreover, it is agnostic to prior knowledge available about the process: For example, it does not consider that the objective being… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

    Comments: 10 pages

  18. arXiv:2407.01649  [pdf, other

    q-bio.QM cs.LG

    FAFE: Immune Complex Modeling with Geodesic Distance Loss on Noisy Group Frames

    Authors: Ruidong Wu, Ruihan Guo, Rui Wang, Shitong Luo, Yue Xu, Jiahan Li, Jianzhu Ma, Qiang Liu, Yunan Luo, Jian Peng

    Abstract: Despite the striking success of general protein folding models such as AlphaFold2(AF2, Jumper et al. (2021)), the accurate computational modeling of antibody-antigen complexes remains a challenging task. In this paper, we first analyze AF2's primary loss function, known as the Frame Aligned Point Error (FAPE), and raise a previously overlooked issue that FAPE tends to face gradient vanishing probl… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  19. arXiv:2406.17248  [pdf, other

    quant-ph

    MindSpore Quantum: A User-Friendly, High-Performance, and AI-Compatible Quantum Computing Framework

    Authors: Xusheng Xu, Jiangyu Cui, Zidong Cui, Runhong He, Qingyu Li, Xiaowei Li, Yanling Lin, Jiale Liu, Wuxin Liu, Jiale Lu, Maolin Luo, Chufan Lyu, Shijie Pan, Mosharev Pavel, Runqiu Shu, Jialiang Tang, Ruoqian Xu, Shu Xu, Kang Yang, Fan Yu, Qingguo Zeng, Haiying Zhao, Qiang Zheng, Junyuan Zhou, Xu Zhou , et al. (14 additional authors not shown)

    Abstract: We introduce MindSpore Quantum, a pioneering hybrid quantum-classical framework with a primary focus on the design and implementation of noisy intermediate-scale quantum (NISQ) algorithms. Leveraging the robust support of MindSpore, an advanced open-source deep learning training/inference framework, MindSpore Quantum exhibits exceptional efficiency in the design and training of variational quantum… ▽ More

    Submitted 10 July, 2024; v1 submitted 24 June, 2024; originally announced June 2024.

  20. arXiv:2406.15774  [pdf, other

    cs.RO

    Observation Time Difference: an Online Dynamic Objects Removal Method for Ground Vehicles

    Authors: Rongguang Wu, Chenglin Pang, Xuankang Wu, Zheng Fang

    Abstract: In the process of urban environment mapping, the sequential accumulations of dynamic objects will leave a large number of traces in the map. These traces will usually have bad influences on the localization accuracy and navigation performance of the robot. Therefore, dynamic objects removal plays an important role for creating clean map. However, conventional dynamic objects removal methods usuall… ▽ More

    Submitted 22 June, 2024; originally announced June 2024.

  21. arXiv:2406.14288  [pdf, other

    cs.LG cs.AI

    Revisiting Modularity Maximization for Graph Clustering: A Contrastive Learning Perspective

    Authors: Yunfei Liu, Jintang Li, Yuehe Chen, Ruofan Wu, Ericbk Wang, Jing Zhou, Sheng Tian, Shuheng Shen, Xing Fu, Changhua Meng, Weiqiang Wang, Liang Chen

    Abstract: Graph clustering, a fundamental and challenging task in graph mining, aims to classify nodes in a graph into several disjoint clusters. In recent years, graph contrastive learning (GCL) has emerged as a dominant line of research in graph clustering and advances the new state-of-the-art. However, GCL-based methods heavily rely on graph augmentations and contrastive schemes, which may potentially in… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: KDD 2024 research track. Code available at https://github.com/EdisonLeeeee/MAGI

  22. arXiv:2406.12316  [pdf, other

    cs.CV cs.AI cs.MM

    Enhancing Visible-Infrared Person Re-identification with Modality- and Instance-aware Visual Prompt Learning

    Authors: Ruiqi Wu, Bingliang Jiao, Wenxuan Wang, Meng Liu, Peng Wang

    Abstract: The Visible-Infrared Person Re-identification (VI ReID) aims to match visible and infrared images of the same pedestrians across non-overlapped camera views. These two input modalities contain both invariant information, such as shape, and modality-specific details, such as color. An ideal model should utilize valuable information from both modalities during training for enhanced representational… ▽ More

    Submitted 18 June, 2024; originally announced June 2024.

    Comments: Accepyed by ACM International Conference on Multimedia Retrieval (ICMR'24)

    Journal ref: ICMR'24: Proceedings of the 2024 International Conference on Multimedia Retrieval (2024) 579 - 588

  23. arXiv:2406.11810  [pdf, ps, other

    cs.LG cs.RO eess.SY

    Computationally Efficient RL under Linear Bellman Completeness for Deterministic Dynamics

    Authors: Runzhe Wu, Ayush Sekhari, Akshay Krishnamurthy, Wen Sun

    Abstract: We study computationally and statistically efficient Reinforcement Learning algorithms for the linear Bellman Complete setting, a setting that uses linear function approximation to capture value functions and unifies existing models like linear Markov Decision Processes (MDP) and Linear Quadratic Regulators (LQR). While it is known from the prior works that this setting is statistically tractable,… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

  24. arXiv:2406.08698  [pdf, other

    astro-ph.HE hep-ph

    Constraints on Ultra Heavy Dark Matter Properties from Dwarf Spheroidal Galaxies with LHAASO Observations

    Authors: Zhen Cao, F. Aharonian, Q. An, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, J. T. Cai, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. H. Chen, S. Z. Chen , et al. (255 additional authors not shown)

    Abstract: In this work we try to search for signals generated by ultra-heavy dark matter at the Large High Altitude Air Shower Observatory (LHAASO) data. We look for possible gamma-ray by dark matter annihilation or decay from 16 dwarf spheroidal galaxies in the field of view of LHAASO. Dwarf spheroidal galaxies are among the most promising targets for indirect detection of dark matter which have low fluxes… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 17 pages, 12 figures, accepted by PRL

  25. arXiv:2406.08177  [pdf, other

    eess.IV cs.CV

    One-Step Effective Diffusion Network for Real-World Image Super-Resolution

    Authors: Rongyuan Wu, Lingchen Sun, Zhiyuan Ma, Lei Zhang

    Abstract: The pre-trained text-to-image diffusion models have been increasingly employed to tackle the real-world image super-resolution (Real-ISR) problem due to their powerful generative image priors. Most of the existing methods start from random noise to reconstruct the high-quality (HQ) image under the guidance of the given low-quality (LQ) image. While promising results have been achieved, such Real-… ▽ More

    Submitted 14 June, 2024; v1 submitted 12 June, 2024; originally announced June 2024.

  26. arXiv:2406.07928  [pdf, other

    cs.RO

    Undergraduate Robotics Education with General Instructors using a Student-Centered Personalized Learning Framework

    Authors: Rui Wu, David J Feil-Seifer, Ponkoj C Shill, Hossein Jamali, Sergiu Dascalu, Fred Harris, Laura Rosof, Bryan Hutchins, Marjorie Campo Ringler, Zhen Zhu

    Abstract: Recent advancements in robotics, including applications like self-driving cars, unmanned systems, and medical robots, have had a significant impact on the job market. On one hand, big robotics companies offer training programs based on the job requirements. However, these training programs may not be as beneficial as general robotics programs offered by universities or community colleges. On the o… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

    Comments: 11 pages, 3 figures, 1 table, 2024 ASEE Conference

  27. arXiv:2406.07780  [pdf, other

    cs.LG cs.CL

    A Critical Look At Tokenwise Reward-Guided Text Generation

    Authors: Ahmad Rashid, Ruotian Wu, Julia Grosse, Agustinus Kristiadi, Pascal Poupart

    Abstract: Large language models (LLMs) can significantly be improved by aligning to human preferences -- the so-called reinforcement learning from human feedback (RLHF). However, the cost of fine-tuning an LLM is prohibitive for many users. Due to their ability to bypass LLM finetuning, tokenwise reward-guided text generation (RGTG) methods have recently been proposed. They use a reward model trained on ful… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  28. arXiv:2406.06612  [pdf, other

    cs.CV cs.LG cs.SD eess.AS

    SEE-2-SOUND: Zero-Shot Spatial Environment-to-Spatial Sound

    Authors: Rishit Dagli, Shivesh Prakash, Robert Wu, Houman Khosravani

    Abstract: Generating combined visual and auditory sensory experiences is critical for the consumption of immersive content. Recent advances in neural generative models have enabled the creation of high-resolution content across multiple modalities such as images, text, speech, and videos. Despite these successes, there remains a significant gap in the generation of high-quality spatial audio that complement… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Project Page: https://see2sound.github.io/

  29. arXiv:2406.05917  [pdf, other

    econ.GN

    China's Rising Leadership in Global Science

    Authors: Renli Wu, Christopher Esposito, James Evans

    Abstract: Major shifts in the global system of science and technology are destabilizing the global status order and demonstrating the capacity for emerging countries like China and India to exert greater influence. In order to measure changes in the global scientific system, we develop a framework to assess the hierarchical position of countries in the international scientific collaboration network. Using a… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

  30. arXiv:2406.04002  [pdf, other

    cs.CV

    3rd Place Solution for PVUW Challenge 2024: Video Panoptic Segmentation

    Authors: Ruipu Wu, Jifei Che, Han Li, Chengjing Wu, Ting Liu, Luoqi Liu

    Abstract: Video panoptic segmentation is an advanced task that extends panoptic segmentation by applying its concept to video sequences. In the hope of addressing the challenge of video panoptic segmentation in diverse conditions, We utilize DVIS++ as our baseline model and enhance it by introducing a comprehensive approach centered on the query-wise ensemble, supplemented by additional techniques. Our prop… ▽ More

    Submitted 6 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 3nd Place Solution for CVPR 2024 PVUW VPS Track

  31. arXiv:2406.03835  [pdf, other

    cs.CV cs.RO

    Monocular Localization with Semantics Map for Autonomous Vehicles

    Authors: Jixiang Wan, Xudong Zhang, Shuzhou Dong, Yuwei Zhang, Yuchen Yang, Ruoxi Wu, Ye Jiang, Jijunnan Li, Jinquan Lin, Ming Yang

    Abstract: Accurate and robust localization remains a significant challenge for autonomous vehicles. The cost of sensors and limitations in local computational efficiency make it difficult to scale to large commercial applications. Traditional vision-based approaches focus on texture features that are susceptible to changes in lighting, season, perspective, and appearance. Additionally, the large storage siz… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  32. arXiv:2406.02976  [pdf, other

    cs.CV cs.AI

    DA-Flow: Dual Attention Normalizing Flow for Skeleton-based Video Anomaly Detection

    Authors: Ruituo Wu, Yang Chen, Jian Xiao, Bing Li, Jicong Fan, Frédéric Dufaux, Ce Zhu, Yipeng Liu

    Abstract: Cooperation between temporal convolutional networks (TCN) and graph convolutional networks (GCN) as a processing module has shown promising results in skeleton-based video anomaly detection (SVAD). However, to maintain a lightweight model with low computational and storage complexity, shallow GCN and TCN blocks are constrained by small receptive fields and a lack of cross-dimension interaction cap… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  33. arXiv:2406.02283  [pdf, other

    cs.RO

    Broadcasting Support Relations Recursively from Local Dynamics for Object Retrieval in Clutters

    Authors: Yitong Li, Ruihai Wu, Haoran Lu, Chuanruo Ning, Yan Shen, Guanqi Zhan, Hao Dong

    Abstract: In our daily life, cluttered objects are everywhere, from scattered stationery and books cluttering the table to bowls and plates filling the kitchen sink. Retrieving a target object from clutters is an essential while challenging skill for robots, for the difficulty of safely manipulating an object without disturbing others, which requires the robot to plan a manipulation sequence and first move… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: RSS 2024

  34. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  35. arXiv:2406.00943  [pdf, other

    cs.LG cs.AI

    State Space Models on Temporal Graphs: A First-Principles Study

    Authors: Jintang Li, Ruofan Wu, Xinzhou Jin, Boqun Ma, Liang Chen, Zibin Zheng

    Abstract: Over the past few years, research on deep graph learning has shifted from static graphs to temporal graphs in response to real-world complex systems that exhibit dynamic behaviors. In practice, temporal graphs are formalized as an ordered sequence of static graph snapshots observed at discrete time points. Sequence models such as RNNs or Transformers have long been the predominant backbone network… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

    Comments: Preprint; Code will be made available at https://github.com/EdisonLeeeee/GraphSSM

  36. arXiv:2405.19207  [pdf

    cs.IR cs.AI

    A Multi-Source Retrieval Question Answering Framework Based on RAG

    Authors: Ridong Wu, Shuhong Chen, Xiangbiao Su, Yuankai Zhu, Yifei Liao, Jianming Wu

    Abstract: With the rapid development of large-scale language models, Retrieval-Augmented Generation (RAG) has been widely adopted. However, existing RAG paradigms are inevitably influenced by erroneous retrieval information, thereby reducing the reliability and correctness of generated results. Therefore, to improve the relevance of retrieval information, this study proposes a method that replaces tradition… ▽ More

    Submitted 29 May, 2024; originally announced May 2024.

    Comments: 4 pages,3 figures

  37. arXiv:2405.19005  [pdf, other

    cs.CV

    Auto-selected Knowledge Adapters for Lifelong Person Re-identification

    Authors: Xuelin Qian, Ruiqi Wu, Gong Cheng, Junwei Han

    Abstract: Lifelong Person Re-Identification (LReID) extends traditional ReID by requiring systems to continually learn from non-overlapping datasets across different times and locations, adapting to new identities while preserving knowledge of previous ones. Existing approaches, either rehearsal-free or rehearsal-based, still suffer from the problem of catastrophic forgetting since they try to cram diverse… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

  38. arXiv:2405.18334  [pdf, other

    cs.DB cs.CV cs.LG

    SketchQL Demonstration: Zero-shot Video Moment Querying with Sketches

    Authors: Renzhi Wu, Pramod Chunduri, Dristi J Shah, Ashmitha Julius Aravind, Ali Payani, Xu Chu, Joy Arulraj, Kexin Rong

    Abstract: In this paper, we will present SketchQL, a video database management system (VDBMS) for retrieving video moments with a sketch-based query interface. This novel interface allows users to specify object trajectory events with simple mouse drag-and-drop operations. Users can use trajectories of single objects as building blocks to compose complex events. Using a pre-trained model that encodes trajec… ▽ More

    Submitted 30 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Journal ref: Published on International Conference on Very Large Databases 2024

  39. arXiv:2405.17767  [pdf, other

    cs.LG cs.CL stat.ML

    Linguistic Collapse: Neural Collapse in (Large) Language Models

    Authors: Robert Wu, Vardan Papyan

    Abstract: Neural collapse ($\mathcal{NC}$) is a phenomenon observed in classification tasks where top-layer representations collapse into their class means, which become equinorm, equiangular and aligned with the classifiers. These behaviors -- associated with generalization and robustness -- would manifest under specific conditions: models are trained towards zero loss, with noise-free labels belonging to… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 29 pages, 27 figures

    MSC Class: 68T07 (Primary) 68T50 (Secondary) ACM Class: I.2.6; I.2.7

  40. arXiv:2405.16989  [pdf, other

    stat.ME

    Uncertainty Learning for High-dimensional Mean-variance Portfolio

    Authors: Han Lin Shang, Ruike Wu, Yanrong Yang

    Abstract: Accounting for uncertainty in Data quality is important for accurate statistical inference. We aim to an optimal conservative allocation for a large universe of assets in mean-variance portfolio (MVP), which is the worst choice within uncertainty in data distribution. Unlike the low dimensional MVP studied in Blanchet et al. (2022, Management Science), the large number of assets raises a challengi… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 2 figures, 4 tables

    MSC Class: 91G10; 62P05

  41. arXiv:2405.16886  [pdf, other

    cs.CV

    Hawk: Learning to Understand Open-World Video Anomalies

    Authors: Jiaqi Tang, Hao Lu, Ruizheng Wu, Xiaogang Xu, Ke Ma, Cheng Fang, Bin Guo, Jiangbo Lu, Qifeng Chen, Ying-Cong Chen

    Abstract: Video Anomaly Detection (VAD) systems can autonomously monitor and identify disturbances, reducing the need for manual labor and associated costs. However, current VAD systems are often limited by their superficial semantic understanding of scenes and minimal user interaction. Additionally, the prevalent data scarcity in existing datasets restricts their applicability in open-world scenarios. In t… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  42. arXiv:2405.16720  [pdf, other

    cs.CL

    Large Scale Knowledge Washing

    Authors: Yu Wang, Ruihan Wu, Zexue He, Xiusi Chen, Julian McAuley

    Abstract: Large language models show impressive abilities in memorizing world knowledge, which leads to concerns regarding memorization of private information, toxic or sensitive knowledge, and copyrighted content. We introduce the problem of Large Scale Knowledge Washing, focusing on unlearning an extensive amount of factual knowledge. Previous unlearning methods usually define the reverse loss and update… ▽ More

    Submitted 28 May, 2024; v1 submitted 26 May, 2024; originally announced May 2024.

  43. arXiv:2405.16583  [pdf

    physics.optics

    An erbium-doped waveguide amplifier on thin film lithium niobate with an output power exceeding 100 mW

    Authors: Rui Bao, Zhiwei Fang, Jian Liu, Zhaoxiang Liu, Jinming Chen, Min Wang, Rongbo Wu, Haisu Zhang, Ya Cheng

    Abstract: We demonstrate high-power thin film lithium niobate (TFLN) erbium-doped waveguide amplifier (EDWA) with a maximum on-chip output power of 113 mW and a gain of 16 dB. The on-chip integrated EDWA is composed of large mode area (LMA) waveguide structures with a total length of 7 cm and a footprint of 1x1 cm2. Particularly, we connect segmented LMA waveguides with waveguide tapers to achieve on-chip m… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: 13 pages, 4 figures

  44. Development of a Virtual Reality Application for Oculomotor Examination Education Based on Student-Centered Pedagogy

    Authors: Austin Finlayson, Rui Wu, Chia-Cheng Lin, Brian Sylcott

    Abstract: This work-in-progress paper discusses the use of student-centered pedagogy to teach clinical oculomotor examination via Virtual Reality (VR). Traditional methods, such as PowerPoint slides and lab activities, are often insufficient for providing hands-on experience due to the high cost of clinical equipment. To address this, a VR-based application was developed using Unity and the HTC Vive Pro hea… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  45. arXiv:2405.15140  [pdf, other

    cs.LG

    Better Membership Inference Privacy Measurement through Discrepancy

    Authors: Ruihan Wu, Pengrun Huang, Kamalika Chaudhuri

    Abstract: Membership Inference Attacks have emerged as a dominant method for empirically measuring privacy leakage from machine learning models. Here, privacy is measured by the {\em{advantage}} or gap between a score or a function computed on the training and the test data. A major barrier to the practical deployment of these attacks is that they do not scale to large well-generalized models -- either the… ▽ More

    Submitted 23 May, 2024; originally announced May 2024.

    Comments: 9 pages

  46. arXiv:2405.14868  [pdf, other

    cs.CV cs.AI cs.LG cs.RO

    Generative Camera Dolly: Extreme Monocular Dynamic Novel View Synthesis

    Authors: Basile Van Hoorick, Rundi Wu, Ege Ozguroglu, Kyle Sargent, Ruoshi Liu, Pavel Tokmakov, Achal Dave, Changxi Zheng, Carl Vondrick

    Abstract: Accurate reconstruction of complex dynamic scenes from just a single viewpoint continues to be a challenging task in computer vision. Current dynamic novel view synthesis methods typically require videos from many different camera viewpoints, necessitating careful recording setups, and significantly restricting their utility in the wild as well as in terms of embodied AI applications. In this pape… ▽ More

    Submitted 5 July, 2024; v1 submitted 23 May, 2024; originally announced May 2024.

    Comments: Accepted to ECCV 2024. Project webpage is available at: https://gcd.cs.columbia.edu/

  47. arXiv:2405.11826  [pdf, other

    astro-ph.IM hep-ex physics.ins-det

    Data quality control system and long-term performance monitor of the LHAASO-KM2A

    Authors: Zhen Cao, F. Aharonian, Axikegu, Y. X. Bai, Y. W. Bao, D. Bastieri, X. J. Bi, Y. J. Bi, W. Bian, A. V. Bukevich, Q. Cao, W. Y. Cao, Zhe Cao, J. Chang, J. F. Chang, A. M. Chen, E. S. Chen, H. X. Chen, Liang Chen, Lin Chen, Long Chen, M. J. Chen, M. L. Chen, Q. H. Chen, S. Chen , et al. (263 additional authors not shown)

    Abstract: The KM2A is the largest sub-array of the Large High Altitude Air Shower Observatory (LHAASO). It consists of 5216 electromagnetic particle detectors (EDs) and 1188 muon detectors (MDs). The data recorded by the EDs and MDs are used to reconstruct primary information of cosmic ray and gamma-ray showers. This information is used for physical analysis in gamma-ray astronomy and cosmic ray physics. To… ▽ More

    Submitted 13 June, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

    Comments: 15 pages, 9 figures

  48. arXiv:2405.11793  [pdf, other

    cs.CV

    MM-Retinal: Knowledge-Enhanced Foundational Pretraining with Fundus Image-Text Expertise

    Authors: Ruiqi Wu, Chenran Zhang, Jianle Zhang, Yi Zhou, Tao Zhou, Huazhu Fu

    Abstract: Current fundus image analysis models are predominantly built for specific tasks relying on individual datasets. The learning process is usually based on data-driven paradigm without prior knowledge, resulting in poor transferability and generalizability. To address this issue, we propose MM-Retinal, a multi-modal dataset that encompasses high-quality image-text pairs collected from professional fu… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

    Comments: Early Accepted by The International Conference on Medical Image Computing and Computer Assisted Intervention(MICCAI)2024

  49. arXiv:2405.11130  [pdf, other

    cs.RO cs.HC cs.SE

    WIP: A Unit Testing Framework for Self-Guided Personalized Online Robotics Learning

    Authors: Ponkoj Chandra Shill, David Feil-Seifer, Jiullian-Lee Vargas Ruiz, Rui Wu

    Abstract: Our ongoing development and deployment of an online robotics education platform highlighted a gap in providing an interactive, feedback-rich learning environment essential for mastering programming concepts in robotics, which they were not getting with the traditional code-simulate-turn in workflow. Since teaching resources are limited, students would benefit from feedback in real-time to find and… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

    Comments: 5 pages, 4 figures, IEEE FIE 2024

  50. arXiv:2405.09923  [pdf, other

    cs.CV eess.IV

    NTIRE 2024 Restore Any Image Model (RAIM) in the Wild Challenge

    Authors: Jie Liang, Radu Timofte, Qiaosi Yi, Shuaizheng Liu, Lingchen Sun, Rongyuan Wu, Xindong Zhang, Hui Zeng, Lei Zhang

    Abstract: In this paper, we review the NTIRE 2024 challenge on Restore Any Image Model (RAIM) in the Wild. The RAIM challenge constructed a benchmark for image restoration in the wild, including real-world images with/without reference ground truth in various scenarios from real applications. The participants were required to restore the real-captured images from complex and unknown degradation, where gener… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.