Zum Hauptinhalt springen

Showing 1–50 of 2,069 results for author: Liu, R

.
  1. arXiv:2408.15892  [pdf, other

    astro-ph.SR

    Fast Downflows Observed during a Polar Crown Filament Eruption

    Authors: Zheng Sun, Hui Tian, Ting Li, Rui Liu, Yadan Duan

    Abstract: Solar filaments can undergo eruptions and result in the formation of coronal mass ejections (CMEs), which could significantly impact planetary space environments. Observations of eruptions involving polar crown filaments, situated in the polar regions of the Sun, are limited. In this study, we report a polar crown filament eruption (SOL2023-06-12), characterized by fast downflows below the filamen… ▽ More

    Submitted 28 August, 2024; originally announced August 2024.

  2. arXiv:2408.15051  [pdf, other

    physics.optics

    Optical Routing via High Efficiency Composite Acoustic Diffraction

    Authors: Yuxiang Zhao, Jiangyong Hu, Ruijuan Liu, Ruochen Gao, Yiming Li, Xiao Zhang, Huanfeng Zhu, Saijun Wu

    Abstract: Acousto-optical modulation (AOM) is a powerful and widely used technique for rapidly controlling the frequency, phase, intensity, and direction of light. Based on Bragg diffraction, AOMs typically exhibit moderate diffraction efficiency, often less than 90\% even for collimated inputs. In this work, we demonstrate that this efficiency can be significantly improved using a composite (CP) setup comp… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures

  3. arXiv:2408.14620  [pdf, ps, other

    stat.ML cs.LG

    General targeted machine learning for modern causal mediation analysis

    Authors: Richard Liu, Nicholas T. Williams, Kara E. Rudolph, Iván Díaz

    Abstract: Causal mediation analyses investigate the mechanisms through which causes exert their effects, and are therefore central to scientific progress. The literature on the non-parametric definition and identification of mediational effects in rigourous causal models has grown significantly in recent years, and there has been important progress to address challenges in the interpretation and identificat… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  4. arXiv:2408.13016  [pdf, other

    gr-qc

    The Frame-Dragging effect on the excitation rate of atoms

    Authors: Rui-Chen Liu, C. P. Sun

    Abstract: The frame-dragging phenomenon in gravitational fields is revisited to explore the geometric effects induced by spacetime curvature. We quantize a massless scalar field in the spacetime of a rotating sphere, incorporating the frame-dragging frequency into the field modes. The excitation rate for an atom undergoing uniform circular motion and interacting with the scalar field is calculated. Our resu… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  5. arXiv:2408.12758  [pdf, other

    quant-ph cond-mat.mes-hall

    Month-long-lifetime microwave spectral holes in an erbium-doped scheelite crystal at millikelvin temperature

    Authors: Zhiren Wang, Sen Lin, Marianne Le Dantec, Miloš Rančić, Philippe Goldner, Sylvain Bertaina, Thierry Chanelière, Ren-Bao Liu, Daniel Esteve, Denis Vion, Emmanuel Flurin, Patrice Bertet

    Abstract: Rare-earth-ion (REI) ensembles in crystals have remarkable optical and spin properties characterized by narrow homogeneous linewidths relative to the inhomogeneous ensemble broadening. This makes it possible to precisely tailor the ensemble spectral density and therefore the absorption profile by applying narrow-linewidth radiation to transfer population into auxiliary levels, a process broadly kn… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

  6. arXiv:2408.11593  [pdf, other

    cs.MM cs.CV cs.SD eess.AS

    MCDubber: Multimodal Context-Aware Expressive Video Dubbing

    Authors: Yuan Zhao, Zhenqi Jia, Rui Liu, De Hu, Feilong Bao, Guanglai Gao

    Abstract: Automatic Video Dubbing (AVD) aims to take the given script and generate speech that aligns with lip motion and prosody expressiveness. Current AVD models mainly utilize visual information of the current sentence to enhance the prosody of synthesized speech. However, it is crucial to consider whether the prosody of the generated dubbing aligns with the multimodal context, as the dubbing will be co… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

  7. arXiv:2408.10926  [pdf, other

    astro-ph.IM hep-ex hep-ph

    GRANDlib: A simulation pipeline for the Giant Radio Array for Neutrino Detection (GRAND)

    Authors: GRAND Collaboration, Rafael Alves Batista, Aurélien Benoit-Lévy, Teresa Bister, Martina Bohacova, Mauricio Bustamante, Washington Carvalho, Yiren Chen, LingMei Cheng, Simon Chiche, Jean-Marc Colley, Pablo Correa, Nicoleta Cucu Laurenciu, Zigao Dai, Rogerio M. de Almeida, Beatriz de Errico, Sijbrand de Jong, João R. T. de Mello Neto, Krijn D. de Vries, Valentin Decoene, Peter B. Denton, Bohao Duan, Kaikai Duan, Ralph Engel, William Erba , et al. (90 additional authors not shown)

    Abstract: The operation of upcoming ultra-high-energy cosmic-ray, gamma-ray, and neutrino radio-detection experiments, like the Giant Radio Array for Neutrino Detection (GRAND), poses significant computational challenges involving the production of numerous simulations of particle showers and their detection, and a high data throughput. GRANDlib is an open-source software tool designed to meet these challen… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 11 pages, 9 figures, plus appendices

  8. arXiv:2408.10575  [pdf, other

    cs.CV

    MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval

    Authors: Haoran Tang, Meng Cao, Jinfa Huang, Ruyang Liu, Peng Jin, Ge Li, Xiaodan Liang

    Abstract: Text-Video Retrieval (TVR) aims to align and associate relevant video content with corresponding natural language queries. Most existing TVR methods are based on large-scale pre-trained vision-language models (e.g., CLIP). However, due to the inherent plain structure of CLIP, few TVR methods explore the multi-scale representations which offer richer contextual information for a more thorough under… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 8 pages

  9. arXiv:2408.10162  [pdf, other

    cs.RO cs.LG

    Physics-Aware Combinatorial Assembly Planning using Deep Reinforcement Learning

    Authors: Ruixuan Liu, Alan Chen, Weiye Zhao, Changliu Liu

    Abstract: Combinatorial assembly uses standardized unit primitives to build objects that satisfy user specifications. Lego is a widely used platform for combinatorial assembly, in which people use unit primitives (ie Lego bricks) to build highly customizable 3D objects. This paper studies sequence planning for physical combinatorial assembly using Lego. Given the shape of the desired object, we want to find… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

  10. arXiv:2408.10115  [pdf, other

    cs.CL

    GLIMMER: Incorporating Graph and Lexical Features in Unsupervised Multi-Document Summarization

    Authors: Ran Liu, Ming Liu, Min Yu, Jianguo Jiang, Gang Li, Dan Zhang, Jingyuan Li, Xiang Meng, Weiqing Huang

    Abstract: Pre-trained language models are increasingly being used in multi-document summarization tasks. However, these models need large-scale corpora for pre-training and are domain-dependent. Other non-neural unsupervised summarization approaches mostly rely on key sentence extraction, which can lead to information loss. To address these challenges, we propose a lightweight yet effective unsupervised app… ▽ More

    Submitted 19 August, 2024; originally announced August 2024.

    Comments: 19 pages, 7 figures. Accepted by ECAI 2024

  11. arXiv:2408.09731  [pdf, other

    eess.IV cs.CV

    Reconstruct Spine CT from Biplanar X-Rays via Diffusion Learning

    Authors: Zhi Qiao, Xuhui Liu, Xiaopeng Wang, Runkun Liu, Xiantong Zhen, Pei Dong, Zhen Qian

    Abstract: Intraoperative CT imaging serves as a crucial resource for surgical guidance; however, it may not always be readily accessible or practical to implement. In scenarios where CT imaging is not an option, reconstructing CT scans from X-rays can offer a viable alternative. In this paper, we introduce an innovative method for 3D CT reconstruction utilizing biplanar X-rays. Distinct from previous resear… ▽ More

    Submitted 20 August, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  12. arXiv:2408.09265  [pdf, other

    cs.CR cs.LG cs.NI eess.SY

    ByCAN: Reverse Engineering Controller Area Network (CAN) Messages from Bit to Byte Level

    Authors: Xiaojie Lin, Baihe Ma, Xu Wang, Guangsheng Yu, Ying He, Ren Ping Liu, Wei Ni

    Abstract: As the primary standard protocol for modern cars, the Controller Area Network (CAN) is a critical research target for automotive cybersecurity threats and autonomous applications. As the decoding specification of CAN is a proprietary black-box maintained by Original Equipment Manufacturers (OEMs), conducting related research and industry developments can be challenging without a comprehensive unde… ▽ More

    Submitted 17 August, 2024; originally announced August 2024.

    Comments: Accept by IEEE Internet of Things Journal, 15 pages, 5 figures, 6 tables

  13. arXiv:2408.08493  [pdf, other

    cs.LG stat.ML

    Fishers Harvest Parallel Unlearning in Inherited Model Networks

    Authors: Xiao Liu, Mingyuan Li, Xu Wang, Guangsheng Yu, Wei Ni, Lixiang Li, Haipeng Peng, Renping Liu

    Abstract: Unlearning in various learning frameworks remains challenging, with the continuous growth and updates of models exhibiting complex inheritance relationships. This paper presents a novel unlearning framework, which enables fully parallel unlearning among models exhibiting inheritance. A key enabler is the new Unified Model Inheritance Graph (UMIG), which captures the inheritance using a Directed Ac… ▽ More

    Submitted 20 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  14. arXiv:2408.08188  [pdf, other

    cs.RO cs.AI cs.LO

    Scaling Up Natural Language Understanding for Multi-Robots Through the Lens of Hierarchy

    Authors: Shaojun Xu, Xusheng Luo, Yutong Huang, Letian Leng, Ruixuan Liu, Changliu Liu

    Abstract: Long-horizon planning is hindered by challenges such as uncertainty accumulation, computational complexity, delayed rewards and incomplete information. This work proposes an approach to exploit the task hierarchy from human instructions to facilitate multi-robot planning. Using Large Language Models (LLMs), we propose a two-step approach to translate multi-sentence instructions into a structured l… ▽ More

    Submitted 15 August, 2024; originally announced August 2024.

  15. arXiv:2408.08011  [pdf, other

    quant-ph

    Intensity correlations in measurement-device-independent quantum key distribution

    Authors: Junxuan Liu, Tianyi Xing, Ruiyin Liu, Zihao Chen, Hao Tan, Anqi Huang

    Abstract: The intensity correlations due to imperfect modulation during the quantum-state preparation in a measurement-device-independent quantum key distribution (MDI QKD) system compromise its security performance. Therefore, it is crucial to assess the impact of intensity correlations on the practical security of MDI QKD systems. In this work, we propose a theoretical model that quantitatively analyzes t… ▽ More

    Submitted 18 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  16. arXiv:2408.07960  [pdf, other

    quant-ph

    Characterization of Intensity Correlation via Single-photon Detection in Quantum Key Distribution

    Authors: Tianyi Xing, Junxuan Liu, Likang Zhang, Min-Yan Wang, Yu-Huai Li, Ruiyin Liu, Qingquan Peng, Dongyang Wang, Yaxuan Wang, Hongwei Liu, Wei Li, Yuan Cao, Anqi Huang

    Abstract: One of the most significant vulnerabilities in the source unit of quantum key distribution (QKD) is the correlation between quantum states after modulation, which shall be characterized and evaluated for its practical security performance. In this work, we propose a methodology to characterize the intensity correlation according to the single-photon detection results in the measurement unit withou… ▽ More

    Submitted 18 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

  17. arXiv:2408.07852  [pdf, other

    cs.CL cs.AI cs.LG

    Training Language Models on the Knowledge Graph: Insights on Hallucinations and Their Detectability

    Authors: Jiri Hron, Laura Culp, Gamaleldin Elsayed, Rosanne Liu, Ben Adlam, Maxwell Bileschi, Bernd Bohnet, JD Co-Reyes, Noah Fiedel, C. Daniel Freeman, Izzeddin Gur, Kathleen Kenealy, Jaehoon Lee, Peter J. Liu, Gaurav Mishra, Igor Mordatch, Azade Nova, Roman Novak, Aaron Parisi, Jeffrey Pennington, Alex Rizkowsky, Isabelle Simpson, Hanie Sedghi, Jascha Sohl-dickstein, Kevin Swersky , et al. (6 additional authors not shown)

    Abstract: While many capabilities of language models (LMs) improve with increased training budget, the influence of scale on hallucinations is not yet fully understood. Hallucinations come in many forms, and there is no universally accepted definition. We thus focus on studying only those hallucinations where a correct answer appears verbatim in the training set. To fully control the training data content,… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: Published at COLM 2024. 16 pages, 11 figures

  18. arXiv:2408.07344  [pdf

    cs.CV cs.AI

    RTAT: A Robust Two-stage Association Tracker for Multi-Object Tracking

    Authors: Song Guo, Rujie Liu, Narishige Abe

    Abstract: Data association is an essential part in the tracking-by-detection based Multi-Object Tracking (MOT). Most trackers focus on how to design a better data association strategy to improve the tracking performance. The rule-based handcrafted association methods are simple and highly efficient but lack generalization capability to deal with complex scenes. While the learnt association methods can learn… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: ICPR2024

  19. Flexible 3D Lane Detection by Hierarchical Shape MatchingFlexible 3D Lane Detection by Hierarchical Shape Matching

    Authors: Zhihao Guan, Ruixin Liu, Zejian Yuan, Ao Liu, Kun Tang, Tong Zhou, Erlong Li, Chao Zheng, Shuqi Mei

    Abstract: As one of the basic while vital technologies for HD map construction, 3D lane detection is still an open problem due to varying visual conditions, complex typologies, and strict demands for precision. In this paper, an end-to-end flexible and hierarchical lane detector is proposed to precisely predict 3D lane lines from point clouds. Specifically, we design a hierarchical network predicting flexib… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  20. arXiv:2408.07147  [pdf, other

    cs.CV

    Controlling the World by Sleight of Hand

    Authors: Sruthi Sudhakar, Ruoshi Liu, Basile Van Hoorick, Carl Vondrick, Richard Zemel

    Abstract: Humans naturally build mental models of object interactions and dynamics, allowing them to imagine how their surroundings will change if they take a certain action. While generative models today have shown impressive results on generating/editing images unconditionally or conditioned on text, current methods do not provide the ability to perform object manipulation conditioned on actions, an impor… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  21. arXiv:2408.06312  [pdf, other

    astro-ph.GA astro-ph.HE

    A magnetised Galactic halo from inner Galaxy outflows

    Authors: He-Shou Zhang, Gabriele Ponti, Ettore Carretti, Ruo-Yu Liu, Mark R. Morris, Marijke Haverkorn, Nicola Locatelli, Xueying Zheng, Felix Aharonian, Haiming Zhang, Yi Zhang, Giovanni Stel, Andrew Strong, Micheal Yeung, Andrea Merloni

    Abstract: Large-scale magnetic fields are observed off the midplanes of disk galaxies, indicating that they harbour magnetised halos. These halos are crucial to studies of galaxy evolution, galactic-scale outflows, and feedback from star formation activity. Identifying the magnetised halo of the Milky Way is challenging because of the potential contamination from foreground emission arising in local spiral… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

    Comments: Initially submitted on March 2nd, 2024

  22. arXiv:2408.04243  [pdf, other

    cs.CV cs.MM

    MU-MAE: Multimodal Masked Autoencoders-Based One-Shot Learning

    Authors: Rex Liu, Xin Liu

    Abstract: With the exponential growth of multimedia data, leveraging multimodal sensors presents a promising approach for improving accuracy in human activity recognition. Nevertheless, accurately identifying these activities using both video data and wearable sensor data presents challenges due to the labor-intensive data annotation, and reliance on external pretrained models or additional data. To address… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: IEEE MIPR 2024

  23. arXiv:2408.03259  [pdf, other

    quant-ph gr-qc physics.optics

    Single-photon interference over 8.4 km urban atmosphere: towards testing quantum effects in curved spacetime with photons

    Authors: Hui-Nan Wu, Yu-Huai Li, Bo Li, Xiang You, Run-Ze Liu, Ji-Gang Ren, Juan Yin, Chao-Yang Lu, Yuan Cao, Cheng-Zhi Peng, Jian-Wei Pan

    Abstract: The emergence of quantum mechanics and general relativity has transformed our understanding of the natural world significantly. However, integrating these two theories presents immense challenges, and their interplay remains untested. Recent theoretical studies suggest that the single-photon interference covering huge space can effectively probe the interface between quantum mechanics and general… ▽ More

    Submitted 18 August, 2024; v1 submitted 6 August, 2024; originally announced August 2024.

    Comments: 22 pages, 6 figures

    Journal ref: Phys. Rev. Lett. 133, 020201 (2024)

  24. arXiv:2408.03046  [pdf, other

    cs.CV

    Comb, Prune, Distill: Towards Unified Pruning for Vision Model Compression

    Authors: Jonas Schmitt, Ruiping Liu, Junwei Zheng, Jiaming Zhang, Rainer Stiefelhagen

    Abstract: Lightweight and effective models are essential for devices with limited resources, such as intelligent vehicles. Structured pruning offers a promising approach to model compression and efficiency enhancement. However, existing methods often tie pruning techniques to specific model architectures or vision tasks. To address this limitation, we propose a novel unified pruning framework Comb, Prune, D… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: Accepted by ITSC 2024. Code is publicly available at: https://github.com/Cranken/CPD

  25. arXiv:2408.01172  [pdf, other

    physics.soc-ph

    Cascading failures with group support in interdependent hypergraphs

    Authors: Lei Chen, Chunxiao Jia, Run-Ran Liu, Fanyuan Meng

    Abstract: The functionality of an entity frequently necessitates the support of a group situated in another layer of the system. To unravel the profound impact of such group support on a system's resilience against cascading failures, we devise a framework comprising a double-layer interdependent hypergraph system, wherein nodes are capable of receiving support via hyperedges. Our central hypothesis posits… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

    Comments: 18 pages, 5 figures

  26. arXiv:2408.00368  [pdf, other

    eess.SP

    Illumination Design for Joint Imaging and Wireless Power Transfer Systems

    Authors: Qianyu Yang, Haiyang Zhang, Chunguo Li, Ruiqi Liu, Baoyun Wang

    Abstract: This paper presents a novel concept termed Integrated Imaging and Wireless Power Transfer (IWPT), wherein the integration of imaging and wireless power transfer functionalities is achieved on a unified hardware platform. IWPT leverages a transmitting array to efficiently illuminate a specific Region of Interest (ROI), enabling the extraction of ROI's scattering coefficients while concurrently prov… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 10 pages, 5 figures

  27. arXiv:2408.00229  [pdf

    physics.comp-ph cond-mat.mtrl-sci cs.LG physics.app-ph

    Invariant Discovery of Features Across Multiple Length Scales: Applications in Microscopy and Autonomous Materials Characterization

    Authors: Aditya Raghavan, Utkarsh Pratiush, Mani Valleti, Richard Liu, Reece Emery, Hiroshi Funakubo, Yongtao Liu, Philip Rack, Sergei Kalinin

    Abstract: Physical imaging is a foundational characterization method in areas from condensed matter physics and chemistry to astronomy and spans length scales from atomic to universe. Images encapsulate crucial data regarding atomic bonding, materials microstructures, and dynamic phenomena such as microstructural evolution and turbulence, among other phenomena. The challenge lies in effectively extracting a… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

  28. arXiv:2407.21491  [pdf

    cs.CL cs.SD eess.AS

    Generative Expressive Conversational Speech Synthesis

    Authors: Rui Liu, Yifan Hu, Yi Ren, Xiang Yin, Haizhou Li

    Abstract: Conversational Speech Synthesis (CSS) aims to express a target utterance with the proper speaking style in a user-agent conversation setting. Existing CSS methods employ effective multi-modal context modeling techniques to achieve empathy understanding and expression. However, they often need to design complex network architectures and meticulously optimize the modules within them. In addition, du… ▽ More

    Submitted 31 July, 2024; v1 submitted 31 July, 2024; originally announced July 2024.

    Comments: 14 pages, 6 figures, 8 tables. Accepted by ACM MM 2024

  29. arXiv:2407.21336  [pdf, ps, other

    math.AP

    Regularization by noise for the inviscid primitive equations

    Authors: Ruimeng Hu, Quyuan Lin, Rongchang Liu

    Abstract: The deterministic inviscid primitive equations (also called the hydrostatic Euler equations) are known to be ill-posed in Sobolev spaces and in Gevrey classes of order strictly greater than 1, and some of their analytic solutions exist only locally in time and exhibit finite-time blowup. This work demonstrates that introducing suitable random noise can restore the local well-posedness and prevent… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: 20 pages

  30. arXiv:2407.18424  [pdf, other

    cs.SD cs.LG eess.AS

    Model-driven Heart Rate Estimation and Heart Murmur Detection based on Phonocardiogram

    Authors: Jingping Nie, Ran Liu, Behrooz Mahasseni, Erdrin Azemi, Vikramjit Mitra

    Abstract: Acoustic signals are crucial for health monitoring, particularly heart sounds which provide essential data like heart rate and detect cardiac anomalies such as murmurs. This study utilizes a publicly available phonocardiogram (PCG) dataset to estimate heart rate using model-driven methods and extends the best-performing model to a multi-task learning (MTL) framework for simultaneous heart rate est… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 6 pages, 10 figures

  31. arXiv:2407.17816  [pdf, other

    cs.LG cs.AI

    NC-NCD: Novel Class Discovery for Node Classification

    Authors: Yue Hou, Xueyuan Chen, He Zhu, Romei Liu, Bowen Shi, Jiaheng Liu, Junran Wu, Ke Xu

    Abstract: Novel Class Discovery (NCD) involves identifying new categories within unlabeled data by utilizing knowledge acquired from previously established categories. However, existing NCD methods often struggle to maintain a balance between the performance of old and new categories. Discovering unlabeled new categories in a class-incremental way is more practical but also more challenging, as it is freque… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: Accepted by CIKM'24

  32. arXiv:2407.17702  [pdf, other

    cond-mat.quant-gas

    Universal clusters in quasi-two-dimensional ultracold Fermi mixtures

    Authors: Ruijin Liu, Tingting Shi, Matteo Zaccanti, Xiaoling Cui

    Abstract: We study universal clusters in quasi-two dimensions (q2D) that consist of a light (L) atom interacting with two or three heavy (H) identical fermions, forming the trimer or tetramer bound state. The axial confinement in q2D is shown to lift the three-fold degeneracy of 3D trimer (tetramer) in $p$-wave channel and uniquely select the ground state with magnetic angular momentum $|m|=1$ ($m=0$). By v… ▽ More

    Submitted 3 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: 6 pages, 4 figures, with supplementary material (8 pages, 4 figures)

  33. arXiv:2407.17379  [pdf, other

    cs.CV cs.CL

    MMRA: A Benchmark for Evaluating Multi-Granularity and Multi-Image Relational Association Capabilities in Large Visual Language Models

    Authors: Siwei Wu, Kang Zhu, Yu Bai, Yiming Liang, Yizhi Li, Haoning Wu, J. H. Liu, Ruibo Liu, Xingwei Qu, Xuxin Cheng, Ge Zhang, Wenhao Huang, Chenghua Lin

    Abstract: Given the remarkable success that large visual language models (LVLMs) have achieved in image perception tasks, the endeavor to make LVLMs perceive the world like humans is drawing increasing attention. Current multi-modal benchmarks primarily focus on facts or specific topic-related knowledge contained within individual images. However, they often overlook the associative relations between multip… ▽ More

    Submitted 5 August, 2024; v1 submitted 24 July, 2024; originally announced July 2024.

    Comments: VLMs, Multi-Image Association

  34. arXiv:2407.16310  [pdf, ps, other

    cs.IT math.CO

    Some $3$-designs invariant under $2.PΣL(2,49).$

    Authors: Minjia Shi, Ruowen Liu, Patrick Solé

    Abstract: We construct a ternary [49,25,7] code from the row span of a Jacobsthal matrix. It is equivalent to a Generalized Quadratic Residue (GQR) code in the sense of van Lint and MacWilliams (1978). These codes are the abelian generalizations of the quadratic residue (QR) codes which are cyclic. The union of the [50,25,8] extension of the said code and its dual supports a 3-(50,14,1248) design. The autom… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 9 pages

    MSC Class: 94 B15; 05 B05

  35. arXiv:2407.15148  [pdf, ps, other

    astro-ph.SR physics.space-ph

    Nonparametric Statistics on Magnetic Properties at the Footpoints of Erupting Magnetic Flux Ropes

    Authors: Rui Liu, Wensi Wang

    Abstract: It is under debate whether the magnetic field in the solar atmosphere carries neutralized electric currents; particularly, whether a magnetic flux rope (MFR), which is considered the core structure of coronal mass ejections, carries neutralized electric currents. Recently Wang et al. (2023, ApJ, 943, 80) studied magnetic flux and electric current measured at the footpoints of 28 eruptive MFRs from… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: Accepted for publication in ApJ

  36. arXiv:2407.15087  [pdf, other

    cs.CV

    Navigation Instruction Generation with BEV Perception and Large Language Models

    Authors: Sheng Fan, Rui Liu, Wenguan Wang, Yi Yang

    Abstract: Navigation instruction generation, which requires embodied agents to describe the navigation routes, has been of great interest in robotics and human-computer interaction. Existing studies directly map the sequence of 2D perspective observations to route descriptions. Though straightforward, they overlook the geometric information and object semantics of the 3D environment. To address these challe… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

    Comments: ECCV 2024; Project Page: https://github.com/FanScy/BEVInstructor

  37. arXiv:2407.13725  [pdf, other

    cs.CR

    Scalable Optimization for Locally Relevant Geo-Location Privacy

    Authors: Chenxi Qiu, Ruiyao Liu, Primal Pappachan, Anna Squicciarini, Xinpeng Xie

    Abstract: Geo-obfuscation functions as a location privacy protection mechanism (LPPM), enabling mobile users to share obfuscated locations with servers instead of their exact locations. This technique protects users' location privacy during server-side data breaches since the obfuscation process is irreversible. To minimize the utility loss caused by data obfuscation, linear programming (LP) is widely used.… ▽ More

    Submitted 29 August, 2024; v1 submitted 18 July, 2024; originally announced July 2024.

  38. arXiv:2407.13691  [pdf, other

    eess.SP

    Unsupervised and Interpretable Synthesizing for Electrical Time Series Based on Information Maximizing Generative Adversarial Nets

    Authors: Zhenghao Zhou, Yiyan Li, Runlong Liu, Zheng Yan, Mo-Yuen Chow

    Abstract: Generating synthetic data has become a popular alternative solution to deal with the difficulties in accessing and sharing field measurement data in power systems. However, to make the generation results controllable, existing methods (e.g. Conditional Generative Adversarial Nets, cGAN) require labeled dataset to train the model, which is demanding in practice because many field measurement data l… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  39. arXiv:2407.13545  [pdf, other

    eess.IV cs.CV

    DiffuX2CT: Diffusion Learning to Reconstruct CT Images from Biplanar X-Rays

    Authors: Xuhui Liu, Zhi Qiao, Runkun Liu, Hong Li, Juan Zhang, Xiantong Zhen, Zhen Qian, Baochang Zhang

    Abstract: Computed tomography (CT) is widely utilized in clinical settings because it delivers detailed 3D images of the human body. However, performing CT scans is not always feasible due to radiation exposure and limitations in certain surgical environments. As an alternative, reconstructing CT images from ultra-sparse X-rays offers a valuable solution and has gained significant interest in scientific res… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  40. arXiv:2407.12038  [pdf, ps, other

    eess.AS cs.AI

    ICAGC 2024: Inspirational and Convincing Audio Generation Challenge 2024

    Authors: Ruibo Fu, Rui Liu, Chunyu Qiang, Yingming Gao, Yi Lu, Shuchen Shi, Tao Wang, Ya Li, Zhengqi Wen, Chen Zhang, Hui Bu, Yukun Liu, Xin Qi, Guanjun Li

    Abstract: The Inspirational and Convincing Audio Generation Challenge 2024 (ICAGC 2024) is part of the ISCSLP 2024 Competitions and Challenges track. While current text-to-speech (TTS) technology can generate high-quality audio, its ability to convey complex emotions and controlled detail content remains limited. This constraint leads to a discrepancy between the generated audio and human subjective percept… ▽ More

    Submitted 31 July, 2024; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: ISCSLP 2024 Challenge description and results

  41. arXiv:2407.11084  [pdf, other

    eess.IV cs.CV

    A Survey of Distance-Based Vessel Trajectory Clustering: Data Pre-processing, Methodologies, Applications, and Experimental Evaluation

    Authors: Maohan Liang, Ryan Wen Liu, Ruobin Gao, Zhe Xiao, Xiaocai Zhang, Hua Wang

    Abstract: Vessel trajectory clustering, a crucial component of the maritime intelligent transportation systems, provides valuable insights for applications such as anomaly detection and trajectory prediction. This paper presents a comprehensive survey of the most prevalent distance-based vessel trajectory clustering methods, which encompass two main steps: trajectory similarity measurement and clustering. I… ▽ More

    Submitted 19 July, 2024; v1 submitted 13 July, 2024; originally announced July 2024.

  42. arXiv:2407.10441  [pdf

    cs.AI cs.LG

    Enhancing Building Safety Design for Active Shooter Incidents: Exploration of Building Exit Parameters using Reinforcement Learning-Based Simulations

    Authors: Ruying Liu, Wanjing Wu, Burcin Becerik-Gerber, Gale M. Lucas

    Abstract: With the alarming rise in active shooter incidents (ASIs) in the United States, enhancing public safety through building design has become a pressing need. This study proposes a reinforcement learning-based simulation approach addressing gaps in existing research that has neglected the dynamic behaviours of shooters. We developed an autonomous agent to simulate an active shooter within a realistic… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

    Journal ref: 31st EG-ICE International Workshop on Intelligent Computing in Engineering 2024

  43. arXiv:2407.07720  [pdf, other

    eess.IV cs.CV

    Exploiting Scale-Variant Attention for Segmenting Small Medical Objects

    Authors: Wei Dai, Rui Liu, Zixuan Wu, Tianyi Wu, Min Wang, Junxian Zhou, Yixuan Yuan, Jun Liu

    Abstract: Early detection and accurate diagnosis can predict the risk of malignant disease transformation, thereby increasing the probability of effective treatment. Identifying mild syndrome with small pathological regions serves as an ominous warning and is fundamental in the early diagnosis of diseases. While deep learning algorithms, particularly convolutional neural networks (CNNs), have shown promise… ▽ More

    Submitted 5 August, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: 14 pages, 9 figures, under review

  44. Distributed multi-robot potential-field-based exploration with submap-based mapping and noise-augmented strategy

    Authors: Khattiya Pongsirijinda, Zhiqiang Cao, Kaushik Bhowmik, Muhammad Shalihan, Billy Pik Lik Lau, Ran Liu, Chau Yuen, U-Xuan Tan

    Abstract: Multi-robot collaboration has become a needed component in unknown environment exploration due to its ability to accomplish various challenging situations. Potential-field-based methods are widely used for autonomous exploration because of their high efficiency and low travel cost. However, exploration speed and collaboration ability are still challenging topics. Therefore, we propose a Distribute… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: This paper has been accepted by Robotics and Autonomous Systems

  45. arXiv:2407.07402  [pdf, other

    cs.CV

    ActionVOS: Actions as Prompts for Video Object Segmentation

    Authors: Liangyang Ouyang, Ruicong Liu, Yifei Huang, Ryosuke Furuta, Yoichi Sato

    Abstract: Delving into the realm of egocentric vision, the advancement of referring video object segmentation (RVOS) stands as pivotal in understanding human activities. However, existing RVOS task primarily relies on static attributes such as object names to segment target objects, posing challenges in distinguishing target objects from background objects and in identifying objects undergoing state changes… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: This paper is accepted by ECCV2024. Code will be released at https://github.com/ut-vision/ActionVOS

  46. arXiv:2407.07152  [pdf, other

    astro-ph.CO astro-ph.GA

    Evidence for large baryonic feedback at low and intermediate redshifts from kinematic Sunyaev-Zel'dovich observations with ACT and DESI photometric galaxies

    Authors: B. Hadzhiyska, S. Ferraro, B. Ried Guachalla, E. Schaan, J. Aguilar, N. Battaglia, J. R. Bond, D. Brooks, E. Calabrese, S. K. Choi, T. Claybaugh, W. R. Coulton, K. Dawson, M. Devlin, B. Dey, P. Doel, A. J. Duivenvoorden, J. Dunkley, G. S. Farren, A. Font-Ribera, J. E. Forero-Romero, P. A. Gallardo, E. Gaztañaga, S. Gontcho Gontcho, M. Gralla , et al. (48 additional authors not shown)

    Abstract: Recent advances in cosmological observations have provided an unprecedented opportunity to investigate the distribution of baryons relative to the underlying matter. In this work, we robustly show that the gas is much more extended than the dark matter at 40$σ$ and the amount of baryonic feedback at $z \lesssim 1$ strongly disfavors low-feedback models such as that of state-of-the-art hydrodynamic… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: 20 pages, 8 figures, submitting to PRL

  47. arXiv:2407.06628  [pdf, other

    cs.CV

    Masked Video and Body-worn IMU Autoencoder for Egocentric Action Recognition

    Authors: Mingfang Zhang, Yifei Huang, Ruicong Liu, Yoichi Sato

    Abstract: Compared with visual signals, Inertial Measurement Units (IMUs) placed on human limbs can capture accurate motion signals while being robust to lighting variation and occlusion. While these characteristics are intuitively valuable to help egocentric action recognition, the potential of IMUs remains under-explored. In this work, we present a novel method for action recognition that integrates motio… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

    Comments: ECCV 2024

  48. arXiv:2407.06567  [pdf, other

    cs.CL

    FinCon: A Synthesized LLM Multi-Agent System with Conceptual Verbal Reinforcement for Enhanced Financial Decision Making

    Authors: Yangyang Yu, Zhiyuan Yao, Haohang Li, Zhiyang Deng, Yupeng Cao, Zhi Chen, Jordan W. Suchow, Rong Liu, Zhenyu Cui, Denghui Zhang, Koduvayur Subbalakshmi, Guojun Xiong, Yueru He, Jimin Huang, Dong Li, Qianqian Xie

    Abstract: Large language models (LLMs) have demonstrated notable potential in conducting complex tasks and are increasingly utilized in various financial applications. However, high-quality sequential financial investment decision-making remains challenging. These tasks require multiple interactions with a volatile environment for every decision, demanding sufficient intelligence to maximize returns and man… ▽ More

    Submitted 10 July, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: LLM Applications, LLM Agents, Financial Technology, Quantitative Finance, Algorithmic Trading, Cognitive Science

  49. arXiv:2407.06087  [pdf, other

    cs.LG cs.CV

    Analytic Convolutional Layer: A Step to Analytic Neural Network

    Authors: Jingmao Cui, Donglai Tao, Linmi Tao, Ruiyang Liu, Yu Cheng

    Abstract: The prevailing approach to embedding prior knowledge within convolutional layers typically includes the design of steerable kernels or their modulation using designated kernel banks. In this study, we introduce the Analytic Convolutional Layer (ACL), an innovative model-driven convolutional layer, which is a mosaic of analytical convolution kernels (ACKs) and traditional convolution kernels. ACKs… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  50. arXiv:2407.05858  [pdf, other

    cs.AI

    Empowering 1000 tokens/second on-device LLM prefilling with mllm-NPU

    Authors: Daliang Xu, Hao Zhang, Liming Yang, Ruiqi Liu, Gang Huang, Mengwei Xu, Xuanzhe Liu

    Abstract: On-device large language models (LLMs) are catalyzing novel mobile applications such as UI task automation and personalized email auto-reply, without giving away users' private data. However, on-device LLMs still suffer from unacceptably long inference latency, especially the time to first token (prefill stage) due to the need of long context for accurate, personalized content generation, as well… ▽ More

    Submitted 8 July, 2024; originally announced July 2024.