Zum Hauptinhalt springen

Showing 1–50 of 714 results for author: He, M

.
  1. arXiv:2408.15217  [pdf, other

    eess.IV cs.AI cs.CV

    Fundus2Video: Cross-Modal Angiography Video Generation from Static Fundus Photography with Clinical Knowledge Guidance

    Authors: Weiyi Zhang, Siyu Huang, Jiancheng Yang, Ruoyu Chen, Zongyuan Ge, Yingfeng Zheng, Danli Shi, Mingguang He

    Abstract: Fundus Fluorescein Angiography (FFA) is a critical tool for assessing retinal vascular dynamics and aiding in the diagnosis of eye diseases. However, its invasive nature and less accessibility compared to Color Fundus (CF) images pose significant challenges. Current CF to FFA translation methods are limited to static generation. In this work, we pioneer dynamic FFA video generation from static CF… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: The paper has been accepted by Medical Image Computing and Computer Assisted Intervention Society (MICCAI) 2024

  2. arXiv:2408.14955  [pdf, other

    nucl-th hep-ex hep-ph

    De-excitations of highly excited $^{11}$B$^*$ and $^{15}$N$^*$ based on the GEMINI++ code

    Authors: Yujie Niu, Wan-Lei Guo, Miao He, Jun Su

    Abstract: Nuclear de-excitations associated with neutrino-nucleus interactions and nucleon decays are playing an increasingly significant role in neutrino experiments. We explore the GEMINI++ code and estimate its ability to account for the de-excitation processes of highly excited $^{11}$B$^*$ and $^{15}$N$^*$, which can be created in the liquid scintillator and water Cherenkov detectors respectively. It i… ▽ More

    Submitted 27 August, 2024; originally announced August 2024.

    Comments: 7 pages, 4 figures, 2 tables

  3. arXiv:2408.13401  [pdf, ps, other

    math.GT math.DS

    Relative train tracks and endperiodic graph maps

    Authors: Yan Mary He, Chenxi Wu

    Abstract: We study endperiodic maps of an infinite graph with finitely many ends. We prove that any such map is homotopic to an endperiodic relative train track map. Moreover, we show that the (largest) Perron-Frobenius eigenvalue of the transition matrix is a canonical quantity associated to the map.

    Submitted 23 August, 2024; originally announced August 2024.

  4. arXiv:2408.12910  [pdf, other

    cs.AI

    What Do You Want? User-centric Prompt Generation for Text-to-image Synthesis via Multi-turn Guidance

    Authors: Yilun Liu, Minggui He, Feiyu Yao, Yuhe Ji, Shimin Tao, Jingzhou Du, Duan Li, Jian Gao, Li Zhang, Hao Yang, Boxing Chen, Osamu Yoshie

    Abstract: The emergence of text-to-image synthesis (TIS) models has significantly influenced digital image creation by producing high-quality visuals from written descriptions. Yet these models heavily rely on the quality and specificity of textual prompts, posing a challenge for novice users who may not be familiar with TIS-model-preferred prompt writing. Existing solutions relieve this via automatic model… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

  5. arXiv:2408.12725  [pdf, other

    physics.ins-det hep-ex

    DUNE Phase II: Scientific Opportunities, Detector Concepts, Technological Solutions

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos, M. Andreotti , et al. (1347 additional authors not shown)

    Abstract: The international collaboration designing and constructing the Deep Underground Neutrino Experiment (DUNE) at the Long-Baseline Neutrino Facility (LBNF) has developed a two-phase strategy toward the implementation of this leading-edge, large-scale science project. The 2023 report of the US Particle Physics Project Prioritization Panel (P5) reaffirmed this vision and strongly endorsed DUNE Phase I… ▽ More

    Submitted 22 August, 2024; originally announced August 2024.

    Report number: FERMILAB-TM-2833-LBNF

  6. arXiv:2408.11787  [pdf, other

    eess.IV cs.CV

    NuSegDG: Integration of Heterogeneous Space and Gaussian Kernel for Domain-Generalized Nuclei Segmentation

    Authors: Zhenye Lou, Qing Xu, Zekun Jiang, Xiangjian He, Zhen Chen, Yi Wang, Chenxin Li, Maggie M. He, Wenting Duan

    Abstract: Domain-generalized nuclei segmentation refers to the generalizability of models to unseen domains based on knowledge learned from source domains and is challenged by various image conditions, cell types, and stain strategies. Recently, the Segment Anything Model (SAM) has made great success in universal image segmentation by interactive prompt modes (e.g., point and box). Despite its strengths, th… ▽ More

    Submitted 24 August, 2024; v1 submitted 21 August, 2024; originally announced August 2024.

    Comments: Under Reivew

  7. arXiv:2408.10636  [pdf

    eess.IV cs.CV

    UWF-RI2FA: Generating Multi-frame Ultrawide-field Fluorescein Angiography from Ultrawide-field Retinal Imaging Improves Diabetic Retinopathy Stratification

    Authors: Ruoyu Chen, Kezheng Xu, Kangyan Zheng, Weiyi Zhang, Yan Lu, Danli Shi, Mingguang He

    Abstract: Ultrawide-field fluorescein angiography (UWF-FA) facilitates diabetic retinopathy (DR) detection by providing a clear visualization of peripheral retinal lesions. However, the intravenous dye injection with potential risks hamper its application. We aim to acquire dye-free UWF-FA images from noninvasive UWF retinal imaging (UWF-RI) using generative artificial intelligence (GenAI) and evaluate its… ▽ More

    Submitted 27 August, 2024; v1 submitted 20 August, 2024; originally announced August 2024.

    Comments: 22 pages, 2 figures

  8. arXiv:2408.09671  [pdf, other

    cs.IR

    GANPrompt: Enhancing Robustness in LLM-Based Recommendations with GAN-Enhanced Diversity Prompts

    Authors: Xinyu Li, Chuang Zhao, Hongke Zhao, Likang Wu, Ming HE

    Abstract: In recent years, LLM has demonstrated remarkable proficiency in comprehending and generating natural language, with a growing prevalence in the domain of recommender systems. However, LLM continues to face a significant challenge in that it is highly susceptible to the influence of prompt words. This inconsistency in response to minor alterations in prompt input may compromise the accuracy and res… ▽ More

    Submitted 18 August, 2024; originally announced August 2024.

  9. arXiv:2408.07301  [pdf

    physics.optics physics.class-ph

    Imaginary Poynting momentum driven particle rotation by cylindrically polarized Gaussian beams

    Authors: Xue Yun, Yansheng Liang, Linquan Guo, Minru He, Tianyu Zhao, Shaowei Wang, Ming Lei

    Abstract: Imaginary Poynting momentum (IPM) provides a new degree of freedom for particle manipulation. However, the application of IPM in experiments has been largely unexplored. Here, we demonstrate the IPM driven particle rotation by cylindrically polarized Gaussian beams with no spin or orbital angular momentum. Theoretical analysis and experimental measurements demonstrate that gold microparticles will… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

    Comments: 10 pages, 6 figures

    MSC Class: 78A10 Physical optics

  10. arXiv:2408.01599  [pdf

    cond-mat.mes-hall cond-mat.str-el

    Strongly interacting Hofstadter states in magic-angle twisted bilayer graphene

    Authors: Minhao He, Xiaoyu Wang, Jiaqi Cai, Jonah Herzog-Arbeitman, Takashi Taniguchi, Kenji Watanabe, Ady Stern, B. Andrei Bernevig, Matthew Yankowitz, Oskar Vafek, Xiaodong Xu

    Abstract: Magic-angle twisted bilayer graphene (MATBG) hosts a multitude of strongly correlated states at partial fillings of its flat bands. In a magnetic field, these flat bands further evolve into a unique Hofstadter spectrum renormalized by strong Coulomb interactions. Here, we study the interacting Hofstadter states spontaneously formed within the topological magnetic subbands of an ultraclean MATBG de… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  11. arXiv:2408.00582  [pdf, other

    hep-ex physics.ins-det

    First Measurement of the Total Inelastic Cross-Section of Positively-Charged Kaons on Argon at Energies Between 5.0 and 7.5 GeV

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, C. Andreopoulos, M. Andreotti , et al. (1341 additional authors not shown)

    Abstract: ProtoDUNE Single-Phase (ProtoDUNE-SP) is a 770-ton liquid argon time projection chamber that operated in a hadron test beam at the CERN Neutrino Platform in 2018. We present a measurement of the total inelastic cross section of charged kaons on argon as a function of kaon energy using 6 and 7 GeV/$c$ beam momentum settings. The flux-weighted average of the extracted inelastic cross section at each… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Report number: CERN-EP-2024-211, FERMILAB-PUB-24-0216-V

  12. arXiv:2407.21333  [pdf, other

    cs.CV

    Chat2Layout: Interactive 3D Furniture Layout with a Multimodal LLM

    Authors: Can Wang, Hongliang Zhong, Menglei Chai, Mingming He, Dongdong Chen, Jing Liao

    Abstract: Automatic furniture layout is long desired for convenient interior design. Leveraging the remarkable visual reasoning capabilities of multimodal large language models (MLLMs), recent methods address layout generation in a static manner, lacking the feedback-driven refinement essential for interactive user engagement. We introduce Chat2Layout, a novel interactive furniture layout generation system… ▽ More

    Submitted 31 July, 2024; originally announced July 2024.

    Comments: Main paper with supplemental materials

  13. arXiv:2407.18460  [pdf, other

    cond-mat.mtrl-sci cond-mat.str-el

    Large Nernst Effect in a layered metallic antiferromagnet EuAl$_2$Si$_2$

    Authors: Kunya Yang, Wei Xia, Xinrun Mi, Yiyue zhang, Long zhang, Aifeng Wang, Yisheng Chai, Xiaoyuan Zhou, Yanfeng Guo, Mingquan He

    Abstract: The large Nernst effect is advantageous for developing transverse Nernst thermoelectric generators or Ettingshausen coolers within a single component, avoiding the complexity of electron- and hole-modules in longitudinal Seebeck thermoelectric devices. We report a large Nernst signal reaching 130 uV/K at 8 K and 13 T in the layered metallic antiferromagnet EuAl$_2$Si$_2$. Notably, this large trans… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: 13 pages, 3 figures

  14. arXiv:2407.18441  [pdf, ps, other

    math.DS math.GT

    Pressure metrics in geometry and dynamics

    Authors: Yan Mary He, Homin Lee, Insung Park

    Abstract: In this article, we first provide a survey of pressure metrics on various deformation spaces in geometry, topology, and dynamics. Then we discuss pressure metrics and their degeneracy loci on the space of quasi-Blaschke products

    Submitted 29 July, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Comments: 19 pages

    MSC Class: 37F10; 37F30; 32G15

  15. arXiv:2407.18043  [pdf, other

    cs.RO cs.CV

    YOCO: You Only Calibrate Once for Accurate Extrinsic Parameter in LiDAR-Camera Systems

    Authors: Tianle Zeng, Dengke He, Feifan Yan, Meixi He

    Abstract: In a multi-sensor fusion system composed of cameras and LiDAR, precise extrinsic calibration contributes to the system's long-term stability and accurate perception of the environment. However, methods based on extracting and registering corresponding points still face challenges in terms of automation and precision. This paper proposes a novel fully automatic extrinsic calibration method for LiDA… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Comments: IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT

    Journal ref: IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT2024

  16. arXiv:2407.17267  [pdf, other

    cs.CV

    M4: Multi-Proxy Multi-Gate Mixture of Experts Network for Multiple Instance Learning in Histopathology Image Analysis

    Authors: Junyu Li, Ye Zhang, Wen Shu, Xiaobing Feng, Yingchun Wang, Pengju Yan, Xiaolin Li, Chulin Sha, Min He

    Abstract: Multiple instance learning (MIL) has been successfully applied for whole slide images (WSIs) analysis in computational pathology, enabling a wide range of prediction tasks from tumor subtyping to inferring genetic mutations and multi-omics biomarkers. However, existing MIL methods predominantly focus on single-task learning, resulting in not only overall low efficiency but also the overlook of int… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 25pages,5figures

  17. arXiv:2407.15926  [pdf, other

    hep-ph astro-ph.CO

    Thermalization and hotspot formation around small primordial black holes

    Authors: Minxi He, Kazunori Kohri, Kyohei Mukaida, Masaki Yamada

    Abstract: We quantitatively analyze a basic question: what is the stationary solution of the background plasma temperature profile around a black hole (BH)? One may naively expect that the temperature profile continuously decreases from the Hawking temperature at the surface of the BH towards an outer region. We show analytically and numerically that this is not the case because local thermal equilibrium ca… ▽ More

    Submitted 22 July, 2024; originally announced July 2024.

    Comments: 24 pages, 9 figures

    Report number: KEK-TH-2639, TU-1238, CTPU-PTC-24-22, KEK-Cosmo-0351, KEK-QUP-2024-0018

  18. PolyFormer: Scalable Node-wise Filters via Polynomial Graph Transformer

    Authors: Jiahong Ma, Mingguo He, Zhewei Wei

    Abstract: Spectral Graph Neural Networks have demonstrated superior performance in graph representation learning. However, many current methods focus on employing shared polynomial coefficients for all nodes, i.e., learning node-unified filters, which limits the filters' flexibility for node-level tasks. The recent DSF attempts to overcome this limitation by learning node-wise coefficients based on position… ▽ More

    Submitted 19 July, 2024; originally announced July 2024.

    Comments: ACM SIGKDD 2024

  19. arXiv:2407.14153  [pdf, other

    eess.IV cs.CV

    ESP-MedSAM: Efficient Self-Prompting SAM for Universal Domain-Generalized Medical Image Segmentation

    Authors: Qing Xu, Jiaxuan Li, Xiangjian He, Ziyu Liu, Zhen Chen, Wenting Duan, Chenxin Li, Maggie M. He, Fiseha B. Tesema, Wooi P. Cheah, Yi Wang, Rong Qu, Jonathan M. Garibaldi

    Abstract: The universality of deep neural networks across different modalities and their generalization capabilities to unseen domains play an essential role in medical image segmentation. The recent Segment Anything Model (SAM) has demonstrated its potential in both settings. However, the huge computational costs, demand for manual annotations as prompts and conflict-prone decoding process of SAM degrade i… ▽ More

    Submitted 17 August, 2024; v1 submitted 19 July, 2024; originally announced July 2024.

    Comments: Under Review

  20. arXiv:2407.10339  [pdf, other

    hep-ex astro-ph.HE astro-ph.IM astro-ph.SR nucl-ex physics.ins-det

    Supernova Pointing Capabilities of DUNE

    Authors: DUNE Collaboration, A. Abed Abud, B. Abi, R. Acciarri, M. A. Acero, M. R. Adames, G. Adamov, M. Adamowski, D. Adams, M. Adinolfi, C. Adriano, A. Aduszkiewicz, J. Aguilar, B. Aimard, F. Akbar, K. Allison, S. Alonso Monsalve, M. Alrashed, A. Alton, R. Alvarez, T. Alves, H. Amar, P. Amedo, J. Anderson, D. A. Andrade , et al. (1340 additional authors not shown)

    Abstract: The determination of the direction of a stellar core collapse via its neutrino emission is crucial for the identification of the progenitor for a multimessenger follow-up. A highly effective method of reconstructing supernova directions within the Deep Underground Neutrino Experiment (DUNE) is introduced. The supernova neutrino pointing resolution is studied by simulating and reconstructing electr… ▽ More

    Submitted 14 July, 2024; originally announced July 2024.

    Comments: 25 pages, 16 figures

    Report number: FERMILAB-PUB-24-0319-LBNF

  21. arXiv:2407.08150  [pdf, other

    cs.CV

    Hypergraph Multi-modal Large Language Model: Exploiting EEG and Eye-tracking Modalities to Evaluate Heterogeneous Responses for Video Understanding

    Authors: Minghui Wu, Chenxu Zhao, Anyang Su, Donglin Di, Tianyu Fu, Da An, Min He, Ya Gao, Meng Ma, Kun Yan, Ping Wang

    Abstract: Understanding of video creativity and content often varies among individuals, with differences in focal points and cognitive levels across different ages, experiences, and genders. There is currently a lack of research in this area, and most existing benchmarks suffer from several drawbacks: 1) a limited number of modalities and answers with restrictive length; 2) the content and scenarios within… ▽ More

    Submitted 16 July, 2024; v1 submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ACM MULTIMEDIA 2024

  22. arXiv:2407.07053  [pdf, other

    cs.CV

    Multimodal Self-Instruct: Synthetic Abstract Image and Visual Reasoning Instruction Using Language Model

    Authors: Wenqi Zhang, Zhenglin Cheng, Yuanyu He, Mengna Wang, Yongliang Shen, Zeqi Tan, Guiyang Hou, Mingqian He, Yanna Ma, Weiming Lu, Yueting Zhuang

    Abstract: Although most current large multimodal models (LMMs) can already understand photos of natural scenes and portraits, their understanding of abstract images, e.g., charts, maps, or layouts, and visual reasoning capabilities remains quite rudimentary. They often struggle with simple daily tasks, such as reading time from a clock, understanding a flowchart, or planning a route using a road map. In lig… ▽ More

    Submitted 8 August, 2024; v1 submitted 9 July, 2024; originally announced July 2024.

    Comments: code: https://github.com/zwq2018/Multi-modal-Self-instruct dataset: https://huggingface.co/datasets/zwq2018/Multi-modal-Self-instruct Leaderboard: https://multi-modal-self-instruct.github.io/

  23. arXiv:2407.05234  [pdf, ps, other

    nucl-th hep-ph nucl-ex

    Statistical Production of $B_c$ Mesons in Heavy-Ion Collisions at the LHC Energy

    Authors: Shouxing Zhao, Min He

    Abstract: The recombination production of $B_c$ mesons in heavy-ion collisions at the LHC energy is facilitated by the abundant and highly thermalized charm ($c$) quarks transported in the deconfined medium created. We study the production of $B_c$ mesons via $c$ and bottom ($b$) quark recombination in a statistical fashion by placing $B_c$ in the position of a member of the family of open $b$ hadrons, whic… ▽ More

    Submitted 6 July, 2024; originally announced July 2024.

    Comments: 10 pages, 4 figures

  24. arXiv:2407.03913  [pdf, other

    cs.AI cs.HC

    MobileExperts: A Dynamic Tool-Enabled Agent Team in Mobile Devices

    Authors: Jiayi Zhang, Chuang Zhao, Yihan Zhao, Zhaoyang Yu, Ming He, Jianping Fan

    Abstract: The attainment of autonomous operations in mobile computing devices has consistently been a goal of human pursuit. With the development of Large Language Models (LLMs) and Visual Language Models (VLMs), this aspiration is progressively turning into reality. While contemporary research has explored automation of simple tasks on mobile devices via VLMs, there remains significant room for improvement… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  25. arXiv:2407.01903  [pdf, other

    cs.LG cs.AI cs.CV

    Text-Aware Diffusion for Policy Learning

    Authors: Calvin Luo, Mandy He, Zilai Zeng, Chen Sun

    Abstract: Training an agent to achieve particular goals or perform desired behaviors is often accomplished through reinforcement learning, especially in the absence of expert demonstrations. However, supporting novel goals or behaviors through reinforcement learning requires the ad-hoc design of appropriate reward functions, which quickly becomes intractable. To address this challenge, we propose Text-Aware… ▽ More

    Submitted 1 July, 2024; originally announced July 2024.

  26. arXiv:2407.00390  [pdf, other

    cs.CL

    Advancing Process Verification for Large Language Models via Tree-Based Preference Learning

    Authors: Mingqian He, Yongliang Shen, Wenqi Zhang, Zeqi Tan, Weiming Lu

    Abstract: Large Language Models (LLMs) have demonstrated remarkable potential in handling complex reasoning tasks by generating step-by-step rationales.Some methods have proven effective in boosting accuracy by introducing extra verifiers to assess these paths. However, existing verifiers, typically trained on binary-labeled reasoning paths, fail to fully utilize the relative merits of intermediate steps, t… ▽ More

    Submitted 29 June, 2024; originally announced July 2024.

  27. arXiv:2406.17555  [pdf, ps, other

    physics.plasm-ph

    A response to commenter Ke Lan's comment on our paper published in Nature Communications (2023)14:5782 by J. Yan et al

    Authors: Ji Yan, Jiwei Li, X. T. He, Lifeng Wang, Yaohua Chen, Feng Wang, Xiaoying Han, Kaiqiang Pan, Juxi Liang, Yulong Li, Zanyang Guan, Xiangming Liu, Xingsen Che, Zhongjing Chen, Xing Zhang, Yan Xu, Bin Li, Minging He, Hongbo Cai, Liang. Hao, Zhanjun Liu, Chunyang Zheng, Zhensheng Dai, Zhengfeng Fan, Bin Qiao , et al. (4 additional authors not shown)

    Abstract: A response to commenter Ke Lan's comment on our paper published in Nature Communications (2023)14:5782 by J. Yan et al

    Submitted 25 June, 2024; originally announced June 2024.

  28. Performative Debias with Fair-exposure Optimization Driven by Strategic Agents in Recommender Systems

    Authors: Zhichen Xiang, Hongke Zhao, Chuang Zhao, Ming He, Jianping Fan

    Abstract: Data bias, e.g., popularity impairs the dynamics of two-sided markets within recommender systems. This overshadows the less visible but potentially intriguing long-tail items that could capture user interest. Despite the abundance of research surrounding this issue, it still poses challenges and remains a hot topic in academic circles. Along this line, in this paper, we developed a re-ranking appr… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

    Comments: SIGKDD 2024 accepted paper

  29. arXiv:2406.16494  [pdf, other

    cs.IR cs.AI

    Cross-domain Transfer of Valence Preferences via a Meta-optimization Approach

    Authors: Chuang Zhao, Hongke Zhao, Ming He, Xiaomeng Li, Jianping Fan

    Abstract: Cross-domain recommendation offers a potential avenue for alleviating data sparsity and cold-start problems. Embedding and mapping, as a classic cross-domain research genre, aims to identify a common mapping function to perform representation transformation between two domains. Nevertheless, previous coarse-grained preference representations, non-personalized mapping functions, and excessive relia… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

  30. arXiv:2406.16251  [pdf, other

    cond-mat.str-el

    Probing critical spin fluctuations with a composite magnetoelectric method: A case study on a Kitaev spin liquid candidate Na$_3$Co$_2$SbO$_6$

    Authors: Xinrun Mi, Xintong Li, Long Zhang, Aifeng Wang, Yuan Li, Yisheng Chai, Mingquan He

    Abstract: In correlated quantum materials, divergent critical fluctuations near the quantum critical point are often closely associated with exotic quantum phases of matter, such as unconventional superconductivity and quantum spin liquids. Here we present a simple yet highly sensitive composite magnetoelectric (ME) method for detecting the critical spin fluctuations in quantum magnets. The ME signal is pro… ▽ More

    Submitted 23 June, 2024; originally announced June 2024.

    Comments: 6 pages, 4 figures

  31. arXiv:2406.15504  [pdf, other

    cs.CL cs.LG

    Dr.E Bridges Graphs with Large Language Models through Words

    Authors: Zipeng Liu, Likang Wu, Ming He, Zhong Guan, Hongke Zhao, Nan Feng

    Abstract: Significant efforts have been dedicated to integrating the powerful Large Language Models (LLMs) with diverse modalities, particularly focusing on the fusion of language, vision and audio data. However, the graph-structured data, which is inherently rich in structural and domain-specific knowledge, has not yet been gracefully adapted to LLMs. Existing methods either describe the graph with raw tex… ▽ More

    Submitted 27 August, 2024; v1 submitted 19 June, 2024; originally announced June 2024.

  32. arXiv:2406.13250  [pdf, other

    cs.AI cs.CL cs.LG

    LangTopo: Aligning Language Descriptions of Graphs with Tokenized Topological Modeling

    Authors: Zhong Guan, Hongke Zhao, Likang Wu, Ming He, Jianpin Fan

    Abstract: Recently, large language models (LLMs) have been widely researched in the field of graph machine learning due to their outstanding abilities in language comprehension and learning. However, the significant gap between natural language tasks and topological structure modeling poses a nonnegligible challenge. Specifically, since natural language descriptions are not sufficient for LLMs to understand… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

  33. arXiv:2406.13235  [pdf, other

    cs.IR cs.AI

    Enhancing Collaborative Semantics of Language Model-Driven Recommendations via Graph-Aware Learning

    Authors: Zhong Guan, Likang Wu, Hongke Zhao, Ming He, Jianpin Fan

    Abstract: Large Language Models (LLMs) are increasingly prominent in the recommendation systems domain. Existing studies usually utilize in-context learning or supervised fine-tuning on task-specific data to align LLMs into recommendations. However, the substantial bias in semantic spaces between language processing tasks and recommendation tasks poses a nonnegligible challenge. Specifically, without the ad… ▽ More

    Submitted 19 June, 2024; originally announced June 2024.

    Comments: 10pages

  34. arXiv:2406.10988  [pdf, other

    quant-ph

    Quantum coupon collector with mixed-state encoding

    Authors: Jing-Peng Zhang, Min-Quan He, Dan-Bo Zhang

    Abstract: The coupon collector is a prototypical model for evaluating the number of samples for identifying a set. By superposing all elements in the set as a pure quantum state, a quantum version of the coupon collector aims to learn the state, which is shown to reduce the sample complexity. Here we propose a quantum coupon collector by encoding the set into a mixed state, where the information of missing… ▽ More

    Submitted 16 June, 2024; originally announced June 2024.

  35. arXiv:2406.10638  [pdf, other

    cs.CV

    Seeing Clearly, Answering Incorrectly: A Multimodal Robustness Benchmark for Evaluating MLLMs on Leading Questions

    Authors: Yexin Liu, Zhengyang Liang, Yueze Wang, Muyang He, Jian Li, Bo Zhao

    Abstract: Multimodal Large Language Models (MLLMs) have exhibited impressive capabilities in visual understanding and reasoning, providing sightly reasonable answers, such as image descriptions. This has spurred extensive research on the evaluation of MLLMs. Most evaluation benchmarks assume that incorrect answers indicate a lack of understanding of the visual content. However, our findings reveal that, in… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  36. arXiv:2406.09755  [pdf, other

    cs.AI cs.RO

    Mix Q-learning for Lane Changing: A Collaborative Decision-Making Method in Multi-Agent Deep Reinforcement Learning

    Authors: Xiaojun Bi, Mingjie He, Yiwen Sun

    Abstract: Lane-changing decisions, which are crucial for autonomous vehicle path planning, face practical challenges due to rule-based constraints and limited data. Deep reinforcement learning has become a major research focus due to its advantages in data acquisition and interpretability. However, current models often overlook collaboration, which affects not only impacts overall traffic efficiency but als… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

  37. arXiv:2406.07546  [pdf, other

    cs.CV cs.AI cs.CL

    Commonsense-T2I Challenge: Can Text-to-Image Generation Models Understand Commonsense?

    Authors: Xingyu Fu, Muyu He, Yujie Lu, William Yang Wang, Dan Roth

    Abstract: We present a novel task and benchmark for evaluating the ability of text-to-image(T2I) generation models to produce images that align with commonsense in real life, which we call Commonsense-T2I. Given two adversarial text prompts containing an identical set of action words with minor differences, such as "a lightbulb without electricity" v.s. "a lightbulb with electricity", we evaluate whether T2… ▽ More

    Submitted 12 August, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

    Comments: COLM 2024, Project Url: https://zeyofu.github.io/CommonsenseT2I/

  38. arXiv:2406.05848  [pdf, other

    physics.space-ph

    Nonlinear Interactions of Planetary-Scale Waves in Mesospheric Winds Observed at 52°N Latitude and Two Longitudes

    Authors: Maosheng He, Jeffrey M. Forbes, Gunter Stober, Christoph Jacobi, Guozhu Li, Libo Liu, Jiyao Xu

    Abstract: Nine years of mesospheric wind data from two meteor radars at 52°N latitude were analyzed to investigate planetary waves (PWs) and tides by estimating their zonal wavenumber through longitudinal phase differences. Our results reveal that PW normal modes (NMs) primarily drive multi-day oscillations, showing seasonal variability and statistical associations with Sudden Stratospheric Warming (SSW) ev… ▽ More

    Submitted 11 June, 2024; v1 submitted 9 June, 2024; originally announced June 2024.

  39. arXiv:2406.04180  [pdf, other

    hep-ph

    Cogenesis by a sliding pNGB with symmetry non-restoration

    Authors: Eung Jin Chun, Suruj Jyoti Das, Minxi He, Tae Hyun Jung, Jin Sun

    Abstract: We show that a pseudo-Nambu-Goldstone boson (pNGB) with an initial misalignment angle can drive successful spontaneous baryogenesis, and become a good dark matter candidate if the corresponding global symmetry is non-restored at high temperatures. Considering a dimension-five explicit breaking operator, we find that the pNGB starts its motion with a sliding across rapidly decreasing potential barr… ▽ More

    Submitted 21 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: 5 pages, 2 figures with supplemental material, v2: discussion on the isocurvature perturbation constraint and references added

    Report number: CTPU-PTC-24-16

  40. arXiv:2406.01993  [pdf

    eess.IV cs.CV

    Choroidal Vessel Segmentation on Indocyanine Green Angiography Images via Human-in-the-Loop Labeling

    Authors: Ruoyu Chen, Ziwei Zhao, Mayinuer Yusufu, Xianwen Shang, Danli Shi, Mingguang He

    Abstract: Human-in-the-loop (HITL) strategy has been recently introduced into the field of medical image processing. Indocyanine green angiography (ICGA) stands as a well-established examination for visualizing choroidal vasculature and detecting chorioretinal diseases. However, the intricate nature of choroidal vascular networks makes large-scale manual segmentation of ICGA images challenging. Thus, the st… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 25 pages,4 figures

  41. arXiv:2406.01435  [pdf, other

    cs.LG stat.ML

    Learning Analysis of Kernel Ridgeless Regression with Asymmetric Kernel Learning

    Authors: Fan He, Mingzhen He, Lei Shi, Xiaolin Huang, Johan A. K. Suykens

    Abstract: Ridgeless regression has garnered attention among researchers, particularly in light of the ``Benign Overfitting'' phenomenon, where models interpolating noisy samples demonstrate robust generalization. However, kernel ridgeless regression does not always perform well due to the lack of flexibility. This paper enhances kernel ridgeless regression with Locally-Adaptive-Bandwidths (LAB) RBF kernels,… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2310.05236

  42. arXiv:2406.01007  [pdf, other

    hep-ex

    Measurement of Electron Antineutrino Oscillation Amplitude and Frequency via Neutron Capture on Hydrogen at Daya Bay

    Authors: Daya Bay collaboration, F. P. An, W. D. Bai, A. B. Balantekin, M. Bishai, S. Blyth, G. F. Cao, J. Cao, J. F. Chang, Y. Chang, H. S. Chen, H. Y. Chen, S. M. Chen, Y. Chen, Y. X. Chen, Z. Y. Chen, J. Cheng, J. Cheng, Y. -C. Cheng, Z. K. Cheng, J. J. Cherwinka, M. C. Chu, J. P. Cummings, O. Dalager, F. S. Deng , et al. (177 additional authors not shown)

    Abstract: This Letter reports the first measurement of the oscillation amplitude and frequency of reactor antineutrinos at Daya Bay via neutron capture on hydrogen using 1958 days of data. With over 3.6 million signal candidates, an optimized candidate selection, improved treatment of backgrounds and efficiencies, refined energy calibration, and an energy response model for the capture-on-hydrogen sensitive… ▽ More

    Submitted 3 June, 2024; originally announced June 2024.

  43. arXiv:2405.20659  [pdf

    physics.atom-ph physics.app-ph physics.pop-ph quant-ph

    Realization of a cold atom gyroscope in space

    Authors: Jinting Li, Xi Chen, Danfang Zhang, Wenzhang Wang, Yang Zhou, Meng He, Jie Fang, Lin Zhou, Chuan He, Junjie Jiang, Huanyao Sun, Qunfeng Chen, Lei Qin, Xiao Li, Yibo Wang, Xiaowei Zhang, Jiaqi Zhong, Runbing Li, Meizhen An, Long Zhang, Shuquan Wang, Zongfeng Li, Jin Wang, Mingsheng Zhan

    Abstract: High precision gyroscopes in space are important for sophisticated scientific experiments and deep space navigation. Microgravity in the space provides an ideal condition for operation of a cold atom gyroscope. To demonstrate this advantage, an atom interferometer (AI) was launched and installed in the China Space Station in 2022. Here reported is a realization of the cold atom gyroscope with this… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: 12 pages, 5 figures

  44. arXiv:2405.20621  [pdf, other

    physics.flu-dyn

    A critical comparison of the implementation of granular pressure gradient term in Euler-Euler simulation of gas-solid flows

    Authors: Yige Liu, Mingming He, Jianhua Chen, Wen Li, Bidan Zhao, Ji Xu, Junwu Wang

    Abstract: Numerical solution of Euler-Euler model using different in-house, open source and commercial software can generate significantly different results, even when the governing equations and the initial and boundary conditions are exactly same. Unfortunately, the underlying reasons have not been identified yet. In this article, three methods for calculating the granular pressure gradient term are prese… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  45. arXiv:2405.17792  [pdf, other

    hep-ex hep-ph

    JUNO Sensitivity to Invisible Decay Modes of Neutrons

    Authors: JUNO Collaboration, Angel Abusleme, Thomas Adam, Kai Adamowicz, Shakeel Ahmad, Rizwan Ahmed, Sebastiano Aiello, Fengpeng An, Qi An, Giuseppe Andronico, Nikolay Anfimov, Vito Antonelli, Tatiana Antoshkina, João Pedro Athayde Marcondes de André, Didier Auguste, Weidong Bai, Nikita Balashov, Wander Baldini, Andrea Barresi, Davide Basilico, Eric Baussan, Marco Bellato, Marco Beretta, Antonio Bergnoli, Daniel Bick , et al. (635 additional authors not shown)

    Abstract: We explore the bound neutrons decay into invisible particles (e.g., $n\rightarrow 3 ν$ or $nn \rightarrow 2 ν$) in the JUNO liquid scintillator detector. The invisible decay includes two decay modes: $ n \rightarrow { inv} $ and $ nn \rightarrow { inv} $. The invisible decays of $s$-shell neutrons in $^{12}{\rm C}$ will leave a highly excited residual nucleus. Subsequently, some de-excitation mode… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

    Comments: 28 pages, 7 figures, 4 tables

  46. arXiv:2405.11338  [pdf

    cs.CV cs.AI

    EyeFound: A Multimodal Generalist Foundation Model for Ophthalmic Imaging

    Authors: Danli Shi, Weiyi Zhang, Xiaolan Chen, Yexin Liu, Jiancheng Yang, Siyu Huang, Yih Chung Tham, Yingfeng Zheng, Mingguang He

    Abstract: Artificial intelligence (AI) is vital in ophthalmology, tackling tasks like diagnosis, classification, and visual question answering (VQA). However, existing AI models in this domain often require extensive annotation and are task-specific, limiting their clinical utility. While recent developments have brought about foundation models for ophthalmology, they are limited by the need to train separa… ▽ More

    Submitted 21 May, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

    Comments: 21 pages, 2 figures, 4 tables

  47. arXiv:2405.11236  [pdf, other

    cs.CV

    TriLoRA: Integrating SVD for Advanced Style Personalization in Text-to-Image Generation

    Authors: Chengcheng Feng, Mu He, Qiuyu Tian, Haojie Yin, Xiaofang Zhao, Hongwei Tang, Xingqiang Wei

    Abstract: As deep learning technology continues to advance, image generation models, especially models like Stable Diffusion, are finding increasingly widespread application in visual arts creation. However, these models often face challenges such as overfitting, lack of stability in generated results, and difficulties in accurately capturing the features desired by creators during the fine-tuning process.… ▽ More

    Submitted 13 June, 2024; v1 submitted 18 May, 2024; originally announced May 2024.

  48. arXiv:2405.10739  [pdf, other

    cs.CV cs.AI

    Efficient Multimodal Large Language Models: A Survey

    Authors: Yizhang Jin, Jian Li, Yexin Liu, Tianjun Gu, Kai Wu, Zhengkai Jiang, Muyang He, Bo Zhao, Xin Tan, Zhenye Gan, Yabiao Wang, Chengjie Wang, Lizhuang Ma

    Abstract: In the past year, Multimodal Large Language Models (MLLMs) have demonstrated remarkable performance in tasks such as visual question answering, visual understanding and reasoning. However, the extensive model size and high training and inference costs have hindered the widespread application of MLLMs in academia and industry. Thus, studying efficient and lightweight MLLMs has enormous potential, e… ▽ More

    Submitted 9 August, 2024; v1 submitted 17 May, 2024; originally announced May 2024.

  49. arXiv:2405.10676  [pdf, other

    physics.plasm-ph

    Identifying L-H transition in HL-2A through deep learning

    Authors: Meihuizi He, Songfen Liu, Fan Xia, Zongyu Yang, Wulyu Zhong

    Abstract: During the operation of tokamak devices, addressing the thermal load issues caused by Edge Localized Modes (ELMs) eruption is crucial. Ideally, mitigation and suppression measures for ELMs should be promptly initiated as soon as the first low-to-high confinement (L-H) transition occurs, which necessitates the real-time monitoring and accurate identification of the L-H transition process. Motivated… ▽ More

    Submitted 17 May, 2024; originally announced May 2024.

  50. arXiv:2405.09059  [pdf, other

    cs.CV

    Task-adaptive Q-Face

    Authors: Haomiao Sun, Mingjie He, Shiguang Shan, Hu Han, Xilin Chen

    Abstract: Although face analysis has achieved remarkable improvements in the past few years, designing a multi-task face analysis model is still challenging. Most face analysis tasks are studied as separate problems and do not benefit from the synergy among related tasks. In this work, we propose a novel task-adaptive multi-task face analysis method named as Q-Face, which simultaneously performs multiple fa… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: Ever submitted to ECCV2024