Zum Hauptinhalt springen

Showing 1–50 of 570 results for author: Han, M

.
  1. The MICADO first light imager for the ELT: overview and current Status

    Authors: E. Sturm, R. Davies, J. Alves, Y. Clénet, J. Kotilainen, A. Monna, H. Nicklas, J. -U. Pott, E. Tolstoy, B. Vulcani, J. Achren, S. Annadevara, H. Anwand-Heerwart, C. Arcidiacono, S. Barboza, L. Barl, P. Baudoz, R. Bender, N. Bezawada, F. Biondi, P. Bizenberger, A. Blin, A. Boné, P. Bonifacio, B. Borgo , et al. (129 additional authors not shown)

    Abstract: MICADO is a first light instrument for the Extremely Large Telescope (ELT), set to start operating later this decade. It will provide diffraction limited imaging, astrometry, high contrast imaging, and long slit spectroscopy at near-infrared wavelengths. During the initial phase operations, adaptive optics (AO) correction will be provided by its own natural guide star wavefront sensor. In its fina… ▽ More

    Submitted 29 August, 2024; originally announced August 2024.

    Comments: Proceedings of the SPIE, Volume 13096, id. 1309611 11 pp. (2024)

  2. arXiv:2408.13006  [pdf, other

    cs.CL

    Systematic Evaluation of LLM-as-a-Judge in LLM Alignment Tasks: Explainable Metrics and Diverse Prompt Templates

    Authors: Hui Wei, Shenghua He, Tian Xia, Andy Wong, Jingyang Lin, Mei Han

    Abstract: Alignment approaches such as RLHF and DPO are actively investigated to align large language models (LLMs) with human preferences. Commercial large language models (LLMs) like GPT-4 have been recently employed to evaluate and compare different LLM alignment approaches. These models act as surrogates for human evaluators due to their promising abilities to approximate human preferences with remarkab… ▽ More

    Submitted 23 August, 2024; originally announced August 2024.

    Comments: Preprint, under review. 17 pages, 7 figures, 16 tables

  3. arXiv:2408.11505  [pdf, other

    cs.CV

    MSCPT: Few-shot Whole Slide Image Classification with Multi-scale and Context-focused Prompt Tuning

    Authors: Minghao Han, Linhao Qu, Dingkang Yang, Xukun Zhang, Xiaoying Wang, Lihua Zhang

    Abstract: Multiple instance learning (MIL) has become a standard paradigm for weakly supervised classification of whole slide images (WSI). However, this paradigm relies on the use of a large number of labelled WSIs for training. The lack of training data and the presence of rare diseases present significant challenges for these methods. Prompt tuning combined with the pre-trained Vision-Language models (VL… ▽ More

    Submitted 21 August, 2024; originally announced August 2024.

    Comments: 11 pages, 5 figures, 5tables

  4. arXiv:2408.10532  [pdf, other

    cs.CV cs.AI

    NutrifyAI: An AI-Powered System for Real-Time Food Detection, Nutritional Analysis, and Personalized Meal Recommendations

    Authors: Michelle Han, Junyao Chen

    Abstract: With diet and nutrition apps reaching 1.4 billion users in 2022 [1], it's no surprise that health apps like MyFitnessPal, Noom, and Calorie Counter, are surging in popularity. However, one major setback [2] of nearly all nutrition applications is that users must enter food data manually, which is time-consuming and tedious. Thus, there has been an increasing demand for applications that can accura… ▽ More

    Submitted 20 August, 2024; originally announced August 2024.

    Comments: 7 pages, 12 figures

  5. arXiv:2408.07285  [pdf, ps, other

    cs.LG

    DDIM Redux: Mathematical Foundation and Some Extension

    Authors: Manhyung Han

    Abstract: This note provides a critical review of the mathematical concepts underlying the generalized diffusion denoising implicit model (gDDIM) and the exponential integrator (EI) scheme. We present enhanced mathematical results, including an exact expression for the reverse trajectory in the probability flow ODE and an exact expression for the covariance matrix in the gDDIM scheme. Furthermore, we offer… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  6. arXiv:2408.04968  [pdf, other

    physics.optics

    One-dimensional spin-flipping topological edge state laser

    Authors: Jhih-Sheng Wu, Zhen-Ting Huang, Meng-Ting Han, Yen-Hsun Chen, Tien-Chang Lu

    Abstract: Topological edge states manifest spin-momentum-locking propagation as a primary consequence of topological crystals. However, experimental studies on spin manipulation and the resulting propagation of these states are lacking. Here, we demonstrate experimentally spin manipulation of topological edge states by the boundary conditions of the one-dimensional path. Armchair boundaries at the endpoints… ▽ More

    Submitted 9 August, 2024; originally announced August 2024.

    Comments: 9 pages, 6 figures

  7. arXiv:2408.03653  [pdf, other

    eess.SY

    Self-tuning moving horizon estimation of nonlinear systems via physics-informed machine learning Koopman modeling

    Authors: Mingxue Yan, Minghao Han, Adrian Wing-Keung Law, Xunyuan Yin

    Abstract: In this paper, we propose a physics-informed learning-based Koopman modeling approach and present a Koopman-based self-tuning moving horizon estimation design for a class of nonlinear systems. Specifically, we train Koopman operators and two neural networks - the state lifting network and the noise characterization network - using both data and available physical information. The two neural networ… ▽ More

    Submitted 7 August, 2024; originally announced August 2024.

    Comments: 31 pages, 7 figures

  8. arXiv:2408.02315  [pdf, ps, other

    eess.SY

    Machine learning-based input-augmented Koopman modeling and predictive control of nonlinear processes

    Authors: Zhaoyang Li, Minghao Han, Dat-Nguyen Vo, Xunyuan Yin

    Abstract: Koopman-based modeling and model predictive control have been a promising alternative for optimal control of nonlinear processes. Good Koopman modeling performance significantly depends on an appropriate nonlinear mapping from the original state-space to a lifted state space. In this work, we propose an input-augmented Koopman modeling and model predictive control approach. Both the states and the… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  9. arXiv:2407.20981  [pdf, other

    cs.GT

    Escape Sensing Games: Detection-vs-Evasion in Security Applications

    Authors: Niclas Boehmer, Minbiao Han, Haifeng Xu, Milind Tambe

    Abstract: Traditional game-theoretic research for security applications primarily focuses on the allocation of external protection resources to defend targets. This work puts forward the study of a new class of games centered around strategically arranging targets to protect them against a constrained adversary, with motivations from varied domains such as peacekeeping resource transit and cybersecurity. Sp… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  10. arXiv:2407.20143  [pdf, other

    cs.AI

    ByteCheckpoint: A Unified Checkpointing System for LLM Development

    Authors: Borui Wan, Mingji Han, Yiyao Sheng, Zhichao Lai, Mofan Zhang, Junda Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu

    Abstract: The development of real-world Large Language Models (LLMs) necessitates checkpointing of training states in persistent storage to mitigate potential software and hardware failures, as well as to facilitate checkpoint transferring within the training pipeline and across various tasks. Due to the immense size of LLMs, saving and loading checkpoints often incur intolerable minute-level stalls, signif… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  11. arXiv:2407.16214  [pdf, other

    cs.CV

    Diff-Shadow: Global-guided Diffusion Model for Shadow Removal

    Authors: Jinting Luo, Ru Li, Chengzhi Jiang, Mingyan Han, Xiaoming Zhang, Ting Jiang, Haoqiang Fan, Shuaicheng Liu

    Abstract: We propose Diff-Shadow, a global-guided diffusion model for high-quality shadow removal. Previous transformer-based approaches can utilize global information to relate shadow and non-shadow regions but are limited in their synthesis ability and recover images with obvious boundaries. In contrast, diffusion-based methods can generate better content but ignore global information, resulting in incons… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  12. arXiv:2407.16205  [pdf, other

    cs.CR cs.AI cs.CL cs.LG

    Figure it Out: Analyzing-based Jailbreak Attack on Large Language Models

    Authors: Shi Lin, Rongchang Li, Xun Wang, Changting Lin, Wenpeng Xing, Meng Han

    Abstract: The rapid development of Large Language Models (LLMs) has brought remarkable generative capabilities across diverse tasks. However, despite the impressive achievements, these LLMs still have numerous inherent vulnerabilities, particularly when faced with jailbreak attacks. By investigating jailbreak attacks, we can uncover hidden weaknesses in LLMs and inform the development of more robust defense… ▽ More

    Submitted 13 August, 2024; v1 submitted 23 July, 2024; originally announced July 2024.

  13. arXiv:2407.15268  [pdf, other

    cs.CL

    Fact-Aware Multimodal Retrieval Augmentation for Accurate Medical Radiology Report Generation

    Authors: Liwen Sun, James Zhao, Megan Han, Chenyan Xiong

    Abstract: Multimodal foundation models hold significant potential for automating radiology report generation, thereby assisting clinicians in diagnosing cardiac diseases. However, generated reports often suffer from serious factual inaccuracy. In this paper, we introduce a fact-aware multimodal retrieval-augmented pipeline in generating accurate radiology reports (FactMM-RAG). We first leverage RadGraph to… ▽ More

    Submitted 21 July, 2024; originally announced July 2024.

  14. Atomic-Layer-Controlled Magnetic Orders in MnBi2Te4-Bi2Te3 Topological Heterostructures

    Authors: Xiong Yao, Qirui Cui, Zengle Huang, Xiaoyu Yuan, Hee Taek Yi, Deepti Jain, Kim Kisslinger, Myung-Geun Han, Weida Wu, Hongxin Yang, Seongshik Oh

    Abstract: The natural van der Waals superlattice MnBi2Te4-(Bi2Te3)m provides an optimal platform to combine topology and magnetism in one system with minimal structural disorder. Here, we show that this system can harbor both ferromagnetic (FM) and antiferromagnetic (AFM) orders and that these magnetic orders can be controlled in two different ways by either varying the Mn-Mn distance while keeping the Bi2T… ▽ More

    Submitted 20 July, 2024; originally announced July 2024.

    Comments: 25 pages, 5 figures, accepted to Nano Letters

  15. arXiv:2407.14829  [pdf, other

    cs.CL

    Overview of AI-Debater 2023: The Challenges of Argument Generation Tasks

    Authors: Jiayu Lin, Guanrong Chen, Bojun Jin, Chenyang Li, Shutong Jia, Wancong Lin, Yang Sun, Yuhang He, Caihua Yang, Jianzhu Bao, Jipeng Wu, Wen Su, Jinglu Chen, Xinyi Li, Tianyu Chen, Mingjie Han, Shuaiwen Du, Zijian Wang, Jiyin Li, Fuzhong Suo, Hao Wang, Nuanchen Lin, Xuanjing Huang, Changjian Jiang, RuiFeng Xu , et al. (4 additional authors not shown)

    Abstract: In this paper we present the results of the AI-Debater 2023 Challenge held by the Chinese Conference on Affect Computing (CCAC 2023), and introduce the related datasets. We organize two tracks to handle the argumentative generation tasks in different scenarios, namely, Counter-Argument Generation (Track 1) and Claim-based Argument Generation (Track 2). Each track is equipped with its distinct data… ▽ More

    Submitted 24 July, 2024; v1 submitted 20 July, 2024; originally announced July 2024.

  16. arXiv:2407.12184  [pdf

    eess.IV cs.CV

    The object detection method aids in image reconstruction evaluation and clinical interpretation of meniscal abnormalities

    Authors: Natalia Konovalova, Aniket Tolpadi, Felix Liu, Zehra Akkaya, Felix Gassert, Paula Giesler, Johanna Luitjens, Misung Han, Emma Bahroos, Sharmila Majumdar, Valentina Pedoia

    Abstract: This study investigates the relationship between deep learning (DL) image reconstruction quality and anomaly detection performance, and evaluates the efficacy of an artificial intelligence (AI) assistant in enhancing radiologists' interpretation of meniscal anomalies on reconstructed images. A retrospective study was conducted using an in-house reconstruction and anomaly detection pipeline to asse… ▽ More

    Submitted 16 July, 2024; originally announced July 2024.

  17. arXiv:2407.09662  [pdf, other

    physics.atom-ph math-ph

    Analytical Expression for Continuum-continuum Transition Amplitude of Hydrogen-like Atoms with Angular-momentum Dependence

    Authors: Jia-Bao Ji, Kiyoshi Ueda, Meng Han, Hans Jakob Wörner

    Abstract: Attosecond chronoscopy typically utilises interfering two-photon transitions to access the phase information. Simulating these two-photon transitions is challenging due to the continuum-continuum transition term. The hydrogenic approximation within second-order perturbation theory has been widely used due to the existence of analytical expressions of the wave functions. So far, only (partially) as… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

  18. arXiv:2407.04675  [pdf, other

    eess.AS cs.SD

    Seed-ASR: Understanding Diverse Speech and Contexts with LLM-based Speech Recognition

    Authors: Ye Bai, Jingping Chen, Jitong Chen, Wei Chen, Zhuo Chen, Chuang Ding, Linhao Dong, Qianqian Dong, Yujiao Du, Kepan Gao, Lu Gao, Yi Guo, Minglun Han, Ting Han, Wenchao Hu, Xinying Hu, Yuxiang Hu, Deyu Hua, Lu Huang, Mingkun Huang, Youjia Huang, Jishuo Jin, Fanliu Kong, Zongwei Lan, Tianyu Li , et al. (30 additional authors not shown)

    Abstract: Modern automatic speech recognition (ASR) model is required to accurately transcribe diverse speech signals (from different domains, languages, accents, etc) given the specific contextual information in various application scenarios. Classic end-to-end models fused with extra language models perform well, but mainly in data matching scenarios and are gradually approaching a bottleneck. In this wor… ▽ More

    Submitted 10 July, 2024; v1 submitted 5 July, 2024; originally announced July 2024.

  19. arXiv:2406.10655  [pdf, ps, other

    cs.CR

    E-SAGE: Explainability-based Defense Against Backdoor Attacks on Graph Neural Networks

    Authors: Dingqiang Yuan, Xiaohua Xu, Lei Yu, Tongchang Han, Rongchang Li, Meng Han

    Abstract: Graph Neural Networks (GNNs) have recently been widely adopted in multiple domains. Yet, they are notably vulnerable to adversarial and backdoor attacks. In particular, backdoor attacks based on subgraph insertion have been shown to be effective in graph classification tasks while being stealthy, successfully circumventing various existing defense methods. In this paper, we propose E-SAGE, a novel… ▽ More

    Submitted 15 June, 2024; originally announced June 2024.

  20. arXiv:2406.04474  [pdf

    cond-mat.mtrl-sci cond-mat.other

    Stoichiometry-induced ferromagnetism in altermagnetic candidate MnTe

    Authors: Michael Chilcote, Alessandro R. Mazza, Qiangsheng Lu, Isaiah Gray, Qi Tian, Qinwen Deng, Duncan Moseley, An-Hsi Chen, Jason Lapano, Jason S. Gardner, Gyula Eres, T. Zac Ward, Erxi Feng, Huibo Cao, Valeria Lauter, Michael A. McGuire, Raphael Hermann, David Parker, Myung-Geun Han, Asghar Kayani, Gaurab Rimal, Liang Wu, Timothy R. Charlton, Robert G. Moore, Matthew Brahlek

    Abstract: The field of spintronics has seen a surge of interest in altermagnetism due to novel predictions and many possible applications. MnTe is a leading altermagnetic candidate that is of significant interest across spintronics due to its layered antiferromagnetic structure, high Neel temperature (TN ~ 310 K) and semiconducting properties. We present results on molecular beam epitaxy (MBE) grown MnTe/In… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted in Advanced Functional Materials

  21. arXiv:2406.00785  [pdf

    cond-mat.mes-hall cond-mat.other physics.app-ph

    Electric-Field Control of Magnetic Skyrmion Chirality in a Centrosymmetric 2D van der Waals Magnet

    Authors: Myung-Geun Han, Joachim Dahl Thomsen, John P. Philbin, Junsik Mun, Eugene Park, Fernando Camino, Lukáš Děkanovský, Chuhang Liu, Zdenek Sofer, Prineha Narang, Frances M. Ross, Yimei Zhu

    Abstract: Two-dimensional van der Waals magnets hosting topological magnetic textures, such as skyrmions, show promise for applications in spintronics and quantum computing. Electrical control of these topological spin textures would enable novel devices with enhanced performance and functionality. Here, using electron microscopy combined with in situ electric and magnetic biasing, we show that the skyrmion… ▽ More

    Submitted 2 June, 2024; originally announced June 2024.

  22. arXiv:2405.19758  [pdf, other

    cs.RO

    InterPreT: Interactive Predicate Learning from Language Feedback for Generalizable Task Planning

    Authors: Muzhi Han, Yifeng Zhu, Song-Chun Zhu, Ying Nian Wu, Yuke Zhu

    Abstract: Learning abstract state representations and knowledge is crucial for long-horizon robot planning. We present InterPreT, an LLM-powered framework for robots to learn symbolic predicates from language feedback of human non-experts during embodied interaction. The learned predicates provide relational abstractions of the environment state, facilitating the learning of symbolic operators that capture… ▽ More

    Submitted 30 May, 2024; originally announced May 2024.

    Comments: RSS 2024; https://interpret-robot.github.io

  23. arXiv:2405.18923  [pdf, other

    astro-ph.IM

    The BlackGEM telescope array I: Overview

    Authors: Paul J. Groot, S. Bloemen, P. Vreeswijk, J. van Roestel, P. G. Jonker, G. Nelemans, M. Klein-Wolt, R. Le Poole, D. Pieterse, M. Rodenhuis, W. Boland, M. Haverkorn, C. Aerts, R. Bakker, H. Balster, M. Bekema, E. Dijkstra, P. Dolron, E. Elswijk, A. van Elteren, A. Engels, M. Fokker, M. de Haan, F. Hahn, R. ter Horst , et al. (49 additional authors not shown)

    Abstract: The main science aim of the BlackGEM array is to detect optical counterparts to gravitational wave mergers. Additionally, the array will perform a set of synoptic surveys to detect Local Universe transients and short time-scale variability in stars and binaries, as well as a six-filter all-sky survey down to ~22nd mag. The BlackGEM Phase-I array consists of three optical wide-field unit telescopes… ▽ More

    Submitted 30 May, 2024; v1 submitted 29 May, 2024; originally announced May 2024.

    Comments: 14 pages, submitted to Astronomy & Astrophysics

  24. arXiv:2405.15322  [pdf, other

    cs.CR cs.AR

    Dishonest Approximate Computing: A Coming Crisis for Cloud Clients

    Authors: Ye Wang, Jian Dong, Ming Han, Jin Wu, Gang Qu

    Abstract: Approximate Computing (AC) has emerged as a promising technique for achieving energy-efficient architectures and is expected to become an effective technique for reducing the electricity cost for cloud service providers (CSP). However, the potential misuse of AC has not received adequate attention, which is a coming crisis behind the blueprint of AC. Driven by the pursuit of illegal financial prof… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

    Comments: 12 pages, 9 figures

  25. arXiv:2405.12478  [pdf, other

    eess.SY

    Efficient Economic Model Predictive Control of Water Treatment Process with Learning-based Koopman Operator

    Authors: Minghao Han, Jingshi Yao, Adrian Wing-Keung Law, Xunyuan Yin

    Abstract: Used water treatment plays a pivotal role in advancing environmental sustainability. Economic model predictive control holds the promise of enhancing the overall operational performance of the water treatment facilities. In this study, we propose a data-driven economic predictive control approach within the Koopman modeling framework. First, we propose a deep learning-enabled input-output Koopman… ▽ More

    Submitted 14 July, 2024; v1 submitted 20 May, 2024; originally announced May 2024.

  26. arXiv:2405.12079  [pdf, other

    cs.DC cs.OS

    PARALLELGPUOS: A Concurrent OS-level GPU Checkpoint and Restore System using Validated Speculation

    Authors: Zhuobin Huang, Xingda Wei, Yingyi Hao, Rong Chen, Mingcong Han, Jinyu Gu, Haibo Chen

    Abstract: Checkpointing (C) and restoring (R) are key components for GPU tasks. POS is an OS-level GPU C/R system: It can transparently checkpoint or restore processes that use the GPU, without requiring any cooperation from the application, a key feature required by modern systems like the cloud. Moreover, POS is the first OS-level C/R system that can concurrently execute C/R with the application execution… ▽ More

    Submitted 20 May, 2024; originally announced May 2024.

  27. arXiv:2405.08318  [pdf, other

    cs.LG

    No-Regret Learning of Nash Equilibrium for Black-Box Games via Gaussian Processes

    Authors: Minbiao Han, Fengxue Zhang, Yuxin Chen

    Abstract: This paper investigates the challenge of learning in black-box games, where the underlying utility function is unknown to any of the agents. While there is an extensive body of literature on the theoretical analysis of algorithms for computing the Nash equilibrium with complete information about the game, studies on Nash equilibrium in black-box games are less common. In this paper, we focus on le… ▽ More

    Submitted 14 May, 2024; originally announced May 2024.

    Comments: 40th Conference on Uncertainty in Artificial Intelligence (UAI 2024)

  28. arXiv:2405.04752  [pdf, other

    eess.AS cs.SD

    HILCodec: High Fidelity and Lightweight Neural Audio Codec

    Authors: Sunghwan Ahn, Beom Jun Woo, Min Hyun Han, Chanyeong Moon, Nam Soo Kim

    Abstract: The recent advancement of end-to-end neural audio codecs enables compressing audio at very low bitrates while reconstructing the output audio with high fidelity. Nonetheless, such improvements often come at the cost of increased model complexity. In this paper, we identify and address the problems of existing neural audio codecs. We show that the performance of Wave-U-Net does not increase consist… ▽ More

    Submitted 7 May, 2024; originally announced May 2024.

  29. arXiv:2404.19334  [pdf, other

    cs.CV

    Multi-Scale Heterogeneity-Aware Hypergraph Representation for Histopathology Whole Slide Images

    Authors: Minghao Han, Xukun Zhang, Dingkang Yang, Tao Liu, Haopeng Kuang, Jinghui Feng, Lihua Zhang

    Abstract: Survival prediction is a complex ordinal regression task that aims to predict the survival coefficient ranking among a cohort of patients, typically achieved by analyzing patients' whole slide images. Existing deep learning approaches mainly adopt multiple instance learning or graph neural networks under weak supervision. Most of them are unable to uncover the diverse interactions between differen… ▽ More

    Submitted 30 April, 2024; originally announced April 2024.

    Comments: 9 pages, 6 figures, accepted by ICME2024

  30. arXiv:2404.17521  [pdf, other

    cs.RO cs.CV

    Ag2Manip: Learning Novel Manipulation Skills with Agent-Agnostic Visual and Action Representations

    Authors: Puhao Li, Tengyu Liu, Yuyang Li, Muzhi Han, Haoran Geng, Shu Wang, Yixin Zhu, Song-Chun Zhu, Siyuan Huang

    Abstract: Autonomous robotic systems capable of learning novel manipulation tasks are poised to transform industries from manufacturing to service automation. However, modern methods (e.g., VIP and R3M) still face significant hurdles, notably the domain gap among robotic embodiments and the sparsity of successful task executions within specific action spaces, resulting in misaligned and ambiguous task repre… ▽ More

    Submitted 26 April, 2024; originally announced April 2024.

    Comments: Project website and open-source code: https://xiaoyao-li.github.io/research/ag2manip

  31. arXiv:2404.10563  [pdf, other

    gr-qc

    A Mathematica program for numerically computing real and complex critical points in 4-dimensional Lorentzian spinfoam amplitude

    Authors: Muxin Han, Hongguang Liu, Dongxue Qu

    Abstract: This work develops a comprehensive algorithm and a Mathematica program to construct boundary data and compute real and complex critical points in spinfoam amplitudes. Our approach covers both spacelike tetrahedra and triangles in the EPRL model and timelike tetrahedra and triangles in the Conrady-Hnybida extension, aiming at addressing a wide range of physical scenarios such as cosmology and black… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: 34 pages, 36 figures

  32. arXiv:2404.10358  [pdf, other

    cs.CV

    Improving Bracket Image Restoration and Enhancement with Flow-guided Alignment and Enhanced Feature Aggregation

    Authors: Wenjie Lin, Zhen Liu, Chengzhi Jiang, Mingyan Han, Ting Jiang, Shuaicheng Liu

    Abstract: In this paper, we address the Bracket Image Restoration and Enhancement (BracketIRE) task using a novel framework, which requires restoring a high-quality high dynamic range (HDR) image from a sequence of noisy, blurred, and low dynamic range (LDR) multi-exposure RAW inputs. To overcome this challenge, we present the IREANet, which improves the multiple exposure alignment and aggregation with a Fl… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

  33. arXiv:2404.10220  [pdf, other

    cs.RO cs.AI cs.CV cs.LG

    Closed-Loop Open-Vocabulary Mobile Manipulation with GPT-4V

    Authors: Peiyuan Zhi, Zhiyuan Zhang, Muzhi Han, Zeyu Zhang, Zhitian Li, Ziyuan Jiao, Baoxiong Jia, Siyuan Huang

    Abstract: Autonomous robot navigation and manipulation in open environments require reasoning and replanning with closed-loop feedback. We present COME-robot, the first closed-loop framework utilizing the GPT-4V vision-language foundation model for open-ended reasoning and adaptive planning in real-world scenarios. We meticulously construct a library of action primitives for robot exploration, navigation, a… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  34. arXiv:2404.09757  [pdf, other

    physics.atom-ph

    Ultra-Wide Dual-band Rydberg Atomic Receiver Based on Space Division Multiplexing RF-Chip Modules

    Authors: Li-Hua Zhang, Bang Liu, Zong-Kai Liu, Zheng-Yuan Zhang, Shi-Yao Shao, Qi-Feng Wang, Ma YuTian-Yu Han, Guang-Can Guo, Dong-Sheng Ding, Bao-Sen Shi

    Abstract: Detecting microwave signals over a wide frequency range has numerous advantages as it enables simultaneous transmission of a large amount of information and access to more spectrum resources. This capability is crucial for applications such as microwave communication, remote sensing, and radar. However, conventional microwave receiving systems are limited by amplifiers and band-pass filters that c… ▽ More

    Submitted 16 April, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 11 pages, 5 figures

  35. arXiv:2404.09563  [pdf, other

    astro-ph.HE nucl-th

    Upper Limit of Sound Speed in Nuclear Matter: A Harmonious Interplay of Transport Calculation and Perturbative QCD Constraint

    Authors: Shao-Peng Tang, Yong-Jia Huang, Ming-Zhe Han, Yi-Zhong Fan

    Abstract: Very recently, it has been shown that there is an upper bound on the squared sound speed of nuclear matter from the transport, which reads $c_{\rm s}^2 \leq 0.781$. In this work, we demonstrate that this upper bound is corroborated by the reconstructed equation of state (EOS; modeled with a nonparametric method) for ultra-dense matter. The reconstruction integrates multi-messenger observation for… ▽ More

    Submitted 22 July, 2024; v1 submitted 15 April, 2024; originally announced April 2024.

    Comments: 9 pages, 9 figures, updated with the most recent data release from NICER

    Report number: RIKEN-iTHEMS-Report-24

  36. arXiv:2404.03384  [pdf, other

    cs.CV

    LongVLM: Efficient Long Video Understanding via Large Language Models

    Authors: Yuetian Weng, Mingfei Han, Haoyu He, Xiaojun Chang, Bohan Zhuang

    Abstract: Empowered by Large Language Models (LLMs), recent advancements in Video-based LLMs (VideoLLMs) have driven progress in various video understanding tasks. These models encode video representations through pooling or query aggregation over a vast number of visual tokens, making computational and memory costs affordable. Despite successfully providing an overall comprehension of video content, existi… ▽ More

    Submitted 20 July, 2024; v1 submitted 4 April, 2024; originally announced April 2024.

    Comments: Accepted by ECCV 2024

  37. arXiv:2404.03310  [pdf

    physics.ao-ph cs.LG

    Site-specific Deterministic Temperature and Humidity Forecasts with Explainable and Reliable Machine Learning

    Authors: MengMeng Han, Tennessee Leeuwenburg, Brad Murphy

    Abstract: Site-specific weather forecasts are essential to accurate prediction of power demand and are consequently of great interest to energy operators. However, weather forecasts from current numerical weather prediction (NWP) models lack the fine-scale detail to capture all important characteristics of localised real-world sites. Instead they provide weather information representing a rectangular gridbo… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

    Comments: 27 Pages, 16 Figures, 11 Tables

    Journal ref: Applied Sciences, Volume 14, Issue 14, Article Number 6314, July 2024

  38. arXiv:2404.02796  [pdf, other

    gr-qc hep-th

    Spin foam amplitude of the black-to-white hole transition

    Authors: Muxin Han, Dongxue Qu, Cong Zhang

    Abstract: It has been conjectured that quantum gravity effects may cause the black-to-white hole transition due to quantum tunneling. The transition amplitude of this process is explored within the framework of the spin foam model on a 2-complex containing 56 vertices. We develop a systematic way to construct the bulk triangulation from the boundary triangulation to obtain the 2-complex. By using Thiemann's… ▽ More

    Submitted 3 April, 2024; originally announced April 2024.

    Comments: 17+15 Pages

  39. arXiv:2404.00963  [pdf, other

    cond-mat.supr-con cond-mat.str-el

    Inversion and Tunability of Van Hove Singularities in $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs) kagome metals

    Authors: Sangjun Sim, Min Yong Jeong, Hyunggeun Lee, Dong Hyun David Lee, Myung Joon Han

    Abstract: To understand the alkali-metal-dependent material properties of recently discovered $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, and Cs), we conducted a detailed electronic structure analysis based on first-principles density functional theory calculations. Contrary to the case of $A$ = K and Rb, the energetic positions of the low-lying Van Hove singularities are reversed in CsV$_{3}$Sb$_{5}$, and the charact… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: Physical Chemistry Chemical Physics (PCCP) in press

  40. Reduced-order Koopman modeling and predictive control of nonlinear processes

    Authors: Xuewen Zhang, Minghao Han, Xunyuan Yin

    Abstract: In this paper, we propose an efficient data-driven predictive control approach for general nonlinear processes based on a reduced-order Koopman operator. A Kalman-based sparse identification of nonlinear dynamics method is employed to select lifting functions for Koopman identification. The selected lifting functions are used to project the original nonlinear state-space into a higher-dimensional… ▽ More

    Submitted 30 March, 2024; originally announced April 2024.

    Comments: 29 pages, 8 figures

    Journal ref: Computers & Chemical Engineering, 2023, 179, p.108440

  41. arXiv:2403.17801  [pdf, other

    cs.CV eess.IV

    Towards 3D Vision with Low-Cost Single-Photon Cameras

    Authors: Fangzhou Mu, Carter Sifferman, Sacha Jungerman, Yiquan Li, Mark Han, Michael Gleicher, Mohit Gupta, Yin Li

    Abstract: We present a method for reconstructing 3D shape of arbitrary Lambertian objects based on measurements by miniature, energy-efficient, low-cost single-photon cameras. These cameras, operating as time resolved image sensors, illuminate the scene with a very fast pulse of diffuse light and record the shape of that pulse as it returns back from the scene at a high temporal resolution. We propose to mo… ▽ More

    Submitted 29 March, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  42. arXiv:2403.14105  [pdf, other

    astro-ph.HE

    Bulk properties of PSR J0030+0451 inferred with the compactness measurement of NICER

    Authors: Chuan-Ning Luo, Shao-Peng Tang, Ming-Zhe Han, Jin-Liang Jiang, Wei-Hong Gao, Da-Ming Wei

    Abstract: In 2019, Neutron star Interior Composition ExploreR (NICER) mission released its findings on the mass and radius of the isolated neutron star (INS) PSR J0030+0451, revealing a mass of approximately 1.4 solar masses ($M_{\odot}$) and a radius near 13 kilometers. However, the recent re-analysis by the NICER collaboration \citep{vinciguerra2024updated} suggests that the available data primarily yield… ▽ More

    Submitted 2 April, 2024; v1 submitted 20 March, 2024; originally announced March 2024.

    Comments: 10 pages,6 figures, Accepted for publication in ApJ

  43. arXiv:2403.11552  [pdf, other

    cs.RO cs.AI

    LLM3:Large Language Model-based Task and Motion Planning with Motion Failure Reasoning

    Authors: Shu Wang, Muzhi Han, Ziyuan Jiao, Zeyu Zhang, Ying Nian Wu, Song-Chun Zhu, Hangxin Liu

    Abstract: Conventional Task and Motion Planning (TAMP) approaches rely on manually crafted interfaces connecting symbolic task planning with continuous motion generation. These domain-specific and labor-intensive modules are limited in addressing emerging tasks in real-world settings. Here, we present LLM^3, a novel Large Language Model (LLM)-based TAMP framework featuring a domain-independent interface. Sp… ▽ More

    Submitted 21 August, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

    Comments: IROS 2024. Codes available: https://github.com/AssassinWS/LLM-TAMP

  44. arXiv:2403.08010  [pdf, other

    cs.CL

    Debatrix: Multi-dimensional Debate Judge with Iterative Chronological Analysis Based on LLM

    Authors: Jingcong Liang, Rong Ye, Meng Han, Ruofei Lai, Xinyu Zhang, Xuanjing Huang, Zhongyu Wei

    Abstract: How can we construct an automated debate judge to evaluate an extensive, vibrant, multi-turn debate? This task is challenging, as judging a debate involves grappling with lengthy texts, intricate argument relationships, and multi-dimensional assessments. At the same time, current research mainly focuses on short dialogues, rarely touching upon the evaluation of an entire debate. In this paper, by… ▽ More

    Submitted 19 June, 2024; v1 submitted 12 March, 2024; originally announced March 2024.

  45. arXiv:2403.02763  [pdf, other

    quant-ph cond-mat.str-el

    Quantum Zeno Monte Carlo for computing observables

    Authors: Mancheon Han, Hyowon Park, Sangkook Choi

    Abstract: The recent development of logical quantum processors signifies a pivotal moment in the progression from the noisy intermediate-scale quantum (NISQ) era to the fault-tolerant quantum computing (FTQC) era. These advanced devices are poised to alter the approach to problems that challenge classical computation methods. By transforming such problems into Hamiltonian frameworks and exploiting quantum m… ▽ More

    Submitted 9 May, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

    Comments: 15 figures

  46. arXiv:2403.02075  [pdf, other

    cs.CV

    DiffMOT: A Real-time Diffusion-based Multiple Object Tracker with Non-linear Prediction

    Authors: Weiyi Lv, Yuhang Huang, Ning Zhang, Ruei-Sung Lin, Mei Han, Dan Zeng

    Abstract: In Multiple Object Tracking, objects often exhibit non-linear motion of acceleration and deceleration, with irregular direction changes. Tacking-by-detection (TBD) trackers with Kalman Filter motion prediction work well in pedestrian-dominant scenarios but fall short in complex situations when multiple objects perform non-linear and diverse motion simultaneously. To tackle the complex non-linear m… ▽ More

    Submitted 20 March, 2024; v1 submitted 4 March, 2024; originally announced March 2024.

    Comments: CVPR2024

  47. arXiv:2403.00257  [pdf

    cs.CV cs.LG

    Robust deep labeling of radiological emphysema subtypes using squeeze and excitation convolutional neural networks: The MESA Lung and SPIROMICS Studies

    Authors: Artur Wysoczanski, Nabil Ettehadi, Soroush Arabshahi, Yifei Sun, Karen Hinkley Stukovsky, Karol E. Watson, MeiLan K. Han, Erin D Michos, Alejandro P. Comellas, Eric A. Hoffman, Andrew F. Laine, R. Graham Barr, Elsa D. Angelini

    Abstract: Pulmonary emphysema, the progressive, irreversible loss of lung tissue, is conventionally categorized into three subtypes identifiable on pathology and on lung computed tomography (CT) images. Recent work has led to the unsupervised learning of ten spatially-informed lung texture patterns (sLTPs) on lung CT, representing distinct patterns of emphysematous lung parenchyma based on both textural app… ▽ More

    Submitted 29 February, 2024; originally announced March 2024.

  48. arXiv:2402.17685  [pdf, other

    physics.chem-ph

    Attosecond X-ray Chronoscopy of Core-level Photoemission

    Authors: Jia-Bao Ji, Zhaoheng Guo, Taran Driver, Cynthia S. Trevisan, David Cesar, Xinxin Cheng, Joseph Duris, Paris L. Franz, James Glownia, Xiaochun Gong, Daniel Hammerland, Meng Han, Saijoscha Heck, Matthias Hoffmann, Andrei Kamalov, Kirk A. Larsen, Xiang Li, Ming-Fu Lin, Yuchen Liu, C. William McCurdy, Razib Obaid, Jordan T. ONeal, Thomas N. Rescigno, River R. Robles, Nicholas Sudar , et al. (10 additional authors not shown)

    Abstract: Attosecond photoemission or photoionization delays are a unique probe of the structure and the electronic dynamics of matter. However, spectral congestion and spatial delocalization of valence electron wave functions set fundamental limits to the complexity of systems that can be studied and the information that can be retrieved, respectively. Using attosecond X-ray pulses from LCLS, we demonstrat… ▽ More

    Submitted 8 April, 2024; v1 submitted 27 February, 2024; originally announced February 2024.

  49. arXiv:2402.12070  [pdf, other

    math.FA

    M-ideals of compact operators and Norm attaining operators

    Authors: Manwook Han, Sun Kwang Kim

    Abstract: We investigate M-ideals of compact operators and two distinct properties in norm-attaining operator theory related with M-ideals of compact operators called the weak maximizing property and the compact perturbation property. For Banach spaces $X$ and $Y$, it is previously known that if $\mathcal{K}(X,Y)$ is an M-ideal or $(X,Y)$ has the weak maximizing property, then $(X,Y)$ has the adjoint compac… ▽ More

    Submitted 19 February, 2024; originally announced February 2024.

    Comments: 25 pages

  50. arXiv:2402.08176  [pdf, other

    hep-th gr-qc math-ph math.GT math.QA

    Representations of a quantum-deformed Lorentz algebra, Clebsch-Gordan map, and Fenchel-Nielsen representation of quantum complex flat connections at level-$k$

    Authors: Muxin Han

    Abstract: A family of infinite-dimensional irreducible $\star$-representations on $\mathcal{H}\simeq L^2(\mathbb{R})\otimes\mathbb{C}^k$ is defined for a quantum-deformed Lorentz algebra $U_\mathbf{q}(sl_2)\otimes U_{\tilde{\mathbf{q}}}(sl_2)$, where $\mathbf{q}=\exp[\frac{2πi}{k}(1+b^2)]$ and $\tilde{\mathbf{q}}=\exp[\frac{2πi}{k}(1+b^{-2})]$ with $k\in\mathbb{Z}_+$ and $|b|=1$. The representations are con… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

    Comments: 28 pages, 4 figures