Zum Hauptinhalt springen

Showing 1–50 of 619 results for author: Cheung, C

.
  1. arXiv:2408.14318  [pdf, other

    quant-ph physics.app-ph

    Blueprint for NV center ensemble based magnetometer: precise diamond sensor material characterization

    Authors: Jixing Zhang, Michael Kuebler, Cheuk Kit Cheung, Magnus Benke, Andrej Denisenko, Jens Anders, Emilio Corcione, Cristina Tarín Sauer, Junichi Isoya, Chen Zhang, Joerg Wrachtrup

    Abstract: The nitrogen-vacancy (NV) center in diamond is a promising candidate for various quantum applications, such as quantum sensing. High sensitivity in NV-based magnetic sensing requires a diamond sample with a high density of NV centers and a long electron spin dephasing time. In this work, we propose a systematic measurement method for determining the electron spin dephasing time of NV center ensemb… ▽ More

    Submitted 26 August, 2024; originally announced August 2024.

  2. arXiv:2408.04720  [pdf, other

    hep-th cs.LG hep-ph

    Learning the Simplicity of Scattering Amplitudes

    Authors: Clifford Cheung, Aurélien Dersy, Matthew D. Schwartz

    Abstract: The simplification and reorganization of complex expressions lies at the core of scientific progress, particularly in theoretical high-energy physics. This work explores the application of machine learning to a particular facet of this challenge: the task of simplifying scattering amplitudes expressed in terms of spinor-helicity variables. We demonstrate that an encoder-decoder transformer archite… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 25+15 pages, 9+6 figures

    Report number: CALT-TH 2024-031

  3. arXiv:2408.03362  [pdf, other

    hep-th gr-qc hep-ph

    Uniqueness Criteria for the Virasoro-Shapiro Amplitude

    Authors: Clifford Cheung, Aaron Hillman, Grant N. Remmen

    Abstract: The Veneziano amplitude has recently been uniquely bootstrapped from crossing symmetry, faster than power-law falloff at high energies, and a property dubbed level truncation. In this paper we apply this bootstrap approach to fully permutation invariant amplitudes, deriving new deformations of the Virasoro-Shapiro amplitude for graviton scattering in string theory. Superpolynomially soft Regge beh… ▽ More

    Submitted 6 August, 2024; originally announced August 2024.

    Comments: 6 pages, 2 figures

    Report number: CALT-TH 2024-030

  4. arXiv:2408.00477  [pdf, other

    physics.atom-ph

    A neural network approach to running high-precision atomic computations

    Authors: Pavlo Bilous, Charles Cheung, Marianna Safronova

    Abstract: Modern applications of atomic physics, including the determination of frequency standards, and the analysis of astrophysical spectra, require prediction of atomic properties with exquisite accuracy. For complex atomic systems, high-precision calculations are a major challenge due to the exponential scaling of the involved electronic configuration sets. This exacerbates the problem of required comp… ▽ More

    Submitted 1 August, 2024; originally announced August 2024.

    Comments: 10 pages, 3 figures, 4 tables

  5. arXiv:2407.17610  [pdf, ps, other

    physics.atom-ph

    Pr10+ as a candidate for a high-accuracy optical clock for tests of fundamental physics

    Authors: S. G. Porsev, C. Cheung, M. S. Safronova, H. Bekker, N. -H. Rehbehn, J. R. Crespo Lopez-Urrutia, S. M. Brewer

    Abstract: We propose In-like Pr10+ as a candidate for the development of a high-accuracy optical clock with high sensitivity to a time variation of the fine-structure constant, (\dot alpha}/alpha, as well as favorable experimental systematics. We calculate its low-lying energy levels by combining the configuration interaction and the coupled cluster method, achieving uncertainties as low as 0.1%, and improv… ▽ More

    Submitted 24 July, 2024; originally announced July 2024.

    Comments: 9 pages, 2 figures

  6. arXiv:2407.13390  [pdf, other

    cs.CV

    GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

    Authors: Xiufeng Huang, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Remarkable advancements in the recolorization of Neural Radiance Fields (NeRF) have simplified the process of modifying NeRF's color attributes. Yet, with the potential of NeRF to serve as shareable digital assets, there's a concern that malicious users might alter the color of NeRF models and falsely claim the recolorized version as their own. To safeguard against such breaches of ownership, enab… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  7. arXiv:2407.10510  [pdf, other

    cs.CL cs.AI cs.CE

    TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction

    Authors: Xingzhi Zhou, Xin Dong, Chunhao Li, Yuning Bai, Yulong Xu, Ka Chun Cheung, Simon See, Xinpeng Song, Runshun Zhang, Xuezhong Zhou, Nevin L. Zhang

    Abstract: Traditional Chinese medicine (TCM) relies on specific combinations of herbs in prescriptions to treat symptoms and signs, a practice that spans thousands of years. Predicting TCM prescriptions presents a fascinating technical challenge with practical implications. However, this task faces limitations due to the scarcity of high-quality clinical datasets and the intricate relationship between sympt… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  8. arXiv:2407.07735  [pdf, other

    cs.CV

    Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

    Authors: Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation. With the rising prominence and influence of NeRF, safeguarding its intellectual property has become increasingly important. In this paper, we propose \textbf{NeRFProtector}, which adopts a plug-and-play strategy to protect NeRF's copyright during its creation. NeRFProtector utilizes a pre-trained watermarking base… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  9. arXiv:2406.17245  [pdf, other

    cs.LG cs.AI cs.CL

    Unlocking Continual Learning Abilities in Language Models

    Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

    Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint, 19 pages

  10. arXiv:2406.14770  [pdf, other

    hep-th gr-qc hep-ph

    Gravitational Scattering and Beyond from Extreme Mass Ratio Effective Field Theory

    Authors: Clifford Cheung, Julio Parra-Martinez, Ira Z. Rothstein, Nabha Shah, Jordan Wilson-Gerow

    Abstract: We explore a recently proposed effective field theory describing electromagnetically or gravitationally interacting massive particles in an expansion about their mass ratio, also known as the self-force (SF) expansion. By integrating out the deviation of the heavy particle about its inertial trajectory, we obtain an effective action whose only degrees of freedom are the lighter particle together w… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

    Comments: 77 pages, 10 figures

    Report number: CALT-TH 2024-023

  11. arXiv:2406.12018  [pdf, other

    cs.CL

    CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling

    Authors: Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie Chi Kit Cheung

    Abstract: Long sequence modeling has gained broad interest as large language models (LLMs) continue to advance. Recent research has identified that a large portion of hidden states within the key-value caches of Transformer models can be discarded (also termed evicted) without affecting the perplexity performance in generating long sequences. However, we show that these methods, despite preserving perplexit… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Work in progress

  12. arXiv:2406.11536  [pdf, other

    cs.DC cs.CV

    RO-SVD: A Reconfigurable Hardware Copyright Protection Framework for AIGC Applications

    Authors: Zhuoheng Ran, Muhammad A. A. Abdelgawad, Zekai Zhang, Ray C. C. Cheung, Hong Yan

    Abstract: The dramatic surge in the utilisation of generative artificial intelligence (GenAI) underscores the need for a secure and efficient mechanism to responsibly manage, use and disseminate multi-dimensional data generated by artificial intelligence (AI). In this paper, we propose a blockchain-based copyright traceability framework called ring oscillator-singular value decomposition (RO-SVD), which int… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted on 20 May 2024 as a full paper at ASAP 2024

  13. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  14. arXiv:2406.08723  [pdf, other

    cs.CL

    ECBD: Evidence-Centered Benchmark Design for NLP

    Authors: Yu Lu Liu, Su Lin Blodgett, Jackie Chi Kit Cheung, Q. Vera Liao, Alexandra Olteanu, Ziang Xiao

    Abstract: Benchmarking is seen as critical to assessing progress in NLP. However, creating a benchmark involves many design decisions (e.g., which datasets to include, which metrics to use) that often rely on tacit, untested assumptions about what the benchmark is intended to measure or is actually measuring. There is currently no principled way of analyzing these decisions and how they impact the validity… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  15. arXiv:2406.07640  [pdf, other

    cs.LG cs.AI

    When is an Embedding Model More Promising than Another?

    Authors: Maxime Darrin, Philippe Formont, Ismail Ben Ayed, Jackie CK Cheung, Pablo Piantanida

    Abstract: Embedders play a central role in machine learning, projecting any object into numerical representations that can, in turn, be leveraged to perform various downstream tasks. The evaluation of embedding models typically depends on domain-specific empirical approaches utilizing downstream tasks, primarily because of the lack of a standardized framework for comparison. However, acquiring adequately la… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  16. arXiv:2406.07359  [pdf, other

    cs.CL

    GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews

    Authors: Maxime Darrin, Ines Arous, Pablo Piantanida, Jackie CK Cheung

    Abstract: Scientific peer review is essential for the quality of academic publications. However, the increasing number of paper submissions to conferences has strained the reviewing process. This surge poses a burden on area chairs who have to carefully read an ever-growing volume of reviews and discern each reviewer's main arguments as part of their decision process. In this paper, we introduce \sys, a sum… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  17. arXiv:2406.02665  [pdf, other

    hep-th hep-ph

    A Bootstrap Principle for the Spectrum and Scattering of Strings

    Authors: Clifford Cheung, Aaron Hillman, Grant N. Remmen

    Abstract: We show that the Veneziano amplitude of string theory is the unique solution to an analytically solvable bootstrap problem. Uniqueness follows from two assumptions: faster than power-law falloff in high-energy scattering and the existence of some infinite sequence in momentum transfer at which higher-spin exchanges cancel. The string amplitude$\unicode{x2013}$including the mass spectrum… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 5 pages (+ 2 pages supplementary), 1 figure

    Report number: CALT-TH 2024-022

  18. arXiv:2405.20913  [pdf, other

    cond-mat.mtrl-sci

    Coexisting charge density waves in twisted bilayer NbSe2

    Authors: Christopher T. S. Cheung, Zachary A. H. Goodwin, Yixuan Han, Jiong Lu, Arash A. Mostofi, Johannes Lischner

    Abstract: Twisted bilayers of two-dimensional materials have emerged as a highly tunable platform for studying broken symmetry phases. While most interest has been focused on emergent states in systems whose constituent monolayers do not feature broken symmetry states, assembling monolayers that exhibit ordered states into twisted bilayers can also give rise to interesting phenomena. Here, we use large-scal… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

  19. arXiv:2405.17757  [pdf, other

    cs.CE

    NASPrecision: Neural Architecture Search-Driven Multi-Stage Learning for Surface Roughness Prediction in Ultra-Precision Machining

    Authors: Penghui Ruan, Divya Saxena, Jiannong Cao, Xiaoyun Liu, Ruoxin Wang, Chi Fai Cheung

    Abstract: Accurate surface roughness prediction is critical for ensuring high product quality, especially in areas like manufacturing and aerospace, where the smallest imperfections can compromise performance or safety. However, this is challenging due to complex, non-linear interactions among variables, which is further exacerbated with limited and imbalanced datasets. Existing methods using traditional ma… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  20. arXiv:2405.15724  [pdf, other

    cs.CG cs.DM cs.RO

    Reconfiguration Algorithms for Cubic Modular Robots with Realistic Movement Constraints

    Authors: MIT--NASA Space Robots Team, Josh Brunner, Kenneth C. Cheung, Erik D. Demaine, Jenny Diomidova, Christine Gregg, Della H. Hendrickson, Irina Kostitsyna

    Abstract: We introduce and analyze a model for self-reconfigurable robots made up of unit-cube modules. Compared to past models, our model aims to newly capture two important practical aspects of real-world robots. First, modules often do not occupy an exact unit cube, but rather have features like bumps extending outside the allotted space so that modules can interlock. Thus, for example, our model forbids… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  21. arXiv:2405.02594  [pdf, other

    cs.LG stat.ML

    Leveraging (Biased) Information: Multi-armed Bandits with Offline Data

    Authors: Wang Chi Cheung, Lixing Lyu

    Abstract: We leverage offline data to facilitate online learning in stochastic multi-armed bandits. The probability distributions that govern the offline data and the online rewards can be different. Without any non-trivial upper bound on their difference, we show that no non-anticipatory policy can outperform the UCB policy by (Auer et al. 2002), even in the presence of offline data. In complement, we prop… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 24 pages, 5 figures. Accepted to ICML 2024

  22. arXiv:2405.01189  [pdf, other

    cs.LG cs.AI

    Gradient-Congruity Guided Federated Sparse Training

    Authors: Chris Xing Tian, Yibing Liu, Haoliang Li, Ray C. C. Cheung, Shiqi Wang

    Abstract: Edge computing allows artificial intelligence and machine learning models to be deployed on edge devices, where they can learn from local data and collaborate to form a global model. Federated learning (FL) is a distributed machine learning technique that facilitates this process while preserving data privacy. However, FL also faces challenges such as high computational and communication costs reg… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  23. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  24. arXiv:2404.14589  [pdf, other

    physics.atom-ph astro-ph.HE astro-ph.IM astro-ph.SR physics.plasm-ph

    Natural-linewidth measurements of the 3C and 3D soft-x-ray transitions in Ni XIX

    Authors: Chintan Shah, Steffen Kühn, Sonja Bernitt, René Steinbrügge, Moto Togawa, Lukas Berger, Jens Buck, Moritz Hoesch, Jörn Seltmann, Mikhail G. Kozlov, Sergey G. Porsev, Ming Feng Gu, F. Scott Porter, Thomas Pfeifer, Maurice A. Leutenegger, Charles Cheung, Marianna S. Safronova, José R. Crespo López-Urrutia

    Abstract: We used the monochromatic soft-x-ray beamline P04 at the synchrotron-radiation facility PETRA III to resonantly excite the strongest $2p-3d$ transitions in neon-like Ni XIX ions, $[2p^6]_{J=0} \rightarrow [(2p^5)_{1/2}\,3d_{3/2}]_{J=1}$ and $[2p^6]_{J=0} \rightarrow [(2p^5)_{3/2}\,3d_{5/2}]_{J=1}$, respectively dubbed 3C and 3D, achieving a resolving power of 15\,000 and signal-to-background ratio… ▽ More

    Submitted 17 June, 2024; v1 submitted 22 April, 2024; originally announced April 2024.

    Comments: 10 pages, 3 figures, 3 tables, published

    Journal ref: Physical Review A 109, 063108 (2024)

  25. arXiv:2404.10487  [pdf, other

    astro-ph.HE astro-ph.GA astro-ph.SR

    Early-time gamma-ray constraints on cosmic-ray acceleration in the core-collapse SN 2023ixf with the Fermi Large Area Telescope

    Authors: G. Martí-Devesa, C. C. Cheung, N. Di Lalla, M. Renaud, G. Principe, N. Omodei, F. Acero

    Abstract: While SNRs have been considered the most relevant Galactic CR accelerators for decades, CCSNe could accelerate particles during the earliest stages of their evolution and hence contribute to the CR energy budget in the Galaxy. Some SNRs have indeed been associated with TeV gamma-rays, yet proton acceleration efficiency during the early stages of an SN expansion remains mostly unconstrained. The mu… ▽ More

    Submitted 16 April, 2024; originally announced April 2024.

    Comments: Accepted in A&A. 13 pages, 12 figures, 4 tables

  26. arXiv:2404.00727  [pdf, other

    cs.CL

    A Controlled Reevaluation of Coreference Resolution Models

    Authors: Ian Porada, Xiyuan Zou, Jackie Chi Kit Cheung

    Abstract: All state-of-the-art coreference resolution (CR) models involve finetuning a pretrained language model. Whether the superior performance of one CR model over another is due to the choice of language model or other factors, such as the task-specific architecture, is difficult or impossible to determine due to lack of a standardized experimental setup. To resolve this ambiguity, we systematically ev… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: LREC-COLING 2024

  27. arXiv:2403.18167  [pdf, other

    cs.CL cs.AI

    Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations

    Authors: Lei Yu, Meng Cao, Jackie Chi Kit Cheung, Yue Dong

    Abstract: State-of-the-art language models (LMs) sometimes generate non-factual hallucinations that misalign with world knowledge. To explore the mechanistic causes of these hallucinations, we create diagnostic datasets with subject-relation queries and adapt interpretability methods to trace hallucinations through internal model representations. We discover two general and distinct mechanistic causes of ha… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  28. arXiv:2403.13213  [pdf, other

    cs.LG cs.CL cs.CY

    From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards

    Authors: Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi

    Abstract: Recent progress in large language models (LLMs) has led to their widespread adoption in various domains. However, these advancements have also introduced additional safety risks and raised concerns regarding their detrimental impact on already marginalized populations. Despite growing mitigation efforts to develop safety safeguards, such as supervised safety-oriented fine-tuning and leveraging saf… ▽ More

    Submitted 5 July, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures. Accepted to Findings of the Association for Computational Linguistics: ACL 2024

  29. arXiv:2403.02330  [pdf, other

    cs.CV

    RegionGPT: Towards Region Understanding Vision Language Model

    Authors: Qiushan Guo, Shalini De Mello, Hongxu Yin, Wonmin Byeon, Ka Chun Cheung, Yizhou Yu, Ping Luo, Sifei Liu

    Abstract: Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  30. arXiv:2403.01837  [pdf, other

    hep-th gr-qc hep-ph

    Generalized Symmetry in Dynamical Gravity

    Authors: Clifford Cheung, Maria Derda, Joon-Hwi Kim, Vinicius Nevoa, Ira Rothstein, Nabha Shah

    Abstract: We explore generalized symmetry in the context of nonlinear dynamical gravity. Our basic strategy is to transcribe known results from Yang-Mills theory directly to gravity via the tetrad formalism, which recasts general relativity as a gauge theory of the local Lorentz group. By analogy, we deduce that gravity exhibits a one-form symmetry implemented by an operator $U_α$ labeled by a center elemen… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: 60 pages, 13 figures

    Report number: CALT-TH 2024-009

  31. arXiv:2402.19457  [pdf, other

    cs.CL cs.AI

    $\texttt{COSMIC}$: Mutual Information for Task-Agnostic Summarization Evaluation

    Authors: Maxime Darrin, Philippe Formont, Jackie Chi Kit Cheung, Pablo Piantanida

    Abstract: Assessing the quality of summarizers poses significant challenges. In response, we propose a novel task-oriented evaluation approach that assesses summarizers based on their capacity to produce summaries that are useful for downstream tasks, while preserving task outcomes. We theoretically establish a direct relationship between the resulting error probability of these tasks and the mutual informa… ▽ More

    Submitted 14 August, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

    Comments: ACL 2024

  32. arXiv:2402.19090  [pdf, ps, other

    cs.LG

    Best Arm Identification with Resource Constraints

    Authors: Zitian Li, Wang Chi Cheung

    Abstract: Motivated by the cost heterogeneity in experimentation across different alternatives, we study the Best Arm Identification with Resource Constraints (BAIwRC) problem. The agent aims to identify the best arm under resource constraints, where resources are consumed for each arm pull. We make two novel contributions. We design and analyze the Successive Halving with Resource Rationing algorithm (SH-R… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  33. General characterisation of Hamiltonians generating velocity-independent forces

    Authors: Fredy Yip, A. C. H. Cheung

    Abstract: Dynamics generated from Hamiltonians enjoy potential pathways to quantisation, but standard Hamiltonians are only capable of generating conservative forces. Classes of Hamiltonians have been proposed in Berry et al. capable of generating non-conservative velocity-independent forces. Such Hamiltonians have been classified in the past, under the strict assumption that they are polynomial in momentum… ▽ More

    Submitted 26 June, 2024; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: 27 pages

    Journal ref: J. Phys. A: Math. Theor. 57 275203 (2024)

  34. arXiv:2401.15977  [pdf, other

    cs.CV

    Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

    Authors: Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V). In contrast to previous methods that directly learn the complicated image-to-video mapping, Motion-I2V factorizes I2V into two stages with explicit motion modeling. For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the ref… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Project page: https://xiaoyushi97.github.io/Motion-I2V/

  35. arXiv:2401.14619  [pdf, other

    cs.LG

    Resilient Practical Test-Time Adaptation: Soft Batch Normalization Alignment and Entropy-driven Memory Bank

    Authors: Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang

    Abstract: Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference. However, the model performance can be significantly impaired by continuous distribution changes in the target domain and non-independent and identically distributed (non-i.i.d.) test samples often encountered in practical scenarios. While existing memory… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  36. arXiv:2401.11323  [pdf, other

    cs.CL

    Identifying and Analyzing Task-Encoding Tokens in Large Language Models

    Authors: Yu Bai, Heyan Huang, Cesare Spinoso-Di Piano, Marc-Antoine Rondeau, Sanxing Chen, Yang Gao, Jackie Chi Kit Cheung

    Abstract: In-context learning (ICL) has become an effective solution for few-shot learning in natural language processing. However, our understanding of ICL's working mechanisms is limited, specifically regarding how models learn to perform tasks from ICL demonstrations. For example, unexpectedly large changes in performance can arise from small changes in the prompt, leaving prompt design a largely empiric… ▽ More

    Submitted 16 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: Work in progress

  37. arXiv:2401.08395  [pdf, other

    physics.atom-ph astro-ph.HE astro-ph.IM astro-ph.SR physics.plasm-ph

    High-Precision Transition Energy Measurements of Neon-like Fe XVII Ions

    Authors: Chintan Shah, Moto Togawa, Marc Botz, Jonas Danisch, Joschka J. Goes, Sonja Bernitt, Marleen Maxton, Kai Köbnick, Jen Buck, Jörn Seltmann, Moritz Hoesch, Ming Feng Gu, F. Scott Porter, Thomas Pfeifer, Maurice A. Leutenegger, Charles Cheung, Marianna S. Safronova, José R. Crespo López-Urrutia

    Abstract: We improve by a factor of 4-20 the energy accuracy of the strongest soft X-ray transitions of Fe XVII ions by resonantly exciting them in an electron beam ion trap with a monochromatic beam at the P04 beamline of the PETRA III synchrotron facility. By simultaneously tracking instantaneous photon-energy fluctuations with a high-resolution photoelectron spectrometer, we minimize systematic uncertain… ▽ More

    Submitted 15 July, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 14 pages, 2 figures, 4 tables, published version

    Journal ref: The Astrophysical Journal, 969, 52 (2024)

  38. Characterizing the Gamma-ray Emission Properties of the Globular Cluster M5 with the Fermi-LAT

    Authors: X. Hou, W. Zhang, P. C. C. Freire, D. F. Torres, J. Ballet, D. A. Smith, T. J. Johnson, M. Kerr, C. C. Cheung, L. Guillemot, J. Li, L. Zhang, A. Ridolfi, P. Wang, D. Li, J. Yuan, N. Wang

    Abstract: We analyzed the globular cluster M5 (NGC 5904) using 15 years of gamma-ray data from the Fermi Large Area Telescope (LAT). Using rotation ephemerides generated from Arecibo and FAST radio telescope observations, we searched for gamma-ray pulsations from the seven millisecond pulsars (MSPs) identified in M5. We detected no significant pulsations from any of the individual pulsars. Also, we searched… ▽ More

    Submitted 23 March, 2024; v1 submitted 16 January, 2024; originally announced January 2024.

    Comments: 14 pages, 5 figures, 3 tables, published in ApJ

  39. arXiv:2401.05914  [pdf, other

    cs.CL cs.AI

    How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes

    Authors: Sabina Elkins, Ekaterina Kochmar, Jackie C. K. Cheung, Iulian Serban

    Abstract: Question generation (QG) is a natural language processing task with an abundance of potential benefits and use cases in the educational domain. In order for this potential to be realized, QG systems must be designed and validated with pedagogical needs in mind. However, little research has assessed or designed QG approaches with the input from real teachers or students. This paper applies a large… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 8 pages, 8 figures. Accepted to the main track of the EAAI-24: The 14th Symposium on Educational Advances in Artificial Intelligence

  40. Multiparticle Factorization and the Rigidity of String Theory

    Authors: Nima Arkani-Hamed, Clifford Cheung, Carolina Figueiredo, Grant N. Remmen

    Abstract: Is string theory uniquely determined by self-consistency? Causality and unitarity seemingly permit a multitude of putative deformations, at least at the level of two-to-two scattering. Motivated by this question, we initiate a systematic exploration of the constraints on scattering from higher-point factorization, which imposes extraordinarily restrictive sum rules on the residues and spectra defi… ▽ More

    Submitted 18 March, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

    Comments: 5 pages (+ 4 pages supplementary), 2 figures

    Report number: CALT-TH 2023-051

    Journal ref: Phys. Rev. Lett. 132, 091601 (2024)

  41. arXiv:2312.01858  [pdf, other

    cs.CL

    Evaluating Dependencies in Fact Editing for Language Models: Specificity and Implication Awareness

    Authors: Zichao Li, Ines Arous, Siva Reddy, Jackie C. K. Cheung

    Abstract: The potential of using a large language model (LLM) as a knowledge base (KB) has sparked significant interest. To manage the knowledge acquired by LLMs, we need to ensure that the editing of learned facts respects internal logical constraints, which are known as dependency of knowledge. Existing work on editing LLMs has partially addressed the issue of dependency, when the editing of a fact should… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Findings of EMNLP2023

  42. arXiv:2311.11103  [pdf, other

    cs.CL

    Responsible AI Considerations in Text Summarization Research: A Review of Current Practices

    Authors: Yu Lu Liu, Meng Cao, Su Lin Blodgett, Jackie Chi Kit Cheung, Alexandra Olteanu, Adam Trischler

    Abstract: AI and NLP publication venues have increasingly encouraged researchers to reflect on possible ethical considerations, adverse impacts, and other responsible AI issues their work might engender. However, for specific NLP tasks our understanding of how prevalent such issues are, or when and why these issues are likely to arise, remains limited. Focusing on text summarization -- a common NLP task lar… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  43. arXiv:2311.04921  [pdf, other

    cs.CL cs.AI

    Successor Features for Efficient Multisubject Controlled Text Generation

    Authors: Meng Cao, Mehdi Fatemi, Jackie Chi Kit Cheung, Samira Shabanian

    Abstract: While large language models (LLMs) have achieved impressive performance in generating fluent and realistic text, controlling the generated text so that it exhibits properties such as safety, factuality, and non-toxicity remains challenging. % such as DExperts, GeDi, and rectification Existing decoding-based methods are static in terms of the dimension of control; if the target subject is changed,… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  44. arXiv:2311.00390  [pdf, other

    cs.RO

    A Modular Pneumatic Soft Gripper Design for Aerial Grasping and Landing

    Authors: Hiu Ching Cheung, Ching-Wei Chang, Bailun Jiang, Chih-Yung Wen, Henry K. Chu

    Abstract: Aerial robots have garnered significant attention due to their potential applications in various industries, such as inspection, search and rescue, and drone delivery. Successful missions often depend on the ability of these robots to grasp and land effectively. This paper presents a novel modular soft gripper design tailored explicitly for aerial grasping and landing operations. The proposed modu… ▽ More

    Submitted 25 March, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 7 pages, 13 figures, accepted by IEEE RoboSoft 2024

  45. arXiv:2310.03886  [pdf

    physics.app-ph cond-mat.mtrl-sci

    Demonstration of a monocrystalline GaAs-$β$-Ga$_2$O$_3$ p-n heterojunction

    Authors: Jie Zhou, Moheb Sheikhi, Ashok Dheenan, Haris Abbasi, Jiarui Gong, Yang Liu, Carolina Adamo, Patrick Marshall, Nathan Wriedt, Clincy Cheung, Shuoyang Qiu, Tien Khee Ng, Qiaoqiang Gan, Vincent Gambin, Boon S. Ooi, Siddharth Rajan, Zhenqiang Ma

    Abstract: In this work, we report the fabrication and characterizations of a monocrystalline GaAs/$β$-Ga$_2$O$_3$ p-n heterojunction by employing semiconductor grafting technology. The heterojunction was created by lifting off and transfer printing a p-type GaAs single crystal nanomembrane to an Al$_2$O$_3$-coated n-type$β$-Ga$_2$O$_3$ epitaxial substrate. The resultant heterojunction diodes exhibit remarka… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

    Comments: 14 pages, 5 figures

  46. arXiv:2310.01717  [pdf, other

    cs.CL cs.AI cs.LG

    Ensemble Distillation for Unsupervised Constituency Parsing

    Authors: Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie C. K. Cheung, Lili Mou

    Abstract: We investigate the unsupervised constituency parsing task, which organizes words and phrases of a sentence into a hierarchical structure without using linguistically annotated data. We observe that existing unsupervised parsers capture differing aspects of parsing structures, which can be leveraged to enhance unsupervised parsing performance. To this end, we propose a notion of "tree averaging," b… ▽ More

    Submitted 25 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by International Conference on Learning Representations (ICLR) 2024

  47. arXiv:2310.01642  [pdf, other

    cs.SE cs.AI

    Naming Practices of Pre-Trained Models in Hugging Face

    Authors: Wenxin Jiang, Chingwo Cheung, Mingyu Kim, Heesoo Kim, George K. Thiruvathukal, James C. Davis

    Abstract: As innovation in deep learning continues, many engineers seek to adopt Pre-Trained Models (PTMs) as components in computer systems. Researchers publish PTMs, which engineers adapt for quality or performance prior to deployment. PTM authors should choose appropriate names for their PTMs, which would facilitate model discovery and reuse. However, prior research has reported that model names are not… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 21 pages

  48. arXiv:2309.17269  [pdf, ps, other

    eess.IV cs.CV

    Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN

    Authors: Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen

    Abstract: For optical coherence tomography angiography (OCTA) images, a limited scanning rate leads to a trade-off between field-of-view (FOV) and imaging resolution. Although larger FOV images may reveal more parafoveal vascular lesions, their application is greatly hampered due to lower resolution. To increase the resolution, previous works only achieved satisfactory performance by using paired data for t… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: 10 pages, 9 figures

  49. arXiv:2309.10956  [pdf, other

    astro-ph.GA astro-ph.HE

    Powerful Radio Sources in the Southern Sky. II. A SWIFT X-Ray Perspective

    Authors: F. Massaro, S. V. White, A. Paggi, A. Jimenez-Gallardo, J. P. Madrid, C. Mazzucchelli, W. R. Forman, A. Capetti, C. Leto, A. Garcia-Perez, C. C. Cheung, V. Chavushyan, N. P. H. Nesvadba, I. Andruchow, H. A. Pena-Herazo, E. Sani, R. Grossova, V. Reynaldi, R. P. Kraft, B. Balmaverde, S. Cellone

    Abstract: We recently constructed the G4Jy-3CRE, a catalog of extragalactic radio sources based on the GLEAM 4-Jy (G4Jy) sample, with the aim of increasing the number of powerful radio galaxies and quasars with similar selection criteria to those of the revised release of the Third Cambridge catalog (3CR). The G4Jy-3CRE consists of a total of 264 radio sources mainly visible from the Southern Hemisphere. He… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 35 pages, 17 figures, 2 tables; second paper of a series, pre-proof version

    Journal ref: The Astrophysical Journal Supplement Series, 268, 32 (2023)

  50. arXiv:2309.10349  [pdf, other

    astro-ph.SR astro-ph.EP physics.space-ph

    Resolving moving heliospheric structures using interplanetary scintillation observations with the Murchison Widefield Array

    Authors: A. Waszewski, J. S. Morgan, R. Chhetri, R. Ekers, M. C. M. Cheung, N. D. R Bhat, M. Johnston-Hollitt

    Abstract: We have conducted a blind search in 49 consecutive days of interplanetary scintillation observations made by the Murchison Widefield Array from mid-2019, with overlapping daily observations approximately East and South-East of the Sun at an elongation of $\sim$30 degrees and a field of view of 30 degrees. These observations detect an unprecedented density of sources. In spite of these observations… ▽ More

    Submitted 19 September, 2023; originally announced September 2023.

    Comments: 14 pages, 5 figures