Skip to main content

Showing 1–50 of 160 results for author: Cheung, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.13390  [pdf, other

    cs.CV

    GeometrySticker: Enabling Ownership Claim of Recolorized Neural Radiance Fields

    Authors: Xiufeng Huang, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Remarkable advancements in the recolorization of Neural Radiance Fields (NeRF) have simplified the process of modifying NeRF's color attributes. Yet, with the potential of NeRF to serve as shareable digital assets, there's a concern that malicious users might alter the color of NeRF models and falsely claim the recolorized version as their own. To safeguard against such breaches of ownership, enab… ▽ More

    Submitted 18 July, 2024; originally announced July 2024.

  2. arXiv:2407.10510  [pdf, other

    cs.CL cs.AI cs.CE

    TCM-FTP: Fine-Tuning Large Language Models for Herbal Prescription Prediction

    Authors: Xingzhi Zhou, Xin Dong, Chunhao Li, Yuning Bai, Yulong Xu, Ka Chun Cheung, Simon See, Xinpeng Song, Runshun Zhang, Xuezhong Zhou, Nevin L. Zhang

    Abstract: Traditional Chinese medicine (TCM) relies on specific combinations of herbs in prescriptions to treat symptoms and signs, a practice that spans thousands of years. Predicting TCM prescriptions presents a fascinating technical challenge with practical implications. However, this task faces limitations due to the scarcity of high-quality clinical datasets and the intricate relationship between sympt… ▽ More

    Submitted 15 July, 2024; originally announced July 2024.

  3. arXiv:2407.07735  [pdf, other

    cs.CV

    Protecting NeRFs' Copyright via Plug-And-Play Watermarking Base Model

    Authors: Qi Song, Ziyuan Luo, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Neural Radiance Fields (NeRFs) have become a key method for 3D scene representation. With the rising prominence and influence of NeRF, safeguarding its intellectual property has become increasingly important. In this paper, we propose \textbf{NeRFProtector}, which adopts a plug-and-play strategy to protect NeRF's copyright during its creation. NeRFProtector utilizes a pre-trained watermarking base… ▽ More

    Submitted 10 July, 2024; originally announced July 2024.

    Comments: Accepted by ECCV2024

  4. arXiv:2406.17245  [pdf, other

    cs.LG cs.AI cs.CL

    Unlocking Continual Learning Abilities in Language Models

    Authors: Wenyu Du, Shuang Cheng, Tongxu Luo, Zihan Qiu, Zeyu Huang, Ka Chun Cheung, Reynold Cheng, Jie Fu

    Abstract: Language models (LMs) exhibit impressive performance and generalization capabilities. However, LMs struggle with the persistent challenge of catastrophic forgetting, which undermines their long-term sustainability in continual learning (CL). Existing approaches usually address the issue by incorporating old task data or task-wise inductive bias into LMs. However, old data and accurate task informa… ▽ More

    Submitted 24 June, 2024; originally announced June 2024.

    Comments: preprint, 19 pages

  5. arXiv:2406.12018  [pdf, other

    cs.CL

    CItruS: Chunked Instruction-aware State Eviction for Long Sequence Modeling

    Authors: Yu Bai, Xiyuan Zou, Heyan Huang, Sanxing Chen, Marc-Antoine Rondeau, Yang Gao, Jackie Chi Kit Cheung

    Abstract: Long sequence modeling has gained broad interest as large language models (LLMs) continue to advance. Recent research has identified that a large portion of hidden states within the key-value caches of Transformer models can be discarded (also termed evicted) without affecting the perplexity performance in generating long sequences. However, we show that these methods, despite preserving perplexit… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Work in progress

  6. arXiv:2406.11536  [pdf, other

    cs.DC cs.CV

    RO-SVD: A Reconfigurable Hardware Copyright Protection Framework for AIGC Applications

    Authors: Zhuoheng Ran, Muhammad A. A. Abdelgawad, Zekai Zhang, Ray C. C. Cheung, Hong Yan

    Abstract: The dramatic surge in the utilisation of generative artificial intelligence (GenAI) underscores the need for a secure and efficient mechanism to responsibly manage, use and disseminate multi-dimensional data generated by artificial intelligence (AI). In this paper, we propose a blockchain-based copyright traceability framework called ring oscillator-singular value decomposition (RO-SVD), which int… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: Accepted on 20 May 2024 as a full paper at ASAP 2024

  7. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  8. arXiv:2406.08723  [pdf, other

    cs.CL

    ECBD: Evidence-Centered Benchmark Design for NLP

    Authors: Yu Lu Liu, Su Lin Blodgett, Jackie Chi Kit Cheung, Q. Vera Liao, Alexandra Olteanu, Ziang Xiao

    Abstract: Benchmarking is seen as critical to assessing progress in NLP. However, creating a benchmark involves many design decisions (e.g., which datasets to include, which metrics to use) that often rely on tacit, untested assumptions about what the benchmark is intended to measure or is actually measuring. There is currently no principled way of analyzing these decisions and how they impact the validity… ▽ More

    Submitted 12 June, 2024; originally announced June 2024.

  9. arXiv:2406.07640  [pdf, other

    cs.LG cs.AI

    When is an Embedding Model More Promising than Another?

    Authors: Maxime Darrin, Philippe Formont, Ismail Ben Ayed, Jackie CK Cheung, Pablo Piantanida

    Abstract: Embedders play a central role in machine learning, projecting any object into numerical representations that can, in turn, be leveraged to perform various downstream tasks. The evaluation of embedding models typically depends on domain-specific empirical approaches utilizing downstream tasks, primarily because of the lack of a standardized framework for comparison. However, acquiring adequately la… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  10. arXiv:2406.07359  [pdf, other

    cs.CL

    GLIMPSE: Pragmatically Informative Multi-Document Summarization for Scholarly Reviews

    Authors: Maxime Darrin, Ines Arous, Pablo Piantanida, Jackie CK Cheung

    Abstract: Scientific peer review is essential for the quality of academic publications. However, the increasing number of paper submissions to conferences has strained the reviewing process. This surge poses a burden on area chairs who have to carefully read an ever-growing volume of reviews and discern each reviewer's main arguments as part of their decision process. In this paper, we introduce \sys, a sum… ▽ More

    Submitted 11 June, 2024; originally announced June 2024.

  11. arXiv:2405.17757  [pdf, other

    cs.CE

    NASPrecision: Neural Architecture Search-Driven Multi-Stage Learning for Surface Roughness Prediction in Ultra-Precision Machining

    Authors: Penghui Ruan, Divya Saxena, Jiannong Cao, Xiaoyun Liu, Ruoxin Wang, Chi Fai Cheung

    Abstract: Accurate surface roughness prediction is critical for ensuring high product quality, especially in areas like manufacturing and aerospace, where the smallest imperfections can compromise performance or safety. However, this is challenging due to complex, non-linear interactions among variables, which is further exacerbated with limited and imbalanced datasets. Existing methods using traditional ma… ▽ More

    Submitted 27 May, 2024; originally announced May 2024.

  12. arXiv:2405.15724  [pdf, other

    cs.CG cs.DM cs.RO

    Reconfiguration Algorithms for Cubic Modular Robots with Realistic Movement Constraints

    Authors: MIT--NASA Space Robots Team, Josh Brunner, Kenneth C. Cheung, Erik D. Demaine, Jenny Diomidova, Christine Gregg, Della H. Hendrickson, Irina Kostitsyna

    Abstract: We introduce and analyze a model for self-reconfigurable robots made up of unit-cube modules. Compared to past models, our model aims to newly capture two important practical aspects of real-world robots. First, modules often do not occupy an exact unit cube, but rather have features like bumps extending outside the allotted space so that modules can interlock. Thus, for example, our model forbids… ▽ More

    Submitted 24 May, 2024; originally announced May 2024.

  13. arXiv:2405.02594  [pdf, other

    cs.LG stat.ML

    Leveraging (Biased) Information: Multi-armed Bandits with Offline Data

    Authors: Wang Chi Cheung, Lixing Lyu

    Abstract: We leverage offline data to facilitate online learning in stochastic multi-armed bandits. The probability distributions that govern the offline data and the online rewards can be different. Without any non-trivial upper bound on their difference, we show that no non-anticipatory policy can outperform the UCB policy by (Auer et al. 2002), even in the presence of offline data. In complement, we prop… ▽ More

    Submitted 4 May, 2024; originally announced May 2024.

    Comments: 24 pages, 5 figures. Accepted to ICML 2024

  14. arXiv:2405.01189  [pdf, other

    cs.LG cs.AI

    Gradient-Congruity Guided Federated Sparse Training

    Authors: Chris Xing Tian, Yibing Liu, Haoliang Li, Ray C. C. Cheung, Shiqi Wang

    Abstract: Edge computing allows artificial intelligence and machine learning models to be deployed on edge devices, where they can learn from local data and collaborate to form a global model. Federated learning (FL) is a distributed machine learning technique that facilitates this process while preserving data privacy. However, FL also faces challenges such as high computational and communication costs reg… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  15. arXiv:2404.18416  [pdf, other

    cs.AI cs.CL cs.CV cs.LG

    Capabilities of Gemini Models in Medicine

    Authors: Khaled Saab, Tao Tu, Wei-Hung Weng, Ryutaro Tanno, David Stutz, Ellery Wulczyn, Fan Zhang, Tim Strother, Chunjong Park, Elahe Vedadi, Juanma Zambrano Chaves, Szu-Yeu Hu, Mike Schaekermann, Aishwarya Kamath, Yong Cheng, David G. T. Barrett, Cathy Cheung, Basil Mustafa, Anil Palepu, Daniel McDuff, Le Hou, Tomer Golany, Luyang Liu, Jean-baptiste Alayrac, Neil Houlsby , et al. (42 additional authors not shown)

    Abstract: Excellence in a wide variety of medical applications poses considerable challenges for AI, requiring advanced reasoning, access to up-to-date medical knowledge and understanding of complex multimodal data. Gemini models, with strong general capabilities in multimodal and long-context reasoning, offer exciting possibilities in medicine. Building on these core strengths of Gemini, we introduce Med-G… ▽ More

    Submitted 1 May, 2024; v1 submitted 29 April, 2024; originally announced April 2024.

  16. arXiv:2404.00727  [pdf, other

    cs.CL

    A Controlled Reevaluation of Coreference Resolution Models

    Authors: Ian Porada, Xiyuan Zou, Jackie Chi Kit Cheung

    Abstract: All state-of-the-art coreference resolution (CR) models involve finetuning a pretrained language model. Whether the superior performance of one CR model over another is due to the choice of language model or other factors, such as the task-specific architecture, is difficult or impossible to determine due to lack of a standardized experimental setup. To resolve this ambiguity, we systematically ev… ▽ More

    Submitted 22 April, 2024; v1 submitted 31 March, 2024; originally announced April 2024.

    Comments: LREC-COLING 2024

  17. arXiv:2403.18167  [pdf, other

    cs.CL cs.AI

    Mechanistic Understanding and Mitigation of Language Model Non-Factual Hallucinations

    Authors: Lei Yu, Meng Cao, Jackie Chi Kit Cheung, Yue Dong

    Abstract: State-of-the-art language models (LMs) sometimes generate non-factual hallucinations that misalign with world knowledge. To explore the mechanistic causes of these hallucinations, we create diagnostic datasets with subject-relation queries and adapt interpretability methods to trace hallucinations through internal model representations. We discover two general and distinct mechanistic causes of ha… ▽ More

    Submitted 17 June, 2024; v1 submitted 26 March, 2024; originally announced March 2024.

  18. arXiv:2403.13213  [pdf, other

    cs.LG cs.CL cs.CY

    From Representational Harms to Quality-of-Service Harms: A Case Study on Llama 2 Safety Safeguards

    Authors: Khaoula Chehbouni, Megha Roshan, Emmanuel Ma, Futian Andrew Wei, Afaf Taik, Jackie CK Cheung, Golnoosh Farnadi

    Abstract: Recent progress in large language models (LLMs) has led to their widespread adoption in various domains. However, these advancements have also introduced additional safety risks and raised concerns regarding their detrimental impact on already marginalized populations. Despite growing mitigation efforts to develop safety safeguards, such as supervised safety-oriented fine-tuning and leveraging saf… ▽ More

    Submitted 5 July, 2024; v1 submitted 19 March, 2024; originally announced March 2024.

    Comments: 9 pages, 4 figures. Accepted to Findings of the Association for Computational Linguistics: ACL 2024

  19. arXiv:2403.02330  [pdf, other

    cs.CV

    RegionGPT: Towards Region Understanding Vision Language Model

    Authors: Qiushan Guo, Shalini De Mello, Hongxu Yin, Wonmin Byeon, Ka Chun Cheung, Yizhou Yu, Ping Luo, Sifei Liu

    Abstract: Vision language models (VLMs) have experienced rapid advancements through the integration of large language models (LLMs) with image-text pairs, yet they struggle with detailed regional visual understanding due to limited spatial awareness of the vision encoder, and the use of coarse-grained training data that lacks detailed, region-specific captions. To address this, we introduce RegionGPT (short… ▽ More

    Submitted 4 March, 2024; originally announced March 2024.

    Comments: Accepted by CVPR 2024

  20. arXiv:2402.19457  [pdf, other

    cs.CL cs.AI

    $\texttt{COSMIC}$: Mutual Information for Task-Agnostic Summarization Evaluation

    Authors: Maxime Darrin, Philippe Formont, Jackie Chi Kit Cheung, Pablo Piantanida

    Abstract: Assessing the quality of summarizers poses significant challenges. In response, we propose a novel task-oriented evaluation approach that assesses summarizers based on their capacity to produce summaries that are useful for downstream tasks, while preserving task outcomes. We theoretically establish a direct relationship between the resulting error probability of these tasks and the mutual informa… ▽ More

    Submitted 1 March, 2024; v1 submitted 29 February, 2024; originally announced February 2024.

  21. arXiv:2402.19090  [pdf, ps, other

    cs.LG

    Best Arm Identification with Resource Constraints

    Authors: Zitian Li, Wang Chi Cheung

    Abstract: Motivated by the cost heterogeneity in experimentation across different alternatives, we study the Best Arm Identification with Resource Constraints (BAIwRC) problem. The agent aims to identify the best arm under resource constraints, where resources are consumed for each arm pull. We make two novel contributions. We design and analyze the Successive Halving with Resource Rationing algorithm (SH-R… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  22. arXiv:2401.15977  [pdf, other

    cs.CV

    Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

    Authors: Xiaoyu Shi, Zhaoyang Huang, Fu-Yun Wang, Weikang Bian, Dasong Li, Yi Zhang, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: We introduce Motion-I2V, a novel framework for consistent and controllable image-to-video generation (I2V). In contrast to previous methods that directly learn the complicated image-to-video mapping, Motion-I2V factorizes I2V into two stages with explicit motion modeling. For the first stage, we propose a diffusion-based motion field predictor, which focuses on deducing the trajectories of the ref… ▽ More

    Submitted 31 January, 2024; v1 submitted 29 January, 2024; originally announced January 2024.

    Comments: Project page: https://xiaoyushi97.github.io/Motion-I2V/

  23. arXiv:2401.14619  [pdf, other

    cs.LG

    Resilient Practical Test-Time Adaptation: Soft Batch Normalization Alignment and Entropy-driven Memory Bank

    Authors: Xingzhi Zhou, Zhiliang Tian, Ka Chun Cheung, Simon See, Nevin L. Zhang

    Abstract: Test-time domain adaptation effectively adjusts the source domain model to accommodate unseen domain shifts in a target domain during inference. However, the model performance can be significantly impaired by continuous distribution changes in the target domain and non-independent and identically distributed (non-i.i.d.) test samples often encountered in practical scenarios. While existing memory… ▽ More

    Submitted 25 January, 2024; originally announced January 2024.

  24. arXiv:2401.11323  [pdf, other

    cs.CL

    Identifying and Analyzing Task-Encoding Tokens in Large Language Models

    Authors: Yu Bai, Heyan Huang, Cesare Spinoso-Di Piano, Marc-Antoine Rondeau, Sanxing Chen, Yang Gao, Jackie Chi Kit Cheung

    Abstract: In-context learning (ICL) has become an effective solution for few-shot learning in natural language processing. However, our understanding of ICL's working mechanisms is limited, specifically regarding how models learn to perform tasks from ICL demonstrations. For example, unexpectedly large changes in performance can arise from small changes in the prompt, leaving prompt design a largely empiric… ▽ More

    Submitted 16 February, 2024; v1 submitted 20 January, 2024; originally announced January 2024.

    Comments: Work in progress

  25. arXiv:2401.05914  [pdf, other

    cs.CL cs.AI

    How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes

    Authors: Sabina Elkins, Ekaterina Kochmar, Jackie C. K. Cheung, Iulian Serban

    Abstract: Question generation (QG) is a natural language processing task with an abundance of potential benefits and use cases in the educational domain. In order for this potential to be realized, QG systems must be designed and validated with pedagogical needs in mind. However, little research has assessed or designed QG approaches with the input from real teachers or students. This paper applies a large… ▽ More

    Submitted 11 January, 2024; originally announced January 2024.

    Comments: 8 pages, 8 figures. Accepted to the main track of the EAAI-24: The 14th Symposium on Educational Advances in Artificial Intelligence

  26. arXiv:2312.01858  [pdf, other

    cs.CL

    Evaluating Dependencies in Fact Editing for Language Models: Specificity and Implication Awareness

    Authors: Zichao Li, Ines Arous, Siva Reddy, Jackie C. K. Cheung

    Abstract: The potential of using a large language model (LLM) as a knowledge base (KB) has sparked significant interest. To manage the knowledge acquired by LLMs, we need to ensure that the editing of learned facts respects internal logical constraints, which are known as dependency of knowledge. Existing work on editing LLMs has partially addressed the issue of dependency, when the editing of a fact should… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

    Comments: Findings of EMNLP2023

  27. arXiv:2311.11103  [pdf, other

    cs.CL

    Responsible AI Considerations in Text Summarization Research: A Review of Current Practices

    Authors: Yu Lu Liu, Meng Cao, Su Lin Blodgett, Jackie Chi Kit Cheung, Alexandra Olteanu, Adam Trischler

    Abstract: AI and NLP publication venues have increasingly encouraged researchers to reflect on possible ethical considerations, adverse impacts, and other responsible AI issues their work might engender. However, for specific NLP tasks our understanding of how prevalent such issues are, or when and why these issues are likely to arise, remains limited. Focusing on text summarization -- a common NLP task lar… ▽ More

    Submitted 18 November, 2023; originally announced November 2023.

  28. arXiv:2311.04921  [pdf, other

    cs.CL cs.AI

    Successor Features for Efficient Multisubject Controlled Text Generation

    Authors: Meng Cao, Mehdi Fatemi, Jackie Chi Kit Cheung, Samira Shabanian

    Abstract: While large language models (LLMs) have achieved impressive performance in generating fluent and realistic text, controlling the generated text so that it exhibits properties such as safety, factuality, and non-toxicity remains challenging. % such as DExperts, GeDi, and rectification Existing decoding-based methods are static in terms of the dimension of control; if the target subject is changed,… ▽ More

    Submitted 2 November, 2023; originally announced November 2023.

  29. arXiv:2311.00390  [pdf, other

    cs.RO

    A Modular Pneumatic Soft Gripper Design for Aerial Grasping and Landing

    Authors: Hiu Ching Cheung, Ching-Wei Chang, Bailun Jiang, Chih-Yung Wen, Henry K. Chu

    Abstract: Aerial robots have garnered significant attention due to their potential applications in various industries, such as inspection, search and rescue, and drone delivery. Successful missions often depend on the ability of these robots to grasp and land effectively. This paper presents a novel modular soft gripper design tailored explicitly for aerial grasping and landing operations. The proposed modu… ▽ More

    Submitted 25 March, 2024; v1 submitted 1 November, 2023; originally announced November 2023.

    Comments: 7 pages, 13 figures, accepted by IEEE RoboSoft 2024

  30. arXiv:2310.01717  [pdf, other

    cs.CL cs.AI cs.LG

    Ensemble Distillation for Unsupervised Constituency Parsing

    Authors: Behzad Shayegh, Yanshuai Cao, Xiaodan Zhu, Jackie C. K. Cheung, Lili Mou

    Abstract: We investigate the unsupervised constituency parsing task, which organizes words and phrases of a sentence into a hierarchical structure without using linguistically annotated data. We observe that existing unsupervised parsers capture differing aspects of parsing structures, which can be leveraged to enhance unsupervised parsing performance. To this end, we propose a notion of "tree averaging," b… ▽ More

    Submitted 25 April, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: Accepted by International Conference on Learning Representations (ICLR) 2024

  31. arXiv:2310.01642  [pdf, other

    cs.SE cs.AI

    Naming Practices of Pre-Trained Models in Hugging Face

    Authors: Wenxin Jiang, Chingwo Cheung, Mingyu Kim, Heesoo Kim, George K. Thiruvathukal, James C. Davis

    Abstract: As innovation in deep learning continues, many engineers seek to adopt Pre-Trained Models (PTMs) as components in computer systems. Researchers publish PTMs, which engineers adapt for quality or performance prior to deployment. PTM authors should choose appropriate names for their PTMs, which would facilitate model discovery and reuse. However, prior research has reported that model names are not… ▽ More

    Submitted 28 March, 2024; v1 submitted 2 October, 2023; originally announced October 2023.

    Comments: 21 pages

  32. arXiv:2309.17269  [pdf, ps, other

    eess.IV cs.CV

    Unpaired Optical Coherence Tomography Angiography Image Super-Resolution via Frequency-Aware Inverse-Consistency GAN

    Authors: Weiwen Zhang, Dawei Yang, Haoxuan Che, An Ran Ran, Carol Y. Cheung, Hao Chen

    Abstract: For optical coherence tomography angiography (OCTA) images, a limited scanning rate leads to a trade-off between field-of-view (FOV) and imaging resolution. Although larger FOV images may reveal more parafoveal vascular lesions, their application is greatly hampered due to lower resolution. To increase the resolution, previous works only achieved satisfactory performance by using paired data for t… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: 10 pages, 9 figures

  33. arXiv:2309.00440  [pdf, other

    cs.CY

    Yet another Improvement of Plantard Arithmetic for Faster Kyber on Low-end 32-bit IoT Devices

    Authors: Junhao Huang, Haosong Zhao, Jipeng Zhang, Wangchen Dai, Lu Zhou, Ray C. C. Cheung, Cetin Kaya Koc, Donglong Chen

    Abstract: This paper presents another improved version of Plantard arithmetic that could speed up Kyber implementations on two low-end 32-bit IoT platforms (ARM Cortex-M3 and RISC-V) without SIMD extensions. Specifically, we further enlarge the input range of the Plantard arithmetic without modifying its computation steps. After tailoring the Plantard arithmetic for Kyber's modulus, we show that the input r… ▽ More

    Submitted 18 February, 2024; v1 submitted 1 September, 2023; originally announced September 2023.

  34. Low-count Time Series Anomaly Detection

    Authors: Philipp Renz, Kurt Cutajar, Niall Twomey, Gavin K. C. Cheung, Hanting Xie

    Abstract: Low-count time series describe sparse or intermittent events, which are prevalent in large-scale online platforms that capture and monitor diverse data types. Several distinct challenges surface when modelling low-count time series, particularly low signal-to-noise ratios (when anomaly signatures are provably undetectable), and non-uniform performance (when average metrics are not representative o… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

    Comments: 6 pages, 7 figures, to be published in IEEE 2023 Workshop on Machine Learning for Signal Processing (MLSP)

    Journal ref: 2023 IEEE 33rd International Workshop on Machine Learning for Signal Processing (MLSP)

  35. arXiv:2307.11526  [pdf, other

    cs.CV

    CopyRNeRF: Protecting the CopyRight of Neural Radiance Fields

    Authors: Ziyuan Luo, Qing Guo, Ka Chun Cheung, Simon See, Renjie Wan

    Abstract: Neural Radiance Fields (NeRF) have the potential to be a major representation of media. Since training a NeRF has never been an easy task, the protection of its model copyright should be a priority. In this paper, by analyzing the pros and cons of possible copyright protection solutions, we propose to protect the copyright of NeRF models by replacing the original color representation in NeRF with… ▽ More

    Submitted 29 July, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: 11 pages, 6 figures, accepted by ICCV 2023 non-camera-ready version

  36. arXiv:2307.07754  [pdf, other

    cs.CV cs.AI

    Bidirectionally Deformable Motion Modulation For Video-based Human Pose Transfer

    Authors: Wing-Yin Yu, Lai-Man Po, Ray C. C. Cheung, Yuzhi Zhao, Yu Xue, Kun Li

    Abstract: Video-based human pose transfer is a video-to-video generation task that animates a plain source human image based on a series of target human poses. Considering the difficulties in transferring highly structural patterns on the garments and discontinuous poses, existing methods often generate unsatisfactory results such as distorted textures and flickering artifacts. To address these issues, we p… ▽ More

    Submitted 18 July, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: ICCV 2023

  37. arXiv:2306.15308  [pdf, other

    astro-ph.SR astro-ph.IM cs.CV

    Machine learning in solar physics

    Authors: A. Asensio Ramos, M. C. M. Cheung, I. Chifu, R. Gafeira

    Abstract: The application of machine learning in solar physics has the potential to greatly enhance our understanding of the complex processes that take place in the atmosphere of the Sun. By using techniques such as deep learning, we are now in the position to analyze large amounts of data from solar observations and identify patterns and trends that may not have been apparent using traditional methods. Th… ▽ More

    Submitted 27 June, 2023; originally announced June 2023.

    Comments: 100 pages, 13 figures, 286 references, accepted for publication as a Living Review in Solar Physics (LRSP)

  38. arXiv:2306.11351  [pdf, other

    cs.AR

    A Versatility-Performance Balanced Hardware Architecture for Scene Text Detection

    Authors: Yao Xin, Guoming Tang, Donglong Chen, Rumin Zhang, Teng Liang, Ray C. C. Cheung, Cetin Kaya Koc

    Abstract: Detecting and extracting textual information from natural scene images needs Scene Text Detection (STD) algorithms. Fully Convolutional Neural Networks (FCNs) are usually utilized as the backbone model to extract features in these instance segmentation based STD algorithms. FCNs naturally come with high computational complexity. Furthermore, to keep up with the growing variety of models, flexible… ▽ More

    Submitted 20 June, 2023; originally announced June 2023.

  39. arXiv:2306.05888  [pdf, other

    cs.CV

    TrajectoryFormer: 3D Object Tracking Transformer with Predictive Trajectory Hypotheses

    Authors: Xuesong Chen, Shaoshuai Shi, Chao Zhang, Benjin Zhu, Qiang Wang, Ka Chun Cheung, Simon See, Hongsheng Li

    Abstract: 3D multi-object tracking (MOT) is vital for many applications including autonomous driving vehicles and service robots. With the commonly used tracking-by-detection paradigm, 3D MOT has made important progress in recent years. However, these methods only use the detection boxes of the current frame to obtain trajectory-box association results, which makes it impossible for the tracker to recover o… ▽ More

    Submitted 18 August, 2023; v1 submitted 9 June, 2023; originally announced June 2023.

    Comments: Accepted by ICCV 2023

  40. arXiv:2305.05858  [pdf, other

    cs.CL

    Vārta: A Large-Scale Headline-Generation Dataset for Indic Languages

    Authors: Rahul Aralikatte, Ziling Cheng, Sumanth Doddapaneni, Jackie Chi Kit Cheung

    Abstract: We present Vārta, a large-scale multilingual dataset for headline generation in Indic languages. This dataset includes 41.8 million news articles in 14 different Indic languages (and English), which come from a variety of high-quality sources. To the best of our knowledge, this is the largest collection of curated articles for Indic languages currently available. We use the data collected in a ser… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: Findings of ACL 2023

  41. arXiv:2305.05835  [pdf, other

    eess.IV cs.CV cs.LG

    Reference-based OCT Angiogram Super-resolution with Learnable Texture Generation

    Authors: Yuyan Ruan, Dawei Yang, Ziqi Tang, An Ran Ran, Carol Y. Cheung, Hao Chen

    Abstract: Optical coherence tomography angiography (OCTA) is a new imaging modality to visualize retinal microvasculature and has been readily adopted in clinics. High-resolution OCT angiograms are important to qualitatively and quantitatively identify potential biomarkers for different retinal diseases accurately. However, one significant problem of OCTA is the inevitable decrease in resolution when increa… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

    Comments: 12 pages, 11 figures

    MSC Class: 68T07 ACM Class: I.2; I.4

  42. arXiv:2304.06638  [pdf, other

    cs.CL cs.AI cs.CY cs.LG

    How Useful are Educational Questions Generated by Large Language Models?

    Authors: Sabina Elkins, Ekaterina Kochmar, Jackie C. K. Cheung, Iulian Serban

    Abstract: Controllable text generation (CTG) by large language models has a huge potential to transform education for teachers and students alike. Specifically, high quality and diverse question generation can dramatically reduce the load on teachers and improve the quality of their educational content. Recent work in this domain has made progress with generation, but fails to show that real teachers judge… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted to AIED Late Breaking Results 2023 - to be published in their proceedings

  43. arXiv:2303.09092  [pdf, other

    cs.CL

    Challenges to Evaluating the Generalization of Coreference Resolution Models: A Measurement Modeling Perspective

    Authors: Ian Porada, Alexandra Olteanu, Kaheer Suleman, Adam Trischler, Jackie Chi Kit Cheung

    Abstract: It is increasingly common to evaluate the same coreference resolution (CR) model on multiple datasets. Do these multi-dataset evaluations allow us to draw meaningful conclusions about model generalization? Or, do they rather reflect the idiosyncrasies of a particular experimental setup (e.g., the specific datasets used)? To study this, we view evaluation through the lens of measurement modeling, a… ▽ More

    Submitted 18 June, 2024; v1 submitted 16 March, 2023; originally announced March 2023.

    Comments: ACL Findings 2024

  44. arXiv:2303.08340  [pdf, other

    cs.CV

    VideoFlow: Exploiting Temporal Cues for Multi-frame Optical Flow Estimation

    Authors: Xiaoyu Shi, Zhaoyang Huang, Weikang Bian, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: We introduce VideoFlow, a novel optical flow estimation framework for videos. In contrast to previous methods that learn to estimate optical flow from two frames, VideoFlow concurrently estimates bi-directional optical flows for multiple frames that are available in videos by sufficiently exploiting temporal cues. We first propose a TRi-frame Optical Flow (TROF) module that estimates bi-directiona… ▽ More

    Submitted 20 August, 2023; v1 submitted 14 March, 2023; originally announced March 2023.

  45. arXiv:2303.01237  [pdf, other

    cs.CV

    FlowFormer++: Masked Cost Volume Autoencoding for Pretraining Optical Flow Estimation

    Authors: Xiaoyu Shi, Zhaoyang Huang, Dasong Li, Manyuan Zhang, Ka Chun Cheung, Simon See, Hongwei Qin, Jifeng Dai, Hongsheng Li

    Abstract: FlowFormer introduces a transformer architecture into optical flow estimation and achieves state-of-the-art performance. The core component of FlowFormer is the transformer-based cost-volume encoder. Inspired by the recent success of masked autoencoding (MAE) pretraining in unleashing transformers' capacity of encoding visual representation, we propose Masked Cost Volume Autoencoding (MCVA) to enh… ▽ More

    Submitted 2 March, 2023; originally announced March 2023.

  46. arXiv:2302.14003  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Systematic Rectification of Language Models via Dead-end Analysis

    Authors: Meng Cao, Mehdi Fatemi, Jackie Chi Kit Cheung, Samira Shabanian

    Abstract: With adversarial or otherwise normal prompts, existing large language models (LLM) can be pushed to generate toxic discourses. One way to reduce the risk of LLMs generating undesired discourses is to alter the training of the LLM. This can be very restrictive due to demanding computation requirements. Other methods rely on rule-based or prompt-based token elimination, which are limited as they dis… ▽ More

    Submitted 27 February, 2023; originally announced February 2023.

    Comments: The Eleventh International Conference on Learning Representations, ICLR'23

    Journal ref: ICLR 2023

  47. arXiv:2302.09852  [pdf, other

    cs.CL cs.AI

    Unsupervised Layer-wise Score Aggregation for Textual OOD Detection

    Authors: Maxime Darrin, Guillaume Staerman, Eduardo Dadalto Câmara Gomes, Jackie CK Cheung, Pablo Piantanida, Pierre Colombo

    Abstract: Out-of-distribution (OOD) detection is a rapidly growing field due to new robustness and security requirements driven by an increased number of AI-based systems. Existing OOD textual detectors often rely on an anomaly score (e.g., Mahalanobis distance) computed on the embedding output of the last layer of the encoder. In this work, we observe that OOD detection performance varies greatly depending… ▽ More

    Submitted 21 February, 2024; v1 submitted 20 February, 2023; originally announced February 2023.

  48. arXiv:2302.08531  [pdf, other

    cs.CL

    Learning with Rejection for Abstractive Text Summarization

    Authors: Meng Cao, Yue Dong, Jingyi He, Jackie Chi Kit Cheung

    Abstract: State-of-the-art abstractive summarization systems frequently hallucinate content that is not supported by the source document, mainly due to noise in the training dataset. Existing methods opt to drop the noisy samples or tokens from the training set entirely, reducing the effective training set size and creating an artificial propensity to copy words from the source. In this work, we propose a t… ▽ More

    Submitted 16 February, 2023; originally announced February 2023.

  49. arXiv:2302.06784  [pdf, other

    cs.CL

    The Stable Entropy Hypothesis and Entropy-Aware Decoding: An Analysis and Algorithm for Robust Natural Language Generation

    Authors: Kushal Arora, Timothy J. O'Donnell, Doina Precup, Jason Weston, Jackie C. K. Cheung

    Abstract: State-of-the-art language generation models can degenerate when applied to open-ended generation problems such as text completion, story generation, or dialog modeling. This degeneration usually shows up in the form of incoherence, lack of vocabulary diversity, and self-repetition or copying from the context. In this paper, we postulate that ``human-like'' generations usually lie in a narrow and n… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

  50. arXiv:2302.04182  [pdf, other

    cs.LG math.OC

    Online Resource Allocation: Bandits feedback and Advice on Time-varying Demands

    Authors: Lixing Lyu, Wang Chi Cheung

    Abstract: We consider a general online resource allocation model with bandit feedback and time-varying demands. While online resource allocation has been well studied in the literature, most existing works make the strong assumption that the demand arrival process is stationary. In practical applications, such as online advertisement and revenue management, however, this process may be exogenous and non-sta… ▽ More

    Submitted 12 June, 2023; v1 submitted 8 February, 2023; originally announced February 2023.

    Comments: 74 pages. A preliminary short version entitled "Non-Stationary Bandits with Knapsack Problems with Advice" is accepted to ICML 2023