Zum Hauptinhalt springen

Showing 1–50 of 106 results for author: Chan, R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.11847  [pdf, other

    cs.CL

    Prompto: An open source library for asynchronous querying of LLM endpoints

    Authors: Ryan Sze-Yin Chan, Federico Nanni, Edwin Brown, Ed Chapman, Angus R. Williams, Jonathan Bright, Evelina Gabasova

    Abstract: Recent surge in Large Language Model (LLM) availability has opened exciting avenues for research. However, efficiently interacting with these models presents a significant hurdle since LLMs often reside on proprietary or self-hosted API endpoints, each requiring custom code for interaction. Conducting comparative studies between different models can therefore be time-consuming and necessitate sign… ▽ More

    Submitted 12 August, 2024; originally announced August 2024.

  2. arXiv:2408.06731  [pdf, other

    cs.CY cs.AI cs.CL

    Large language models can consistently generate high-quality content for election disinformation operations

    Authors: Angus R. Williams, Liam Burke-Moore, Ryan Sze-Yin Chan, Florence E. Enock, Federico Nanni, Tvesha Sippy, Yi-Ling Chung, Evelina Gabasova, Kobi Hackenburg, Jonathan Bright

    Abstract: Advances in large language models have raised concerns about their potential use in generating compelling election disinformation at scale. This study presents a two-part investigation into the capabilities of LLMs to automate stages of an election disinformation operation. First, we introduce DisElect, a novel evaluation dataset designed to measure LLM compliance with instructions to generate con… ▽ More

    Submitted 13 August, 2024; originally announced August 2024.

  3. arXiv:2407.18449  [pdf, other

    eess.IV cs.CV cs.LG

    Towards A Generalizable Pathology Foundation Model via Unified Knowledge Distillation

    Authors: Jiabo Ma, Zhengrui Guo, Fengtao Zhou, Yihui Wang, Yingxue Xu, Yu Cai, Zhengjie Zhu, Cheng Jin, Yi Lin, Xinrui Jiang, Anjia Han, Li Liang, Ronald Cheong Kin Chan, Jiguang Wang, Kwang-Ting Cheng, Hao Chen

    Abstract: Foundation models pretrained on large-scale datasets are revolutionizing the field of computational pathology (CPath). The generalization ability of foundation models is crucial for the success in various downstream clinical tasks. However, current foundation models have only been evaluated on a limited type and number of tasks, leaving their generalization ability and overall performance unclear.… ▽ More

    Submitted 3 August, 2024; v1 submitted 25 July, 2024; originally announced July 2024.

    Report number: I.2.10

  4. arXiv:2407.15362  [pdf, other

    cs.CV cs.AI

    A Multimodal Knowledge-enhanced Whole-slide Pathology Foundation Model

    Authors: Yingxue Xu, Yihui Wang, Fengtao Zhou, Jiabo Ma, Shu Yang, Huangjing Lin, Xin Wang, Jiguang Wang, Li Liang, Anjia Han, Ronald Cheong Kin Chan, Hao Chen

    Abstract: Remarkable strides in computational pathology have been made in the task-agnostic foundation model that advances the performance of a wide array of downstream clinical tasks. Despite the promising performance, there are still several challenges. First, prior works have resorted to either vision-only or vision-captions data, disregarding invaluable pathology reports and gene expression profiles whi… ▽ More

    Submitted 5 August, 2024; v1 submitted 22 July, 2024; originally announced July 2024.

    Comments: 45 pages, 9 figures

  5. arXiv:2407.06614  [pdf, other

    eess.IV cs.CV

    Implicit Regression in Subspace for High-Sensitivity CEST Imaging

    Authors: Chu Chen, Yang Liu, Se Weon Park, Jizhou Li, Kannie W. Y. Chan, Raymond H. F. Chan

    Abstract: Chemical Exchange Saturation Transfer (CEST) MRI demonstrates its capability in significantly enhancing the detection of proteins and metabolites with low concentrations through exchangeable protons. The clinical application of CEST, however, is constrained by its low contrast and low signal-to-noise ratio (SNR) in the acquired data. Denoising, as one of the post-processing stages for CEST data, c… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  6. arXiv:2406.04331  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    PaCE: Parsimonious Concept Engineering for Large Language Models

    Authors: Jinqi Luo, Tianjiao Ding, Kwan Ho Ryan Chan, Darshan Thaker, Aditya Chattopadhyay, Chris Callison-Burch, René Vidal

    Abstract: Large Language Models (LLMs) are being used for a wide variety of tasks. While they are capable of generating human-like responses, they can also produce undesirable output including potentially harmful information, racist or sexist language, and hallucinations. Alignment methods are designed to reduce such undesirable output, via techniques such as fine-tuning, prompt engineering, and representat… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

    Comments: 26 pages, 17 figures, 5 tables, dataset and code at https://github.com/peterljq/Parsimonious-Concept-Engineering

  7. arXiv:2406.04289  [pdf, other

    cs.CL

    What Languages are Easy to Language-Model? A Perspective from Learning Probabilistic Regular Languages

    Authors: Nadav Borenstein, Anej Svete, Robin Chan, Josef Valvoda, Franz Nowak, Isabelle Augenstein, Eleanor Chodroff, Ryan Cotterell

    Abstract: What can large language models learn? By definition, language models (LM) are distributions over strings. Therefore, an intuitive way of addressing the above question is to formalize it as a matter of learnability of classes of distributions over strings. While prior work in this direction focused on assessing the theoretical limits, in contrast, we seek to understand the empirical learnability. U… ▽ More

    Submitted 10 June, 2024; v1 submitted 6 June, 2024; originally announced June 2024.

    Comments: Accepted to ACL 2024

  8. arXiv:2406.04239  [pdf, other

    cs.LG

    Solving Inverse Problems in Protein Space Using Diffusion-Based Priors

    Authors: Axel Levy, Eric R. Chan, Sara Fridovich-Keil, Frédéric Poitevin, Ellen D. Zhong, Gordon Wetzstein

    Abstract: The interaction of a protein with its environment can be understood and controlled via its 3D structure. Experimental methods for protein structure determination, such as X-ray crystallography or cryogenic electron microscopy, shed light on biological processes but introduce challenging inverse problems. Learning-based approaches have emerged as accurate and efficient methods to solve these invers… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  9. arXiv:2406.02329  [pdf, other

    cs.CL cs.LG

    On Affine Homotopy between Language Encoders

    Authors: Robin SM Chan, Reda Boumasmoud, Anej Svete, Yuxin Ren, Qipeng Guo, Zhijing Jin, Shauli Ravfogel, Mrinmaya Sachan, Bernhard Schölkopf, Mennatallah El-Assady, Ryan Cotterell

    Abstract: Pre-trained language encoders -- functions that represent text as vectors -- are an integral component of many NLP tasks. We tackle a natural question in language encoder analysis: What does it mean for two encoders to be similar? We contend that a faithful measure of similarity needs to be \emph{intrinsic}, that is, task-independent, yet still be informative of \emph{extrinsic} similarity -- the… ▽ More

    Submitted 4 June, 2024; originally announced June 2024.

    Comments: 10 pages

  10. arXiv:2405.05923  [pdf

    cs.HC

    Darkverse -- A New DarkWeb?

    Authors: Raymond Chan, Benjamin W. J. Kwok, Adriel Yeo, Kan Chen, Jeannie S. Lee

    Abstract: The "Darkverse" could be the negative harmful area of the Metaverse; a new virtual immersive environment for the facilitation of illicit activity such as misinformation, fraud, harassment, and illegal marketplaces. This paper explores the potential for inappropriate activities within the Metaverse, and the similarities between the Darkverse and the Dark Web. Challenges and future directions for in… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    Comments: This is an accepted position statement of CHI 2024 Workshop (Novel Approaches for Understanding and Mitigating Emerging New Harms in Immersive and Embodied Virtual Spaces: A Workshop at CHI 2024)

  11. arXiv:2405.00708  [pdf, other

    cs.CL cs.AI cs.HC cs.LG

    Interactive Analysis of LLMs using Meaningful Counterfactuals

    Authors: Furui Cheng, Vilém Zouhar, Robin Shing Moon Chan, Daniel Fürst, Hendrik Strobelt, Mennatallah El-Assady

    Abstract: Counterfactual examples are useful for exploring the decision boundaries of machine learning models and determining feature attributions. How can we apply counterfactual-based methods to analyze and explain LLMs? We identify the following key challenges. First, the generated textual counterfactuals should be meaningful and readable to users and thus can be mentally compared to draw conclusions. Se… ▽ More

    Submitted 23 April, 2024; originally announced May 2024.

    ACM Class: I.2.7; H.5.2

  12. arXiv:2404.08582  [pdf, other

    cs.CV cs.AI

    FashionFail: Addressing Failure Cases in Fashion Object Detection and Segmentation

    Authors: Riza Velioglu, Robin Chan, Barbara Hammer

    Abstract: In the realm of fashion object detection and segmentation for online shopping images, existing state-of-the-art fashion parsing models encounter limitations, particularly when exposed to non-model-worn apparel and close-up shots. To address these failures, we introduce FashionFail; a new fashion dataset with e-commerce images for object detection and segmentation. The dataset is efficiently curate… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: to be published in 2024 International Joint Conference on Neural Networks (IJCNN)

  13. arXiv:2404.01192  [pdf, other

    eess.IV cs.CV

    iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer

    Authors: Fengtao Zhou, Yingxue Xu, Yanfen Cui, Shenyan Zhang, Yun Zhu, Weiyang He, Jiguang Wang, Xin Wang, Ronald Chan, Louis Ho Shing Lau, Chu Han, Dafu Zhang, Zhenhui Li, Hao Chen

    Abstract: Gastric cancer (GC) is a prevalent malignancy worldwide, ranking as the fifth most common cancer with over 1 million new cases and 700 thousand deaths in 2020. Locally advanced gastric cancer (LAGC) accounts for approximately two-thirds of GC diagnoses, and neoadjuvant chemotherapy (NACT) has emerged as the standard treatment for LAGC. However, the effectiveness of NACT varies significantly among… ▽ More

    Submitted 1 April, 2024; originally announced April 2024.

    Comments: 27 pages, 9 figures, 3 tables (under review)

  14. arXiv:2402.18886  [pdf, other

    cs.LG physics.med-ph

    BP-DeepONet: A new method for cuffless blood pressure estimation using the physcis-informed DeepONet

    Authors: Lingfeng Li, Xue-Cheng Tai, Raymond Chan

    Abstract: Cardiovascular diseases (CVDs) are the leading cause of death worldwide, with blood pressure serving as a crucial indicator. Arterial blood pressure (ABP) waveforms provide continuous pressure measurements throughout the cardiac cycle and offer valuable diagnostic insights. Consequently, there is a significant demand for non-invasive and cuff-less methods to measure ABP waveforms continuously. Acc… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  15. arXiv:2402.15814  [pdf, other

    cs.CL cs.CC cs.LG

    On Efficiently Representing Regular Languages as RNNs

    Authors: Anej Svete, Robin Shing Moon Chan, Ryan Cotterell

    Abstract: Recent work by Hewitt et al. (2020) provides an interpretation of the empirical success of recurrent neural networks (RNNs) as language models (LMs). It shows that RNNs can efficiently represent bounded hierarchical structures that are prevalent in human language. This suggests that RNNs' success might be linked to their ability to model hierarchy. However, a closer inspection of Hewitt et al.'s (… ▽ More

    Submitted 18 June, 2024; v1 submitted 24 February, 2024; originally announced February 2024.

  16. arXiv:2401.14754  [pdf, other

    cs.CV

    VJT: A Video Transformer on Joint Tasks of Deblurring, Low-light Enhancement and Denoising

    Authors: Yuxiang Hui, Yang Liu, Yaofang Liu, Fan Jia, Jinshan Pan, Raymond Chan, Tieyong Zeng

    Abstract: Video restoration task aims to recover high-quality videos from low-quality observations. This contains various important sub-tasks, such as video denoising, deblurring and low-light enhancement, since video often faces different types of degradation, such as blur, low light, and noise. Even worse, these kinds of degradation could happen simultaneously when taking videos in extreme environments. T… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: 12 pages,8 figures

  17. arXiv:2401.00456  [pdf, other

    cs.CV

    Double-well Net for Image Segmentation

    Authors: Hao Liu, Jun Liu, Raymond H. Chan, Xue-Cheng Tai

    Abstract: In this study, our goal is to integrate classical mathematical models with deep neural networks by introducing two novel deep neural network models for image segmentation known as Double-well Nets. Drawing inspirations from the Potts model, our models leverage neural networks to represent a region force functional. We extend the well-know MBO (Merriman-Bence-Osher) scheme to solve the Potts model.… ▽ More

    Submitted 28 July, 2024; v1 submitted 31 December, 2023; originally announced January 2024.

    MSC Class: 68U10; 94A08

  18. arXiv:2312.15447  [pdf, other

    cs.CV cs.LG stat.AP

    Superpixel-based and Spatially-regularized Diffusion Learning for Unsupervised Hyperspectral Image Clustering

    Authors: Kangning Cui, Ruoning Li, Sam L. Polk, Yinyi Lin, Hongsheng Zhang, James M. Murphy, Robert J. Plemmons, Raymond H. Chan

    Abstract: Hyperspectral images (HSIs) provide exceptional spatial and spectral resolution of a scene, crucial for various remote sensing applications. However, the high dimensionality, presence of noise and outliers, and the need for precise labels of HSIs present significant challenges to HSIs analysis, motivating the development of performant HSI clustering algorithms. This paper introduces a novel unsupe… ▽ More

    Submitted 24 December, 2023; originally announced December 2023.

    Comments: 27 pages, 9 figures, and 2 tables

  19. arXiv:2312.11548  [pdf, other

    cs.CV

    Learning Interpretable Queries for Explainable Image Classification with Information Pursuit

    Authors: Stefan Kolek, Aditya Chattopadhyay, Kwan Ho Ryan Chan, Hector Andrade-Loarca, Gitta Kutyniok, Réne Vidal

    Abstract: Information Pursuit (IP) is an explainable prediction algorithm that greedily selects a sequence of interpretable queries about the data in order of information gain, updating its posterior at each step based on observed query-answer pairs. The standard paradigm uses hand-crafted dictionaries of potential data queries curated by a domain expert or a large language model after a human prompt. Howev… ▽ More

    Submitted 16 December, 2023; originally announced December 2023.

  20. arXiv:2312.03523  [pdf, other

    cs.CL

    Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling

    Authors: Talia Tseriotou, Ryan Sze-Yin Chan, Adam Tsakalidis, Iman Munire Bilal, Elena Kochkina, Terry Lyons, Maria Liakata

    Abstract: We present an open-source, pip installable toolkit, Sig-Networks, the first of its kind for longitudinal language modelling. A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks. We apply and extend published research providing a full suite of signature-based models. Their components can be used as PyTorch building block… ▽ More

    Submitted 6 February, 2024; v1 submitted 6 December, 2023; originally announced December 2023.

    Comments: To appear in EACL 2024: System Demonstrations

  21. arXiv:2311.17898  [pdf, other

    cs.CV cs.CL cs.LG

    Knowledge Pursuit Prompting for Zero-Shot Multimodal Synthesis

    Authors: Jinqi Luo, Kwan Ho Ryan Chan, Dimitris Dimos, René Vidal

    Abstract: Hallucinations and unfaithful synthesis due to inaccurate prompts with insufficient semantic details are widely observed in multimodal generative models. A prevalent strategy to align multiple modalities is to fine-tune the generator with a large number of annotated text-image pairs. However, such a procedure is labor-consuming and resource-draining. The key question we ask is: can we enhance the… ▽ More

    Submitted 30 November, 2023; v1 submitted 29 November, 2023; originally announced November 2023.

  22. arXiv:2311.13682  [pdf, other

    cs.CV eess.IV

    Single-Shot Plug-and-Play Methods for Inverse Problems

    Authors: Yanqi Cheng, Lipei Zhang, Zhenda Shen, Shujun Wang, Lequan Yu, Raymond H. Chan, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: The utilisation of Plug-and-Play (PnP) priors in inverse problems has become increasingly prominent in recent years. This preference is based on the mathematical equivalence between the general proximal operator and the regularised denoiser, facilitating the adaptation of various off-the-shelf denoiser priors to a wide range of inverse problems. However, existing PnP models predominantly rely on p… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

  23. arXiv:2311.13610  [pdf, other

    cs.CV eess.IV

    TRIDENT: The Nonlinear Trilogy for Implicit Neural Representations

    Authors: Zhenda Shen, Yanqi Cheng, Raymond H. Chan, Pietro Liò, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Implicit neural representations (INRs) have garnered significant interest recently for their ability to model complex, high-dimensional data without explicit parameterisation. In this work, we introduce TRIDENT, a novel function for implicit neural representations characterised by a trilogy of nonlinearities. Firstly, it is designed to represent high-order features through order compactness. Secon… ▽ More

    Submitted 21 November, 2023; originally announced November 2023.

  24. arXiv:2310.17994  [pdf, other

    cs.CV cs.GR

    ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Image

    Authors: Kyle Sargent, Zizhang Li, Tanmay Shah, Charles Herrmann, Hong-Xing Yu, Yunzhi Zhang, Eric Ryan Chan, Dmitry Lagun, Li Fei-Fei, Deqing Sun, Jiajun Wu

    Abstract: We introduce a 3D-aware diffusion model, ZeroNVS, for single-image novel view synthesis for in-the-wild scenes. While existing methods are designed for single objects with masked backgrounds, we propose new techniques to address challenges introduced by in-the-wild multi-object scenes with complex backgrounds. Specifically, we train a generative prior on a mixture of data sources that capture obje… ▽ More

    Submitted 23 April, 2024; v1 submitted 27 October, 2023; originally announced October 2023.

    Comments: Accepted to CVPR 2024. 12 pages

  25. arXiv:2310.11440  [pdf, other

    cs.CV

    EvalCrafter: Benchmarking and Evaluating Large Video Generation Models

    Authors: Yaofang Liu, Xiaodong Cun, Xuebo Liu, Xintao Wang, Yong Zhang, Haoxin Chen, Yang Liu, Tieyong Zeng, Raymond Chan, Ying Shan

    Abstract: The vision and language generative models have been overgrown in recent years. For video generation, various open-sourced models and public-available services have been developed to generate high-quality videos. However, these methods often use a few metrics, e.g., FVD or IS, to evaluate the performance. We argue that it is hard to judge the large conditional generative models from the simple metr… ▽ More

    Submitted 23 March, 2024; v1 submitted 17 October, 2023; originally announced October 2023.

    Comments: Technical Report, Project page: https://evalcrafter.github.io/

  26. arXiv:2310.07204  [pdf, other

    cs.AI cs.CV cs.GR cs.LG

    State of the Art on Diffusion Models for Visual Computing

    Authors: Ryan Po, Wang Yifan, Vladislav Golyanik, Kfir Aberman, Jonathan T. Barron, Amit H. Bermano, Eric Ryan Chan, Tali Dekel, Aleksander Holynski, Angjoo Kanazawa, C. Karen Liu, Lingjie Liu, Ben Mildenhall, Matthias Nießner, Björn Ommer, Christian Theobalt, Peter Wonka, Gordon Wetzstein

    Abstract: The field of visual computing is rapidly advancing due to the emergence of generative artificial intelligence (AI), which unlocks unprecedented capabilities for the generation, editing, and reconstruction of images, videos, and 3D scenes. In these domains, diffusion models are the generative AI architecture of choice. Within the last year alone, the literature on diffusion-based tools and applicat… ▽ More

    Submitted 11 October, 2023; originally announced October 2023.

  27. arXiv:2309.09770  [pdf, other

    cs.AI

    How to Data in Datathons

    Authors: Carlos Mougan, Richard Plant, Clare Teng, Marya Bazzi, Alvaro Cabrejas-Egea, Ryan Sze-Yin Chan, David Salvador Jasin, Martin Stoffel, Kirstie Jane Whitaker, Jules Manser

    Abstract: The rise of datathons, also known as data or data science hackathons, has provided a platform to collaborate, learn, and innovate in a short timeframe. Despite their significant potential benefits, organizations often struggle to effectively work with data due to a lack of clear guidelines and best practices for potential issues that might arise. Drawing on our own experiences and insights from or… ▽ More

    Submitted 25 October, 2023; v1 submitted 18 September, 2023; originally announced September 2023.

    Comments: 37th Conference on Neural Information Processing Systems (NeurIPS 2023) Track on Datasets and Benchmark

  28. arXiv:2309.04302  [pdf, other

    cs.CV

    Have We Ever Encountered This Before? Retrieving Out-of-Distribution Road Obstacles from Driving Scenes

    Authors: Youssef Shoeb, Robin Chan, Gesina Schwalbe, Azarm Nowzard, Fatma Güney, Hanno Gottschalk

    Abstract: In the life cycle of highly automated systems operating in an open and dynamic environment, the ability to adjust to emerging challenges is crucial. For systems integrating data-driven AI-based components, rapid responses to deployment issues require fast access to related data for testing and reconfiguration. In the context of automated driving, this especially applies to road obstacles that were… ▽ More

    Submitted 8 September, 2023; originally announced September 2023.

    Comments: 11 pages, 7 figures, and 3 tables

  29. arXiv:2308.13021  [pdf

    cs.HC

    Augmenting a Firefighters PPE -- Gas Mask SCBA

    Authors: Kunal Aneja, Tejaswini Ramkumar Babu, Rachel Chan

    Abstract: PPE (Personal Protective Equipment) has allowed firefighters to perform their everyday tasks without getting harmed since the mid 1800s. Now, the advancement of technology has given rise to the improvements of PPE. PPE can now include sensors to detect any number of environmental hazards (chemical, biological, temperature etc.). As the GT class of CS3750, we have decided to create a version of an… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  30. arXiv:2308.12562  [pdf, other

    cs.LG stat.ML

    Variational Information Pursuit with Large Language and Multimodal Models for Interpretable Predictions

    Authors: Kwan Ho Ryan Chan, Aditya Chattopadhyay, Benjamin David Haeffele, Rene Vidal

    Abstract: Variational Information Pursuit (V-IP) is a framework for making interpretable predictions by design by sequentially selecting a short chain of task-relevant, user-defined and interpretable queries about the data that are most informative for the task. While this allows for built-in interpretability in predictive models, applying V-IP to any task requires data samples with dense concept-labeling b… ▽ More

    Submitted 24 August, 2023; originally announced August 2023.

  31. arXiv:2307.09052  [pdf, other

    cs.CV

    Connections between Operator-splitting Methods and Deep Neural Networks with Applications in Image Segmentation

    Authors: Hao Liu, Xue-Cheng Tai, Raymond Chan

    Abstract: Deep neural network is a powerful tool for many tasks. Understanding why it is so successful and providing a mathematical explanation is an important problem and has been one popular research direction in past years. In the literature of mathematical analysis of deep neural networks, a lot of works is dedicated to establishing representation theories. How to make connections between deep neural ne… ▽ More

    Submitted 17 October, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    MSC Class: 68U10; 94A08

  32. arXiv:2307.09039  [pdf, other

    cs.CV

    PottsMGNet: A Mathematical Explanation of Encoder-Decoder Based Neural Networks

    Authors: Xue-Cheng Tai, Hao Liu, Raymond Chan

    Abstract: For problems in image processing and many other fields, a large class of effective neural networks has encoder-decoder-based architectures. Although these networks have made impressive performances, mathematical explanations of their architectures are still underdeveloped. In this paper, we study the encoder-decoder-based network architecture from the algorithmic perspective and provide a mathemat… ▽ More

    Submitted 15 September, 2023; v1 submitted 18 July, 2023; originally announced July 2023.

    MSC Class: 68U10; 94A08

  33. arXiv:2306.16309  [pdf, other

    cs.SI

    Raphtory: The temporal graph engine for Rust and Python

    Authors: Ben Steer, Naomi Arnold, Cheick Tidiane Ba, Renaud Lambiotte, Haaroon Yousaf, Lucas Jeub, Fabian Murariu, Shivam Kapoor, Pedro Rico, Rachel Chan, Louis Chan, James Alford, Richard G. Clegg, Felix Cuadrado, Matthew Russell Barnes, Peijie Zhong, John N. Pougué Biyong, Alhamza Alnaimi

    Abstract: Raphtory is a platform for building and analysing temporal networks. The library includes methods for creating networks from a variety of data sources; algorithms to explore their structure and evolution; and an extensible GraphQL server for deployment of applications built on top. Raphtory's core engine is built in Rust, for efficiency, with Python interfaces, for ease of use. Raphtory is develop… ▽ More

    Submitted 3 January, 2024; v1 submitted 28 June, 2023; originally announced June 2023.

  34. arXiv:2306.12146  [pdf, other

    cs.CL cs.HC

    Which Spurious Correlations Impact Reasoning in NLI Models? A Visual Interactive Diagnosis through Data-Constrained Counterfactuals

    Authors: Robin Chan, Afra Amini, Mennatallah El-Assady

    Abstract: We present a human-in-the-loop dashboard tailored to diagnosing potential spurious features that NLI models rely on for predictions. The dashboard enables users to generate diverse and challenging examples by drawing inspiration from GPT-3 suggestions. Additionally, users can receive feedback from a trained NLI model on how challenging the newly created example is and make refinements based on the… ▽ More

    Submitted 21 June, 2023; originally announced June 2023.

    Comments: 7 pages, Accepted at ACL 2023: System Demonstrations

  35. arXiv:2306.09668  [pdf, other

    cs.LG cs.AI

    Multi-Classification using One-versus-One Deep Learning Strategy with Joint Probability Estimates

    Authors: Anthony Hei-Long Chan, Raymond HonFu Chan, Lingjia Dai

    Abstract: The One-versus-One (OvO) strategy is an approach of multi-classification models which focuses on training binary classifiers between each pair of classes. While the OvO strategy takes advantage of balanced training data, the classification accuracy is usually hindered by the voting mechanism to combine all binary classifiers. In this paper, a novel OvO multi-classification model incorporating a jo… ▽ More

    Submitted 16 June, 2023; originally announced June 2023.

  36. Single-Shot Implicit Morphable Faces with Consistent Texture Parameterization

    Authors: Connor Z. Lin, Koki Nagano, Jan Kautz, Eric R. Chan, Umar Iqbal, Leonidas Guibas, Gordon Wetzstein, Sameh Khamis

    Abstract: There is a growing demand for the accessible creation of high-quality 3D avatars that are animatable and customizable. Although 3D morphable models provide intuitive control for editing and animation, and robustness for single-view face reconstruction, they cannot easily capture geometric and appearance details. Methods based on neural implicit representations, such as signed distance functions (S… ▽ More

    Submitted 4 May, 2023; originally announced May 2023.

    Comments: SIGGRAPH 2023, Project Page: https://research.nvidia.com/labs/toronto-ai/ssif

  37. arXiv:2305.02310  [pdf, other

    cs.CV cs.AI cs.GR cs.LG

    Real-Time Radiance Fields for Single-Image Portrait View Synthesis

    Authors: Alex Trevithick, Matthew Chan, Michael Stengel, Eric R. Chan, Chao Liu, Zhiding Yu, Sameh Khamis, Manmohan Chandraker, Ravi Ramamoorthi, Koki Nagano

    Abstract: We present a one-shot method to infer and render a photorealistic 3D representation from a single unposed image (e.g., face portrait) in real-time. Given a single RGB input, our image encoder directly predicts a canonical triplane representation of a neural radiance field for 3D-aware novel view synthesis via volume rendering. Our method is fast (24 fps) on consumer hardware, and produces higher q… ▽ More

    Submitted 3 May, 2023; originally announced May 2023.

    Comments: Project page: https://research.nvidia.com/labs/nxp/lp3d/

  38. arXiv:2304.06662  [pdf, other

    eess.IV cs.CV

    Deep Learning in Breast Cancer Imaging: A Decade of Progress and Future Directions

    Authors: Luyang Luo, Xi Wang, Yi Lin, Xiaoqi Ma, Andong Tan, Ronald Chan, Varut Vardhanabhuti, Winnie CW Chu, Kwang-Ting Cheng, Hao Chen

    Abstract: Breast cancer has reached the highest incidence rate worldwide among all malignancies since 2020. Breast imaging plays a significant role in early diagnosis and intervention to improve the outcome of breast cancer patients. In the past decade, deep learning has shown remarkable progress in breast cancer imaging analysis, holding great promise in interpreting the rich information and complex contex… ▽ More

    Submitted 20 January, 2024; v1 submitted 13 April, 2023; originally announced April 2023.

    Comments: IEEE RBME 2024

  39. arXiv:2304.02602  [pdf, other

    cs.CV cs.AI cs.GR

    Generative Novel View Synthesis with 3D-Aware Diffusion Models

    Authors: Eric R. Chan, Koki Nagano, Matthew A. Chan, Alexander W. Bergman, Jeong Joon Park, Axel Levy, Miika Aittala, Shalini De Mello, Tero Karras, Gordon Wetzstein

    Abstract: We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of ambiguity, is capable of rendering diverse and plausible novel views. To achieve this, our method makes use of existing 2D diffusion backbones but, crucially, incorp… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Project page: https://nvlabs.github.io/genvs

  40. arXiv:2303.06138  [pdf, other

    cs.CV cs.GR

    Learning Object-Centric Neural Scattering Functions for Free-Viewpoint Relighting and Scene Composition

    Authors: Hong-Xing Yu, Michelle Guo, Alireza Fathi, Yen-Yu Chang, Eric Ryan Chan, Ruohan Gao, Thomas Funkhouser, Jiajun Wu

    Abstract: Photorealistic object appearance modeling from 2D images is a constant topic in vision and graphics. While neural implicit methods (such as Neural Radiance Fields) have shown high-fidelity view synthesis results, they cannot relight the captured objects. More recent neural inverse rendering approaches have enabled object relighting, but they represent surface properties as simple BRDFs, and theref… ▽ More

    Submitted 3 October, 2023; v1 submitted 10 March, 2023; originally announced March 2023.

    Comments: Journal extension of arXiv:2012.08503 (TMLR 2023). The first two authors contributed equally to this work. Project page: https://kovenyu.com/osf/

    Journal ref: Transactions on Machine Learning Research (TMLR), 2023

  41. arXiv:2303.04291  [pdf, other

    eess.IV cs.CV

    Diffusion in the Dark: A Diffusion Model for Low-Light Text Recognition

    Authors: Cindy M. Nguyen, Eric R. Chan, Alexander W. Bergman, Gordon Wetzstein

    Abstract: Capturing images is a key part of automation for high-level tasks such as scene text recognition. Low-light conditions pose a challenge for high-level perception stacks, which are often optimized on well-lit, artifact-free images. Reconstruction methods for low-light images can produce well-lit counterparts, but typically at the cost of high-frequency details critical for downstream tasks. We prop… ▽ More

    Submitted 30 October, 2023; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: WACV 2024. Project website: https://ccnguyen.github.io/diffusion-in-the-dark/

  42. arXiv:2302.14430  [pdf, other

    cs.CV

    Tracking Fast by Learning Slow: An Event-based Speed Adaptive Hand Tracker Leveraging Knowledge in RGB Domain

    Authors: Chuanlin Lan, Ziyuan Yin, Arindam Basu, Rosa H. M. Chan

    Abstract: 3D hand tracking methods based on monocular RGB videos are easily affected by motion blur, while event camera, a sensor with high temporal resolution and dynamic range, is naturally suitable for this task with sparse output and low power consumption. However, obtaining 3D annotations of fast-moving hands is difficult for constructing event-based hand-tracking datasets. In this paper, we provided a… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

  43. arXiv:2302.11517  [pdf, other

    eess.IV cs.CV cs.LG

    A Global and Patch-wise Contrastive Loss for Accurate Automated Exudate Detection

    Authors: Wei Tang, Kangning Cui, Raymond H. Chan

    Abstract: Diabetic retinopathy (DR) is a leading global cause of blindness. Early detection of hard exudates plays a crucial role in identifying DR, which aids in treating diabetes and preventing vision loss. However, the unique characteristics of hard exudates, ranging from their inconsistent shapes to indistinct boundaries, pose significant challenges to existing segmentation techniques. To address these… ▽ More

    Submitted 2 March, 2024; v1 submitted 22 February, 2023; originally announced February 2023.

    Comments: 8 pages, 3 figures, 2 tables. To appear in ISBI 2024

  44. arXiv:2302.10524  [pdf, other

    cs.LG cs.NE

    LU-Net: Invertible Neural Networks Based on Matrix Factorization

    Authors: Robin Chan, Sarina Penquitt, Hanno Gottschalk

    Abstract: LU-Net is a simple and fast architecture for invertible neural networks (INN) that is based on the factorization of quadratic weight matrices $\mathsf{A=LU}$, where $\mathsf{L}$ is a lower triangular matrix with ones on the diagonal and $\mathsf{U}$ an upper triangular matrix. Instead of learning a fully occupied matrix $\mathsf{A}$, we learn $\mathsf{L}$ and $\mathsf{U}$ separately. If combined w… ▽ More

    Submitted 21 February, 2023; originally announced February 2023.

  45. arXiv:2302.09884  [pdf, other

    cs.CV

    GlocalFuse-Depth: Fusing Transformers and CNNs for All-day Self-supervised Monocular Depth Estimation

    Authors: Zezheng Zhang, Ryan K. Y. Chan, Kenneth K. Y. Wong

    Abstract: In recent years, self-supervised monocular depth estimation has drawn much attention since it frees of depth annotations and achieved remarkable results on standard benchmarks. However, most of existing methods only focus on either daytime or nighttime images, thus their performance degrades on the other domain because of the large domain shift between daytime and nighttime images. To address this… ▽ More

    Submitted 20 February, 2023; originally announced February 2023.

  46. arXiv:2302.07045  [pdf, other

    cs.LG

    Multi-Prototypes Convex Merging Based K-Means Clustering Algorithm

    Authors: Dong Li, Shuisheng Zhou, Tieyong Zeng, Raymond H. Chan

    Abstract: K-Means algorithm is a popular clustering method. However, it has two limitations: 1) it gets stuck easily in spurious local minima, and 2) the number of clusters k has to be given a priori. To solve these two issues, a multi-prototypes convex merging based K-Means clustering algorithm (MCKM) is presented. First, based on the structure of the spurious local minima of the K-Means problem, a multi-p… ▽ More

    Submitted 14 February, 2023; originally announced February 2023.

  47. arXiv:2302.02876  [pdf, other

    cs.LG cs.AI stat.ML

    Variational Information Pursuit for Interpretable Predictions

    Authors: Aditya Chattopadhyay, Kwan Ho Ryan Chan, Benjamin D. Haeffele, Donald Geman, René Vidal

    Abstract: There is a growing interest in the machine learning community in developing predictive algorithms that are "interpretable by design". Towards this end, recent work proposes to make interpretable decisions by sequentially asking interpretable queries about data until a prediction can be made with high confidence based on the answers obtained (the history). To promote short query-answer chains, a gr… ▽ More

    Submitted 15 February, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

    Comments: Code is available at https://github.com/ryanchankh/VariationalInformationPursuit

    Report number: https://openreview.net/forum?id=77lSWa-Tm3Z

  48. arXiv:2302.00626  [pdf, other

    cs.CV eess.IV

    Continuous U-Net: Faster, Greater and Noiseless

    Authors: Chun-Wun Cheng, Christina Runkel, Lihao Liu, Raymond H Chan, Carola-Bibiane Schönlieb, Angelica I Aviles-Rivero

    Abstract: Image segmentation is a fundamental task in image analysis and clinical practice. The current state-of-the-art techniques are based on U-shape type encoder-decoder networks with skip connections, called U-Net. Despite the powerful performance reported by existing U-Net type networks, they suffer from several major limitations. Issues include the hard coding of the receptive field size, compromisin… ▽ More

    Submitted 1 February, 2023; originally announced February 2023.

  49. arXiv:2301.11499  [pdf

    cs.CV cs.AI

    Dual-View Selective Instance Segmentation Network for Unstained Live Adherent Cells in Differential Interference Contrast Images

    Authors: Fei Pan, Yutong Wu, Kangning Cui, Shuxun Chen, Yanfang Li, Yaofang Liu, Adnan Shakoor, Han Zhao, Beijia Lu, Shaohua Zhi, Raymond Chan, Dong Sun

    Abstract: Despite recent advances in data-independent and deep-learning algorithms, unstained live adherent cell instance segmentation remains a long-standing challenge in cell image processing. Adherent cells' inherent visual characteristics, such as low contrast structures, fading edges, and irregular morphology, have made it difficult to distinguish from one another, even by human experts, let alone comp… ▽ More

    Submitted 26 January, 2023; originally announced January 2023.

    Comments: 13 pages, 5 figures, 3 tables

  50. arXiv:2301.01805  [pdf, other

    cs.LG cs.CV

    Unsupervised Manifold Linearizing and Clustering

    Authors: Tianjiao Ding, Shengbang Tong, Kwan Ho Ryan Chan, Xili Dai, Yi Ma, Benjamin D. Haeffele

    Abstract: We consider the problem of simultaneously clustering and learning a linear representation of data lying close to a union of low-dimensional manifolds, a fundamental task in machine learning and computer vision. When the manifolds are assumed to be linear subspaces, this reduces to the classical problem of subspace clustering, which has been studied extensively over the past two decades. Unfortunat… ▽ More

    Submitted 24 August, 2023; v1 submitted 4 January, 2023; originally announced January 2023.