Skip to main content

Showing 1–50 of 120 results for author: Lin, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.06766  [pdf, other

    cs.DB

    Relational Perspective on Graph Query Languages

    Authors: Diego Figueira, Anthony W. Lin, Liat Peterfreund

    Abstract: We study a relational perspective of graph database querying. Such a perspective underlies various graph database systems but very few theoretical investigations have been conducted on it. This perspective offers a powerful and unified framework to study graph database querying, by which algorithms and complexity follow from classical results. We provide two concrete applications. The first is q… ▽ More

    Submitted 9 July, 2024; originally announced July 2024.

  2. arXiv:2406.17871  [pdf, other

    cs.DB

    Revisiting the Expressiveness Landscape of Data Graph Queries

    Authors: Michael Benedikt, Anthony Widjaja Lin, Di-De Yen

    Abstract: The study of graph queries in database theory has spanned more than three decades, resulting in a multitude of proposals for graph query languages. These languages differ in the mechanisms. We can identify three main families of languages, with the canonical representatives being: (1) regular path queries, (2) walk logic, and (3) first-order logic with transitive closure operators. This paper prov… ▽ More

    Submitted 25 June, 2024; originally announced June 2024.

  3. arXiv:2406.16942  [pdf, other

    eess.IV cs.AI cs.CV

    Enhancing Diagnostic Reliability of Foundation Model with Uncertainty Estimation in OCT Images

    Authors: Yuanyuan Peng, Aidi Lin, Meng Wang, Tian Lin, Ke Zou, Yinglin Cheng, Tingkun Shi, Xulong Liao, Lixia Feng, Zhen Liang, Xinjian Chen, Huazhu Fu, Haoyu Chen

    Abstract: Inability to express the confidence level and detect unseen classes has limited the clinical implementation of artificial intelligence in the real-world. We developed a foundation model with uncertainty estimation (FMUE) to detect 11 retinal conditions on optical coherence tomography (OCT). In the internal test set, FMUE achieved a higher F1 score of 96.76% than two state-of-the-art algorithms, RE… ▽ More

    Submitted 17 June, 2024; originally announced June 2024.

    Comments: All codes are available at https://github.com/yuanyuanpeng0129/FMUE

  4. arXiv:2406.09317  [pdf, other

    eess.IV cs.CV

    Common and Rare Fundus Diseases Identification Using Vision-Language Foundation Model with Knowledge of Over 400 Diseases

    Authors: Meng Wang, Tian Lin, Aidi Lin, Kai Yu, Yuanyuan Peng, Lianyu Wang, Cheng Chen, Ke Zou, Huiyu Liang, Man Chen, Xue Yao, Meiqin Zhang, Binwei Huang, Chaoxin Zheng, Peixin Zhang, Wei Chen, Yilong Luo, Yifan Chen, Honghe Xia, Tingkun Shi, Qi Zhang, Jinming Guo, Xiaolin Chen, Jingcheng Wang, Yih Chung Tham , et al. (24 additional authors not shown)

    Abstract: Previous foundation models for retinal images were pre-trained with limited disease categories and knowledge base. Here we introduce RetiZero, a vision-language foundation model that leverages knowledge from over 400 fundus diseases. To RetiZero's pre-training, we compiled 341,896 fundus images paired with text descriptions, sourced from public datasets, ophthalmic literature, and online resources… ▽ More

    Submitted 30 June, 2024; v1 submitted 13 June, 2024; originally announced June 2024.

  5. arXiv:2406.06038  [pdf, other

    cs.RO

    Navigation and 3D Surface Reconstruction from Passive Whisker Sensing

    Authors: Michael A. Lin, Hao Li, Chengyi Xing, Mark R. Cutkosky

    Abstract: Whiskers provide a way to sense surfaces in the immediate environment without disturbing it. In this paper we present a method for using highly flexible, curved, passive whiskers mounted along a robot arm to gather sensory data as they brush past objects during normal robot motion. The information is useful both for guiding the robot in cluttered spaces and for reconstructing the exposed faces of… ▽ More

    Submitted 10 June, 2024; originally announced June 2024.

    Comments: arXiv admin note: text overlap with arXiv:2210.12387

  6. arXiv:2406.02778  [pdf, other

    cs.LG

    MS-IMAP -- A Multi-Scale Graph Embedding Approach for Interpretable Manifold Learning

    Authors: Shay Deutsch, Lionel Yelibi, Alex Tong Lin, Arjun Ravi Kannan

    Abstract: Deriving meaningful representations from complex, high-dimensional data in unsupervised settings is crucial across diverse machine learning applications. This paper introduces a framework for multi-scale graph network embedding based on spectral graph wavelets that employs a contrastive learning approach. A significant feature of the proposed embedding is its capacity to establish a correspondence… ▽ More

    Submitted 5 June, 2024; v1 submitted 4 June, 2024; originally announced June 2024.

  7. arXiv:2405.18457  [pdf, other

    cs.LG stat.ML

    Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating projections or stochastic gradient descent, to construct an estimate of the marginal likelihood gradient. We discuss three key improvements which are applicable across so… ▽ More

    Submitted 6 June, 2024; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: Preprint. arXiv admin note: text overlap with arXiv:2405.18328

  8. arXiv:2405.18328  [pdf, other

    cs.LG stat.ML

    Warm Start Marginal Likelihood Optimisation for Iterative Gaussian Processes

    Authors: Jihao Andreas Lin, Shreyas Padhy, Bruno Mlodozeniec, José Miguel Hernández-Lobato

    Abstract: Gaussian processes are a versatile probabilistic machine learning model whose effectiveness often depends on good hyperparameters, which are typically learned by maximising the marginal likelihood. In this work, we consider iterative methods, which use iterative linear system solvers to approximate marginal likelihood gradients up to a specified numerical precision, allowing a trade-off between co… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

    Comments: Advances in Approximate Bayesian Inference 2024

  9. arXiv:2405.16166  [pdf, other

    cs.FL

    The Power of Hard Attention Transformers on Data Sequences: A Formal Language Theoretic Perspective

    Authors: Pascal Bergsträßer, Chris Köcher, Anthony Widjaja Lin, Georg Zetzsche

    Abstract: Formal language theory has recently been successfully employed to unravel the power of transformer encoders. This setting is primarily applicable in Natural Languange Processing (NLP), as a token embedding function (where a bounded number of tokens is admitted) is first applied before feeding the input to the transformer. On certain kinds of data (e.g. time series), we want our transformers to be… ▽ More

    Submitted 25 May, 2024; originally announced May 2024.

  10. arXiv:2405.06945  [pdf, other

    cs.CV

    Direct Learning of Mesh and Appearance via 3D Gaussian Splatting

    Authors: Ancheng Lin, Jun Li

    Abstract: Accurately reconstructing a 3D scene including explicit geometry information is both attractive and challenging. Geometry reconstruction can benefit from incorporating differentiable appearance models, such as Neural Radiance Fields and 3D Gaussian Splatting (3DGS). In this work, we propose a learnable scene model that incorporates 3DGS with an explicit geometry representation, namely a mesh. Our… ▽ More

    Submitted 11 May, 2024; originally announced May 2024.

  11. Countering Mainstream Bias via End-to-End Adaptive Local Learning

    Authors: Jinhao Pan, Ziwei Zhu, Jianling Wang, Allen Lin, James Caverlee

    Abstract: Collaborative filtering (CF) based recommendations suffer from mainstream bias -- where mainstream users are favored over niche users, leading to poor recommendation quality for many long-tail users. In this paper, we identify two root causes of this mainstream bias: (i) discrepancy modeling, whereby CF algorithms focus on modeling mainstream users while neglecting niche users with unique preferen… ▽ More

    Submitted 12 April, 2024; originally announced April 2024.

    Comments: ECIR 2024

    Journal ref: In European Conference on Information Retrieval 2024, vol 14612 (pp. 75-89)

  12. arXiv:2402.14817  [pdf, other

    cs.CV cs.LG

    Cameras as Rays: Pose Estimation via Ray Diffusion

    Authors: Jason Y. Zhang, Amy Lin, Moneish Kumar, Tzu-Hsuan Yang, Deva Ramanan, Shubham Tulsiani

    Abstract: Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera extrinsics, we propose a distributed representation of camera pose that treats a camera as a bundle of rays. This representation allows for a tight coupling with spatia… ▽ More

    Submitted 4 April, 2024; v1 submitted 22 February, 2024; originally announced February 2024.

    Comments: In ICLR 2024 (oral). v2-3: updated references. Project webpage: https://jasonyzhang.com/RayDiffusion

  13. arXiv:2402.09430  [pdf, other

    eess.SP cs.AI cs.CV cs.MM

    WiMANS: A Benchmark Dataset for WiFi-based Multi-user Activity Sensing

    Authors: Shuokang Huang, Kaihan Li, Di You, Yichong Chen, Arvin Lin, Siying Liu, Xiaohui Li, Julie A. McCann

    Abstract: WiFi-based human sensing has exhibited remarkable potential to analyze user behaviors in a non-intrusive and device-free manner, benefiting applications as diverse as smart homes and healthcare. However, most previous works focus on single-user sensing, which has limited practicability in scenarios involving multiple users. Although recent studies have begun to investigate WiFi-based multi-user se… ▽ More

    Submitted 12 March, 2024; v1 submitted 24 January, 2024; originally announced February 2024.

    Comments: We present WiMANS, to our knowledge, the first dataset for multi-user activity sensing based on WiFi

  14. arXiv:2402.01695  [pdf, other

    cs.CL cs.AI cs.LG

    Language-Guided World Models: A Model-Based Approach to AI Control

    Authors: Alex Zhang, Khanh Nguyen, Jens Tuyls, Albert Lin, Karthik Narasimhan

    Abstract: This paper introduces the concept of Language-Guided World Models (LWMs) -- probabilistic models that can simulate environments by reading texts. Agents equipped with these models provide humans with more extensive and efficient control, allowing them to simultaneously alter agent behaviors in multiple tasks via natural verbal communication. In this work, we take initial steps in developing robust… ▽ More

    Submitted 4 July, 2024; v1 submitted 23 January, 2024; originally announced February 2024.

    Comments: SpLU-RoboNLP workshop at ACL 2024

  15. arXiv:2401.02618  [pdf, ps, other

    cs.SE cs.LO

    Regular Abstractions for Array Systems

    Authors: Chih-Duo Hong, Anthony W. Lin

    Abstract: Verifying safety and liveness over array systems is a highly challenging problem. Array systems naturally capture parameterized systems such as distributed protocols with an unbounded number of processes. Such distributed protocols often exploit process IDs during their computation, resulting in array systems whose element values range over an infinite domain. In this paper, we develop a novel fra… ▽ More

    Submitted 4 January, 2024; originally announced January 2024.

  16. arXiv:2312.10074  [pdf

    cs.HC

    STAGER checklist: Standardized Testing and Assessment Guidelines for Evaluating Generative AI Reliability

    Authors: Jinghong Chen, Lingxuan Zhu, Weiming Mou, Zaoqu Liu, Quan Cheng, Anqi Lin, Jian Zhang, Peng Luo

    Abstract: Generative Artificial Intelligence (AI) holds immense potential in medical applications. Numerous studies have explored the efficacy of various generative AI models within healthcare contexts, but there is a lack of a comprehensive and systematic evaluation framework. Given that some studies evaluating the ability of generative AI for medical applications have deficiencies in their methodological… ▽ More

    Submitted 7 December, 2023; originally announced December 2023.

    Comments: 11 pages, 0 figure, 2 tables

  17. arXiv:2312.08604  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Verification of Neural Reachable Tubes via Scenario Optimization and Conformal Prediction

    Authors: Albert Lin, Somil Bansal

    Abstract: Learning-based approaches for controlling safety-critical systems are rapidly growing in popularity; thus, it is important to assure their performance and safety. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing such guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constrain… ▽ More

    Submitted 9 April, 2024; v1 submitted 13 December, 2023; originally announced December 2023.

    Comments: Accepted to 6th Annual Learning for Dynamics & Control Conference. arXiv admin note: text overlap with arXiv:2209.12336

  18. arXiv:2311.17037  [pdf, other

    cs.GT cs.FL

    Concurrent Stochastic Lossy Channel Games

    Authors: Daniel Stan, Muhammad Najib, Anthony Widjaja Lin, Parosh Aziz Abdulla

    Abstract: Concurrent stochastic games are an important formalism for the rational verification of probabilistic multi-agent systems, which involves verifying whether a temporal logic property is satisfied in some or all game-theoretic equilibria of such systems. In this work, we study the rational verification of probabilistic multi-agent systems where agents can cooperate by communicating over unbounded lo… ▽ More

    Submitted 28 November, 2023; originally announced November 2023.

    Comments: To appear at CSL 2024. Extended version

  19. arXiv:2311.15883  [pdf, other

    cs.GT cs.FL cs.LO cs.MA

    Characterising and Verifying the Core in Concurrent Multi-Player Mean-Payoff Games (Full Version)

    Authors: Julian Gutierrez, Anthony W. Lin, Muhammad Najib, Thomas Steeples, Michael Wooldridge

    Abstract: Concurrent multi-player mean-payoff games are important models for systems of agents with individual, non-dichotomous preferences. Whilst these games have been extensively studied in terms of their equilibria in non-cooperative settings, this paper explores an alternative solution concept: the core from cooperative game theory. This concept is particularly relevant for cooperative AI systems, as i… ▽ More

    Submitted 27 November, 2023; originally announced November 2023.

    Comments: This is the full version of the paper with the same title that appears in the CSL'24 proceedings

  20. arXiv:2311.04031  [pdf, other

    cs.LO cs.FL

    Ramsey Quantifiers in Linear Arithmetics

    Authors: Pascal Bergsträßer, Moses Ganardi, Anthony W. Lin, Georg Zetzsche

    Abstract: We study Satisfiability Modulo Theories (SMT) enriched with the so-called Ramsey quantifiers, which assert the existence of cliques (complete graphs) in the graph induced by some formulas. The extended framework is known to have applications in proving program termination (in particular, whether a transitive binary predicate is well-founded), and monadic decomposability of SMT formulas. Our main r… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

  21. arXiv:2311.03901  [pdf, ps, other

    cs.FL cs.LO

    Parikh's Theorem Made Symbolic

    Authors: Matthew Hague, Artur Jeż, Anthony W. Lin

    Abstract: Parikh's Theorem is a fundamental result in automata theory with numerous applications in computer science: software verification (e.g. infinite-state verification, string constraints, and theory of arrays), verification of cryptographic protocols (e.g. using Horn clauses modulo equational theories) and database querying (e.g. evaluating path-queries in graph databases). Parikh's Theorem states th… ▽ More

    Submitted 7 November, 2023; originally announced November 2023.

    Comments: Accepted tp POPL '24

  22. arXiv:2310.20581  [pdf, other

    cs.LG stat.ML

    Stochastic Gradient Descent for Gaussian Processes Done Right

    Authors: Jihao Andreas Lin, Shreyas Padhy, Javier Antorán, Austin Tripp, Alexander Terenin, Csaba Szepesvári, José Miguel Hernández-Lobato, David Janz

    Abstract: As is well known, both sampling from the posterior and computing the mean of the posterior in Gaussian process regression reduces to solving a large linear system of equations. We study the use of stochastic gradient descent for solving this linear system, and show that when \emph{done right} -- by which we mean using specific insights from the optimisation and kernel communities -- stochastic gra… ▽ More

    Submitted 28 April, 2024; v1 submitted 31 October, 2023; originally announced October 2023.

  23. arXiv:2310.09974  [pdf, other

    cs.GT

    Algorithmic Contract Design for Crowdsourced Ranking

    Authors: Kiriaki Frangias, Andrew Lin, Ellen Vitercik, Manolis Zampetakis

    Abstract: Ranking is fundamental to many areas, such as search engine optimization, human feedback for language models, as well as peer grading. Crowdsourcing, which is often used for these tasks, requires proper incentivization to ensure accurate inputs. In this work, we draw on the field of \emph{contract theory} from Economics to propose a novel mechanism that enables a \emph{principal} to accurately ran… ▽ More

    Submitted 24 January, 2024; v1 submitted 15 October, 2023; originally announced October 2023.

  24. arXiv:2310.08873  [pdf, other

    cs.RO cs.AI

    Interactive Navigation in Environments with Traversable Obstacles Using Large Language and Vision-Language Models

    Authors: Zhen Zhang, Anran Lin, Chun Wai Wong, Xiangyu Chu, Qi Dou, K. W. Samuel Au

    Abstract: This paper proposes an interactive navigation framework by using large language and vision-language models, allowing robots to navigate in environments with traversable obstacles. We utilize the large language model (GPT-3.5) and the open-set Vision-language Model (Grounding DINO) to create an action-aware costmap to perform effective path planning without fine-tuning. With the large models, we ca… ▽ More

    Submitted 12 March, 2024; v1 submitted 13 October, 2023; originally announced October 2023.

    Comments: Accepted by 2024 IEEE International Conference on Robotics and Automation (ICRA), 7 pages, 8 figures

  25. arXiv:2310.07916  [pdf, other

    cs.CV

    Dynamic Appearance Particle Neural Radiance Field

    Authors: Ancheng Lin, Jun Li

    Abstract: Neural Radiance Fields (NeRFs) have shown great potential in modelling 3D scenes. Dynamic NeRFs extend this model by capturing time-varying elements, typically using deformation fields. The existing dynamic NeRFs employ a similar Eulerian representation for both light radiance and deformation fields. This leads to a close coupling of appearance and motion and lacks a physical interpretation. In th… ▽ More

    Submitted 10 December, 2023; v1 submitted 11 October, 2023; originally announced October 2023.

  26. arXiv:2310.05126  [pdf, other

    cs.CV cs.AI

    UReader: Universal OCR-free Visually-situated Language Understanding with Multimodal Large Language Model

    Authors: Jiabo Ye, Anwen Hu, Haiyang Xu, Qinghao Ye, Ming Yan, Guohai Xu, Chenliang Li, Junfeng Tian, Qi Qian, Ji Zhang, Qin Jin, Liang He, Xin Alex Lin, Fei Huang

    Abstract: Text is ubiquitous in our visual world, conveying crucial information, such as in documents, websites, and everyday photographs. In this work, we propose UReader, a first exploration of universal OCR-free visually-situated language understanding based on the Multimodal Large Language Model (MLLM). By leveraging the shallow text recognition ability of the MLLM, we only finetuned 1.2% parameters and… ▽ More

    Submitted 8 October, 2023; originally announced October 2023.

  27. arXiv:2310.03817  [pdf, ps, other

    cs.FL cs.LG

    Logical Languages Accepted by Transformer Encoders with Hard Attention

    Authors: Pablo Barcelo, Alexander Kozachinskiy, Anthony Widjaja Lin, Vladimir Podolskii

    Abstract: We contribute to the study of formal languages that can be recognized by transformer encoders. We focus on two self-attention mechanisms: (1) UHAT (Unique Hard Attention Transformers) and (2) AHAT (Average Hard Attention Transformers). UHAT encoders are known to recognize only languages inside the circuit complexity class ${\sf AC}^0$, i.e., accepted by a family of poly-sized and depth-bounded boo… ▽ More

    Submitted 5 October, 2023; originally announced October 2023.

  28. arXiv:2310.00420  [pdf, other

    eess.SP cs.LG stat.ML

    An Efficient Algorithm for Clustered Multi-Task Compressive Sensing

    Authors: Alexander Lin, Demba Ba

    Abstract: This paper considers clustered multi-task compressive sensing, a hierarchical model that solves multiple compressive sensing tasks by finding clusters of tasks that leverage shared information to mutually improve signal reconstruction. The existing inference algorithm for this model is computationally expensive and does not scale well in high dimensions. The main bottleneck involves repeated matri… ▽ More

    Submitted 30 September, 2023; originally announced October 2023.

  29. MMEAD: MS MARCO Entity Annotations and Disambiguations

    Authors: Chris Kamphuis, Aileen Lin, Siwen Yang, Jimmy Lin, Arjen P. de Vries, Faegheh Hasibi

    Abstract: MMEAD, or MS MARCO Entity Annotations and Disambiguations, is a resource for entity links for the MS MARCO datasets. We specify a format to store and share links for both document and passage collections of MS MARCO. Following this specification, we release entity links to Wikipedia for documents and passages in both MS MARCO collections (v1 and v2). Entity links have been produced by the REL and… ▽ More

    Submitted 14 September, 2023; originally announced September 2023.

  30. arXiv:2308.00175  [pdf, ps, other

    cs.LO

    Decision Procedures for Sequence Theories (Technical Report)

    Authors: Artur Jeż, Anthony W. Lin, Oliver Markgraf, Philipp Rümmer

    Abstract: Sequence theories are an extension of theories of strings with an infinite alphabet of letters, together with a corresponding alphabet theory (e.g. linear integer arithmetic). Sequences are natural abstractions of extendable arrays, which permit a wealth of operations including append, map, split, and concatenation. In spite of the growing amount of tool support for theories of sequences by leadin… ▽ More

    Submitted 31 July, 2023; originally announced August 2023.

  31. arXiv:2307.09729  [pdf, other

    cs.CV cs.MM eess.IV

    NTIRE 2023 Quality Assessment of Video Enhancement Challenge

    Authors: Xiaohong Liu, Xiongkuo Min, Wei Sun, Yulun Zhang, Kai Zhang, Radu Timofte, Guangtao Zhai, Yixuan Gao, Yuqin Cao, Tengchuan Kou, Yunlong Dong, Ziheng Jia, Yilin Li, Wei Wu, Shuming Hu, Sibin Deng, Pengxiang Xiao, Ying Chen, Kai Li, Kai Zhao, Kun Yuan, Ming Sun, Heng Cong, Hao Wang, Lingzhi Fu , et al. (47 additional authors not shown)

    Abstract: This paper reports on the NTIRE 2023 Quality Assessment of Video Enhancement Challenge, which will be held in conjunction with the New Trends in Image Restoration and Enhancement Workshop (NTIRE) at CVPR 2023. This challenge is to address a major challenge in the field of video processing, namely, video quality assessment (VQA) for enhanced videos. The challenge uses the VQA Dataset for Perceptual… ▽ More

    Submitted 18 July, 2023; originally announced July 2023.

  32. arXiv:2307.07816  [pdf, other

    cs.LG stat.ML

    Minimal Random Code Learning with Mean-KL Parameterization

    Authors: Jihao Andreas Lin, Gergely Flamich, José Miguel Hernández-Lobato

    Abstract: This paper studies the qualitative behavior and robustness of two variants of Minimal Random Code Learning (MIRACLE) used to compress variational Bayesian neural networks. MIRACLE implements a powerful, conditionally Gaussian variational approximation for the weight posterior $Q_{\mathbf{w}}$ and uses relative entropy coding to compress a weight sample from the posterior using a Gaussian coding di… ▽ More

    Submitted 4 December, 2023; v1 submitted 15 July, 2023; originally announced July 2023.

    Comments: ICML Neural Compression Workshop 2023

  33. arXiv:2307.06093  [pdf, other

    cs.LG stat.ML

    Online Laplace Model Selection Revisited

    Authors: Jihao Andreas Lin, Javier Antorán, José Miguel Hernández-Lobato

    Abstract: The Laplace approximation provides a closed-form model selection objective for neural networks (NN). Online variants, which optimise NN parameters jointly with hyperparameters, like weight decay strength, have seen renewed interest in the Bayesian deep learning community. However, these methods violate Laplace's method's critical assumption that the approximation is performed around a mode of the… ▽ More

    Submitted 9 January, 2024; v1 submitted 12 July, 2023; originally announced July 2023.

    Comments: Advances in Approximate Bayesian Inference 2023

  34. arXiv:2307.06055  [pdf, other

    cs.LG stat.ML

    Function-Space Regularization for Deep Bayesian Classification

    Authors: Jihao Andreas Lin, Joe Watson, Pascal Klink, Jan Peters

    Abstract: Bayesian deep learning approaches assume model parameters to be latent random variables and infer posterior distributions to quantify uncertainty, increase safety and trust, and prevent overconfident and unpredictable behavior. However, weight-space priors are model-specific, can be difficult to interpret and are hard to specify. Instead, we apply a Dirichlet prior in predictive space and perform… ▽ More

    Submitted 12 July, 2023; originally announced July 2023.

    Comments: Advances in Approximate Bayesian Inference 2023

  35. arXiv:2307.03093  [pdf, other

    cs.LG stat.ML

    Beyond Intuition, a Framework for Applying GPs to Real-World Data

    Authors: Kenza Tazi, Jihao Andreas Lin, Ross Viljoen, Alex Gardner, ST John, Hong Ge, Richard E. Turner

    Abstract: Gaussian Processes (GPs) offer an attractive method for regression over small, structured and correlated datasets. However, their deployment is hindered by computational costs and limited guidelines on how to apply GPs beyond simple low-dimensional datasets. We propose a framework to identify the suitability of GPs to a given problem and how to set up a robust and well-specified GP model. The guid… ▽ More

    Submitted 17 July, 2023; v1 submitted 6 July, 2023; originally announced July 2023.

    Comments: Accepted at the ICML Workshop on Structured Probabilistic Inference and Generative Modelling (2023)

  36. arXiv:2306.11589  [pdf, other

    cs.LG stat.ML

    Sampling from Gaussian Process Posteriors using Stochastic Gradient Descent

    Authors: Jihao Andreas Lin, Javier Antorán, Shreyas Padhy, David Janz, José Miguel Hernández-Lobato, Alexander Terenin

    Abstract: Gaussian processes are a powerful framework for quantifying uncertainty and for sequential decision-making but are limited by the requirement of solving linear systems. In general, this has a cubic cost in dataset size and is sensitive to conditioning. We explore stochastic gradient algorithms as a computationally efficient method of approximately solving these linear systems: we develop low-varia… ▽ More

    Submitted 15 January, 2024; v1 submitted 20 June, 2023; originally announced June 2023.

    Journal ref: Advances in Neural Information Processing Systems, 2023

  37. arXiv:2306.05500  [pdf, other

    cs.CL cs.AI cs.CV cs.LG

    Word-Level Explanations for Analyzing Bias in Text-to-Image Models

    Authors: Alexander Lin, Lucas Monteiro Paes, Sree Harsha Tanneru, Suraj Srinivas, Himabindu Lakkaraju

    Abstract: Text-to-image models take a sentence (i.e., prompt) and generate images associated with this input prompt. These models have created award wining-art, videos, and even synthetic datasets. However, text-to-image (T2I) models can generate images that underrepresent minorities based on race and sex. This paper investigates which word in the input prompt is responsible for bias in generated images. We… ▽ More

    Submitted 3 June, 2023; originally announced June 2023.

    Comments: 5 main pages, 3 pages in appendix, and 3 figures

  38. arXiv:2306.03249  [pdf, other

    cs.LG eess.SP stat.CO

    Probabilistic Unrolling: Scalable, Inverse-Free Maximum Likelihood Estimation for Latent Gaussian Models

    Authors: Alexander Lin, Bahareh Tolooshams, Yves Atchadé, Demba Ba

    Abstract: Latent Gaussian models have a rich history in statistics and machine learning, with applications ranging from factor analysis to compressed sensing to time series analysis. The classical method for maximizing the likelihood of these models is the expectation-maximization (EM) algorithm. For problems with high-dimensional latent variables and large datasets, EM scales poorly because it needs to inv… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

    Comments: 29 pages, 4 figures

    Journal ref: International Conference on Machine Learning, 2023

  39. arXiv:2305.17110  [pdf, other

    cs.RO

    IndustReal: Transferring Contact-Rich Assembly Tasks from Simulation to Reality

    Authors: Bingjie Tang, Michael A. Lin, Iretiayo Akinola, Ankur Handa, Gaurav S. Sukhatme, Fabio Ramos, Dieter Fox, Yashraj Narang

    Abstract: Robotic assembly is a longstanding challenge, requiring contact-rich interaction and high precision and accuracy. Many applications also require adaptivity to diverse parts, poses, and environments, as well as low cycle times. In other areas of robotics, simulation is a powerful tool to develop algorithms, generate datasets, and train agents. However, simulation has had a more limited impact on as… ▽ More

    Submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted to Robotics: Science and Systems (RSS) 2023

  40. arXiv:2305.04926  [pdf, other

    cs.CV

    RelPose++: Recovering 6D Poses from Sparse-view Observations

    Authors: Amy Lin, Jason Y. Zhang, Deva Ramanan, Shubham Tulsiani

    Abstract: We address the task of estimating 6D camera poses from sparse-view image sets (2-8 images). This task is a vital pre-processing stage for nearly all contemporary (neural) reconstruction algorithms but remains challenging given sparse views, especially for objects with visual symmetries and texture-less surfaces. We build on the recent RelPose framework which learns a network that infers distributi… ▽ More

    Submitted 18 December, 2023; v1 submitted 8 May, 2023; originally announced May 2023.

    Comments: Project webpage: https://amyxlase.github.io/relpose-plus-plus (Accepted to 3DV 2024)

  41. FashionTex: Controllable Virtual Try-on with Text and Texture

    Authors: Anran Lin, Nanxuan Zhao, Shuliang Ning, Yuda Qiu, Baoyuan Wang, Xiaoguang Han

    Abstract: Virtual try-on attracts increasing research attention as a promising way for enhancing the user experience for online cloth shopping. Though existing methods can generate impressive results, users need to provide a well-designed reference image containing the target fashion clothes that often do not exist. To support user-friendly fashion customization in full-body portraits, we propose a multi-mo… ▽ More

    Submitted 8 May, 2023; originally announced May 2023.

    Comments: Accepted to SIGGRAPH 2023 (Conference Proceedings)

  42. arXiv:2304.03981  [pdf, other

    cs.LG cs.CV

    Uncertainty-inspired Open Set Learning for Retinal Anomaly Identification

    Authors: Meng Wang, Tian Lin, Lianyu Wang, Aidi Lin, Ke Zou, Xinxing Xu, Yi Zhou, Yuanyuan Peng, Qingquan Meng, Yiming Qian, Guoyao Deng, Zhiqun Wu, Junhong Chen, Jianhong Lin, Mingzhi Zhang, Weifang Zhu, Changqing Zhang, Daoqiang Zhang, Rick Siow Mong Goh, Yong Liu, Chi Pui Pang, Xinjian Chen, Haoyu Chen, Huazhu Fu

    Abstract: Failure to recognize samples from the classes unseen during training is a major limitation of artificial intelligence in the real-world implementation for recognition and classification of retinal anomalies. We established an uncertainty-inspired open-set (UIOS) model, which was trained with fundus images of 9 retinal conditions. Besides assessing the probability of each category, UIOS also calcul… ▽ More

    Submitted 29 August, 2023; v1 submitted 8 April, 2023; originally announced April 2023.

  43. arXiv:2302.06656  [pdf, other

    cs.IR

    Enhancing User Personalization in Conversational Recommenders

    Authors: Allen Lin, Ziwei Zhu, Jianling Wang, James Caverlee

    Abstract: Conversational recommenders are emerging as a powerful tool to personalize a user's recommendation experience. Through a back-and-forth dialogue, users can quickly hone in on just the right items. Many approaches to conversational recommendation, however, only partially explore the user preference space and make limiting assumptions about how user feedback can be best incorporated, resulting in lo… ▽ More

    Submitted 13 February, 2023; originally announced February 2023.

    Comments: To Appear On TheWebConf (WWW) 2023

  44. arXiv:2211.10580  [pdf, other

    cs.CV

    Normal Transformer: Extracting Surface Geometry from LiDAR Points Enhanced by Visual Semantics

    Authors: Ancheng Lin, Jun Li

    Abstract: High-quality estimation of surface normal can help reduce ambiguity in many geometry understanding problems, such as collision avoidance and occlusion inference. This paper presents a technique for estimating the normal from 3D point clouds and 2D colour images. We have developed a transformer neural network that learns to utilise the hybrid information of visual semantic and 3D geometric data, as… ▽ More

    Submitted 6 July, 2023; v1 submitted 18 November, 2022; originally announced November 2022.

  45. arXiv:2210.12387  [pdf, other

    cs.RO

    Whisker-Inspired Tactile Sensing for Contact Localization on Robot Manipulators

    Authors: Michael A. Lin, Emilio Reyes, Jeannette Bohg, Mark R. Cutkosky

    Abstract: Perceiving the environment through touch is important for robots to reach in cluttered environments, but devising a way to sense without disturbing objects is challenging. This work presents the design and modelling of whisker-inspired sensors that attach to the surface of a robot manipulator to sense its surrounding through light contacts. We obtain a sensor model using a calibration process that… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: 8 pages, 7 figures, conference

  46. arXiv:2210.00135  [pdf, other

    cs.RO cs.HC

    Deep Learning Classification of Touch Gestures Using Distributed Normal and Shear Force

    Authors: Hojung Choi, Dane Brouwer, Michael A. Lin, Kyle T. Yoshida, Carine Rognon, Benjamin Stephens-Fripp, Allison M. Okamura, Mark R. Cutkosky

    Abstract: When humans socially interact with another agent (e.g., human, pet, or robot) through touch, they do so by applying varying amounts of force with different directions, locations, contact areas, and durations. While previous work on touch gesture recognition has focused on the spatio-temporal distribution of normal forces, we hypothesize that the addition of shear forces will permit more reliable c… ▽ More

    Submitted 30 September, 2022; originally announced October 2022.

  47. arXiv:2209.12336  [pdf, other

    cs.RO cs.AI cs.LG eess.SY

    Generating Formal Safety Assurances for High-Dimensional Reachability

    Authors: Albert Lin, Somil Bansal

    Abstract: Providing formal safety and performance guarantees for autonomous systems is becoming increasingly important. Hamilton-Jacobi (HJ) reachability analysis is a popular formal verification tool for providing these guarantees, since it can handle general nonlinear system dynamics, bounded adversarial system disturbances, and state and input constraints. However, it involves solving a PDE, whose comput… ▽ More

    Submitted 10 June, 2023; v1 submitted 25 September, 2022; originally announced September 2022.

    Comments: Accepted to ICRA 2023

    ACM Class: I.2.9; I.2.8

  48. arXiv:2209.04732  [pdf

    cs.DB cs.AI

    Ontologizing Health Systems Data at Scale: Making Translational Discovery a Reality

    Authors: Tiffany J. Callahan, Adrianne L. Stefanski, Jordan M. Wyrwa, Chenjie Zeng, Anna Ostropolets, Juan M. Banda, William A. Baumgartner Jr., Richard D. Boyce, Elena Casiraghi, Ben D. Coleman, Janine H. Collins, Sara J. Deakyne-Davies, James A. Feinstein, Melissa A. Haendel, Asiyah Y. Lin, Blake Martin, Nicolas A. Matentzoglu, Daniella Meeker, Justin Reese, Jessica Sinclair, Sanya B. Taneja, Katy E. Trinkley, Nicole A. Vasilevsky, Andrew Williams, Xingman A. Zhang , et al. (7 additional authors not shown)

    Abstract: Background: Common data models solve many challenges of standardizing electronic health record (EHR) data, but are unable to semantically integrate all the resources needed for deep phenotyping. Open Biological and Biomedical Ontology (OBO) Foundry ontologies provide computable representations of biological knowledge and enable the integration of heterogeneous data. However, mapping EHR data to OB… ▽ More

    Submitted 30 January, 2023; v1 submitted 10 September, 2022; originally announced September 2022.

    Comments: Supplementary Material is included at the end of the manuscript

    ACM Class: J.3

  49. arXiv:2208.03854  [pdf, other

    cs.IR

    Towards Fair Conversational Recommender Systems

    Authors: Allen Lin, Ziwei Zhu, Jianling Wang, James Caverlee

    Abstract: Conversational recommender systems have demonstrated great success. They can accurately capture a user's current detailed preference -- through a multi-round interaction cycle -- to effectively guide users to a more personalized recommendation. Alas, conversational recommender systems can be plagued by the adverse effects of bias, much like traditional recommenders. In this work, we argue for incr… ▽ More

    Submitted 19 August, 2022; v1 submitted 7 August, 2022; originally announced August 2022.

    Comments: arXiv admin note: substantial text overlap with arXiv:2208.03298

  50. Quantifying and Mitigating Popularity Bias in Conversational Recommender Systems

    Authors: Allen Lin, Jianling Wang, Ziwei Zhu, James Caverlee

    Abstract: Conversational recommender systems (CRS) have shown great success in accurately capturing a user's current and detailed preference through the multi-round interaction cycle while effectively guiding users to a more personalized recommendation. Perhaps surprisingly, conversational recommender systems can be plagued by popularity bias, much like traditional recommender systems. In this paper, we sys… ▽ More

    Submitted 19 August, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: to appear in CIKM22