Zum Hauptinhalt springen

Showing 1–50 of 97 results for author: Ho, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.09688  [pdf, other

    cs.CL

    Large Language Models for Integrating Social Determinant of Health Data: A Case Study on Heart Failure 30-Day Readmission Prediction

    Authors: Chase Fensore, Rodrigo M. Carrillo-Larco, Shivani A. Patel, Alanna A. Morris, Joyce C. Ho

    Abstract: Social determinants of health (SDOH) $-$ the myriad of circumstances in which people live, grow, and age $-$ play an important role in health outcomes. However, existing outcome prediction models often only use proxies of SDOH as features. Recent open data initiatives present an opportunity to construct a more comprehensive view of SDOH, but manually integrating the most relevant data for individu… ▽ More

    Submitted 12 July, 2024; originally announced July 2024.

    Comments: 36 pages including references and appendix. This is a work in progress

  2. TACCO: Task-guided Co-clustering of Clinical Concepts and Patient Visits for Disease Subtyping based on EHR Data

    Authors: Ziyang Zhang, Hejie Cui, Ran Xu, Yuzhang Xie, Joyce C. Ho, Carl Yang

    Abstract: The growing availability of well-organized Electronic Health Records (EHR) data has enabled the development of various machine learning models towards disease risk prediction. However, existing risk prediction methods overlook the heterogeneity of complex diseases, failing to model the potential disease subtypes regarding their corresponding patient visits and clinical concept subgroups. In this w… ▽ More

    Submitted 14 June, 2024; originally announced June 2024.

    Comments: 11 pages, 5 figures, to be published in Proceedings of the 30th ACM SIGKDD Conference on Knowledge Discovery and Data Mining

  3. arXiv:2406.05682  [pdf, other

    cs.LG cs.AI

    From Basic to Extra Features: Hypergraph Transformer Pretrain-then-Finetuning for Balanced Clinical Predictions on EHR

    Authors: Ran Xu, Yiwen Lu, Chang Liu, Yong Chen, Yan Sun, Xiao Hu, Joyce C Ho, Carl Yang

    Abstract: Electronic Health Records (EHRs) contain rich patient information and are crucial for clinical research and practice. In recent years, deep learning models have been applied to EHRs, but they often rely on massive features, which may not be readily available for all patients. We propose HTP-Star, which leverages hypergraph structures with a pretrain-then-finetune framework for modeling EHR data, e… ▽ More

    Submitted 9 June, 2024; originally announced June 2024.

    Comments: CHIL 2024

  4. arXiv:2405.07500  [pdf, other

    cs.IR cs.AI cs.CL

    PromptLink: Leveraging Large Language Models for Cross-Source Biomedical Concept Linking

    Authors: Yuzhang Xie, Jiaying Lu, Joyce Ho, Fadi Nahab, Xiao Hu, Carl Yang

    Abstract: Linking (aligning) biomedical concepts across diverse data sources enables various integrative analyses, but it is challenging due to the discrepancies in concept naming conventions. Various strategies have been developed to overcome this challenge, such as those based on string-matching rules, manually crafted thesauri, and machine learning models. However, these methods are constrained by limite… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 47th International ACM SIGIR Conference on Research and Development in Information Retrieval (Short-Paper Track), 2024

  5. arXiv:2404.18443  [pdf, other

    cs.CL cs.AI cs.IR q-bio.QM

    BMRetriever: Tuning Large Language Models as Better Biomedical Text Retrievers

    Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Yanqiao Zhu, May D. Wang, Joyce C. Ho, Chao Zhang, Carl Yang

    Abstract: Developing effective biomedical retrieval models is important for excelling at knowledge-intensive biomedical tasks but still challenging due to the deficiency of sufficient publicly annotated biomedical data and computational resources. We present BMRetriever, a series of dense retrievers for enhancing biomedical retrieval via unsupervised pre-training on large biomedical corpora, followed by ins… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Work in progress. The model and data will be uploaded to \url{https://github.com/ritaranx/BMRetriever}

  6. arXiv:2403.15464  [pdf, other

    cs.CL cs.AI cs.LG cs.MA

    LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction

    Authors: Hejie Cui, Zhuocheng Shen, Jieyu Zhang, Hui Shao, Lianhui Qin, Joyce C. Ho, Carl Yang

    Abstract: Electronic health records (EHRs) contain valuable patient data for health-related prediction tasks, such as disease prediction. Traditional approaches rely on supervised learning methods that require large labeled datasets, which can be expensive and challenging to obtain. In this study, we investigate the feasibility of applying Large Language Models (LLMs) to convert structured patient visit dat… ▽ More

    Submitted 19 March, 2024; originally announced March 2024.

    ACM Class: J.3; I.2.7

  7. arXiv:2403.11517  [pdf, other

    q-bio.NC cs.HC

    Inter-individual and inter-site neural code conversion without shared stimuli

    Authors: Haibao Wang, Jun Kai Ho, Fan L. Cheng, Shuntaro C. Aoki, Yusuke Muraki, Misato Tanaka, Yukiyasu Kamitani

    Abstract: Inter-individual variability in fine-grained functional brain organization poses challenges for scalable data analysis and modeling. Functional alignment techniques can help mitigate these individual differences but typically require paired brain data with the same stimuli between individuals, which is often unavailable. We present a neural code conversion method that overcomes this constraint by… ▽ More

    Submitted 1 August, 2024; v1 submitted 18 March, 2024; originally announced March 2024.

  8. arXiv:2403.08818  [pdf, other

    cs.LG cs.AI cs.CL

    Multimodal Fusion of EHR in Structures and Semantics: Integrating Clinical Records and Notes with Hypergraph and LLM

    Authors: Hejie Cui, Xinyu Fang, Ran Xu, Xuan Kan, Joyce C. Ho, Carl Yang

    Abstract: Electronic Health Records (EHRs) have become increasingly popular to support clinical decision-making and healthcare in recent decades. EHRs usually contain heterogeneous information, such as structural data in tabular form and unstructured data in textual notes. Different types of information in EHRs can complement each other and provide a more complete picture of the health status of a patient.… ▽ More

    Submitted 19 February, 2024; originally announced March 2024.

  9. arXiv:2403.00815  [pdf, other

    cs.CL cs.AI cs.IR q-bio.OT

    RAM-EHR: Retrieval Augmentation Meets Clinical Predictions on Electronic Health Records

    Authors: Ran Xu, Wenqi Shi, Yue Yu, Yuchen Zhuang, Bowen Jin, May D. Wang, Joyce C. Ho, Carl Yang

    Abstract: We present RAM-EHR, a Retrieval AugMentation pipeline to improve clinical predictions on Electronic Health Records (EHRs). RAM-EHR first collects multiple knowledge sources, converts them into text format, and uses dense retrieval to obtain information related to medical concepts. This strategy addresses the difficulties associated with complex names for the concepts. RAM-EHR then augments the loc… ▽ More

    Submitted 26 July, 2024; v1 submitted 25 February, 2024; originally announced March 2024.

    Comments: ACL 2024 (Oral)

    Journal ref: ACL 2024

  10. arXiv:2402.09609  [pdf, other

    cs.CL cs.AI

    LogicPrpBank: A Corpus for Logical Implication and Equivalence

    Authors: Zhexiong Liu, Jing Zhang, Jiaying Lu, Wenjing Ma, Joyce C Ho

    Abstract: Logic reasoning has been critically needed in problem-solving and decision-making. Although Language Models (LMs) have demonstrated capabilities of handling multiple reasoning tasks (e.g., commonsense reasoning), their ability to reason complex mathematical problems, specifically propositional logic, remains largely underexplored. This lack of exploration can be attributed to the limited availabil… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

    Comments: In the 5th AI4ED Workshop, held in conjunction with The 38th AAAI Conference on Artificial Intelligence, February 2024

  11. arXiv:2401.07128  [pdf, other

    cs.CL cs.AI

    EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records

    Authors: Wenqi Shi, Ran Xu, Yuchen Zhuang, Yue Yu, Jieyu Zhang, Hang Wu, Yuanda Zhu, Joyce Ho, Carl Yang, May D. Wang

    Abstract: Large language models (LLMs) have demonstrated exceptional capabilities in planning and tool utilization as autonomous agents, but few have been developed for medical problem-solving. We propose EHRAgent, an LLM agent empowered with a code interface, to autonomously generate and execute code for multi-tabular reasoning within electronic health records (EHRs). First, we formulate an EHR question-an… ▽ More

    Submitted 19 February, 2024; v1 submitted 13 January, 2024; originally announced January 2024.

    Comments: Work in Progress

  12. arXiv:2312.05849  [pdf, other

    cs.CV cs.GR cs.MM

    InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

    Authors: Jiun Tian Hoe, Xudong Jiang, Chee Seng Chan, Yap-Peng Tan, Weipeng Hu

    Abstract: Large-scale text-to-image (T2I) diffusion models have showcased incredible capabilities in generating coherent images based on textual descriptions, enabling vast applications in content generation. While recent advancements have introduced control over factors such as object localization, posture, and image contours, a crucial gap remains in our ability to control the interactions between objects… ▽ More

    Submitted 26 February, 2024; v1 submitted 10 December, 2023; originally announced December 2023.

    Comments: Website: https://jiuntian.github.io/interactdiffusion. Accepted at CVPR2024

  13. arXiv:2312.02309  [pdf, other

    cs.AI cs.HC cs.LG

    Training Reinforcement Learning Agents and Humans With Difficulty-Conditioned Generators

    Authors: Sidney Tio, Jimmy Ho, Pradeep Varakantham

    Abstract: We adapt Parameterized Environment Response Model (PERM), a method for training both Reinforcement Learning (RL) Agents and human learners in parameterized environments by directly modeling difficulty and ability. Inspired by Item Response Theory (IRT), PERM aligns environment difficulty with individual ability, creating a Zone of Proximal Development-based curriculum. Remarkably, PERM operates wi… ▽ More

    Submitted 4 December, 2023; originally announced December 2023.

  14. arXiv:2311.00287  [pdf, other

    cs.CL cs.AI cs.LG q-bio.QM

    Knowledge-Infused Prompting: Assessing and Advancing Clinical Text Data Generation with Large Language Models

    Authors: Ran Xu, Hejie Cui, Yue Yu, Xuan Kan, Wenqi Shi, Yuchen Zhuang, Wei Jin, Joyce Ho, Carl Yang

    Abstract: Clinical natural language processing requires methods that can address domain-specific challenges, such as complex medical terminology and clinical contexts. Recently, large language models (LLMs) have shown promise in this domain. Yet, their direct deployment can lead to privacy issues and are constrained by resources. To address this challenge, we delve into synthetic clinical text generation us… ▽ More

    Submitted 1 November, 2023; originally announced November 2023.

  15. arXiv:2310.03043  [pdf, other

    cs.LG cs.AI cs.HC cs.IR

    A Deep Reinforcement Learning Approach for Interactive Search with Sentence-level Feedback

    Authors: Jianghong Zhou, Joyce C. Ho, Chen Lin, Eugene Agichtein

    Abstract: Interactive search can provide a better experience by incorporating interaction feedback from the users. This can significantly improve search accuracy as it helps avoid irrelevant information and captures the users' search intents. Existing state-of-the-art (SOTA) systems use reinforcement learning (RL) models to incorporate the interactions but focus on item-level feedback, ignoring the fine-gra… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

    Comments: 9 pages, 7 figures, DRL4IR@CIKM

  16. arXiv:2308.12325  [pdf, other

    q-bio.QM cs.LG

    Predicting Drug Solubility Using Different Machine Learning Methods -- Linear Regression Model with Extracted Chemical Features vs Graph Convolutional Neural Network

    Authors: John Ho, Zhao-Heng Yin, Colin Zhang, Nicole Guo, Yang Ha

    Abstract: Predicting the solubility of given molecules remains crucial in the pharmaceutical industry. In this study, we revisited this extensively studied topic, leveraging the capabilities of contemporary computing resources. We employed two machine learning models: a linear regression model and a graph convolutional neural network (GCNN) model, using various experimental datasets. Both methods yielded re… ▽ More

    Submitted 4 January, 2024; v1 submitted 23 August, 2023; originally announced August 2023.

    Comments: 7 pages, 4 figures, 2 tables

  17. arXiv:2308.07012  [pdf, other

    eess.SP cs.LG stat.ML

    Greedy online change point detection

    Authors: Jou-Hui Ho, Felipe Tobar

    Abstract: Standard online change point detection (CPD) methods tend to have large false discovery rates as their detections are sensitive to outliers. To overcome this drawback, we propose Greedy Online Change Point Detection (GOCPD), a computationally appealing method which finds change points by maximizing the probability of the data coming from the (temporal) concatenation of two independent models. We s… ▽ More

    Submitted 14 August, 2023; originally announced August 2023.

    Comments: Accepted at IEEE MLSP 2023

  18. arXiv:2308.02976  [pdf, ps, other

    cs.CL cs.AI cs.LG

    Spanish Pre-trained BERT Model and Evaluation Data

    Authors: José Cañete, Gabriel Chaperon, Rodrigo Fuentes, Jou-Hui Ho, Hojin Kang, Jorge Pérez

    Abstract: The Spanish language is one of the top 5 spoken languages in the world. Nevertheless, finding resources to train or evaluate Spanish language models is not an easy task. In this paper we help bridge this gap by presenting a BERT-based language model pre-trained exclusively on Spanish data. As a second contribution, we also compiled several tasks specifically for the Spanish language in a single re… ▽ More

    Submitted 5 August, 2023; originally announced August 2023.

    Comments: Published as workshop paper at Practical ML for Developing Countries Workshop @ ICLR 2020

  19. Weakly-Supervised Scientific Document Classification via Retrieval-Augmented Multi-Stage Training

    Authors: Ran Xu, Yue Yu, Joyce C. Ho, Carl Yang

    Abstract: Scientific document classification is a critical task for a wide range of applications, but the cost of obtaining massive amounts of human-labeled data can be prohibitive. To address this challenge, we propose a weakly-supervised approach for scientific document classification using label names only. In scientific domains, label names often include domain-specific concepts that may not appear in t… ▽ More

    Submitted 12 June, 2023; originally announced June 2023.

    Comments: SIGIR 2023. The code and data will be published to https://github.com/ritaranx/wander

    Journal ref: The 46th International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2023)

  20. arXiv:2306.04802  [pdf, other

    cs.AI cs.CL cs.LG cs.SI

    A Review on Knowledge Graphs for Healthcare: Resources, Applications, and Promises

    Authors: Carl Yang, Hejie Cui, Jiaying Lu, Shiyu Wang, Ran Xu, Wenjing Ma, Yue Yu, Shaojun Yu, Xuan Kan, Chen Ling, Tianfan Fu, Liang Zhao, Joyce Ho, Fei Wang

    Abstract: Healthcare knowledge graphs (HKGs) are valuable tools for organizing biomedical concepts and their relationships with interpretable structures. The recent advent of large language models (LLMs) has paved the way for building more comprehensive and accurate HKGs. This, in turn, can improve the reliability of generated content and enable better evaluation of LLMs. However, the challenges of HKGs suc… ▽ More

    Submitted 4 August, 2024; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 21 pages, preprint submitted to ACM

  21. arXiv:2305.02691  [pdf, ps, other

    cs.LG cs.SI

    PGB: A PubMed Graph Benchmark for Heterogeneous Network Representation Learning

    Authors: Eric W Lee, Joyce C Ho

    Abstract: There has been rapid growth in biomedical literature, yet capturing the heterogeneity of the bibliographic information of these articles remains relatively understudied. Although graph mining research via heterogeneous graph neural networks has taken center stage, it remains unclear whether these approaches capture the heterogeneity of the PubMed database, a vast digital repository containing over… ▽ More

    Submitted 25 August, 2023; v1 submitted 4 May, 2023; originally announced May 2023.

  22. arXiv:2304.03748  [pdf, other

    cs.LG cs.AI physics.comp-ph physics.data-an

    Perspectives on AI Architectures and Co-design for Earth System Predictability

    Authors: Maruti K. Mudunuru, James A. Ang, Mahantesh Halappanavar, Simon D. Hammond, Maya B. Gokhale, James C. Hoe, Tushar Krishna, Sarat S. Sreepathi, Matthew R. Norman, Ivy B. Peng, Philip W. Jones

    Abstract: Recently, the U.S. Department of Energy (DOE), Office of Science, Biological and Environmental Research (BER), and Advanced Scientific Computing Research (ASCR) programs organized and held the Artificial Intelligence for Earth System Predictability (AI4ESP) workshop series. From this workshop, a critical conclusion that the DOE BER and ASCR community came to is the requirement to develop a new par… ▽ More

    Submitted 7 April, 2023; originally announced April 2023.

    Comments: 23 pages, 1 figure

  23. arXiv:2302.07669  [pdf, other

    cs.CV cs.IR

    Unsupervised Hashing with Similarity Distribution Calibration

    Authors: Kam Woh Ng, Xiatian Zhu, Jiun Tian Hoe, Chee Seng Chan, Tianyu Zhang, Yi-Zhe Song, Tao Xiang

    Abstract: Unsupervised hashing methods typically aim to preserve the similarity between data points in a feature space by mapping them to binary hash codes. However, these methods often overlook the fact that the similarity between data points in the continuous feature space may not be preserved in the discrete hash code space, due to the limited similarity range of hash codes. The similarity range is bound… ▽ More

    Submitted 31 August, 2023; v1 submitted 15 February, 2023; originally announced February 2023.

    Comments: BMVC 2023

  24. arXiv:2302.04355  [pdf, other

    cs.LG cs.AI cs.CR

    MedDiff: Generating Electronic Health Records using Accelerated Denoising Diffusion Model

    Authors: Huan He, Shifan Zhao, Yuanzhe Xi, Joyce C Ho

    Abstract: Due to patient privacy protection concerns, machine learning research in healthcare has been undeniably slower and limited than in other application domains. High-quality, realistic, synthetic electronic health records (EHRs) can be leveraged to accelerate methodological developments for research purposes while mitigating privacy concerns associated with data sharing. The current state-of-the-art… ▽ More

    Submitted 8 February, 2023; originally announced February 2023.

    Comments: 12 pages

  25. arXiv:2301.09443  [pdf, other

    cs.CE

    Probabilistic Machine Learning to Improve Generalisation of Data-Driven Turbulence Modelling

    Authors: Joel Ho, Nick Pepper, Tim Dodwell

    Abstract: A probabilistic machine learning model is introduced to augment the $k-ω SST$ turbulence model in order to improve the modelling of separated flows and the generalisability of learnt corrections. Increasingly, machine learning methods have been used to leverage experimental and high-fidelity data, improving the accuracy of the Reynolds Averaged Navier Stokes (RANS) turbulence models widely used in… ▽ More

    Submitted 23 January, 2023; originally announced January 2023.

  26. arXiv:2301.03726  [pdf, other

    cs.LG cs.CL

    Neighborhood-Regularized Self-Training for Learning with Few Labels

    Authors: Ran Xu, Yue Yu, Hejie Cui, Xuan Kan, Yanqiao Zhu, Joyce Ho, Chao Zhang, Carl Yang

    Abstract: Training deep neural networks (DNNs) with limited supervision has been a popular research topic as it can significantly alleviate the annotation burden. Self-training has been successfully applied in semi-supervised learning tasks, but one drawback of self-training is that it is vulnerable to the label noise from incorrect pseudo labels. Inspired by the fact that samples with similar labels tend t… ▽ More

    Submitted 15 February, 2023; v1 submitted 9 January, 2023; originally announced January 2023.

    Comments: Accepted to AAAI 2023

  27. arXiv:2211.07590  [pdf, other

    cs.CV

    Stain-invariant self supervised learning for histopathology image analysis

    Authors: Alexandre Tiard, Alex Wong, David Joon Ho, Yangchao Wu, Eliram Nof, Alvin C. Goh, Stefano Soatto, Saad Nadeem

    Abstract: We present a self-supervised algorithm for several classification tasks within hematoxylin and eosin (H&E) stained images of breast cancer. Our method is robust to stain variations inherent to the histology images acquisition process, which has limited the applicability of automated analysis tools. We address this problem by imposing constraints a learnt latent space which leverages stain normaliz… ▽ More

    Submitted 7 September, 2023; v1 submitted 14 November, 2022; originally announced November 2022.

  28. arXiv:2210.12573  [pdf, other

    cs.LG math.OC

    An Efficient Nonlinear Acceleration method that Exploits Symmetry of the Hessian

    Authors: Huan He, Shifan Zhao, Ziyuan Tang, Joyce C Ho, Yousef Saad, Yuanzhe Xi

    Abstract: Nonlinear acceleration methods are powerful techniques to speed up fixed-point iterations. However, many acceleration methods require storing a large number of previous iterates and this can become impractical if computational resources are limited. In this paper, we propose a nonlinear Truncated Generalized Conjugate Residual method (nlTGCR) whose goal is to exploit the symmetry of the Hessian to… ▽ More

    Submitted 22 October, 2022; originally announced October 2022.

    Comments: Optimization, Short-term recurrence method by exploiting Hessian, Numerical Analysis, Iterative Method, Quasi-Newton, Anderson Acceleration, 31 pages

  29. arXiv:2210.09773  [pdf, other

    cs.CL cs.AI

    Retrofitting Multilingual Sentence Embeddings with Abstract Meaning Representation

    Authors: Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

    Abstract: We introduce a new method to improve existing multilingual sentence embeddings with Abstract Meaning Representation (AMR). Compared with the original textual input, AMR is a structured semantic representation that presents the core concepts and relations in a sentence explicitly and unambiguously. It also helps reduce surface variations across different expressions and languages. Unlike most prior… ▽ More

    Submitted 18 October, 2022; originally announced October 2022.

    Comments: EMNLP2022

  30. arXiv:2210.04628  [pdf, other

    cs.CV cs.GR cs.LG

    Novel View Synthesis with Diffusion Models

    Authors: Daniel Watson, William Chan, Ricardo Martin-Brualla, Jonathan Ho, Andrea Tagliasacchi, Mohammad Norouzi

    Abstract: We present 3DiM, a diffusion model for 3D novel view synthesis, which is able to translate a single input view into consistent and sharp completions across many views. The core component of 3DiM is a pose-conditional image-to-image diffusion model, which takes a source view and its pose as inputs, and generates a novel view for a target pose as output. 3DiM can generate multiple views that are 3D… ▽ More

    Submitted 6 October, 2022; originally announced October 2022.

  31. arXiv:2210.03142  [pdf, other

    cs.CV cs.AI cs.LG

    On Distillation of Guided Diffusion Models

    Authors: Chenlin Meng, Robin Rombach, Ruiqi Gao, Diederik P. Kingma, Stefano Ermon, Jonathan Ho, Tim Salimans

    Abstract: Classifier-free guided diffusion models have recently been shown to be highly effective at high-resolution image generation, and they have been widely used in large-scale diffusion frameworks including DALLE-2, Stable Diffusion and Imagen. However, a downside of classifier-free guided diffusion models is that they are computationally expensive at inference time since they require evaluating two di… ▽ More

    Submitted 12 April, 2023; v1 submitted 6 October, 2022; originally announced October 2022.

    Comments: CVPR 2023, Award candidate

  32. arXiv:2210.02303  [pdf, other

    cs.CV cs.LG

    Imagen Video: High Definition Video Generation with Diffusion Models

    Authors: Jonathan Ho, William Chan, Chitwan Saharia, Jay Whang, Ruiqi Gao, Alexey Gritsenko, Diederik P. Kingma, Ben Poole, Mohammad Norouzi, David J. Fleet, Tim Salimans

    Abstract: We present Imagen Video, a text-conditional video generation system based on a cascade of video diffusion models. Given a text prompt, Imagen Video generates high definition videos using a base video generation model and a sequence of interleaved spatial and temporal video super-resolution models. We describe how we scale up the system as a high definition text-to-video model including design deci… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: See accompanying website: https://imagen.research.google/video/

  33. arXiv:2208.04910  [pdf

    cs.CV

    Deep Learning-Based Objective and Reproducible Osteosarcoma Chemotherapy Response Assessment and Outcome Prediction

    Authors: David Joon Ho, Narasimhan P. Agaram, Marc-Henri Jean, Stephanie D. Suser, Cynthia Chu, Chad M. Vanderbilt, Paul A. Meyers, Leonard H. Wexler, John H. Healey, Thomas J. Fuchs, Meera R. Hameed

    Abstract: Osteosarcoma is the most common primary bone cancer whose standard treatment includes pre-operative chemotherapy followed by resection. Chemotherapy response is used for predicting prognosis and further management of patients. Necrosis is routinely assessed post-chemotherapy from histology slides on resection specimens where necrosis ratio is defined as the ratio of necrotic tumor to overall tumor… ▽ More

    Submitted 9 August, 2022; originally announced August 2022.

  34. arXiv:2208.00993  [pdf, ps, other

    cs.LG

    MULTIPAR: Supervised Irregular Tensor Factorization with Multi-task Learning

    Authors: Yifei Ren, Jian Lou, Li Xiong, Joyce C Ho, Xiaoqian Jiang, Sivasubramanium Bhavani

    Abstract: Tensor factorization has received increasing interest due to its intrinsic ability to capture latent factors in multi-dimensional data with many applications such as recommender systems and Electronic Health Records (EHR) mining. PARAFAC2 and its variants have been proposed to address irregular tensors where one of the tensor modes is not aligned, e.g., different users in recommender systems or pa… ▽ More

    Submitted 9 August, 2022; v1 submitted 1 August, 2022; originally announced August 2022.

  35. arXiv:2207.12598  [pdf, other

    cs.LG cs.AI

    Classifier-Free Diffusion Guidance

    Authors: Jonathan Ho, Tim Salimans

    Abstract: Classifier guidance is a recently introduced method to trade off mode coverage and sample fidelity in conditional diffusion models post training, in the same spirit as low temperature sampling or truncation in other types of generative models. Classifier guidance combines the score estimate of a diffusion model with the gradient of an image classifier and thereby requires training an image classif… ▽ More

    Submitted 25 July, 2022; originally announced July 2022.

    Comments: A short version of this paper appeared in the NeurIPS 2021 Workshop on Deep Generative Models and Downstream Applications: https://openreview.net/pdf?id=qw8AKxfYbI

  36. arXiv:2207.09370  [pdf, other

    cs.LG cs.CY cs.DB q-bio.QM stat.AP

    Data-Centric Epidemic Forecasting: A Survey

    Authors: Alexander Rodríguez, Harshavardhan Kamarthi, Pulak Agarwal, Javen Ho, Mira Patel, Suchet Sapre, B. Aditya Prakash

    Abstract: The COVID-19 pandemic has brought forth the importance of epidemic forecasting for decision makers in multiple domains, ranging from public health to the economy as a whole. While forecasting epidemic progression is frequently conceptualized as being analogous to weather forecasting, however it has some key differences and remains a non-trivial task. The spread of diseases is subject to multiple c… ▽ More

    Submitted 20 July, 2022; v1 submitted 19 July, 2022; originally announced July 2022.

    Comments: 67 pages, 12 figures

  37. arXiv:2207.05381  [pdf, ps, other

    cs.IT

    Deriving RIP sensing matrices for sparsifying dictionaries

    Authors: Jinn Ho, Wen-Liang Hwang

    Abstract: Compressive sensing involves the inversion of a mapping $SD \in \mathbb{R}^{m \times n}$, where $m < n$, $S$ is a sensing matrix, and $D$ is a sparisfying dictionary. The restricted isometry property is a powerful sufficient condition for the inversion that guarantees the recovery of high-dimensional sparse vectors from their low-dimensional embedding into a Euclidean space via convex optimization… ▽ More

    Submitted 12 July, 2022; originally announced July 2022.

  38. arXiv:2205.11487  [pdf, other

    cs.CV cs.LG

    Photorealistic Text-to-Image Diffusion Models with Deep Language Understanding

    Authors: Chitwan Saharia, William Chan, Saurabh Saxena, Lala Li, Jay Whang, Emily Denton, Seyed Kamyar Seyed Ghasemipour, Burcu Karagol Ayan, S. Sara Mahdavi, Rapha Gontijo Lopes, Tim Salimans, Jonathan Ho, David J Fleet, Mohammad Norouzi

    Abstract: We present Imagen, a text-to-image diffusion model with an unprecedented degree of photorealism and a deep level of language understanding. Imagen builds on the power of large transformer language models in understanding text and hinges on the strength of diffusion models in high-fidelity image generation. Our key discovery is that generic large language models (e.g. T5), pretrained on text-only c… ▽ More

    Submitted 23 May, 2022; originally announced May 2022.

  39. arXiv:2205.02752   

    cs.LG

    A collection of invited non-archival papers for the Conference on Health, Inference, and Learning (CHIL) 2022

    Authors: Gerardo Flores, George H. Chen, Tom Pollard, Joyce C. Ho, Tristan Naumann

    Abstract: A collection of invited non-archival papers for the Conference on Health, Inference, and Learning (CHIL) 2022. This index is incomplete as some authors of invited non-archival presentations opted not to include their papers in this index.

    Submitted 28 March, 2022; originally announced May 2022.

  40. arXiv:2204.03458  [pdf, other

    cs.CV cs.AI cs.LG

    Video Diffusion Models

    Authors: Jonathan Ho, Tim Salimans, Alexey Gritsenko, William Chan, Mohammad Norouzi, David J. Fleet

    Abstract: Generating temporally coherent high fidelity video is an important milestone in generative modeling research. We make progress towards this milestone by proposing a diffusion model for video generation that shows very promising initial results. Our model is a natural extension of the standard image diffusion architecture, and it enables jointly training from image and video data, which we find to… ▽ More

    Submitted 22 June, 2022; v1 submitted 7 April, 2022; originally announced April 2022.

  41. arXiv:2203.15015  [pdf, other

    eess.IV cs.CV

    Deep Interactive Learning-based ovarian cancer segmentation of H&E-stained whole slide images to study morphological patterns of BRCA mutation

    Authors: David Joon Ho, M. Herman Chui, Chad M. Vanderbilt, Jiwon Jung, Mark E. Robson, Chan-Sik Park, Jin Roh, Thomas J. Fuchs

    Abstract: Deep learning has been widely used to analyze digitized hematoxylin and eosin (H&E)-stained histopathology whole slide images. Automated cancer segmentation using deep learning can be used to diagnose malignancy and to find novel morphological patterns to predict molecular subtypes. To train pixel-wise cancer segmentation models, manual annotation from pathologists is generally a bottleneck due to… ▽ More

    Submitted 28 March, 2022; originally announced March 2022.

  42. arXiv:2202.05830  [pdf, other

    cs.LG

    Learning Fast Samplers for Diffusion Models by Differentiating Through Sample Quality

    Authors: Daniel Watson, William Chan, Jonathan Ho, Mohammad Norouzi

    Abstract: Diffusion models have emerged as an expressive family of generative models rivaling GANs in sample quality and autoregressive models in likelihood scores. Standard diffusion models typically require hundreds of forward passes through the model to generate a single high-fidelity sample. We introduce Differentiable Diffusion Sampler Search (DDSS): a method that optimizes fast samplers for any pre-tr… ▽ More

    Submitted 11 February, 2022; originally announced February 2022.

    Comments: Published as a conference paper at ICLR 2022

  43. arXiv:2202.00512  [pdf, other

    cs.LG cs.AI stat.ML

    Progressive Distillation for Fast Sampling of Diffusion Models

    Authors: Tim Salimans, Jonathan Ho

    Abstract: Diffusion models have recently shown great promise for generative modeling, outperforming GANs on perceptual quality and autoregressive models at density estimation. A remaining downside is their slow sampling time: generating high quality samples takes many hundreds or thousands of model evaluations. Here we make two contributions to help eliminate this downside: First, we present new parameteriz… ▽ More

    Submitted 7 June, 2022; v1 submitted 1 February, 2022; originally announced February 2022.

    Comments: Published as a conference paper at ICLR 2022

  44. arXiv:2111.05826  [pdf, other

    cs.CV cs.LG

    Palette: Image-to-Image Diffusion Models

    Authors: Chitwan Saharia, William Chan, Huiwen Chang, Chris A. Lee, Jonathan Ho, Tim Salimans, David J. Fleet, Mohammad Norouzi

    Abstract: This paper develops a unified framework for image-to-image translation based on conditional diffusion models and evaluates this framework on four challenging image-to-image translation tasks, namely colorization, inpainting, uncropping, and JPEG restoration. Our simple implementation of image-to-image diffusion models outperforms strong GAN and regression baselines on all tasks, without task-speci… ▽ More

    Submitted 3 May, 2022; v1 submitted 10 November, 2021; originally announced November 2021.

  45. arXiv:2111.04563  [pdf, other

    cs.NI cs.AR

    A Roadmap for Enabling a Future-Proof In-Network Computing Data Plane Ecosystem

    Authors: Daehyeok Kim, Nikita Lazarev, Tommy Tracy, Farzana Siddique, Hun Namkung, James C. Hoe, Vyas Sekar, Kevin Skadron, Zhiru Zhang, Srinivasan Seshan

    Abstract: As the vision of in-network computing becomes more mature, we see two parallel evolutionary trends. First, we see the evolution of richer, more demanding applications that require capabilities beyond programmable switching ASICs. Second, we see the evolution of diverse data plane technologies with many other future capabilities on the horizon. While some point solutions exist to tackle the interse… ▽ More

    Submitted 8 November, 2021; originally announced November 2021.

    Comments: 6 pages, 3 figures

  46. arXiv:2110.02457  [pdf, other

    cs.LG math.NA

    GDA-AM: On the effectiveness of solving minimax optimization via Anderson Acceleration

    Authors: Huan He, Shifan Zhao, Yuanzhe Xi, Joyce C Ho, Yousef Saad

    Abstract: Many modern machine learning algorithms such as generative adversarial networks (GANs) and adversarial training can be formulated as minimax optimization. Gradient descent ascent (GDA) is the most commonly used algorithm due to its simplicity. However, GDA can converge to non-optimal minimax points. We propose a new minimax optimization framework, GDA-AM, that views the GDAdynamics as a fixed-poin… ▽ More

    Submitted 29 June, 2022; v1 submitted 5 October, 2021; originally announced October 2021.

    Comments: 31 Pages, ICLR, minimax, Anderson Acceleration

  47. arXiv:2109.15196  [pdf, other

    cs.CL cs.AI

    Multilingual AMR Parsing with Noisy Knowledge Distillation

    Authors: Deng Cai, Xin Li, Jackie Chun-Sing Ho, Lidong Bing, Wai Lam

    Abstract: We study multilingual AMR parsing from the perspective of knowledge distillation, where the aim is to learn and improve a multilingual AMR parser by using an existing English parser as its teacher. We constrain our exploration in a strict multilingual setting: there is but one model to parse all different languages including English. We identify that noisy input and precise output are the key to s… ▽ More

    Submitted 13 October, 2021; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: EMNLP21 (findings)

  48. arXiv:2109.14449  [pdf, other

    cs.CV cs.LG

    One Loss for All: Deep Hashing with a Single Cosine Similarity based Learning Objective

    Authors: Jiun Tian Hoe, Kam Woh Ng, Tianyu Zhang, Chee Seng Chan, Yi-Zhe Song, Tao Xiang

    Abstract: A deep hashing model typically has two main learning objectives: to make the learned binary hash codes discriminative and to minimize a quantization error. With further constraints such as bit balance and code orthogonality, it is not uncommon for existing models to employ a large number (>4) of losses. This leads to difficulties in model training and subsequently impedes their effectiveness. In t… ▽ More

    Submitted 29 September, 2021; originally announced September 2021.

    Comments: Accepted at NeurIPS 2021

  49. arXiv:2109.01718  [pdf, ps, other

    cs.LG cs.DC

    Communication Efficient Generalized Tensor Factorization for Decentralized Healthcare Networks

    Authors: Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Sivasubramanium Bhavani, Joyce C. Ho

    Abstract: Tensor factorization has been proved as an efficient unsupervised learning approach for health data analysis, especially for computational phenotyping, where the high-dimensional Electronic Health Records (EHRs) with patients' history of medical procedures, medications, diagnosis, lab tests, etc., are converted to meaningful and interpretable medical concepts. Federated tensor factorization distri… ▽ More

    Submitted 3 November, 2022; v1 submitted 3 September, 2021; originally announced September 2021.

    Comments: Short version accepted to IEEE ICDM 2021

  50. Temporal Network Embedding via Tensor Factorization

    Authors: Jing Ma, Qiuchen Zhang, Jian Lou, Li Xiong, Joyce C. Ho

    Abstract: Representation learning on static graph-structured data has shown a significant impact on many real-world applications. However, less attention has been paid to the evolving nature of temporal networks, in which the edges are often changing over time. The embeddings of such temporal networks should encode both graph-structured information and the temporally evolving pattern. Existing approaches in… ▽ More

    Submitted 22 August, 2021; originally announced August 2021.

    Comments: To appear in CIKM 2021