Zum Hauptinhalt springen

Showing 1–50 of 55 results for author: Clifton, A

Searching in archive cs. Search in all archives.
.
  1. FE-Adapter: Adapting Image-based Emotion Classifiers to Videos

    Authors: Shreyank N Gowda, Boyan Gao, David A. Clifton

    Abstract: Utilizing large pre-trained models for specific tasks has yielded impressive results. However, fully fine-tuning these increasingly large models is becoming prohibitively resource-intensive. This has led to a focus on more parameter-efficient transfer learning, primarily within the same modality. But this approach has limitations, particularly in video understanding where suitable pre-trained mode… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  2. arXiv:2408.00181  [pdf, other

    cs.CV

    CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation

    Authors: Shreyank N Gowda, David A. Clifton

    Abstract: The Segment Anything Model (SAM) has achieved remarkable successes in the realm of natural image segmentation, but its deployment in the medical imaging sphere has encountered challenges. Specifically, the model struggles with medical images that feature low contrast, faint boundaries, intricate morphologies, and small-sized objects. To address these challenges and enhance SAM's performance in the… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted to ECCV 2024

  3. arXiv:2407.16264  [pdf, other

    cs.CV

    Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring

    Authors: Shreyank N Gowda, David A. Clifton

    Abstract: Contemporary medical contrastive learning faces challenges from inconsistent semantics and sample pair morphology, leading to dispersed and converging semantic shifts. The variability in text reports, due to multiple authors, complicates semantic consistency. To tackle these issues, we propose a two-step approach. Initially, text reports are converted into a standardized triplet format, laying the… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted in MICCAI-24

  4. arXiv:2407.10086  [pdf, other

    cs.CL cs.AI

    Rapid Biomedical Research Classification: The Pandemic PACT Advanced Categorisation Engine

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Olena Seminog, Rodrigo Furst, Thomas Mendy, Shanthi Levanita, Zaharat Kadri-Alabi, Nusrat Jabin, Daniela Toale, Georgina Humphreys, Emilia Antonio, Adrian Bucher, Alice Norton, David A. Clifton

    Abstract: This paper introduces the Pandemic PACT Advanced Categorisation Engine (PPACE) along with its associated dataset. PPACE is a fine-tuned model developed to automatically classify research abstracts from funded biomedical projects according to WHO-aligned research priorities. This task is crucial for monitoring research trends and identifying gaps in global health preparedness and response. Our appr… ▽ More

    Submitted 19 July, 2024; v1 submitted 14 July, 2024; originally announced July 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  5. arXiv:2407.04752  [pdf, other

    cs.LG cs.CL cs.NE

    SpikeLLM: Scaling up Spiking Neural Network to Large Language Models via Saliency-based Spiking

    Authors: Xingrun Xing, Boyan Gao, Zheng Zhang, David A. Clifton, Shitao Xiao, Li Du, Guoqi Li, Jiajun Zhang

    Abstract: The recent advancements in large language models (LLMs) with billions of parameters have significantly boosted their performance across various real-world applications. However, the inference processes for these models require substantial energy and computational resources, presenting considerable deployment challenges. In contrast, human brains, which contain approximately 86 billion biological n… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  6. arXiv:2406.14377  [pdf, other

    cs.LG cs.AI

    Computation-Efficient Semi-Supervised Learning for ECG-based Cardiovascular Diseases Detection

    Authors: Rushuang Zhou, Zijun Liu, Lei Clifton, David A. Clifton, Kannie W. Y. Chan, Yuan-Ting Zhang, Yining Dong

    Abstract: Label scarcity problem is the main challenge that hinders the wide application of deep learning systems in automatic cardiovascular diseases (CVDs) detection using electrocardiography (ECG). Tuning pre-trained models alleviates this problem by transferring knowledge learned from large datasets to downstream small datasets. However, bottlenecks in computational efficiency and CVDs detection perform… ▽ More

    Submitted 20 June, 2024; originally announced June 2024.

  7. arXiv:2405.07841  [pdf, other

    cs.LG

    Sample Selection Bias in Machine Learning for Healthcare

    Authors: Vinod Kumar Chauhan, Lei Clifton, Achille Salaün, Huiqi Yvonne Lu, Kim Branson, Patrick Schwab, Gaurav Nigam, David A. Clifton

    Abstract: While machine learning algorithms hold promise for personalised medicine, their clinical adoption remains limited. One critical factor contributing to this restraint is sample selection bias (SSB) which refers to the study population being less representative of the target population, leading to biased and potentially harmful decisions. Despite being well-known in the literature, SSB remains scarc… ▽ More

    Submitted 13 May, 2024; originally announced May 2024.

    Comments: 20 pages and 11 figures (under review)

  8. arXiv:2405.05195  [pdf, ps, other

    math.CO cs.DM

    Trail Trap: a variant of Partizan Edge Geography

    Authors: Calum Buchanan, MacKenzie Carr, Alexander Clifton, Stephen G. Hartke, Vesna Iršič, Nicholas Sieger, Rebecca Whitman

    Abstract: We study a two-player game played on undirected graphs called Trail Trap, which is a variant of a game known as Partizan Edge Geography. One player starts by choosing any edge and moving a token from one endpoint to the other; the other player then chooses a different edge and does the same. Alternating turns, each player moves their token along an unused edge from its current vertex to an adjacen… ▽ More

    Submitted 9 May, 2024; v1 submitted 8 May, 2024; originally announced May 2024.

    Comments: 21 pages, 8 figures, 1 table

    MSC Class: 91A43 (05C57; 68Q17)

  9. arXiv:2405.00716  [pdf, other

    cs.CL cs.AI

    Large Language Models in the Clinic: A Comprehensive Benchmark

    Authors: Andrew Liu, Hongjian Zhou, Yining Hua, Omid Rohanian, Anshul Thakur, Lei Clifton, David A. Clifton

    Abstract: The adoption of large language models (LLMs) to assist clinicians has attracted remarkable attention. Existing works mainly adopt the close-ended question-answering (QA) task with answer options for evaluation. However, many clinical decisions involve answering open-ended questions without pre-set options. To better understand LLMs in the clinic, we construct a benchmark ClinicBench. We first coll… ▽ More

    Submitted 26 June, 2024; v1 submitted 25 April, 2024; originally announced May 2024.

  10. arXiv:2401.00579  [pdf, other

    cs.CL cs.AI cs.LG

    Exploring the Effectiveness of Instruction Tuning in Biomedical Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, David A. Clifton

    Abstract: Large Language Models (LLMs), particularly those similar to ChatGPT, have significantly influenced the field of Natural Language Processing (NLP). While these models excel in general language tasks, their performance in domain-specific downstream tasks such as biomedical and clinical Named Entity Recognition (NER), Relation Extraction (RE), and Medical Natural Language Inference (NLI) is still evo… ▽ More

    Submitted 31 December, 2023; originally announced January 2024.

    MSC Class: 68T50 ACM Class: I.2.7

  11. arXiv:2311.05112  [pdf

    cs.CL cs.AI

    A Survey of Large Language Models in Medicine: Progress, Application, and Challenge

    Authors: Hongjian Zhou, Fenglin Liu, Boyang Gu, Xinyu Zou, Jinfa Huang, Jinge Wu, Yiru Li, Sam S. Chen, Peilin Zhou, Junling Liu, Yining Hua, Chengfeng Mao, Chenyu You, Xian Wu, Yefeng Zheng, Lei Clifton, Zheng Li, Jiebo Luo, David A. Clifton

    Abstract: Large language models (LLMs), such as ChatGPT, have received substantial attention due to their capabilities for understanding and generating human language. While there has been a burgeoning trend in research focusing on the employment of LLMs in supporting different medical tasks (e.g., enhancing clinical diagnostics and providing medical education), a review of these efforts, particularly their… ▽ More

    Submitted 22 July, 2024; v1 submitted 8 November, 2023; originally announced November 2023.

    Comments: Preprint. Version 6. Update Figures 1-5; Tables 2-3; 31 pages

  12. arXiv:2309.00810  [pdf, other

    cs.CV cs.AI

    RenAIssance: A Survey into AI Text-to-Image Generation in the Era of Large Model

    Authors: Fengxiang Bie, Yibo Yang, Zhongzhu Zhou, Adam Ghanem, Minjia Zhang, Zhewei Yao, Xiaoxia Wu, Connor Holmes, Pareesa Golnari, David A. Clifton, Yuxiong He, Dacheng Tao, Shuaiwen Leon Song

    Abstract: Text-to-image generation (TTI) refers to the usage of models that could process text input and generate high fidelity images based on text descriptions. Text-to-image generation using neural networks could be traced back to the emergence of Generative Adversial Network (GAN), followed by the autoregressive Transformer. Diffusion models are one prominent type of generative model used for the genera… ▽ More

    Submitted 1 September, 2023; originally announced September 2023.

  13. arXiv:2306.10494  [pdf, other

    eess.SP cs.AI

    Semi-Supervised Learning for Multi-Label Cardiovascular Diseases Prediction:A Multi-Dataset Study

    Authors: Rushuang Zhou, Lei Lu, Zijun Liu, Ting Xiang, Zhen Liang, David A. Clifton, Yining Dong, Yuan-Ting Zhang

    Abstract: Electrocardiography (ECG) is a non-invasive tool for predicting cardiovascular diseases (CVDs). Current ECG-based diagnosis systems show promising performance owing to the rapid development of deep learning techniques. However, the label scarcity problem, the co-occurrence of multiple CVDs and the poor performance on unseen datasets greatly hinder the widespread application of deep learning-based… ▽ More

    Submitted 18 June, 2023; originally announced June 2023.

  14. A Brief Review of Hypernetworks in Deep Learning

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, Ping Lu, Soheila Molaei, David A. Clifton

    Abstract: Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility, adaptability, dynamism, faster training, information sharing, and model compression. Hypernets have shown promising results in a variety of deep learning problems, inclu… ▽ More

    Submitted 13 July, 2024; v1 submitted 12 June, 2023; originally announced June 2023.

    Comments: 2 figures and 2 tables -- Accepted to Artificial Intelligence Review

  15. arXiv:2305.15984  [pdf, other

    cs.LG stat.ME

    Dynamic Inter-treatment Information Sharing for Individualized Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Jiandong Zhou, Ghadeer Ghosheh, Soheila Molaei, David A. Clifton

    Abstract: Estimation of individualized treatment effects (ITE) from observational studies is a fundamental problem in causal inference and holds significant importance across domains, including healthcare. However, limited observational datasets pose challenges in reliable ITE estimation as data have to be split among treatment groups to train an ITE learner. While information sharing among treatment groups… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 May, 2023; originally announced May 2023.

    Comments: accepted to The 27th International Conference on Artificial Intelligence and Statistics (AISTATS) 2024

  16. arXiv:2305.03711  [pdf, other

    cs.LG cs.CY

    Medical records condensation: a roadmap towards healthcare data democratisation

    Authors: Yujiang Wang, Anshul Thakur, Mingzhi Dong, Pingchuan Ma, Stavros Petridis, Li Shang, Tingting Zhu, David A. Clifton

    Abstract: The prevalence of artificial intelligence (AI) has envisioned an era of healthcare democratisation that promises every stakeholder a new and better way of life. However, the advancement of clinical AI research is significantly hurdled by the dearth of data democratisation in healthcare. To truly democratise data for AI studies, challenges are two-fold: 1. the sensitive information in clinical data… ▽ More

    Submitted 8 January, 2024; v1 submitted 5 May, 2023; originally announced May 2023.

  17. arXiv:2305.03710  [pdf, other

    cs.LG cs.CR

    Data Encoding For Healthcare Data Democratisation and Information Leakage Prevention

    Authors: Anshul Thakur, Tingting Zhu, Vinayak Abrol, Jacob Armstrong, Yujiang Wang, David A. Clifton

    Abstract: The lack of data democratization and information leakage from trained models hinder the development and acceptance of robust deep learning-based healthcare solutions. This paper argues that irreversible data encoding can provide an effective solution to achieve data democratization without violating the privacy constraints imposed on healthcare data and clinical models. An ideal encoding framework… ▽ More

    Submitted 5 May, 2023; originally announced May 2023.

  18. arXiv:2303.06458  [pdf, other

    cs.CL cs.AI cs.CV

    ZeroNLG: Aligning and Autoencoding Domains for Zero-Shot Multimodal and Multilingual Natural Language Generation

    Authors: Bang Yang, Fenglin Liu, Yuexian Zou, Xian Wu, Yaowei Wang, David A. Clifton

    Abstract: Natural Language Generation (NLG) accepts input data in the form of images, videos, or text and generates corresponding natural language text as output. Existing NLG methods mainly adopt a supervised approach and rely heavily on coupled data-to-text pairs. However, for many targeted scenarios and for non-English languages, sufficient quantities of labeled data are often not available. To relax the… ▽ More

    Submitted 3 June, 2024; v1 submitted 11 March, 2023; originally announced March 2023.

    Comments: Accepted by TPAMI (Our code and data are available at https://github.com/yangbang18/ZeroNLG)

  19. arXiv:2302.14679  [pdf, other

    cs.LG cs.CL

    Synthesizing Mixed-type Electronic Health Records using Diffusion Models

    Authors: Taha Ceritli, Ghadeer O. Ghosheh, Vinod Kumar Chauhan, Tingting Zhu, Andrew P. Creagh, David A. Clifton

    Abstract: Electronic Health Records (EHRs) contain sensitive patient information, which presents privacy concerns when sharing such data. Synthetic data generation is a promising solution to mitigate these risks, often relying on deep generative models such as Generative Adversarial Networks (GANs). However, recent studies have shown that diffusion models offer several advantages over GANs, such as generati… ▽ More

    Submitted 10 August, 2023; v1 submitted 28 February, 2023; originally announced February 2023.

    Comments: Page 2, Figure 1 is updated

  20. arXiv:2302.04725  [pdf, other

    cs.CL cs.AI cs.LG

    Lightweight Transformers for Clinical Natural Language Processing

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Hannah Jauncey, Samaneh Kouchaki, ISARIC Clinical Characterisation Group, Lei Clifton, Laura Merson, David A. Clifton

    Abstract: Specialised pre-trained language models are becoming more frequent in NLP since they can potentially outperform models trained on generic texts. BioBERT and BioClinicalBERT are two examples of such models that have shown promise in medical NLP tasks. Many of these models are overparametrised and resource-intensive, but thanks to techniques like Knowledge Distillation (KD), it is possible to create… ▽ More

    Submitted 9 February, 2023; originally announced February 2023.

    MSC Class: 68T50 ACM Class: I.2.7

  21. arXiv:2302.01735  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Rethinking Semi-Supervised Medical Image Segmentation: A Variance-Reduction Perspective

    Authors: Chenyu You, Weicheng Dai, Yifei Min, Fenglin Liu, David A. Clifton, S Kevin Zhou, Lawrence Hamilton Staib, James S Duncan

    Abstract: For medical image segmentation, contrastive learning is the dominant practice to improve the quality of visual representations by contrasting semantically similar and dissimilar pairs of samples. This is enabled by the observation that without accessing ground truth labels, negative examples with truly dissimilar anatomical features, if sampled, can significantly improve the performance. In realit… ▽ More

    Submitted 23 October, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: Accepted by Advances in Neural Information Processing Systems (NeurIPS 2023)

  22. arXiv:2211.11427  [pdf, other

    cs.CV

    Expectation-Maximization Contrastive Learning for Compact Video-and-Language Representations

    Authors: Peng Jin, Jinfa Huang, Fenglin Liu, Xian Wu, Shen Ge, Guoli Song, David A. Clifton, Jie Chen

    Abstract: Most video-and-language representation learning approaches employ contrastive learning, e.g., CLIP, to project the video and text features into a common latent space according to the semantic similarities of text-video pairs. However, such learned shared latent spaces are not often optimal, and the modality gap between visual and textual representation can not be fully eliminated. In this paper, w… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

    Comments: Accepted to NeurIPS 2022

  23. arXiv:2210.12777  [pdf, other

    cs.CL cs.LG

    Retrieval-Augmented and Knowledge-Grounded Language Models for Faithful Clinical Medicine

    Authors: Fenglin Liu, Bang Yang, Chenyu You, Xian Wu, Shen Ge, Zhangdaihong Liu, Xu Sun, Yang Yang, David A. Clifton

    Abstract: Language models (LMs), including large language models (such as ChatGPT), have the potential to assist clinicians in generating various clinical notes. However, LMs are prone to produce ``hallucinations'', i.e., generated content that is not aligned with facts and knowledge. In this paper, we propose the Re$^3$Writer method with retrieval-augmented generation and knowledge-grounded reasoning to en… ▽ More

    Submitted 21 July, 2024; v1 submitted 23 October, 2022; originally announced October 2022.

  24. arXiv:2210.10530  [pdf, other

    cs.LG cs.AI stat.ME

    Adversarial De-confounding in Individualised Treatment Effects Estimation

    Authors: Vinod Kumar Chauhan, Soheila Molaei, Marzia Hoque Tania, Anshul Thakur, Tingting Zhu, David A. Clifton

    Abstract: Observational studies have recently received significant attention from the machine learning community due to the increasingly available non-experimental observational data and the limitations of the experimental studies, such as considerable cost, impracticality, small and less representative sample sizes, etc. In observational studies, de-confounding is a fundamental problem of individualised tr… ▽ More

    Submitted 24 January, 2023; v1 submitted 19 October, 2022; originally announced October 2022.

    Comments: accepted to AISTATS 2023

  25. arXiv:2210.06425  [pdf, other

    cs.CL cs.LG

    MiniALBERT: Model Distillation via Parameter-Efficient Recursive Transformers

    Authors: Mohammadmahdi Nouriborji, Omid Rohanian, Samaneh Kouchaki, David A. Clifton

    Abstract: Pre-trained Language Models (LMs) have become an integral part of Natural Language Processing (NLP) in recent years, due to their superior performance in downstream applications. In spite of this resounding success, the usability of LMs is constrained by computational and time complexity, along with their increasing size; an issue that has been referred to as `overparameterisation'. Different stra… ▽ More

    Submitted 30 April, 2023; v1 submitted 12 October, 2022; originally announced October 2022.

    MSC Class: 68T50 ACM Class: I.2.7

  26. arXiv:2209.13476  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    Mine yOur owN Anatomy: Revisiting Medical Image Segmentation with Extremely Limited Labels

    Authors: Chenyu You, Weicheng Dai, Fenglin Liu, Yifei Min, Haoran Su, Xiaoran Zhang, Xiaoxiao Li, David A. Clifton, Lawrence Staib, James S. Duncan

    Abstract: Recent studies on contrastive learning have achieved remarkable performance solely by leveraging few labels in the context of medical image segmentation. Existing methods mainly focus on instance discrimination and invariant mapping. However, they face three common pitfalls: (1) tailness: medical image data usually follows an implicit long-tail class distribution. Blindly leveraging all pixels in… ▽ More

    Submitted 16 March, 2023; v1 submitted 27 September, 2022; originally announced September 2022.

    Comments: In this version: Add theoretical analysis and correct some typos

  27. Cem Mil Podcasts: A Spoken Portuguese Document Corpus For Multi-modal, Multi-lingual and Multi-Dialect Information Access Research

    Authors: Ekaterina Garmash, Edgar Tanaka, Ann Clifton, Joana Correia, Sharmistha Jat, Winstead Zhu, Rosie Jones, Jussi Karlgren

    Abstract: In this paper we describe the Portuguese-language podcast dataset we have released for academic research purposes. We give an overview of how the data was sampled, descriptive statistics over the collection, as well as information about the distribution over Brazilian and Portuguese dialects. We give results from experiments on multi-lingual summarization, showing that summarizing podcast transcri… ▽ More

    Submitted 13 December, 2023; v1 submitted 23 September, 2022; originally announced September 2022.

    Comments: 12 pages, 1 figure

    Journal ref: Volume 14163 of Lecture Notes in Computer Science, pages 48-59, Springer, 2023

  28. arXiv:2209.03182  [pdf, ps, other

    cs.CL cs.LG

    On the Effectiveness of Compact Biomedical Transformers

    Authors: Omid Rohanian, Mohammadmahdi Nouriborji, Samaneh Kouchaki, David A. Clifton

    Abstract: Language models pre-trained on biomedical corpora, such as BioBERT, have recently shown promising results on downstream biomedical tasks. Many existing pre-trained models, on the other hand, are resource-intensive and computationally heavy owing to factors such as embedding size, hidden dimension, and number of layers. The natural language processing (NLP) community has developed numerous strategi… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

    MSC Class: 68T50

  29. COPER: Continuous Patient State Perceiver

    Authors: Vinod Kumar Chauhan, Anshul Thakur, Odhran O'Donoghue, David A. Clifton

    Abstract: In electronic health records (EHRs), irregular time-series (ITS) occur naturally due to patient health dynamics, reflected by irregular hospital visits, diseases/conditions and the necessity to measure different vitals signs at each visit etc. ITS present challenges in training machine learning algorithms which mostly are built on assumption of coherent fixed dimensional feature space. In this pap… ▽ More

    Submitted 24 November, 2022; v1 submitted 5 August, 2022; originally announced August 2022.

    Comments: 2 figures; presented in IEEE International Conference on Biomedical and Health Informatics (IEEE BHI-2022)

  30. arXiv:2207.11846  [pdf, other

    cs.LG cs.AI

    Mixture of Input-Output Hidden Markov Models for Heterogeneous Disease Progression Modeling

    Authors: Taha Ceritli, Andrew P. Creagh, David A. Clifton

    Abstract: A particular challenge for disease progression modeling is the heterogeneity of a disease and its manifestations in the patients. Existing approaches often assume the presence of a single disease progression characteristics which is unlikely for neurodegenerative disorders such as Parkinson's disease. In this paper, we propose a hierarchical time-series model that can discover multiple disease pro… ▽ More

    Submitted 24 July, 2022; originally announced July 2022.

  31. arXiv:2207.00118  [pdf, other

    cs.LG cs.AI cs.CV

    ProSelfLC: Progressive Self Label Correction Towards A Low-Temperature Entropy State

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, Sankha Subhra Mukherjee, David A. Clifton, Neil M. Robertson

    Abstract: There is a family of label modification approaches including self and non-self label correction (LC), and output regularisation. They are widely used for training robust deep neural networks (DNNs), but have not been mathematically and thoroughly analysed together. We study them and discover three key issues: (1) We are more interested in adopting Self LC as it leverages its own knowledge and requ… ▽ More

    Submitted 6 September, 2022; v1 submitted 30 June, 2022; originally announced July 2022.

    Comments: To ease the reading, a summary of changes is put in the beginning. Our source code is available at https://github.com/XinshaoAmosWang/ProSelfLC-AT

  32. arXiv:2206.06488  [pdf, other

    cs.CV cs.LG

    Multimodal Learning with Transformers: A Survey

    Authors: Peng Xu, Xiatian Zhu, David A. Clifton

    Abstract: Transformer is a promising neural network learner, and has achieved great success in various machine learning tasks. Thanks to the recent prevalence of multimodal applications and big data, Transformer-based multimodal learning has become a hot topic in AI research. This paper presents a comprehensive survey of Transformer techniques oriented at multimodal data. The main contents of this survey in… ▽ More

    Submitted 9 May, 2023; v1 submitted 13 June, 2022; originally announced June 2022.

    Comments: This paper is accepted by IEEE TPAMI

  33. arXiv:2206.02909  [pdf, other

    eess.SP cs.AI cs.LG

    Self-supervised Learning for Human Activity Recognition Using 700,000 Person-days of Wearable Data

    Authors: Hang Yuan, Shing Chan, Andrew P. Creagh, Catherine Tong, Aidan Acquah, David A. Clifton, Aiden Doherty

    Abstract: Advances in deep learning for human activity recognition have been relatively limited due to the lack of large labelled datasets. In this study, we leverage self-supervised learning techniques on the UK-Biobank activity tracker dataset--the largest of its kind to date--containing more than 700,000 person-days of unlabelled wearable sensor data. Our resulting activity recognition model consistently… ▽ More

    Submitted 20 June, 2024; v1 submitted 6 June, 2022; originally announced June 2022.

    Journal ref: npj Digit. Med. 7, 91 (2024)

  34. arXiv:2205.12070  [pdf, other

    cs.LG cs.AI

    Deep Reinforcement Learning for Multi-class Imbalanced Training

    Authors: Jenny Yang, Rasheed El-Bouri, Odhran O'Donoghue, Alexander S. Lachapelle, Andrew A. S. Soltan, David A. Clifton

    Abstract: With the rapid growth of memory and computing power, datasets are becoming increasingly complex and imbalanced. This is especially severe in the context of clinical data, where there may be one rare event for many cases in the majority class. We introduce an imbalanced classification framework, based on reinforcement learning, for training extremely imbalanced data sets, and extend it for use in m… ▽ More

    Submitted 24 May, 2022; originally announced May 2022.

  35. arXiv:2202.03670  [pdf, other

    cs.CV cs.LG

    How to Understand Masked Autoencoders

    Authors: Shuhao Cao, Peng Xu, David A. Clifton

    Abstract: "Masked Autoencoders (MAE) Are Scalable Vision Learners" revolutionizes the self-supervised learning method in that it not only achieves the state-of-the-art for image pre-training, but is also a milestone that bridges the gap between visual and linguistic masked autoencoding (BERT-style) pre-trainings. However, to our knowledge, to date there are no theoretical perspectives to explain the powerfu… ▽ More

    Submitted 9 February, 2022; v1 submitted 8 February, 2022; originally announced February 2022.

  36. Podcast Metadata and Content: Episode Relevance andAttractiveness in Ad Hoc Search

    Authors: Ben Carterette, Rosie Jones, Gareth F. Jones, Maria Eskevich, Sravana Reddy, Ann Clifton, Yongze Yu, Jussi Karlgren, Ian Soboroff

    Abstract: Rapidly growing online podcast archives contain diverse content on a wide range of topics. These archives form an important resource for entertainment and professional use, but their value can only be realized if users can rapidly and reliably locate content of interest. Search for relevant content can be based on metadata provided by content creators, but also on transcripts of the spoken content… ▽ More

    Submitted 25 August, 2021; originally announced August 2021.

  37. arXiv:2107.01707  [pdf, other

    cs.LG cs.CR cs.DC

    Towards Scheduling Federated Deep Learning using Meta-Gradients for Inter-Hospital Learning

    Authors: Rasheed el-Bouri, Tingting Zhu, David A. Clifton

    Abstract: Given the abundance and ease of access of personal data today, individual privacy has become of paramount importance, particularly in the healthcare domain. In this work, we aim to utilise patient data extracted from multiple hospital data centres to train a machine learning model without sacrificing patient privacy. We develop a scheduling algorithm in conjunction with a student-teacher algorithm… ▽ More

    Submitted 4 July, 2021; originally announced July 2021.

    Comments: 11 pages, 8 figures

  38. arXiv:2106.09227  [pdf, other

    cs.IR

    Current Challenges and Future Directions in Podcast Information Access

    Authors: Rosie Jones, Hamed Zamani, Markus Schedl, Ching-Wei Chen, Sravana Reddy, Ann Clifton, Jussi Karlgren, Helia Hashemi, Aasish Pappu, Zahra Nazari, Longqi Yang, Oguz Semerci, Hugues Bouchard, Ben Carterette

    Abstract: Podcasts are spoken documents across a wide-range of genres and styles, with growing listenership across the world, and a rapidly lowering barrier to entry for both listeners and creators. The great strides in search and recommendation in research and industry have yet to see impact in the podcast space, where recommendations are still largely driven by word of mouth. In this perspective paper, we… ▽ More

    Submitted 16 June, 2021; originally announced June 2021.

    Comments: SIGIR 2021

  39. arXiv:2106.01489  [pdf, other

    cs.LG cs.AI cs.CV

    Not All Knowledge Is Created Equal: Mutual Distillation of Confident Knowledge

    Authors: Ziyun Li, Xinshao Wang, Di Hu, Neil M. Robertson, David A. Clifton, Christoph Meinel, Haojin Yang

    Abstract: Mutual knowledge distillation (MKD) improves a model by distilling knowledge from another model. However, \textit{not all knowledge is certain and correct}, especially under adverse conditions. For example, label noise usually leads to less reliable models due to undesired memorization \cite{zhang2017understanding,arpit2017closer}. Wrong knowledge misleads the learning rather than helps. This prob… ▽ More

    Submitted 16 November, 2022; v1 submitted 2 June, 2021; originally announced June 2021.

    Comments: NeurIPS 2022 Workshop(Trustworthy and Socially Responsible Machine Learning) paper

  40. arXiv:2104.03343  [pdf, other

    cs.CL

    Spotify at TREC 2020: Genre-Aware Abstractive Podcast Summarization

    Authors: Rezvaneh Rezapour, Sravana Reddy, Ann Clifton, Rosie Jones

    Abstract: This paper contains the description of our submissions to the summarization task of the Podcast Track in TREC (the Text REtrieval Conference) 2020. The goal of this challenge was to generate short, informative summaries that contain the key information present in a podcast episode using automatically generated transcripts of the podcast audio. Since podcasts vary with respect to their genre, topic… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: The Twenty-Ninth Text REtrieval Conference (TREC 2020) Proceedings

  41. arXiv:2103.15953  [pdf, other

    cs.IR cs.CL

    TREC 2020 Podcasts Track Overview

    Authors: Rosie Jones, Ben Carterette, Ann Clifton, Maria Eskevich, Gareth J. F. Jones, Jussi Karlgren, Aasish Pappu, Sravana Reddy, Yongze Yu

    Abstract: The Podcast Track is new at the Text Retrieval Conference (TREC) in 2020. The podcast track was designed to encourage research into podcasts in the information retrieval and NLP research communities. The track consisted of two shared tasks: segment retrieval and summarization, both based on a dataset of over 100,000 podcast episodes (metadata, audio, and automatic transcripts) which was released c… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Journal ref: The Proceedings of the Twenty-Ninth Text REtrieval Conference Proceedings (TREC 2020)

  42. arXiv:2011.14230  [pdf, other

    eess.SP cs.LG

    CROCS: Clustering and Retrieval of Cardiac Signals Based on Patient Disease Class, Sex, and Age

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The process of manually searching for relevant instances in, and extracting information from, clinical databases underpin a multitude of clinical tasks. Such tasks include disease diagnosis, clinical trial recruitment, and continuing medical education. This manual search-and-extract process, however, has been hampered by the growth of large-scale clinical databases and the increased prevalence of… ▽ More

    Submitted 3 October, 2021; v1 submitted 28 November, 2020; originally announced November 2020.

    Comments: Accepted at Advances in Neural Information Processing Systems (NeurIPS) 2021

  43. arXiv:2011.14227  [pdf, other

    eess.SP cs.LG

    PCPs: Patient Cardiac Prototypes

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Many clinical deep learning algorithms are population-based and difficult to interpret. Such properties limit their clinical utility as population-based findings may not generalize to individual patients and physicians are reluctant to incorporate opaque models into their clinical workflow. To overcome these obstacles, we propose to learn patient-specific embeddings, entitled patient cardiac proto… ▽ More

    Submitted 28 November, 2020; originally announced November 2020.

  44. arXiv:2005.13249  [pdf, other

    cs.LG eess.SP stat.ML

    CLOCS: Contrastive Learning of Cardiac Signals Across Space, Time, and Patients

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: The healthcare industry generates troves of unlabelled physiological data. This data can be exploited via contrastive learning, a self-supervised pre-training method that encourages representations of instances to be similar to one another. We propose a family of contrastive learning methods, CLOCS, that encourages representations across space, time, \textit{and} patients to be similar to one anot… ▽ More

    Submitted 16 May, 2021; v1 submitted 27 May, 2020; originally announced May 2020.

    Comments: Accepted to ICML 2021

  45. arXiv:2005.03788  [pdf, other

    cs.LG cs.CV stat.ML

    ProSelfLC: Progressive Self Label Correction for Training Robust Deep Neural Networks

    Authors: Xinshao Wang, Yang Hua, Elyor Kodirov, David A. Clifton, Neil M. Robertson

    Abstract: To train robust deep neural networks (DNNs), we systematically study several target modification approaches, which include output regularisation, self and non-self label correction (LC). Two key issues are discovered: (1) Self LC is the most appealing as it exploits its own knowledge and requires no extra models. However, how to automatically decide the trust degree of a learner as training goes i… ▽ More

    Submitted 2 June, 2021; v1 submitted 7 May, 2020; originally announced May 2020.

    Comments: ProSelfLC is the first method to trust self knowledge progressively and adaptively. ProSelfLC redirects and promotes entropy minimisation, which is in marked contrast to recent practices of confidence penalty [42, 33, 6]

    Journal ref: CVPR 2021

  46. arXiv:2004.10468  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning in Active Learning

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Large sets of unlabelled data within the healthcare domain remain underutilized. Active learning offers a way to exploit these datasets by iteratively requesting an oracle (e.g. medical professional) to label instances. This process, which can be costly and time-consuming is overly-dependent upon an oracle. To alleviate this burden, we propose SoQal, a questioning strategy that dynamically determi… ▽ More

    Submitted 22 April, 2020; originally announced April 2020.

  47. arXiv:2004.09578  [pdf, other

    cs.LG stat.ML

    CLOPS: Continual Learning of Physiological Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Deep learning algorithms are known to experience destructive interference when instances violate the assumption of being independent and identically distributed (i.i.d). This violation, however, is ubiquitous in clinical settings where data are streamed temporally and from a multitude of physiological sensors. To overcome this obstacle, we propose CLOPS, a replay-based continual learning strategy.… ▽ More

    Submitted 28 November, 2020; v1 submitted 20 April, 2020; originally announced April 2020.

  48. arXiv:2004.09557  [pdf, other

    cs.LG stat.ML

    SoQal: Selective Oracle Questioning for Consistency Based Active Learning of Cardiac Signals

    Authors: Dani Kiyasseh, Tingting Zhu, David A. Clifton

    Abstract: Clinical settings are often characterized by abundant unlabelled data and limited labelled data. This is typically driven by the high burden placed on oracles (e.g., physicians) to provide annotations. One way to mitigate this burden is via active learning (AL) which involves the (a) acquisition and (b) annotation of informative unlabelled instances. Whereas previous work addresses either one of t… ▽ More

    Submitted 18 May, 2022; v1 submitted 20 April, 2020; originally announced April 2020.

    Comments: ICML 2022

  49. arXiv:2004.04270  [pdf, other

    cs.CL

    The Spotify Podcast Dataset

    Authors: Ann Clifton, Aasish Pappu, Sravana Reddy, Yongze Yu, Jussi Karlgren, Ben Carterette, Rosie Jones

    Abstract: Podcasts are a relatively new form of audio media. Episodes appear on a regular cadence, and come in many different formats and levels of formality. They can be formal news journalism or conversational chat; fiction or non-fiction. They are rapidly growing in popularity and yet have been relatively little studied. As an audio format, podcasts are more varied in style and production types than, say… ▽ More

    Submitted 5 December, 2020; v1 submitted 8 April, 2020; originally announced April 2020.

    Comments: 4 pages, 3 figures

  50. arXiv:1912.05345  [pdf, other

    eess.SP cs.CV cs.LG

    Severity Detection Tool for Patients with Infectious Disease

    Authors: Girmaw Abebe Tadesse, Tingting Zhu, Nhan Le Nguyen Thanh, Nguyen Thanh Hung, Ha Thi Hai Duong, Truong Huu Khanh, Pham Van Quang, Duc Duong Tran, LamMinh Yen, H Rogier Van Doorn, Nguyen Van Hao, John Prince, Hamza Javed, DaniKiyasseh, Le Van Tan, Louise Thwaites, David A. Clifton

    Abstract: Hand, foot and mouth disease (HFMD) and tetanus are serious infectious diseases in low and middle income countries. Tetanus in particular has a high mortality rate and its treatment is resource-demanding. Furthermore, HFMD often affects a large number of infants and young children. As a result, its treatment consumes enormous healthcare resources, especially when outbreaks occur. Autonomic nervous… ▽ More

    Submitted 10 December, 2019; originally announced December 2019.