Zum Hauptinhalt springen

Showing 1–50 of 68 results for author: Denman, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.13264  [pdf, other

    cs.LG cs.AI cs.CV

    Part-based Quantitative Analysis for Heatmaps

    Authors: Osman Tursun, Sinan Kalkan, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Heatmaps have been instrumental in helping understand deep network decisions, and are a common approach for Explainable AI (XAI). While significant progress has been made in enhancing the informativeness and accessibility of heatmaps, heatmap analysis is typically very subjective and limited to domain experts. As such, developing automatic, scalable, and numerical analysis methods to make heatmap-… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

  2. arXiv:2312.10930  [pdf, other

    cs.CV

    Deep Learning Approaches for Seizure Video Analysis: A Review

    Authors: David Ahmedt-Aristizabal, Mohammad Ali Armin, Zeeshan Hayder, Norberto Garcia-Cairasco, Lars Petersson, Clinton Fookes, Simon Denman, Aileen McGonigal

    Abstract: Seizure events can manifest as transient disruptions in the control of movements which may be organized in distinct behavioral sequences, accompanied or not by other observable features such as altered facial expressions. The analysis of these clinical signs, referred to as semiology, is subject to observer variations when specialists evaluate video-recorded events in the clinical setting. To enha… ▽ More

    Submitted 4 March, 2024; v1 submitted 17 December, 2023; originally announced December 2023.

    Comments: Accepted in Epilepsy & Behavior

  3. arXiv:2312.09489  [pdf, other

    cs.LG eess.SP

    Multi-stage Learning for Radar Pulse Activity Segmentation

    Authors: Zi Huang, Akila Pemasiri, Simon Denman, Clinton Fookes, Terrence Martin

    Abstract: Radio signal recognition is a crucial function in electronic warfare. Precise identification and localisation of radar pulse activities are required by electronic warfare systems to produce effective countermeasures. Despite the importance of these tasks, deep learning-based radar pulse activity recognition methods have remained largely underexplored. While deep learning for radar modulation recog… ▽ More

    Submitted 14 December, 2023; originally announced December 2023.

    Comments: 5 pages, 8 figures

  4. Multi-task Learning for Radar Signal Characterisation

    Authors: Zi Huang, Akila Pemasiri, Simon Denman, Clinton Fookes, Terrence Martin

    Abstract: Radio signal recognition is a crucial task in both civilian and military applications, as accurate and timely identification of unknown signals is an essential part of spectrum management and electronic warfare. The majority of research in this field has focused on applying deep learning for modulation classification, leaving the task of signal characterisation as an understudied area. This paper… ▽ More

    Submitted 30 April, 2024; v1 submitted 19 June, 2023; originally announced June 2023.

    Comments: 5 pages, 3 figures

  5. arXiv:2305.11394  [pdf, other

    cs.CV

    Remembering What Is Important: A Factorised Multi-Head Retrieval and Auxiliary Memory Stabilisation Scheme for Human Motion Prediction

    Authors: Tharindu Fernando, Harshala Gammulle, Sridha Sridharan, Simon Denman, Clinton Fookes

    Abstract: Humans exhibit complex motions that vary depending on the task that they are performing, the interactions they engage in, as well as subject-specific preferences. Therefore, forecasting future poses based on the history of the previous motions is a challenging task. This paper presents an innovative auxiliary-memory-powered deep neural network framework for the improved modelling of historical kno… ▽ More

    Submitted 18 May, 2023; originally announced May 2023.

  6. arXiv:2304.02202  [pdf, other

    cs.CV cs.HC cs.LG

    Towards Self-Explainability of Deep Neural Networks with Heatmap Captioning and Large-Language Models

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Heatmaps are widely used to interpret deep neural networks, particularly for computer vision tasks, and the heatmap-based explainable AI (XAI) techniques are a well-researched topic. However, most studies concentrate on enhancing the quality of the generated heatmap or discovering alternate heatmap generation techniques, and little effort has been devoted to making heatmap-based XAI automatic, int… ▽ More

    Submitted 4 April, 2023; originally announced April 2023.

  7. arXiv:2212.00386  [pdf, other

    eess.IV cs.CV q-bio.TO

    Automated Coronary Arteries Labeling Via Geometric Deep Learning

    Authors: Yadan Li, Mohammad Ali Armin, Simon Denman, David Ahmedt-Aristizabal

    Abstract: Automatic labelling of anatomical structures, such as coronary arteries, is critical for diagnosis, yet existing (non-deep learning) methods are limited by a reliance on prior topological knowledge of the expected tree-like structures. As the structure such vascular systems is often difficult to conceptualize, graph-based representations have become popular due to their ability to capture the geom… ▽ More

    Submitted 1 December, 2022; originally announced December 2022.

    Journal ref: 2023 IEEE International Symposium on Biomedical Imaging (ISBI)

  8. Vision-Based Activity Recognition in Children with Autism-Related Behaviors

    Authors: Pengbo Wei, David Ahmedt-Aristizabal, Harshala Gammulle, Simon Denman, Mohammad Ali Armin

    Abstract: Advances in machine learning and contactless sensors have enabled the understanding complex human behaviors in a healthcare setting. In particular, several deep learning systems have been introduced to enable comprehensive analysis of neuro-developmental conditions such as Autism Spectrum Disorder (ASD). This condition affects children from their early developmental stages onwards, and diagnosis r… ▽ More

    Submitted 8 August, 2022; originally announced August 2022.

    Journal ref: Heliyon, Volume 9, Issue 6, June 2023, e16763

  9. arXiv:2207.01769  [pdf, other

    cs.CV

    SESS: Saliency Enhancing with Scaling and Sliding

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: High-quality saliency maps are essential in several machine learning application areas including explainable AI and weakly supervised object detection and segmentation. Many techniques have been developed to generate better saliency using neural networks. However, they are often limited to specific saliency visualisation methods or saliency issues. We propose a novel saliency enhancing approach ca… ▽ More

    Submitted 4 July, 2022; originally announced July 2022.

    Comments: This paper will be presented at ECCV2022

  10. Towards On-Board Panoptic Segmentation of Multispectral Satellite Images

    Authors: Tharindu Fernando, Clinton Fookes, Harshala Gammulle, Simon Denman, Sridha Sridharan

    Abstract: With tremendous advancements in low-power embedded computing devices and remote sensing instruments, the traditional satellite image processing pipeline which includes an expensive data transfer step prior to processing data on the ground is being replaced by on-board processing of captured data. This paradigm shift enables critical and time-sensitive analytic intelligence to be acquired in a time… ▽ More

    Submitted 4 April, 2022; originally announced April 2022.

  11. arXiv:2202.13096  [pdf, other

    cs.CV cs.HC cs.LG

    Continuous Human Action Recognition for Human-Machine Interaction: A Review

    Authors: Harshala Gammulle, David Ahmedt-Aristizabal, Simon Denman, Lachlan Tychsen-Smith, Lars Petersson, Clinton Fookes

    Abstract: With advances in data-driven machine learning research, a wide variety of prediction models have been proposed to capture spatio-temporal features for the analysis of video streams. Recognising actions and detecting action transitions within an input video are challenging but necessary tasks for applications that require real-time human-machine interaction. By reviewing a large body of recent rela… ▽ More

    Submitted 26 February, 2022; originally announced February 2022.

    Comments: Preprint submitted to ACM Computing Surveys

    Journal ref: 2023, Volume 55, Issue 13s

  12. Privacy-Preserving In-Bed Pose Monitoring: A Fusion and Reconstruction Study

    Authors: Thisun Dayarathna, Thamidu Muthukumarana, Yasiru Rathnayaka, Simon Denman, Chathura de Silva, Akila Pemasiri, David Ahmedt-Aristizabal

    Abstract: Recently, in-bed human pose estimation has attracted the interest of researchers due to its relevance to a wide range of healthcare applications. Compared to the general problem of human pose estimation, in-bed pose estimation has several inherent challenges, the most prominent being frequent and severe occlusions caused by bedding. In this paper we explore the effective use of images from multipl… ▽ More

    Submitted 22 February, 2022; originally announced February 2022.

    Journal ref: Expert Systems with Applications, Volume 213, Part C, 1 March 2023, 119139

  13. In-Bed Human Pose Estimation from Unseen and Privacy-Preserving Image Domains

    Authors: Ting Cao, Mohammad Ali Armin, Simon Denman, Lars Petersson, David Ahmedt-Aristizabal

    Abstract: Medical applications have benefited greatly from the rapid advancement in computer vision. Considering patient monitoring in particular, in-bed human posture estimation offers important health-related metrics with potential value in medical condition assessments. Despite great progress in this domain, it remains challenging due to substantial ambiguity during occlusions, and the lack of large corp… ▽ More

    Submitted 24 January, 2022; v1 submitted 29 November, 2021; originally announced November 2021.

    Comments: In the IEEE International Symposium on Biomedical Imaging (ISBI)

    Journal ref: ISBI 2022

  14. arXiv:2108.03786  [pdf, other

    eess.IV cs.CV cs.LG

    Multi-Slice Net: A novel light weight framework for COVID-19 Diagnosis

    Authors: Harshala Gammulle, Tharindu Fernando, Sridha Sridharan, Simon Denman, Clinton Fookes

    Abstract: This paper presents a novel lightweight COVID-19 diagnosis framework using CT scans. Our system utilises a novel two-stage approach to generate robust and efficient diagnoses across heterogeneous patient level inputs. We use a powerful backbone network as a feature extractor to capture discriminative slice-level features. These features are aggregated by a lightweight network to obtain a patient l… ▽ More

    Submitted 8 August, 2021; originally announced August 2021.

    Comments: IEEE International Conference on Autonomous Systems 2021

  15. A Survey on Graph-Based Deep Learning for Computational Histopathology

    Authors: David Ahmedt-Aristizabal, Mohammad Ali Armin, Simon Denman, Clinton Fookes, Lars Petersson

    Abstract: With the remarkable success of representation learning for prediction problems, we have witnessed a rapid expansion of the use of machine learning and deep learning for the analysis of digital pathology and biopsy image patches. However, learning over patch-wise features using convolutional neural networks limits the ability of the model to capture global contextual information and comprehensively… ▽ More

    Submitted 27 September, 2021; v1 submitted 1 July, 2021; originally announced July 2021.

    Comments: Preprint submitted to Computerized Medical Imaging and Graphics

    Journal ref: Volume 95, January 2022, 102027

  16. arXiv:2106.15835  [pdf, other

    cs.SD cs.LG eess.AS

    Robust and Interpretable Temporal Convolution Network for Event Detection in Lung Sound Recordings

    Authors: Tharindu Fernando, Sridha Sridharan, Simon Denman, Houman Ghaemmaghami, Clinton Fookes

    Abstract: This paper proposes a novel framework for lung sound event detection, segmenting continuous lung sound recordings into discrete events and performing recognition on each event. Exploiting the lightweight nature of Temporal Convolution Networks (TCNs) and their superior results compared to their recurrent counterparts, we propose a lightweight, yet robust, and completely interpretable framework for… ▽ More

    Submitted 30 June, 2021; originally announced June 2021.

    Comments: preprint submitted to JBHI

  17. Video-Based Inpatient Fall Risk Assessment: A Case Study

    Authors: Ziqing Wang, Mohammad Ali Armin, Simon Denman, Lars Petersson, David Ahmedt-Aristizabal

    Abstract: Inpatient falls are a serious safety issue in hospitals and healthcare facilities. Recent advances in video analytics for patient monitoring provide a non-intrusive avenue to reduce this risk through continuous activity monitoring. However, in-bed fall risk assessment systems have received less attention in the literature. The majority of prior studies have focused on fall event detection, and do… ▽ More

    Submitted 27 May, 2021; originally announced June 2021.

    Journal ref: IEEE Engineering in Medicine & Biology Society (EMBC) 2021

  18. Towards Interpretable Attention Networks for Cervical Cancer Analysis

    Authors: Ruiqi Wang, Mohammad Ali Armin, Simon Denman, Lars Petersson, David Ahmedt-Aristizabal

    Abstract: Recent advances in deep learning have enabled the development of automated frameworks for analysing medical images and signals, including analysis of cervical cancer. Many previous works focus on the analysis of isolated cervical cells, or do not offer sufficient methods to explain and understand how the proposed models reach their classification decisions on multi-cell images. Here, we evaluate v… ▽ More

    Submitted 27 May, 2021; originally announced June 2021.

    Journal ref: IEEE Engineering in Medicine & Biology Society (EMBC) 2021

  19. arXiv:2105.13137  [pdf, other

    cs.LG cs.CV q-bio.QM

    Graph-Based Deep Learning for Medical Diagnosis and Analysis: Past, Present and Future

    Authors: David Ahmedt-Aristizabal, Mohammad Ali Armin, Simon Denman, Clinton Fookes, Lars Petersson

    Abstract: With the advances of data-driven machine learning research, a wide variety of prediction problems have been tackled. It has become critical to explore how machine learning and specifically deep learning methods can be exploited to analyse healthcare data. A major limitation of existing methods has been the focus on grid-like data; however, the structure of physiological recordings are often irregu… ▽ More

    Submitted 27 May, 2021; originally announced May 2021.

    Journal ref: Sensors 2021, 21, 4758

  20. arXiv:2104.13780  [pdf, other

    cs.CV cs.AI cs.LG

    Semantic Consistency and Identity Mapping Multi-Component Generative Adversarial Network for Person Re-Identification

    Authors: Amena Khatun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: In a real world environment, person re-identification (Re-ID) is a challenging task due to variations in lighting conditions, viewing angles, pose and occlusions. Despite recent performance gains, current person Re-ID algorithms still suffer heavily when encountering these variations. To address this problem, we propose a semantic consistency and identity mapping multi-component generative adversa… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Accepted in WACV 2020

    Journal ref: WACV, 2020

  21. arXiv:2104.13773  [pdf, other

    cs.CV cs.AI cs.LG

    Pose-driven Attention-guided Image Generation for Person Re-Identification

    Authors: Amena Khatun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Person re-identification (re-ID) concerns the matching of subject images across different camera views in a multi camera surveillance system. One of the major challenges in person re-ID is pose variations across the camera network, which significantly affects the appearance of a person. Existing development data lack adequate pose variations to carry out effective training of person re-ID systems.… ▽ More

    Submitted 28 April, 2021; originally announced April 2021.

    Comments: Submitted to Pattern Recognition

  22. Learning Regional Attention over Multi-resolution Deep Convolutional Features for Trademark Retrieval

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Large-scale trademark retrieval is an important content-based image retrieval task. A recent study shows that off-the-shelf deep features aggregated with Regional-Maximum Activation of Convolutions (R-MAC) achieve state-of-the-art results. However, R-MAC suffers in the presence of background clutter/trivial regions and scale variance, and discards important spatial information. We introduce three… ▽ More

    Submitted 15 April, 2021; originally announced April 2021.

  23. arXiv:2102.04016  [pdf, other

    cs.CV

    An Efficient Framework for Zero-Shot Sketch-Based Image Retrieval

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Ethan Goan, Clinton Fookes

    Abstract: Recently, Zero-shot Sketch-based Image Retrieval (ZS-SBIR) has attracted the attention of the computer vision community due to it's real-world applications, and the more realistic and challenging setting than found in SBIR. ZS-SBIR inherits the main challenges of multiple computer vision problems including content-based Image Retrieval (CBIR), zero-shot learning and domain adaptation. The majority… ▽ More

    Submitted 8 February, 2021; originally announced February 2021.

  24. arXiv:2012.02364  [pdf, other

    cs.LG cs.CV eess.IV stat.ML

    Deep Learning for Medical Anomaly Detection -- A Survey

    Authors: Tharindu Fernando, Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Machine learning-based medical anomaly detection is an important problem that has been extensively studied. Numerous approaches have been proposed across various medical application domains and we observe several similarities across these distinct applications. Despite this comparability, we observe a lack of structured organisation of these diverse research applications such that their advantages… ▽ More

    Submitted 13 April, 2021; v1 submitted 3 December, 2020; originally announced December 2020.

    Comments: Preprint submitted to ACM Computing Surveys

  25. arXiv:2012.01170  [pdf, other

    cs.CV

    Sparse Convolutions on Continuous Domains for Point Cloud and Event Stream Networks

    Authors: Dominic Jack, Frederic Maire, Simon Denman, Anders Eriksson

    Abstract: Image convolutions have been a cornerstone of a great number of deep learning advances in computer vision. The research community is yet to settle on an equivalent operator for sparse, unstructured continuous data like point clouds and event streams however. We present an elegant sparse matrix-based interpretation of the convolution operator for these cases, which is consistent with the mathematic… ▽ More

    Submitted 2 December, 2020; originally announced December 2020.

    Comments: ACCV2020

  26. arXiv:2011.09581  [pdf, other

    cs.CV

    Patient-independent Epileptic Seizure Prediction using Deep Learning Models

    Authors: Theekshana Dissanayake, Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Objective: Epilepsy is one of the most prevalent neurological diseases among humans and can lead to severe brain injuries, strokes, and brain tumors. Early detection of seizures can help to mitigate injuries, and can be used to aid the treatment of patients with epilepsy. The purpose of a seizure prediction system is to successfully identify the pre-ictal brain stage, which occurs before a seizure… ▽ More

    Submitted 18 November, 2020; originally announced November 2020.

  27. arXiv:2011.06207  [pdf, other

    cs.CV

    Domain Generalization in Biosignal Classification

    Authors: Theekshana Dissanayake, Tharindu Fernando, Simon Denman, Houman Ghaemmaghami, Sridha Sridharan, Clinton Fookes

    Abstract: Objective: When training machine learning models, we often assume that the training data and evaluation data are sampled from the same distribution. However, this assumption is violated when the model is evaluated on another unseen but similar database, even if that database contains the same classes. This problem is caused by domain-shift and can be solved using two approaches: domain adaptation… ▽ More

    Submitted 12 November, 2020; originally announced November 2020.

  28. arXiv:2011.05438  [pdf, other

    cs.CV

    Fast & Slow Learning: Incorporating Synthetic Gradients in Neural Memory Controllers

    Authors: Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Neural Memory Networks (NMNs) have received increased attention in recent years compared to deep architectures that use a constrained memory. Despite their new appeal, the success of NMNs hinges on the ability of the gradient-based optimiser to perform incremental training of the NMN controllers, determining how to leverage their high capacity for knowledge retrieval. This means that while excelle… ▽ More

    Submitted 10 November, 2020; originally announced November 2020.

  29. Multi-modal Fusion for Single-Stage Continuous Gesture Recognition

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Gesture recognition is a much studied research area which has myriad real-world applications including robotics and human-machine interaction. Current gesture recognition methods have focused on recognising isolated gestures, and existing continuous gesture recognition methods are limited to two-stage approaches where independent models are required for detection and classification, with the perfo… ▽ More

    Submitted 24 August, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

    Comments: Accepted for publication in IEEE Transactions on Image Processing

  30. arXiv:2009.10991  [pdf, other

    eess.AS cs.HC cs.LG stat.ML

    Attention Driven Fusion for Multi-Modal Emotion Recognition

    Authors: Darshana Priyasad, Tharindu Fernando, Simon Denman, Clinton Fookes, Sridha Sridharan

    Abstract: Deep learning has emerged as a powerful alternative to hand-crafted methods for emotion recognition on combined acoustic and text modalities. Baseline systems model emotion information in text and acoustic modes independently using Deep Convolutional Neural Networks (DCNN) and Recurrent Neural Networks (RNN), followed by applying attention, fusion, and classification. In this paper, we present a d… ▽ More

    Submitted 10 October, 2020; v1 submitted 23 September, 2020; originally announced September 2020.

    Comments: An updated version of the ICASSP 2020 paper

  31. arXiv:2007.08076  [pdf, other

    cs.LG cs.CV stat.ML

    Memory based fusion for multi-modal deep learning

    Authors: Darshana Priyasad, Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: The use of multi-modal data for deep machine learning has shown promise when compared to uni-modal approaches with fusion of multi-modal features resulting in improved performance in several applications. However, most state-of-the-art methods use naive fusion which processes feature streams independently, ignoring possible long-term dependencies within the data during fusion. In this paper, we pr… ▽ More

    Submitted 23 October, 2020; v1 submitted 15 July, 2020; originally announced July 2020.

    Comments: Pre-print submitted to Information Fusion

  32. arXiv:2007.05914  [pdf, other

    cs.CV cs.LG

    Two-Stream Deep Feature Modelling for Automated Video Endoscopy Data Analysis

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Automating the analysis of imagery of the Gastrointestinal (GI) tract captured during endoscopy procedures has substantial potential benefits for patients, as it can provide diagnostic support to medical practitioners and reduce mistakes via human error. To further the development of such methods, we propose a two-stream model for endoscopic image analysis. Our model fuses two streams of deep feat… ▽ More

    Submitted 12 July, 2020; originally announced July 2020.

    Comments: Accepted for Publication at MICCAI 2020

  33. arXiv:2006.13211  [pdf, other

    cs.CV

    Meta Transfer Learning for Emotion Recognition

    Authors: Dung Nguyen, Sridha Sridharan, Duc Thanh Nguyen, Simon Denman, David Dean, Clinton Fookes

    Abstract: Deep learning has been widely adopted in automatic emotion recognition and has lead to significant progress in the field. However, due to insufficient annotated emotion datasets, pre-trained models are limited in their generalization capability and thus lead to poor performance on novel test sets. To mitigate this challenge, transfer learning performing fine-tuning on pre-trained models has been a… ▽ More

    Submitted 22 June, 2020; originally announced June 2020.

    Comments: Revision under Journal of Pattern Recognition

  34. arXiv:2005.10480  [pdf, other

    cs.SD cs.LG eess.AS q-bio.QM

    A Robust Interpretable Deep Learning Classifier for Heart Anomaly Detection Without Segmentation

    Authors: Theekshana Dissanayake, Tharindu Fernando, Simon Denman, Sridha Sridharan, Houman Ghaemmaghami, Clinton Fookes

    Abstract: Traditionally, abnormal heart sound classification is framed as a three-stage process. The first stage involves segmenting the phonocardiogram to detect fundamental heart sounds; after which features are extracted and classification is performed. Some researchers in the field argue the segmentation step is an unwanted computational burden, whereas others embrace it as a prior step to feature extra… ▽ More

    Submitted 29 September, 2020; v1 submitted 21 May, 2020; originally announced May 2020.

  35. arXiv:2005.03222  [pdf, other

    cs.CV

    End-to-End Domain Adaptive Attention Network for Cross-Domain Person Re-Identification

    Authors: Amena Khatun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Person re-identification (re-ID) remains challenging in a real-world scenario, as it requires a trained network to generalise to totally unseen target data in the presence of variations across domains. Recently, generative adversarial models have been widely adopted to enhance the diversity of training data. These approaches, however, often fail to generalise to other domains, as existing generati… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: submitted to IEEE Transactions on Information Forensics and Security

  36. arXiv:2005.03209  [pdf, other

    cs.CV

    Hierarchical Attention Network for Action Segmentation

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: The temporal segmentation of events is an essential task and a precursor for the automatic recognition of human actions in the video. Several attempts have been made to capture frame-level salient aspects through attention but they lack the capacity to effectively map the temporal relationships in between the frames as they only capture a limited span of temporal dependencies. To this end we propo… ▽ More

    Submitted 6 May, 2020; originally announced May 2020.

    Comments: Published in Pattern Recognition Letters

  37. arXiv:2004.03712  [pdf, other

    eess.SP cs.LG cs.SD eess.AS q-bio.QM stat.ML

    Heart Sound Segmentation using Bidirectional LSTMs with Attention

    Authors: Tharindu Fernando, Houman Ghaemmaghami, Simon Denman, Sridha Sridharan, Nayyar Hussain, Clinton Fookes

    Abstract: This paper proposes a novel framework for the segmentation of phonocardiogram (PCG) signals into heart states, exploiting the temporal evolution of the PCG as well as considering the salient information that it provides for the detection of the heart state. We propose the use of recurrent neural networks and exploit recent advancements in attention based learning to segment the PCG signal. This al… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Comments: IEEE Journal of Biomedical and Health Informatics, 25 October 2019

  38. arXiv:2004.01546  [pdf, other

    eess.AS cs.LG cs.SD stat.ML

    Temporarily-Aware Context Modelling using Generative Adversarial Networks for Speech Activity Detection

    Authors: Tharindu Fernando, Sridha Sridharan, Mitchell McLaren, Darshana Priyasad, Simon Denman, Clinton Fookes

    Abstract: This paper presents a novel framework for Speech Activity Detection (SAD). Inspired by the recent success of multi-task learning approaches in the speech processing domain, we propose a novel joint learning framework for SAD. We utilise generative adversarial networks to automatically learn a loss function for joint prediction of the frame-wise speech/ non-speech classifications together with the… ▽ More

    Submitted 1 April, 2020; originally announced April 2020.

    Journal ref: IEEE/ACM Transactions on Audio, Speech and Language Processing, 2020

  39. arXiv:2003.11136  [pdf, other

    cs.CV

    Joint Deep Cross-Domain Transfer Learning for Emotion Recognition

    Authors: Dung Nguyen, Sridha Sridharan, Duc Thanh Nguyen, Simon Denman, Son N. Tran, Rui Zeng, Clinton Fookes

    Abstract: Deep learning has been applied to achieve significant progress in emotion recognition. Despite such substantial progress, existing approaches are still hindered by insufficient training data, and the resulting models do not generalize well under mismatched conditions. To address this challenge, we propose a learning strategy which jointly transfers the knowledge learned from rich datasets to sourc… ▽ More

    Submitted 24 March, 2020; originally announced March 2020.

  40. Learning Test-time Augmentation for Content-based Image Retrieval

    Authors: Osman Tursun, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Off-the-shelf convolutional neural network features achieve outstanding results in many image retrieval tasks. However, their invariance to target data is pre-defined by the network architecture and training data. Existing image retrieval approaches require fine-tuning or modification of pre-trained networks to adapt to variations unique to the target data. In contrast, our method enhances the inv… ▽ More

    Submitted 5 July, 2022; v1 submitted 5 February, 2020; originally announced February 2020.

  41. MTRNet++: One-stage Mask-based Scene Text Eraser

    Authors: Osman Tursun, Simon Denman, Rui Zeng, Sabesan Sivapalan, Sridha Sridharan, Clinton Fookes

    Abstract: A precise, controllable, interpretable and easily trainable text removal approach is necessary for both user-specific and large-scale text removal applications. To achieve this, we propose a one-stage mask-based text inpainting network, MTRNet++. It has a novel architecture that includes mask-refine, coarse-inpainting and fine-inpainting branches, and attention blocks. With this architecture, MTRN… ▽ More

    Submitted 4 June, 2020; v1 submitted 15 December, 2019; originally announced December 2019.

    Comments: This paper is under CVIU review (after major revision)

  42. arXiv:1912.07148  [pdf, other

    cs.CV

    Predicting the Future: A Jointly Learnt Model for Action Anticipation

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: Inspired by human neurological structures for action anticipation, we present an action anticipation model that enables the prediction of plausible future actions by forecasting both the visual and temporal future. In contrast to current state-of-the-art methods which first learn a model to predict future video features and then perform action anticipation using these features, the proposed framew… ▽ More

    Submitted 15 December, 2019; originally announced December 2019.

    Comments: ICCV 2019

  43. arXiv:1912.04968  [pdf, other

    eess.SP cs.LG cs.NE stat.ML

    Neural Memory Networks for Seizure Type Classification

    Authors: David Ahmedt-Aristizabal, Tharindu Fernando, Simon Denman, Lars Petersson, Matthew J. Aburn, Clinton Fookes

    Abstract: Classification of seizure type is a key step in the clinical process for evaluating an individual who presents with seizures. It determines the course of clinical diagnosis and treatment, and its impact stretches beyond the clinical domain to epilepsy research and the development of novel therapies. Automated identification of seizure type may facilitate understanding of the disease, and seizure d… ▽ More

    Submitted 29 January, 2020; v1 submitted 10 December, 2019; originally announced December 2019.

    Comments: Proceedings of the IEEE International Conference of Engineering in Medicine and Biology Society. 2020

  44. arXiv:1911.07844  [pdf, other

    cs.CV cs.LG cs.MM stat.ML

    Exploiting Human Social Cognition for the Detection of Fake and Fraudulent Faces via Memory Networks

    Authors: Tharindu Fernando, Clinton Fookes, Simon Denman, Sridha Sridharan

    Abstract: Advances in computer vision have brought us to the point where we have the ability to synthesise realistic fake content. Such approaches are seen as a source of disinformation and mistrust, and pose serious concerns to governments around the world. Convolutional Neural Networks (CNNs) demonstrate encouraging results when detecting fake images that arise from the specific type of manipulation they… ▽ More

    Submitted 17 November, 2019; originally announced November 2019.

  45. arXiv:1910.05448  [pdf, other

    cs.NE cs.CV cs.LG stat.ML

    Neural Memory Plasticity for Anomaly Detection

    Authors: Tharindu Fernando, Simon Denman, David Ahmedt-Aristizabal, Sridha Sridharan, Kristin Laurens, Patrick Johnston, Clinton Fookes

    Abstract: In the domain of machine learning, Neural Memory Networks (NMNs) have recently achieved impressive results in a variety of application areas including visual question answering, trajectory prediction, object tracking, and language modelling. However, we observe that the attention based knowledge retrieval mechanisms used in current NMNs restricts them from achieving their full potential as the att… ▽ More

    Submitted 11 October, 2019; originally announced October 2019.

  46. arXiv:1909.09283  [pdf, other

    cs.CV

    Coupled Generative Adversarial Network for Continuous Fine-grained Action Segmentation

    Authors: Harshala Gammulle, Tharindu Fernando, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: We propose a novel conditional GAN (cGAN) model for continuous fine-grained human action segmentation, that utilises multi-modal data and learned scene context information. The proposed approach utilises two GANs: termed Action GAN and Auxiliary GAN, where the Action GAN is trained to operate over the current RGB frame while the Auxiliary GAN utilises supplementary information such as depth or opt… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: WACV 2019

  47. arXiv:1909.09278  [pdf, other

    cs.CV

    Forecasting Future Action Sequences with Neural Memory Networks

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: We propose a novel neural memory network based framework for future action sequence forecasting. This is a challenging task where we have to consider short-term, within sequence relationships as well as relationships in between sequences, to understand how sequences of actions evolve over time. To capture these relationships effectively, we introduce neural memory networks to our modelling scheme.… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: BMVC 2019 Oral

  48. arXiv:1909.09269  [pdf, other

    cs.CV

    Fine-grained Action Segmentation using the Semi-Supervised Action GAN

    Authors: Harshala Gammulle, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: In this paper we address the problem of continuous fine-grained action segmentation, in which multiple actions are present in an unsegmented video stream. The challenge for this task lies in the need to represent the hierarchical nature of the actions and to detect the transitions between actions, allowing us to localise the actions within the video effectively. We propose a novel recurrent semi-s… ▽ More

    Submitted 19 September, 2019; originally announced September 2019.

    Comments: Published in Pattern Recognition Journal

  49. arXiv:1903.12328  [pdf, other

    cs.LG stat.ML

    Improved Reinforcement Learning with Curriculum

    Authors: Joseph West, Frederic Maire, Cameron Browne, Simon Denman

    Abstract: Humans tend to learn complex abstract concepts faster if examples are presented in a structured manner. For instance, when learning how to play a board game, usually one of the first concepts learned is how the game ends, i.e. the actions that lead to a terminal state (win, lose or draw). The advantage of learning end-games first is that once the actions which lead to a terminal state are understo… ▽ More

    Submitted 10 June, 2019; v1 submitted 28 March, 2019; originally announced March 2019.

    Comments: Draft prior to submission to IEEE Trans on Games. Changed paper slightly

  50. arXiv:1903.07916  [pdf, other

    cs.CV

    Geometry-constrained Car Recognition Using a 3D Perspective Network

    Authors: Rui Zeng, Zongyuan Ge, Simon Denman, Sridha Sridharan, Clinton Fookes

    Abstract: We present a novel learning framework for vehicle recognition from a single RGB image. Unlike existing methods which only use attention mechanisms to locate 2D discriminative information, our work learns a novel 3D perspective feature representation of a vehicle, which is then fused with 2D appearance feature to predict the category. The framework is composed of a global network (GN), a 3D perspec… ▽ More

    Submitted 17 November, 2019; v1 submitted 19 March, 2019; originally announced March 2019.