Skip to main content

Showing 1–33 of 33 results for author: Moubayed, N A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.17911  [pdf, other

    cs.CL

    X-ray Made Simple: Radiology Report Generation and Evaluation with Layman's Terms

    Authors: Kun Zhao, Chenghao Xiao, Chen Tang, Bohao Yang, Kai Ye, Noura Al Moubayed, Liang Zhan, Chenghua Lin

    Abstract: Radiology Report Generation (RRG) has achieved significant progress with the advancements of multimodal generative models. However, the evaluation in the domain suffers from a lack of fair and robust metrics. We reveal that, high performance on RRG with existing lexical-based metrics (e.g. BLEU) might be more of a mirage - a model can get a high BLEU only by learning the template of reports. This… ▽ More

    Submitted 30 June, 2024; v1 submitted 25 June, 2024; originally announced June 2024.

  2. arXiv:2405.17965  [pdf, other

    cs.CV

    AttenCraft: Attention-guided Disentanglement of Multiple Concepts for Text-to-Image Customization

    Authors: Junjie Shentu, Matthew Watson, Noura Al Moubayed

    Abstract: With the unprecedented performance being achieved by text-to-image (T2I) diffusion models, T2I customization further empowers users to tailor the diffusion model to new concepts absent in the pre-training dataset, termed subject-driven generation. Moreover, extracting several new concepts from a single image enables the model to learn multiple concepts, and simultaneously decreases the difficultie… ▽ More

    Submitted 28 May, 2024; originally announced May 2024.

  3. arXiv:2405.17450  [pdf, other

    cs.CV cs.LG

    The Power of Next-Frame Prediction for Learning Physical Laws

    Authors: Thomas Winterbottom, G. Thomas Hudson, Daniel Kluvanec, Dean Slack, Jamie Sterling, Junjie Shentu, Chenghao Xiao, Zheming Zhou, Noura Al Moubayed

    Abstract: Next-frame prediction is a useful and powerful method for modelling and understanding the dynamics of video data. Inspired by the empirical success of causal language modelling and next-token prediction in language modelling, we explore the extent to which next-frame prediction serves as a strong foundational learning strategy (analogous to language modelling) for inducing an understanding of the… ▽ More

    Submitted 21 May, 2024; originally announced May 2024.

    Comments: 7 Figures, 12 Pages, 1 Table

    MSC Class: 68T45 ACM Class: I.2.6; I.2.10

  4. arXiv:2404.06347  [pdf, other

    cs.CL cs.IR

    RAR-b: Reasoning as Retrieval Benchmark

    Authors: Chenghao Xiao, G Thomas Hudson, Noura Al Moubayed

    Abstract: Semantic textual similartiy (STS) and information retrieval tasks (IR) tasks have been the two major avenues to record the progress of embedding models in the past few years. Under the emerging Retrieval-augmented Generation (RAG) paradigm, we envision the need to evaluate next-level language understanding abilities of embedding models, and take a conscious look at the reasoning abilities stored i… ▽ More

    Submitted 12 May, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

    Comments: v2, small typo fixes

  5. arXiv:2403.19897  [pdf, other

    cs.CV cs.LG

    Disentangling Racial Phenotypes: Fine-Grained Control of Race-related Facial Phenotype Characteristics

    Authors: Seyma Yucer, Amir Atapour Abarghouei, Noura Al Moubayed, Toby P. Breckon

    Abstract: Achieving an effective fine-grained appearance variation over 2D facial images, whilst preserving facial identity, is a challenging task due to the high complexity and entanglement of common 2D facial feature encoding spaces. Despite these challenges, such fine-grained control, by way of disentanglement is a crucial enabler for data-driven racial bias mitigation strategies across multiple automate… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

  6. arXiv:2402.09966  [pdf, other

    cs.CV

    Textual Localization: Decomposing Multi-concept Images for Subject-Driven Text-to-Image Generation

    Authors: Junjie Shentu, Matthew Watson, Noura Al Moubayed

    Abstract: Subject-driven text-to-image diffusion models empower users to tailor the model to new concepts absent in the pre-training dataset using a few sample images. However, prevalent subject-driven models primarily rely on single-concept input images, facing challenges in specifying the target concept when dealing with multi-concept input images. To this end, we introduce a textual localized text-to-ima… ▽ More

    Submitted 15 February, 2024; originally announced February 2024.

  7. arXiv:2402.08183  [pdf, other

    cs.CL cs.CV

    Pixel Sentence Representation Learning

    Authors: Chenghao Xiao, Zhuoxu Huang, Danlu Chen, G Thomas Hudson, Yizhi Li, Haoran Duan, Chenghua Lin, Jie Fu, Jungong Han, Noura Al Moubayed

    Abstract: Pretrained language models are long known to be subpar in capturing sentence and document-level semantics. Though heavily investigated, transferring perturbation-based methods from unsupervised visual representation learning to NLP remains an unsolved problem. This is largely due to the discreteness of subword units brought by tokenization of language models, limiting small perturbations of inputs… ▽ More

    Submitted 12 February, 2024; originally announced February 2024.

  8. arXiv:2401.13478  [pdf, other

    cs.IR cs.CL cs.CV cs.MM

    SciMMIR: Benchmarking Scientific Multi-modal Information Retrieval

    Authors: Siwei Wu, Yizhi Li, Kang Zhu, Ge Zhang, Yiming Liang, Kaijing Ma, Chenghao Xiao, Haoran Zhang, Bohao Yang, Wenhu Chen, Wenhao Huang, Noura Al Moubayed, Jie Fu, Chenghua Lin

    Abstract: Multi-modal information retrieval (MMIR) is a rapidly evolving field, where significant progress, particularly in image-text pairing, has been made through advanced representation learning and cross-modality alignment research. However, current benchmarks for evaluating MMIR performance in image-text pairing within the scientific domain show a notable gap, where chart and table images described in… ▽ More

    Submitted 11 June, 2024; v1 submitted 24 January, 2024; originally announced January 2024.

    Comments: camera-ready version for ACL 2024 Findings

  9. arXiv:2310.16193  [pdf, other

    cs.CL cs.AI

    Length is a Curse and a Blessing for Document-level Semantics

    Authors: Chenghao Xiao, Yizhi Li, G Thomas Hudson, Chenghua Lin, Noura Al Moubayed

    Abstract: In recent years, contrastive learning (CL) has been extensively utilized to recover sentence and document-level encoding capability from pre-trained language models. In this work, we question the length generalizability of CL-based models, i.e., their vulnerability towards length-induced semantic shift. We verify not only that length vulnerability is a significant yet overlooked research gap, but… ▽ More

    Submitted 24 October, 2023; originally announced October 2023.

    Comments: Accepted at EMNLP 2023. Our code is publicly available at https://github.com/gowitheflow-1998/LA-SER-cubed

  10. arXiv:2309.11895  [pdf, other

    cs.SD cs.AI cs.CL eess.AS

    Audio Contrastive based Fine-tuning

    Authors: Yang Wang, Qibin Liang, Chenghao Xiao, Yizhi Li, Noura Al Moubayed, Chenghua Lin

    Abstract: Audio classification plays a crucial role in speech and sound processing tasks with a wide range of applications. There still remains a challenge of striking the right balance between fitting the model to the training data (avoiding overfitting) and enabling it to generalise well to a new domain. Leveraging the transferability of contrastive learning, we introduce Audio Contrastive-based Fine-tuni… ▽ More

    Submitted 19 October, 2023; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Under review

  11. arXiv:2305.00817  [pdf, other

    cs.CV

    Racial Bias within Face Recognition: A Survey

    Authors: Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby P. Breckon

    Abstract: Facial recognition is one of the most academically studied and industrially developed areas within computer vision where we readily find associated applications deployed globally. This widespread adoption has uncovered significant performance variation across subjects of different racial profiles leading to focused research attention on racial bias within face recognition spanning both current cau… ▽ More

    Submitted 1 May, 2023; originally announced May 2023.

  12. Language as a Latent Sequence: deep latent variable models for semi-supervised paraphrase generation

    Authors: Jialin Yu, Alexandra I. Cristea, Anoushka Harit, Zhongtian Sun, Olanrewaju Tahir Aduragba, Lei Shi, Noura Al Moubayed

    Abstract: This paper explores deep latent variable models for semi-supervised paraphrase generation, where the missing target pair for unlabelled data is modelled as a latent paraphrase sequence. We present a novel unsupervised model named variational sequence auto-encoding reconstruction (VSAR), which performs latent sequence inference given an observed text. To leverage information from text pairs, we add… ▽ More

    Submitted 8 September, 2023; v1 submitted 5 January, 2023; originally announced January 2023.

  13. arXiv:2212.09170  [pdf, other

    cs.CL cs.AI

    On Isotropy, Contextualization and Learning Dynamics of Contrastive-based Sentence Representation Learning

    Authors: Chenghao Xiao, Yang Long, Noura Al Moubayed

    Abstract: Incorporating contrastive learning objectives in sentence representation learning (SRL) has yielded significant improvements on many sentence-level NLP tasks. However, it is not well understood why contrastive learning works for learning sentence-level semantics. In this paper, we aim to help guide future designs of sentence representation learning methods by taking a closer look at contrastive SR… ▽ More

    Submitted 26 May, 2023; v1 submitted 18 December, 2022; originally announced December 2022.

    Comments: Accepted by ACL 2023 (Findings, long paper)

  14. arXiv:2211.01266  [pdf, other

    cs.LG cs.AI eess.SY

    Knowing the Past to Predict the Future: Reinforcement Virtual Learning

    Authors: Peng Zhang, Yawen Huang, Bingzhang Hu, Shizheng Wang, Haoran Duan, Noura Al Moubayed, Yefeng Zheng, Yang Long

    Abstract: Reinforcement Learning (RL)-based control system has received considerable attention in recent decades. However, in many real-world problems, such as Batch Process Control, the environment is uncertain, which requires expensive interaction to acquire the state and reward values. In this paper, we present a cost-efficient framework, such that the RL model can evolve for itself in a Virtual Space us… ▽ More

    Submitted 2 November, 2022; originally announced November 2022.

  15. arXiv:2209.01061  [pdf, other

    cs.CL cs.AI

    INTERACTION: A Generative XAI Framework for Natural Language Inference Explanations

    Authors: Jialin Yu, Alexandra I. Cristea, Anoushka Harit, Zhongtian Sun, Olanrewaju Tahir Aduragba, Lei Shi, Noura Al Moubayed

    Abstract: XAI with natural language processing aims to produce human-readable explanations as evidence for AI decision-making, which addresses explainability and transparency. However, from an HCI perspective, the current approaches only focus on delivering a single explanation, which fails to account for the diversity of human thoughts and experiences in language. This paper thus addresses this gap, by pro… ▽ More

    Submitted 2 September, 2022; originally announced September 2022.

  16. arXiv:2208.07613  [pdf, other

    cs.CV cs.CY cs.LG

    Does lossy image compression affect racial bias within face recognition?

    Authors: Seyma Yucer, Matt Poyser, Noura Al Moubayed, Toby P. Breckon

    Abstract: Yes - This study investigates the impact of commonplace lossy image compression on face recognition algorithms with regard to the racial characteristics of the subject. We adopt a recently proposed racial phenotype-based bias analysis methodology to measure the effect of varying levels of lossy compression across racial phenotype categories. Additionally, we determine the relationship between chro… ▽ More

    Submitted 16 August, 2022; originally announced August 2022.

  17. arXiv:2208.03824  [pdf, other

    cs.CV cs.LG

    Towards Graph Representation Learning Based Surgical Workflow Anticipation

    Authors: Xiatian Zhang, Noura Al Moubayed, Hubert P. H. Shum

    Abstract: Surgical workflow anticipation can give predictions on what steps to conduct or what instruments to use next, which is an essential part of the computer-assisted intervention system for surgery, e.g. workflow reasoning in robotic surgery. However, current approaches are limited to their insufficient expressive power for relationships between instruments. Hence, we propose a graph representation le… ▽ More

    Submitted 7 August, 2022; originally announced August 2022.

    Comments: Proceedings of the 2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI), 2022

  18. arXiv:2203.13004  [pdf, other

    cs.LG

    Using Orientation to Distinguish Overlapping Chromosomes

    Authors: Daniel Kluvanec, Thomas B. Phillips, Kenneth J. W. McCaffrey, Noura Al Moubayed

    Abstract: A difficult step in the process of karyotyping is segmenting chromosomes that touch or overlap. In an attempt to automate the process, previous studies turned to Deep Learning methods, with some formulating the task as a semantic segmentation problem. These models treat separate chromosome instances as semantic classes, which we show to be problematic, since it is uncertain which chromosome should… ▽ More

    Submitted 24 March, 2022; originally announced March 2022.

    Comments: Conference for Health, Inference, and Learning (CHIL) 2022 - Invited non-archival presentation

  19. arXiv:2202.07362  [pdf, other

    cs.CL cs.AI

    MuLD: The Multitask Long Document Benchmark

    Authors: G Thomas Hudson, Noura Al Moubayed

    Abstract: The impressive progress in NLP techniques has been driven by the development of multi-task benchmarks such as GLUE and SuperGLUE. While these benchmarks focus on tasks for one or two input sentences, there has been exciting work in designing efficient techniques for processing much longer inputs. In this paper, we present MuLD: a new long document benchmark consisting of only documents over 10,000… ▽ More

    Submitted 15 February, 2022; originally announced February 2022.

  20. Measuring Hidden Bias within Face Recognition via Racial Phenotypes

    Authors: Seyma Yucer, Furkan Tektas, Noura Al Moubayed, Toby P. Breckon

    Abstract: Recent work reports disparate performance for intersectional racial groups across face recognition tasks: face verification and identification. However, the definition of those racial groups has a significant impact on the underlying findings of such racial bias analysis. Previous studies define these groups based on either demographic information (e.g. African, Asian etc.) or skin tone (e.g. ligh… ▽ More

    Submitted 19 October, 2021; originally announced October 2021.

    Comments: published in IEEE Winter Conference on Applications of Computer Vision, WACV, 2022

  21. arXiv:2108.01589  [pdf, other

    cs.CL

    ExBERT: An External Knowledge Enhanced BERT for Natural Language Inference

    Authors: Amit Gajbhiye, Noura Al Moubayed, Steven Bradley

    Abstract: Neural language representation models such as BERT, pre-trained on large-scale unstructured corpora lack explicit grounding to real-world commonsense knowledge and are often unable to remember facts required for reasoning and inference. Natural Language Inference (NLI) is a challenging reasoning task that relies on common human understanding of language and real-world commonsense knowledge. We int… ▽ More

    Submitted 3 August, 2021; originally announced August 2021.

  22. arXiv:2106.02183  [pdf, other

    cs.CL cs.AI cs.LG

    Towards Equal Gender Representation in the Annotations of Toxic Language Detection

    Authors: Elizabeth Excell, Noura Al Moubayed

    Abstract: Classifiers tend to propagate biases present in the data on which they are trained. Hence, it is important to understand how the demographic identities of the annotators of comments affect the fairness of the resulting model. In this paper, we focus on the differences in the ways men and women annotate comments for toxicity, investigating how these differences result in models that amplify the opi… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: Paper is accepted at GeBNLP2021 workshop at ACL-IJCNLP 2021

  23. arXiv:2105.06791  [pdf, other

    cs.LG

    Agree to Disagree: When Deep Learning Models With Identical Architectures Produce Distinct Explanations

    Authors: Matthew Watson, Bashar Awwad Shiekh Hasan, Noura Al Moubayed

    Abstract: Deep Learning of neural networks has progressively become more prominent in healthcare with models reaching, or even surpassing, expert accuracy levels. However, these success stories are tainted by concerning reports on the lack of model transparency and bias against some medical conditions or patients' sub-groups. Explainable methods are considered the gateway to alleviate many of these concerns… ▽ More

    Submitted 30 October, 2021; v1 submitted 14 May, 2021; originally announced May 2021.

    Comments: 9 pages, 5 figures, 3 tables

    ACM Class: I.2

  24. arXiv:2105.01959  [pdf, other

    cs.LG cs.CR

    Attack-agnostic Adversarial Detection on Medical Data Using Explainable Machine Learning

    Authors: Matthew Watson, Noura Al Moubayed

    Abstract: Explainable machine learning has become increasingly prevalent, especially in healthcare where explainable models are vital for ethical and trusted automated decision making. Work on the susceptibility of deep learning models to adversarial attacks has shown the ease of designing samples to mislead a model into making incorrect predictions. In this work, we propose a model agnostic explainability-… ▽ More

    Submitted 5 May, 2021; originally announced May 2021.

    Comments: 13 pages, 6 figures, accepted to ICPR 2020

    ACM Class: I.2; I.4

  25. Curvature-based Feature Selection with Application in Classifying Electronic Health Records

    Authors: Zheming Zuo, Jie Li, Han Xu, Noura Al Moubayed

    Abstract: Disruptive technologies provides unparalleled opportunities to contribute to the identifications of many aspects in pervasive healthcare, from the adoption of the Internet of Things through to Machine Learning (ML) techniques. As a powerful tool, ML has been widely applied in patient-centric healthcare solutions. To further improve the quality of patient care, Electronic Health Records (EHRs) are… ▽ More

    Submitted 30 November, 2021; v1 submitted 10 January, 2021; originally announced January 2021.

    Comments: Accepted by Technological Forecasting and Social Change; Source code available

  26. arXiv:2012.10285  [pdf, other

    cs.CV cs.CL

    Trying Bilinear Pooling in Video-QA

    Authors: Thomas Winterbottom, Sarah Xiao, Alistair McLean, Noura Al Moubayed

    Abstract: Bilinear pooling (BLP) refers to a family of operations recently developed for fusing features from different modalities predominantly developed for VQA models. A bilinear (outer-product) expansion is thought to encourage models to learn interactions between two feature spaces and has experimentally outperformed `simpler' vector operations (concatenation and element-wise-addition/multiplication) o… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 16 Pages, 8 Figures, 4 Tables, +Supp Mats

    MSC Class: 68T99 ACM Class: I.2.10; I.2.7

  27. arXiv:2012.10210  [pdf, other

    cs.CV cs.AI cs.CL

    On Modality Bias in the TVQA Dataset

    Authors: Thomas Winterbottom, Sarah Xiao, Alistair McLean, Noura Al Moubayed

    Abstract: TVQA is a large scale video question answering (video-QA) dataset based on popular TV shows. The questions were specifically designed to require "both vision and language understanding to answer". In this work, we demonstrate an inherent bias in the dataset towards the textual subtitle modality. We infer said bias both directly and indirectly, notably finding that models trained with subtitles lea… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 10 pages, 4 Figures, 2 Tables, +Supp Mats, BMVC 2020

    MSC Class: 68T99 ACM Class: I.2.10; I.2.7; I.2.4

  28. Bilinear Fusion of Commonsense Knowledge with Attention-Based NLI Models

    Authors: Amit Gajbhiye, Thomas Winterbottom, Noura Al Moubayed, Steven Bradley

    Abstract: We consider the task of incorporating real-world commonsense knowledge into deep Natural Language Inference (NLI) models. Existing external knowledge incorporation methods are limited to lexical level knowledge and lack generalization across NLI models, datasets, and commonsense knowledge sources. To address these issues, we propose a novel NLI model-independent neural framework, BiCAM. BiCAM inco… ▽ More

    Submitted 22 October, 2020; originally announced October 2020.

    Comments: Published in Lecture Notes in Computer Science, Springer International Publishing

  29. Using Variable Natural Environment Brain-Computer Interface Stimuli for Real-time Humanoid Robot Navigation

    Authors: Nik Khadijah Nik Aznan, Jason D. Connolly, Noura Al Moubayed, Toby P. Breckon

    Abstract: This paper addresses the challenge of humanoid robot teleoperation in a natural indoor environment via a Brain-Computer Interface (BCI). We leverage deep Convolutional Neural Network (CNN) based image and signal understanding to facilitate both real-time bject detection and dry-Electroencephalography (EEG) based human cortical brain bio-signals decoding. We employ recent advances in dry-EEG techno… ▽ More

    Submitted 1 March, 2019; v1 submitted 26 November, 2018; originally announced November 2018.

    Comments: Accepted as a full paper at the 2019 International Conference on Robotics and Automation (ICRA)

  30. arXiv:1810.08675  [pdf, other

    cs.DC cs.LG

    Using Machine Learning to reduce the energy wasted in Volunteer Computing Environments

    Authors: A. Stephen McGough, Matthew Forshaw, John Brennan, Noura Al Moubayed, Stephen Bonner

    Abstract: High Throughput Computing (HTC) provides a convenient mechanism for running thousands of tasks. Many HTC systems exploit computers which are provisioned for other purposes by utilising their idle time - volunteer computing. This has great advantages as it gives access to vast quantities of computational power for little or no cost. The downside is that running tasks are sacrificed if the computer… ▽ More

    Submitted 19 October, 2018; originally announced October 2018.

    Comments: Accepted for publication at THE 9th international Green and sustainable computing Conference, Technically Co-sponsored by IEEE Computer Society & STC Sustainable Computing, October 22-24, Pittsburgh, PA, USA

  31. An Exploration of Dropout with RNNs for Natural Language Inference

    Authors: Amit Gajbhiye, Sardar Jaf, Noura Al Moubayed, A. Stephen McGough, Steven Bradley

    Abstract: Dropout is a crucial regularization technique for the Recurrent Neural Network (RNN) models of Natural Language Inference (NLI). However, dropout has not been evaluated for the effectiveness at different layers and dropout rates in NLI models. In this paper, we propose a novel RNN model for NLI and empirically evaluate the effect of applying dropout at different layers in the model. We also invest… ▽ More

    Submitted 22 October, 2018; originally announced October 2018.

    Comments: Accepted in International Conference on Artificial Neural Networks, 2018

  32. arXiv:1805.04157  [pdf, other

    cs.HC eess.SP q-bio.NC

    On the Classification of SSVEP-Based Dry-EEG Signals via Convolutional Neural Networks

    Authors: Nik Khadijah Nik Aznan, Stephen Bonner, Jason D. Connolly, Noura Al Moubayed, Toby P. Breckon

    Abstract: In this paper, we propose a novel Convolutional Neural Network (CNN) approach for the classification of raw dry-EEG signals without any data pre-processing. To illustrate the effectiveness of our approach, we utilise the Steady State Visual Evoked Potential (SSVEP) paradigm as our use case. SSVEP can be utilised to allow people with severe physical disabilities such as Complete Locked-In Syndrome… ▽ More

    Submitted 2 August, 2018; v1 submitted 10 May, 2018; originally announced May 2018.

    Comments: Accepted as a full paper at the 2018 IEEE International Conference on Systems, Man, and Cybernetics (SMC2018)

  33. arXiv:1606.05554  [pdf, other

    cs.CL cs.LG cs.NE

    SMS Spam Filtering using Probabilistic Topic Modelling and Stacked Denoising Autoencoder

    Authors: Noura Al Moubayed, Toby Breckon, Peter Matthews, A. Stephen McGough

    Abstract: In This paper we present a novel approach to spam filtering and demonstrate its applicability with respect to SMS messages. Our approach requires minimum features engineering and a small set of la- belled data samples. Features are extracted using topic modelling based on latent Dirichlet allocation, and then a comprehensive data model is created using a Stacked Denoising Autoencoder (SDA). Topic… ▽ More

    Submitted 17 June, 2016; originally announced June 2016.

    Comments: Paper was accepted to the 25th International Conference on Artificial Neural Networks (ICANN 2016)