Zum Hauptinhalt springen

Showing 1–50 of 80 results for author: O'Connor, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.00006  [pdf, other

    cs.DC cs.LG

    Synthetic Time Series for Anomaly Detection in Cloud Microservices

    Authors: Mohamed Allam, Noureddine Boujnah, Noel E. O'Connor, Mingming Liu

    Abstract: This paper proposes a framework for time series generation built to investigate anomaly detection in cloud microservices. In the field of cloud computing, ensuring the reliability of microservices is of paramount concern and yet a remarkably challenging task. Despite the large amount of research in this area, validation of anomaly detection algorithms in realistic environments is difficult to achi… ▽ More

    Submitted 21 July, 2024; originally announced August 2024.

    Comments: The paper has been accepted by the 10th International Conference on Machine Learning, Optimization and Data Science

  2. arXiv:2407.05528  [pdf, other

    cs.CV

    An accurate detection is not all you need to combat label noise in web-noisy datasets

    Authors: Paul Albert, Jack Valmadre, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Training a classifier on web-crawled data demands learning algorithms that are robust to annotation errors and irrelevant examples. This paper builds upon the recent empirical observation that applying unsupervised contrastive learning to noisy, web-crawled datasets yields a feature representation under which the in-distribution (ID) and out-of-distribution (OOD) samples are linearly separable. We… ▽ More

    Submitted 7 July, 2024; originally announced July 2024.

    Comments: Accepted in the European Conference on Computer Vision (ECCV) 2024

  3. arXiv:2404.06941  [pdf, other

    eess.IV cs.CV

    Accelerating Cardiac MRI Reconstruction with CMRatt: An Attention-Driven Approach

    Authors: Anam Hashmi, Julia Dietlmeier, Kathleen M. Curran, Noel E. O'Connor

    Abstract: Cine cardiac magnetic resonance (CMR) imaging is recognised as the benchmark modality for the comprehensive assessment of cardiac function. Nevertheless, the acquisition process of cine CMR is considered as an impediment due to its prolonged scanning time. One commonly used strategy to expedite the acquisition process is through k-space undersampling, though it comes with a drawback of introducing… ▽ More

    Submitted 10 April, 2024; originally announced April 2024.

    Comments: This paper has been submitted for the 32nd European Signal Processing Conference EUSIPCO 2024 in Lyon

  4. arXiv:2404.06362  [pdf, other

    cs.CV cs.AI

    Test-Time Adaptation with SaLIP: A Cascade of SAM and CLIP for Zero shot Medical Image Segmentation

    Authors: Sidra Aleem, Fangyijie Wang, Mayug Maniparambil, Eric Arazo, Julia Dietlmeier, Guenole Silvestre, Kathleen Curran, Noel E. O'Connor, Suzanne Little

    Abstract: The Segment Anything Model (SAM) and CLIP are remarkable vision foundation models (VFMs). SAM, a prompt driven segmentation model, excels in segmentation tasks across diverse domains, while CLIP is renowned for its zero shot recognition capabilities. However, their unified potential has not yet been explored in medical image segmentation. To adapt SAM to medical imaging, existing methods primarily… ▽ More

    Submitted 30 April, 2024; v1 submitted 9 April, 2024; originally announced April 2024.

  5. arXiv:2402.09550  [pdf, other

    cs.LG cs.RO

    Dataset Clustering for Improved Offline Policy Learning

    Authors: Qiang Wang, Yixin Deng, Francisco Roldan Sanchez, Keru Wang, Kevin McGuinness, Noel O'Connor, Stephen J. Redmond

    Abstract: Offline policy learning aims to discover decision-making policies from previously-collected datasets without additional online interactions with the environment. As the training dataset is fixed, its quality becomes a crucial determining factor in the performance of the learned policy. This paper studies a dataset characteristic that we refer to as multi-behavior, indicating that the dataset is co… ▽ More

    Submitted 14 February, 2024; originally announced February 2024.

  6. arXiv:2401.05224  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Do Vision and Language Encoders Represent the World Similarly?

    Authors: Mayug Maniparambil, Raiymbek Akshulakov, Yasser Abdelaziz Dahou Djilali, Sanath Narayan, Mohamed El Amine Seddik, Karttikeya Mangalam, Noel E. O'Connor

    Abstract: Aligned text-image encoders such as CLIP have become the de facto model for vision-language tasks. Furthermore, modality-specific encoders achieve impressive performances in their respective domains. This raises a central question: does an alignment exist between uni-modal vision and language encoders since they fundamentally represent the same physical world? Analyzing the latent spaces structure… ▽ More

    Submitted 22 March, 2024; v1 submitted 10 January, 2024; originally announced January 2024.

    Comments: Accepted CVPR 2024

  7. arXiv:2312.07371  [pdf, other

    cs.LG cs.AI cs.CR physics.soc-ph

    Privacy-Aware Energy Consumption Modeling of Connected Battery Electric Vehicles using Federated Learning

    Authors: Sen Yan, Hongyuan Fang, Ji Li, Tomas Ward, Noel O'Connor, Mingming Liu

    Abstract: Battery Electric Vehicles (BEVs) are increasingly significant in modern cities due to their potential to reduce air pollution. Precise and real-time estimation of energy consumption for them is imperative for effective itinerary planning and optimizing vehicle systems, which can reduce driving range anxiety and decrease energy costs. As public awareness of data privacy increases, adopting approach… ▽ More

    Submitted 12 December, 2023; originally announced December 2023.

    Comments: This paper is accepted by IEEE Transactions on Transportation Electrification (TTE) on December 4, 2023. (13 pages, 6 figures, and 6 tables)

  8. arXiv:2311.16514  [pdf, other

    cs.CV cs.AI cs.LG

    Video Anomaly Detection via Spatio-Temporal Pseudo-Anomaly Generation : A Unified Approach

    Authors: Ayush K. Rai, Tarun Krishna, Feiyan Hu, Alexandru Drimbarean, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: Video Anomaly Detection (VAD) is an open-set recognition task, which is usually formulated as a one-class classification (OCC) problem, where training data is comprised of videos with normal instances while test data contains both normal and anomalous instances. Recent works have investigated the creation of pseudo-anomalies (PAs) using only the normal data and making strong assumptions about real… ▽ More

    Submitted 7 April, 2024; v1 submitted 27 November, 2023; originally announced November 2023.

    Comments: Accepted in CVPRW 2024 - VAND Workshop

  9. arXiv:2311.14073  [pdf, other

    cs.CV cs.AI

    Learning Saliency From Fixations

    Authors: Yasser Abdelaziz Dahou Djilali, Kevin McGuiness, Noel O'Connor

    Abstract: We present a novel approach for saliency prediction in images, leveraging parallel decoding in transformers to learn saliency solely from fixation maps. Models typically rely on continuous saliency maps, to overcome the difficulty of optimizing for the discrete fixation map. We attempt to replicate the experimental setup that generates saliency datasets. Our approach treats saliency prediction as… ▽ More

    Submitted 23 November, 2023; originally announced November 2023.

  10. arXiv:2310.01827  [pdf, other

    cs.RO cs.AI

    Learning and reusing primitive behaviours to improve Hindsight Experience Replay sample efficiency

    Authors: Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Stephen Redmond, Noel O'Connor

    Abstract: Hindsight Experience Replay (HER) is a technique used in reinforcement learning (RL) that has proven to be very efficient for training off-policy RL-based agents to solve goal-based robotic manipulation tasks using sparse rewards. Even though HER improves the sample efficiency of RL-based agents by learning from mistakes made in past experiences, it does not provide any guidance while exploring th… ▽ More

    Submitted 19 November, 2023; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: 6 pages, 2 figures, 1 algorithm, 1 table. Version accepted to ICARA 2024

  11. A Review on AI Algorithms for Energy Management in E-Mobility Services

    Authors: Sen Yan, Maqsood Hussain Shah, Ji Li, Noel O'Connor, Mingming Liu

    Abstract: E-mobility, or electric mobility, has emerged as a pivotal solution to address pressing environmental and sustainability concerns in the transportation sector. The depletion of fossil fuels, escalating greenhouse gas emissions, and the imperative to combat climate change underscore the significance of transitioning to electric vehicles (EVs). This paper seeks to explore the potential of artificial… ▽ More

    Submitted 26 September, 2023; originally announced September 2023.

    Comments: 8 pages, 4 tables, 1 figure

  12. arXiv:2307.12033  [pdf, other

    cs.CV

    Self-Supervised and Semi-Supervised Polyp Segmentation using Synthetic Data

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Early detection of colorectal polyps is of utmost importance for their treatment and for colorectal cancer prevention. Computer vision techniques have the potential to aid professionals in the diagnosis stage, where colonoscopies are manually carried out to examine the entirety of the patient's colon. The main challenge in medical imaging is the lack of data, and a further challenge specific to po… ▽ More

    Submitted 22 July, 2023; originally announced July 2023.

  13. arXiv:2307.11661  [pdf, other

    cs.CV cs.AI cs.CL cs.LG

    Enhancing CLIP with GPT-4: Harnessing Visual Descriptions as Prompts

    Authors: Mayug Maniparambil, Chris Vorster, Derek Molloy, Noel Murphy, Kevin McGuinness, Noel E. O'Connor

    Abstract: Contrastive pretrained large Vision-Language Models (VLMs) like CLIP have revolutionized visual representation learning by providing good performance on downstream datasets. VLMs are 0-shot adapted to a downstream dataset by designing prompts that are relevant to the dataset. Such prompt engineering makes use of domain expertise and a validation dataset. Meanwhile, recent developments in generativ… ▽ More

    Submitted 8 August, 2023; v1 submitted 21 July, 2023; originally announced July 2023.

    Comments: Paper accepted at ICCV-W 2023. V2 contains additional comparisons with concurrent works

  14. Joint one-sided synthetic unpaired image translation and segmentation for colorectal cancer prevention

    Authors: Enric Moreu, Eric Arazo, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We propose CUT-seg, a joint training where a segmentation model and a… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

    Comments: arXiv admin note: substantial text overlap with arXiv:2202.08680

  15. arXiv:2305.10115  [pdf, other

    eess.IV cs.CV cs.LG

    An Ensemble Deep Learning Approach for COVID-19 Severity Prediction Using Chest CT Scans

    Authors: Sidra Aleem, Mayug Maniparambil, Suzanne Little, Noel O'Connor, Kevin McGuinness

    Abstract: Chest X-rays have been widely used for COVID-19 screening; however, 3D computed tomography (CT) is a more effective modality. We present our findings on COVID-19 severity prediction from chest CT scans using the STOIC dataset. We developed an ensemble deep learning based model that incorporates multiple neural networks to improve predictions. To address data imbalance, we used slicing functions an… ▽ More

    Submitted 17 May, 2023; originally announced May 2023.

  16. Fashion CUT: Unsupervised domain adaptation for visual pattern classification in clothes using synthetic data and pseudo-labels

    Authors: Enric Moreu, Alex Martinelli, Martina Naughton, Philip Kelly, Noel E. O'Connor

    Abstract: Accurate product information is critical for e-commerce stores to allow customers to browse, filter, and search for products. Product data quality is affected by missing or incorrect information resulting in poor customer experience. While machine learning can be used to correct inaccurate or missing information, achieving high performance on fashion image classification tasks requires large amoun… ▽ More

    Submitted 9 May, 2023; originally announced May 2023.

  17. arXiv:2301.13019  [pdf, other

    cs.RO cs.LG

    Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Francisco Roldan Sanchez, Kevin McGuinness, Noel E. O'Connor, Stephen J. Redmond

    Abstract: This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline data. Participants were provided with two types of datasets for each task: expert and mixed datasets with varying skill levels. While the simplest offline policy le… ▽ More

    Submitted 21 September, 2023; v1 submitted 30 January, 2023; originally announced January 2023.

  18. arXiv:2301.11734  [pdf, other

    cs.LG cs.RO

    Improving Behavioural Cloning with Positive Unlabeled Learning

    Authors: Qiang Wang, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel E. O'Connor, Nico Gürtler, Felix Widmaier, Francisco Roldan Sanchez, Stephen J. Redmond

    Abstract: Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we would consider as positive examples; i.e., high-quality demonstrations. Therefore, we propose a novel iterative learning algorithm for identifying expert trajecto… ▽ More

    Submitted 21 September, 2023; v1 submitted 27 January, 2023; originally announced January 2023.

  19. arXiv:2301.09164  [pdf, other

    cs.LG cs.CV

    Unifying Synergies between Self-supervised Learning and Dynamic Computation

    Authors: Tarun Krishna, Ayush K Rai, Alexandru Drimbarean, Eric Arazo, Paul Albert, Alan F Smeaton, Kevin McGuinness, Noel E O'Connor

    Abstract: Computationally expensive training strategies make self-supervised learning (SSL) impractical for resource constrained industrial settings. Techniques like knowledge distillation (KD), dynamic computation (DC), and pruning are often used to obtain a lightweightmodel, which usually involves multiple epochs of fine-tuning (or distilling steps) of a large pre-trained model, making it more computation… ▽ More

    Submitted 9 September, 2023; v1 submitted 22 January, 2023; originally announced January 2023.

    Comments: Accepted in BMVC 2023

  20. arXiv:2210.05574  [pdf, other

    cs.CV cs.AI cs.LG

    Motion Aware Self-Supervision for Generic Event Boundary Detection

    Authors: Ayush K. Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The task of Generic Event Boundary Detection (GEBD) aims to detect moments in videos that are naturally perceived by humans as generic and taxonomy-free event boundaries. Modeling the dynamically evolving temporal and spatial changes in a video makes GEBD a difficult problem to solve. Existing approaches involve very complex and sophisticated pipelines in terms of architectural design choices, hen… ▽ More

    Submitted 12 October, 2022; v1 submitted 11 October, 2022; originally announced October 2022.

    Comments: Accepted in IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023

  21. arXiv:2210.04578  [pdf, other

    cs.CV cs.LG

    Is your noise correction noisy? PLS: Robustness to label noise with two stage detection

    Authors: Paul Albert, Eric Arazo, Tarun Krishna, Noel E. O'Connor, Kevin McGuinness

    Abstract: Designing robust algorithms capable of training accurate neural networks on uncurated datasets from the web has been the subject of much research as it reduces the need for time consuming human labor. The focus of many previous research contributions has been on the detection of different types of label noise; however, this paper proposes to improve the correction accuracy of noisy samples once th… ▽ More

    Submitted 15 October, 2022; v1 submitted 10 October, 2022; originally announced October 2022.

    Comments: 9 pages 4 figures. Accepted at WACV 2023

  22. arXiv:2210.02476  [pdf, other

    cs.CV cs.AI cs.LG

    BaseTransformers: Attention over base data-points for One Shot Learning

    Authors: Mayug Maniparambil, Kevin McGuinness, Noel O'Connor

    Abstract: Few shot classification aims to learn to recognize novel categories using only limited samples per category. Most current few shot methods use a base dataset rich in labeled examples to train an encoder that is used for obtaining representations of support instances for novel classes. Since the test instances are from a distribution different to the base distribution, their feature representations… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: Paper accepted at British Machine Vision Conference 2022

  23. arXiv:2210.00795  [pdf, other

    cs.RO

    Hierarchical reinforcement learning for in-hand robotic manipulation using Davenport chained rotations

    Authors: Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Stephen Redmond, Noel O'Connor

    Abstract: End-to-end reinforcement learning techniques are among the most successful methods for robotic manipulation tasks. However, the training time required to find a good policy capable of solving complex tasks is prohibitively large. Therefore, depending on the computing resources available, it might not be feasible to use such techniques. The use of domain knowledge to decompose manipulation tasks in… ▽ More

    Submitted 15 November, 2022; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: 5 pages, 3 figures, 3 tables, submitted to ICARA 2023

  24. arXiv:2209.09714  [pdf, other

    eess.IV cs.CV

    Cardiac Segmentation using Transfer Learning under Respiratory Motion Artifacts

    Authors: Carles Garcia-Cabrera, Eric Arazo, Kathleen M. Curran, Noel E. O'Connor, Kevin McGuinness

    Abstract: Methods that are resilient to artifacts in the cardiac magnetic resonance imaging (MRI) while performing ventricle segmentation, are crucial for ensuring quality in structural and functional analysis of those tissues. While there has been significant efforts on improving the quality of the algorithms, few works have tackled the harm that the artifacts generate in the predictions. In this work, we… ▽ More

    Submitted 20 September, 2022; originally announced September 2022.

    Comments: accepted for the STACOM2022 workshop @ MICCAI2022

  25. arXiv:2209.08903  [pdf, other

    cs.RO

    Towards advanced robotic manipulation

    Authors: Francisco Roldan Sanchez, Stephen Redmond, Kevin McGuinness, Noel O'Connor

    Abstract: Robotic manipulation and control has increased in importance in recent years. However, state of the art techniques still have limitations when required to operate in real world applications. This paper explores Hindsight Experience Replay both in simulated and real environments, highlighting its weaknesses and proposing reinforcement-learning based alternatives based on reward and goal shaping. Ad… ▽ More

    Submitted 26 September, 2022; v1 submitted 19 September, 2022; originally announced September 2022.

    Comments: 4 pages, 1 figure, Submitted to PhD Workshop submission at IRC 2022. Updated figure 1

  26. arXiv:2207.12065  [pdf, other

    cs.CV

    Dynamic Channel Selection in Self-Supervised Learning

    Authors: Tarun Krishna, Ayush K. Rai, Yasser A. D. Djilali, Alan F. Smeaton, Kevin McGuinness, Noel E. O'Connor

    Abstract: Whilst computer vision models built using self-supervised approaches are now commonplace, some important questions remain. Do self-supervised models learn highly redundant channel features? What if a self-supervised network could dynamically select the important channels and get rid of the unnecessary ones? Currently, convnets pre-trained with self-supervision have obtained comparable performance… ▽ More

    Submitted 16 December, 2022; v1 submitted 25 July, 2022; originally announced July 2022.

    Comments: Accepted in Irish Machine Vision and Image Processing Conference 2022

  27. arXiv:2207.01573  [pdf, other

    cs.CV

    Embedding contrastive unsupervised features to cluster in- and out-of-distribution noise in corrupted image datasets

    Authors: Paul Albert, Eric Arazo, Noel E. O'Connor, Kevin McGuinness

    Abstract: Using search engines for web image retrieval is a tempting alternative to manual curation when creating an image dataset, but their main drawback remains the proportion of incorrect (noisy) samples retrieved. These noisy samples have been evidenced by previous works to be a mixture of in-distribution (ID) samples, assigned to the incorrect category but presenting similar visual semantics to other… ▽ More

    Submitted 18 July, 2022; v1 submitted 4 July, 2022; originally announced July 2022.

    Comments: Accepted at ECCV 2022

  28. Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks

    Authors: Qiang Wang, Francisco Roldan Sanchez, Robert McCarthy, David Cordova Bulens, Kevin McGuinness, Noel O'Connor, Manuel Wüthrich, Felix Widmaier, Stefan Bauer, Stephen J. Redmond

    Abstract: This paper describes a deep reinforcement learning (DRL) approach that won Phase 1 of the Real Robot Challenge (RRC) 2021, and then extends this method to a more difficult manipulation task. The RRC consisted of using a TriFinger robot to manipulate a cube along a specified positional trajectory, but with no requirement for the cube to have any specific orientation. We used a relatively simple rew… ▽ More

    Submitted 27 January, 2023; v1 submitted 19 May, 2022; originally announced May 2022.

    Comments: This paper has been summited to Expert Systems: the Journal of Knowledge Engineering for reviewing. arXiv admin note: text overlap with arXiv:2109.15233

  29. arXiv:2204.09343  [pdf

    cs.CV

    Utilizing unsupervised learning to improve sward content prediction and herbage mass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling H. O'Connor, Noel E. O'Connor, Kevin McGuinness

    Abstract: Sward species composition estimation is a tedious one. Herbage must be collected in the field, manually separated into components, dried and weighed to estimate species composition. Deep learning approaches using neural networks have been used in previous work to propose faster and more cost efficient alternatives to this process by estimating the biomass information from a picture of an area of p… ▽ More

    Submitted 20 April, 2022; originally announced April 2022.

    Comments: 3 pages. Accepted at the 29th EGF General Meeting 2022

  30. arXiv:2204.08271  [pdf, other

    cs.CV

    Unsupervised domain adaptation and super resolution on drone images for autonomous dry herbage biomass estimation

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Jaime Fernandez, Brian Mac Namee, Deirdre Hennessey, Noel E. O'Connor, Kevin McGuinness

    Abstract: Herbage mass yield and composition estimation is an important tool for dairy farmers to ensure an adequate supply of high quality herbage for grazing and subsequently milk production. By accurately estimating herbage mass and composition, targeted nitrogen fertiliser application strategies can be deployed to improve localised regions in a herbage field, effectively reducing the negative impacts of… ▽ More

    Submitted 18 April, 2022; originally announced April 2022.

    Comments: 11 pages, 5 figures. Accepted at the Agriculture-Vision CVPR 2022 Workshop

  31. arXiv:2202.08680  [pdf, other

    eess.IV cs.CV

    Synthetic data for unsupervised polyp segmentation

    Authors: Enric Moreu, Kevin McGuinness, Noel E. O'Connor

    Abstract: Deep learning has shown excellent performance in analysing medical images. However, datasets are difficult to obtain due privacy issues, standardization problems, and lack of annotations. We address these problems by producing realistic synthetic images using a combination of 3D technologies and generative adversarial networks. We use zero annotations from medical professionals in our pipeline. Ou… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  32. arXiv:2202.08670  [pdf, other

    cs.CV cs.AI

    Domain Randomization for Object Counting

    Authors: Enric Moreu, Kevin McGuinness, Diego Ortego, Noel E. O'Connor

    Abstract: Recently, the use of synthetic datasets based on game engines has been shown to improve the performance of several tasks in computer vision. However, these datasets are typically only appropriate for the specific domains depicted in computer games, such as urban scenes involving vehicles and people. In this paper, we present an approach to generate synthetic datasets for object counting for any do… ▽ More

    Submitted 17 February, 2022; originally announced February 2022.

  33. arXiv:2201.10243  [pdf, other

    cs.CV cs.LG

    BERTHA: Video Captioning Evaluation Via Transfer-Learned Human Assessment

    Authors: Luis Lebron, Yvette Graham, Kevin McGuinness, Konstantinos Kouramas, Noel E. O'Connor

    Abstract: Evaluating video captioning systems is a challenging task as there are multiple factors to consider; for instance: the fluency of the caption, multiple actions happening in a single scene, and the human bias of what is considered important. Most metrics try to measure how similar the system generated captions are to a single or a set of human-annotated captions. This paper presents a new method ba… ▽ More

    Submitted 16 May, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: In press in Language Resources and Evaluation Conference(LREC) 2022

  34. arXiv:2111.09056  [pdf, other

    cs.CV cs.CY cs.MM

    Improving Person Re-Identification with Temporal Constraints

    Authors: Julia Dietlmeier, Feiyan Hu, Frances Ryan, Noel E. O'Connor, Kevin McGuinness

    Abstract: In this paper we introduce an image-based person re-identification dataset collected across five non-overlapping camera views in the large and busy airport in Dublin, Ireland. Unlike all publicly available image-based datasets, our dataset contains timestamp information in addition to frame number, and camera and person IDs. Also our dataset has been fully anonymized to comply with modern data pri… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 10 pages, RWS @ WACV2022

  35. arXiv:2110.14283  [pdf, other

    cs.CV

    How Important is Importance Sampling for Deep Budgeted Training?

    Authors: Eric Arazo, Diego Ortego, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Long iterative training processes for Deep Neural Networks (DNNs) are commonly required to achieve state-of-the-art performance in many computer vision tasks. Importance sampling approaches might play a key role in budgeted training regimes, i.e. when limiting the number of training iterations. These approaches aim at dynamically estimating the importance of each sample to focus on the most releva… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.

    Comments: British Machine Vision Conference (BMVC) 2021, oral presentation

  36. arXiv:2110.13719  [pdf, other

    cs.CV

    Semi-supervised dry herbage mass estimation using automatic data and synthetic images

    Authors: Paul Albert, Mohamed Saadeldin, Badri Narayanan, Brian Mac Namee, Deirdre Hennessy, Aisling O'Connor, Noel O'Connor, Kevin McGuinness

    Abstract: Monitoring species-specific dry herbage biomass is an important aspect of pasture-based milk production systems. Being aware of the herbage biomass in the field enables farmers to manage surpluses and deficits in herbage supply, as well as using targeted nitrogen fertilization when necessary. Deep learning for computer vision is a powerful tool in this context as it can accurately estimate the dry… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Published at CVPPA 2021, ICCVW 2021

  37. arXiv:2110.13699  [pdf, other

    cs.CV

    Addressing out-of-distribution label noise in webly-labelled data

    Authors: Paul Albert, Diego Ortego, Eric Arazo, Noel O'Connor, Kevin McGuinness

    Abstract: A recurring focus of the deep learning community is towards reducing the labeling effort. Data gathering and annotation using a search engine is a simple alternative to generating a fully human-annotated and human-gathered dataset. Although web crawling is very time efficient, some of the retrieved images are unavoidably noisy, i.e. incorrectly labeled. Designing robust algorithms for training on… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

    Comments: Accepted at WACV 2022

  38. arXiv:2109.15233  [pdf, other

    cs.RO cs.LG

    Solving the Real Robot Challenge using Deep Reinforcement Learning

    Authors: Robert McCarthy, Francisco Roldan Sanchez, Qiang Wang, David Cordova Bulens, Kevin McGuinness, Noel O'Connor, Stephen J. Redmond

    Abstract: This paper details our winning submission to Phase 1 of the 2021 Real Robot Challenge; a challenge in which a three-fingered robot must carry a cube along specified goal trajectories. To solve Phase 1, we use a pure reinforcement learning approach which requires minimal expert knowledge of the robotic system, or of robotic grasping in general. A sparse, goal-based reward is employed in conjunction… ▽ More

    Submitted 27 June, 2022; v1 submitted 30 September, 2021; originally announced September 2021.

    Comments: Published in AICS 2021 (http://ceur-ws.org/Vol-3105/paper41.pdf). Paper updated to clarify procedure used to train the policy

  39. arXiv:2109.10957  [pdf, other

    cs.RO stat.AP

    Real Robot Challenge: A Robotics Competition in the Cloud

    Authors: Stefan Bauer, Felix Widmaier, Manuel Wüthrich, Annika Buchholz, Sebastian Stark, Anirudh Goyal, Thomas Steinbrenner, Joel Akpo, Shruti Joshi, Vincent Berenz, Vaibhav Agrawal, Niklas Funk, Julen Urain De Jesus, Jan Peters, Joe Watson, Claire Chen, Krishnan Srinivasan, Junwu Zhang, Jeffrey Zhang, Matthew R. Walter, Rishabh Madan, Charles Schaff, Takahiro Maeda, Takuma Yoneda, Denis Yarats , et al. (17 additional authors not shown)

    Abstract: Dexterous manipulation remains an open problem in robotics. To coordinate efforts of the research community towards tackling this problem, we propose a shared benchmark. We designed and built robotic platforms that are hosted at MPI for Intelligent Systems and can be accessed remotely. Each platform consists of three robotic fingers that are capable of dexterous object manipulation. Users are able… ▽ More

    Submitted 10 June, 2022; v1 submitted 22 September, 2021; originally announced September 2021.

  40. arXiv:2106.10090  [pdf, other

    cs.CV cs.AI

    Discerning Generic Event Boundaries in Long-Form Wild Videos

    Authors: Ayush K Rai, Tarun Krishna, Julia Dietlmeier, Kevin McGuinness, Alan F Smeaton, Noel E O'Connor

    Abstract: Detecting generic, taxonomy-free event boundaries invideos represents a major stride forward towards holisticvideo understanding. In this paper we present a technique forgeneric event boundary detection based on a two stream in-flated 3D convolutions architecture, which can learn spatio-temporal features from videos. Our work is inspired from theGeneric Event Boundary Detection Challenge (part of… ▽ More

    Submitted 18 June, 2021; originally announced June 2021.

    Comments: Technical Report for Generic Event Boundary Challenge - LOVEU Challenge (CVPR 2021)

  41. arXiv:2105.09460  [pdf, other

    cs.NI eess.SY

    Optimal Distributed Bandwidth Allocation in NB-IoT Networks

    Authors: Hongde Wu, Zhengyong Chen, Noel E. O'Connor, Mingming Liu

    Abstract: In this paper, we investigate a key problem of Narrowband-Internet of Things (NB-IoT) in the context of 5G with Mobile Edge Computing (MEC). We address the challenge that IoT devices may have different priorities when demanding bandwidth for data transmission in specific applications and services. Due to the scarcity of bandwidth in an MEC enabled IoT network, our objective is to optimize bandwidt… ▽ More

    Submitted 5 March, 2021; originally announced May 2021.

    Comments: The paper has been accepted by the 6th ACM/IEEE Conference on Internet of Things Design and Implementation

  42. arXiv:2105.01705  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Attention-based Stylisation for Exemplar Image Colourisation

    Authors: Marc Gorriz Blanch, Issa Khalifeh, Alan Smeaton, Noel O'Connor, Marta Mrak

    Abstract: Exemplar-based colourisation aims to add plausible colours to a grayscale image using the guidance of a colour reference image. Most of the existing methods tackle the task as a style transfer problem, using a convolutional neural network (CNN) to obtain deep representations of the content of both inputs. Stylised outputs are then obtained by computing similarities between both feature representat… ▽ More

    Submitted 4 May, 2021; originally announced May 2021.

  43. Evaluating Contrastive Models for Instance-based Image Retrieval

    Authors: Tarun Krishna, Kevin McGuinness, Noel O'Connor

    Abstract: In this work, we evaluate contrastive models for the task of image retrieval. We hypothesise that models that are learned to encode semantic similarity among instances via discriminative learning should perform well on the task of image retrieval, where relevancy is defined in terms of instances of the same object. Through our extensive evaluation, we find that representations from models trained… ▽ More

    Submitted 30 April, 2021; originally announced April 2021.

    Comments: Accepted In Proceedings of the 2021 International Conference on Multimedia Retrieval (ICMR 21)

  44. arXiv:2104.10644  [pdf, other

    cs.LG eess.SY

    A Comparative Study of Using Spatial-Temporal Graph Convolutional Networks for Predicting Availability in Bike Sharing Schemes

    Authors: Zhengyong Chen, Hongde Wu, Noel E. O'Connor, Mingming Liu

    Abstract: Accurately forecasting transportation demand is crucial for efficient urban traffic guidance, control and management. One solution to enhance the level of prediction accuracy is to leverage graph convolutional networks (GCN), a neural network based modelling approach with the ability to process data contained in graph based structures. As a powerful extension of GCN, a spatial-temporal graph convo… ▽ More

    Submitted 6 July, 2021; v1 submitted 21 April, 2021; originally announced April 2021.

    Comments: This manuscript has been accepted at the IEEE ITSC 2021

  45. arXiv:2102.04993  [pdf, other

    eess.IV cs.CC cs.CV cs.LG cs.MM

    Attention-Based Neural Networks for Chroma Intra Prediction in Video Coding

    Authors: Marc Górriz, Saverio Blasi, Alan F. Smeaton, Noel E. O'Connor, Marta Mrak

    Abstract: Neural networks can be successfully used to improve several modules of advanced video coding schemes. In particular, compression of colour components was shown to greatly benefit from usage of machine learning models, thanks to the design of appropriate attention-based architectures that allow the prediction to exploit specific samples in the reference region. However, such architectures tend to b… ▽ More

    Submitted 9 February, 2021; originally announced February 2021.

    Journal ref: IEEE Journal of Selected Topics in Signal Processing, 2020

  46. arXiv:2012.15641  [pdf, other

    cs.MM cs.AI cs.CV

    Investigating Memorability of Dynamic Media

    Authors: Phuc H. Le-Khac, Ayush K. Rai, Graham Healy, Alan F. Smeaton, Noel E. O'Connor

    Abstract: The Predicting Media Memorability task in MediaEval'20 has some challenging aspects compared to previous years. In this paper we identify the high-dynamic content in videos and dataset of limited size as the core challenges for the task, we propose directions to overcome some of these challenges and we present our initial result in these directions.

    Submitted 31 December, 2020; originally announced December 2020.

    Comments: 3 pages, 1 figure. 1 table

    Journal ref: MediaEval Multimedia Benchmark Workshop Working Notes, 14-15 December 2020

  47. arXiv:2012.10283  [pdf, other

    cs.CV cs.LG

    Temporal Bilinear Encoding Network of Audio-Visual Features at Low Sampling Rates

    Authors: Feiyan Hu, Eva Mohedano, Noel O'Connor, Kevin McGuinness

    Abstract: Current deep learning based video classification architectures are typically trained end-to-end on large volumes of data and require extensive computational resources. This paper aims to exploit audio-visual information in video classification with a 1 frame per second sampling rate. We propose Temporal Bilinear Encoding Networks (TBEN) for encoding both audio and visual long range temporal inform… ▽ More

    Submitted 18 December, 2020; originally announced December 2020.

    Comments: 8 pages

  48. arXiv:2012.04462  [pdf, other

    cs.CV

    Multi-Objective Interpolation Training for Robustness to Label Noise

    Authors: Diego Ortego, Eric Arazo, Paul Albert, Noel E. O'Connor, Kevin McGuinness

    Abstract: Deep neural networks trained with standard cross-entropy loss memorize noisy labels, which degrades their performance. Most research to mitigate this memorization proposes new robust classification loss functions. Conversely, we propose a Multi-Objective Interpolation Training (MOIT) approach that jointly exploits contrastive learning and classification to mutually help each other and boost perfor… ▽ More

    Submitted 18 March, 2021; v1 submitted 8 December, 2020; originally announced December 2020.

    Comments: Accepted to CVPR 2021. 10 pages, 1 figure, and 9 tables

  49. arXiv:2011.10600  [pdf, other

    cs.CV cs.LG

    ATSal: An Attention Based Architecture for Saliency Prediction in 360 Videos

    Authors: Yasser Dahou, Marouane Tliba, Kevin McGuinness, Noel O'Connor

    Abstract: The spherical domain representation of 360 video/image presents many challenges related to the storage, processing, transmission and rendering of omnidirectional videos (ODV). Models of human visual attention can be used so that only a single viewport is rendered at a time, which is important when developing systems that allow users to explore ODV with head mounted displays (HMD). Accordingly, res… ▽ More

    Submitted 20 November, 2020; originally announced November 2020.

  50. arXiv:2011.07616  [pdf, other

    cs.SD cs.LG eess.AS

    Unsupervised Contrastive Learning of Sound Event Representations

    Authors: Eduardo Fonseca, Diego Ortego, Kevin McGuinness, Noel E. O'Connor, Xavier Serra

    Abstract: Self-supervised representation learning can mitigate the limitations in recognition tasks with few manually labeled data but abundant unlabeled data---a common scenario in sound event research. In this work, we explore unsupervised contrastive learning as a way to learn sound event representations. To this end, we propose to use the pretext task of contrasting differently augmented views of sound… ▽ More

    Submitted 15 November, 2020; originally announced November 2020.

    Comments: A 4-page version is submitted to ICASSP 2021