Zum Hauptinhalt springen

Showing 1–18 of 18 results for author: Pugeault, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.16701  [pdf, other

    cs.CV

    Detail-Enhanced Intra- and Inter-modal Interaction for Audio-Visual Emotion Recognition

    Authors: Tong Shi, Xuri Ge, Joemon M. Jose, Nicolas Pugeault, Paul Henderson

    Abstract: Capturing complex temporal relationships between video and audio modalities is vital for Audio-Visual Emotion Recognition (AVER). However, existing methods lack attention to local details, such as facial state changes between video frames, which can reduce the discriminability of features and thus lower recognition accuracy. In this paper, we propose a Detail-Enhanced Intra- and Inter-modal Intera… ▽ More

    Submitted 26 May, 2024; originally announced May 2024.

    Comments: Submitted to 27th International Conference of Pattern Recognition (ICPR 2024)

  2. arXiv:2403.19579  [pdf, other

    cs.CV

    The Bad Batches: Enhancing Self-Supervised Learning in Image Classification Through Representative Batch Curation

    Authors: Ozgu Goksu, Nicolas Pugeault

    Abstract: The pursuit of learning robust representations without human supervision is a longstanding challenge. The recent advancements in self-supervised contrastive learning approaches have demonstrated high performance across various representation learning challenges. However, current methods depend on the random transformation of training examples, resulting in some cases of unrepresentative positive p… ▽ More

    Submitted 28 March, 2024; originally announced March 2024.

    Comments: 8 Pages, 4 figures, IEEE WCCI 2024 Conference

  3. arXiv:2301.08330  [pdf, other

    eess.IV cs.CV

    The role of noise in denoising models for anomaly detection in medical images

    Authors: Antanas Kascenas, Pedro Sanchez, Patrick Schrempf, Chaoyang Wang, William Clackett, Shadia S. Mikhael, Jeremy P. Voisey, Keith Goatman, Alexander Weir, Nicolas Pugeault, Sotirios A. Tsaftaris, Alison Q. O'Neil

    Abstract: Pathological brain lesions exhibit diverse appearance in brain images, in terms of intensity, texture, shape, size, and location. Comprehensive sets of data and annotations are difficult to acquire. Therefore, unsupervised anomaly detection approaches have been proposed using only normal data for training, with the aim of detecting outlier anomalous voxels at test time. Denoising methods, for inst… ▽ More

    Submitted 19 January, 2023; originally announced January 2023.

    Comments: Submitted to Medical Image Analysis special issue for MIDL 2022

    MSC Class: 68T99; 92C55; 68U10

  4. arXiv:2201.10544  [pdf, other

    stat.ML cs.AI cs.LG stat.AP

    A deep mixture density network for outlier-corrected interpolation of crowd-sourced weather data

    Authors: Charlie Kirkwood, Theo Economou, Henry Odbert, Nicolas Pugeault

    Abstract: As the costs of sensors and associated IT infrastructure decreases - as exemplified by the Internet of Things - increasing volumes of observational data are becoming available for use by environmental scientists. However, as the number of available observation sites increases, so too does the opportunity for data quality issues to emerge, particularly given that many of these sensors do not have t… ▽ More

    Submitted 25 January, 2022; originally announced January 2022.

    Comments: 20 pages, 12 figures, not yet submitted

    MSC Class: 68T07; 62P12

  5. arXiv:2011.06978  [pdf, other

    cs.CV

    Transformer-Encoder Detector Module: Using Context to Improve Robustness to Adversarial Attacks on Object Detection

    Authors: Faisal Alamri, Sinan Kalkan, Nicolas Pugeault

    Abstract: Deep neural network approaches have demonstrated high performance in object recognition (CNN) and detection (Faster-RCNN) tasks, but experiments have shown that such architectures are vulnerable to adversarial attacks (FFF, UAP): low amplitude perturbations, barely perceptible by the human eye, can lead to a drastic reduction in labeling performance. This article proposes a new context module, cal… ▽ More

    Submitted 13 November, 2020; originally announced November 2020.

    Comments: Accepted for the 25th International Conference on Pattern Recognition (ICPR'2020)

  6. arXiv:2008.07320  [pdf, other

    stat.ML cs.LG stat.AP

    Bayesian deep learning for mapping via auxiliary information: a new era for geostatistics?

    Authors: Charlie Kirkwood, Theo Economou, Nicolas Pugeault

    Abstract: For geospatial modelling and mapping tasks, variants of kriging - the spatial interpolation technique developed by South African mining engineer Danie Krige - have long been regarded as the established geostatistical methods. However, kriging and its variants (such as regression kriging, in which auxiliary variables or derivatives of these are included as covariates) are relatively restrictive mod… ▽ More

    Submitted 8 September, 2020; v1 submitted 17 August, 2020; originally announced August 2020.

    Comments: 10 pages, 5 figures, version submitted to journal

    MSC Class: 86A32 (Primary) 68T45; 68T07 (Secondary)

  7. arXiv:2005.06613  [pdf, other

    stat.AP cs.LG stat.ML

    A framework for probabilistic weather forecast post-processing across models and lead times using machine learning

    Authors: Charlie Kirkwood, Theo Economou, Henry Odbert, Nicolas Pugeault

    Abstract: Forecasting the weather is an increasingly data intensive exercise. Numerical Weather Prediction (NWP) models are becoming more complex, with higher resolutions, and there are increasing numbers of different models in operation. While the forecasting skill of NWP models continues to improve, the number and complexity of these models poses a new challenge for the operational meteorologist: how shou… ▽ More

    Submitted 25 June, 2020; v1 submitted 6 May, 2020; originally announced May 2020.

    Comments: 17 pages, 9 figures, to be published in Philosophical Transactions of the Royal Society A

    MSC Class: 62P12 (primary) 68T37 (secondary)

  8. arXiv:2005.05509  [pdf, other

    cs.CV cs.LG eess.IV

    Real-time Facial Expression Recognition "In The Wild'' by Disentangling 3D Expression from Identity

    Authors: Mohammad Rami Koujan, Luma Alharbawee, Giorgos Giannakakis, Nicolas Pugeault, Anastasios Roussos

    Abstract: Human emotions analysis has been the focus of many studies, especially in the field of Affective Computing, and is important for many applications, e.g. human-computer intelligent interaction, stress analysis, interactive games, animations, etc. Solutions for automatic emotion analysis have also benefited from the development of deep learning approaches and the availability of vast amount of visua… ▽ More

    Submitted 11 May, 2020; originally announced May 2020.

    Comments: to be published in 15th IEEE International Conference on Automatic Face and Gesture Recognition (FG 2020)

  9. arXiv:2004.14231  [pdf, other

    cs.CV

    Image Captioning through Image Transformer

    Authors: Sen He, Wentong Liao, Hamed R. Tavakoli, Michael Yang, Bodo Rosenhahn, Nicolas Pugeault

    Abstract: Automatic captioning of images is a task that combines the challenges of image analysis and text generation. One important aspect in captioning is the notion of attention: How to decide what to describe and in which order. Inspired by the successes in text analysis and translation, previous work have proposed the \textit{transformer} architecture for image captioning. However, the structure betwee… ▽ More

    Submitted 2 October, 2020; v1 submitted 29 April, 2020; originally announced April 2020.

  10. arXiv:1906.02534  [pdf, other

    cs.CV cs.AI

    Contextual Relabelling of Detected Objects

    Authors: Faisal Alamri, Nicolas Pugeault

    Abstract: Contextual information, such as the co-occurrence of objects and the spatial and relative size among objects provides deep and complex information about scenes. It also can play an important role in improving object detection. In this work, we present two contextual models (rescoring and re-labeling models) that leverage contextual information (16 contextual relationships are applied in this paper… ▽ More

    Submitted 6 June, 2019; originally announced June 2019.

    Comments: Presented at the IEEE ICDL-Epirob'2019 conference, Oslo, Norway

  11. arXiv:1903.02501  [pdf, other

    cs.CV

    Understanding and Visualizing Deep Visual Saliency Models

    Authors: Sen He, Hamed R. Tavakoli, Ali Borji, Yang Mi, Nicolas Pugeault

    Abstract: Recently, data-driven deep saliency models have achieved high performance and have outperformed classical saliency models, as demonstrated by results on datasets such as the MIT300 and SALICON. Yet, there remains a large gap between the performance of these models and the inter-human baseline. Some outstanding questions include what have these models learned, how and where they fail, and how they… ▽ More

    Submitted 3 April, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: To appear in CVPR2019, camera ready version

  12. arXiv:1903.02499  [pdf, other

    cs.CV

    Human Attention in Image Captioning: Dataset and Analysis

    Authors: Sen He, Hamed R. Tavakoli, Ali Borji, Nicolas Pugeault

    Abstract: In this work, we present a novel dataset consisting of eye movements and verbal descriptions recorded synchronously over images. Using this data, we study the differences in human attention during free-viewing and image captioning tasks. We look into the relationship between human attention and language constructs during perception and sentence articulation. We also analyse attention deployment me… ▽ More

    Submitted 7 August, 2019; v1 submitted 6 March, 2019; originally announced March 2019.

    Comments: To appear at ICCV 2019

    Journal ref: IEEE International Conference on Computer Vision (ICCV 2019)

  13. arXiv:1901.06212  [pdf, other

    cs.LG cs.AI stat.ML

    On-Policy Trust Region Policy Optimisation with Replay Buffers

    Authors: Dmitry Kangin, Nicolas Pugeault

    Abstract: Building upon the recent success of deep reinforcement learning methods, we investigate the possibility of on-policy reinforcement learning improvement by reusing the data from several consecutive policies. On-policy methods bring many benefits, such as ability to evaluate each resulting policy. However, they usually discard all the information about the policies which existed before. In this work… ▽ More

    Submitted 18 January, 2019; originally announced January 2019.

  14. arXiv:1803.05785  [pdf, other

    cs.CV

    Aggregated Sparse Attention for Steering Angle Prediction

    Authors: Sen He, Dmitry Kangin, Yang Mi, Nicolas Pugeault

    Abstract: In this paper, we apply the attention mechanism to autonomous driving for steering angle prediction. We propose the first model, applying the recently introduced sparse attention mechanism to visual domain, as well as the aggregated extension for this model. We show the improvement of the proposed method, comparing to no attention as well as to different types of attention.

    Submitted 15 March, 2018; originally announced March 2018.

  15. arXiv:1803.05759  [pdf, other

    cs.CV

    Salient Region Segmentation

    Authors: Sen He, Nicolas Pugeault

    Abstract: Saliency prediction is a well studied problem in computer vision. Early saliency models were based on low-level hand-crafted feature derived from insights gained in neuroscience and psychophysics. In the wake of deep learning breakthrough, a new cohort of models were proposed based on neural network architectures, allowing significantly higher gaze prediction than previous shallow models, on all m… ▽ More

    Submitted 15 March, 2018; originally announced March 2018.

  16. arXiv:1803.05753  [pdf, other

    cs.CV

    What Catches the Eye? Visualizing and Understanding Deep Saliency Models

    Authors: Sen He, Ali Borji, Yang Mi, Nicolas Pugeault

    Abstract: Deep convolutional neural networks have demonstrated high performances for fixation prediction in recent years. How they achieve this, however, is less explored and they remain to be black box models. Here, we attempt to shed light on the internal structure of deep saliency models and study what features they extract for fixation prediction. Specifically, we use a simple yet powerful architecture,… ▽ More

    Submitted 22 March, 2018; v1 submitted 15 March, 2018; originally announced March 2018.

  17. arXiv:1801.04261  [pdf, other

    cs.CV

    Deep saliency: What is learnt by a deep network about saliency?

    Authors: Sen He, Nicolas Pugeault

    Abstract: Deep convolutional neural networks have achieved impressive performance on a broad range of problems, beating prior art on established benchmarks, but it often remains unclear what are the representations learnt by those systems and how they achieve such performance. This article examines the specific problem of saliency detection, where benchmarks are currently dominated by CNN-based approaches,… ▽ More

    Submitted 22 March, 2018; v1 submitted 12 January, 2018; originally announced January 2018.

    Comments: Accepted paper in 2nd Workshop on Visualisation for Deep Learning in the 34th International Conference On Machine Learning

    Journal ref: 2nd Workshop on Visualisation for Deep Learning, ICML 2017

  18. arXiv:1709.01500  [pdf, other

    cs.RO cs.CV

    SeDAR - Semantic Detection and Ranging: Humans can localise without LiDAR, can robots?

    Authors: Oscar Mendez, Simon Hadfield, Nicolas Pugeault, Richard Bowden

    Abstract: How does a person work out their location using a floorplan? It is probably safe to say that we do not explicitly measure depths to every visible surface and try to match them against different pose estimates in the floorplan. And yet, this is exactly how most robotic scan-matching algorithms operate. Similarly, we do not extrude the 2D geometry present in the floorplan into 3D and try to align it… ▽ More

    Submitted 2 May, 2018; v1 submitted 5 September, 2017; originally announced September 2017.