Zum Hauptinhalt springen

Showing 1–17 of 17 results for author: Bull, D R

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.03535  [pdf, other

    cs.CV

    BVI-RLV: A Fully Registered Dataset and Benchmarks for Low-Light Video Enhancement

    Authors: Ruirui Lin, Nantheera Anantrasirichai, Guoxi Huang, Joanne Lin, Qi Sun, Alexandra Malyugina, David R Bull

    Abstract: Low-light videos often exhibit spatiotemporal incoherent noise, compromising visibility and performance in computer vision applications. One significant challenge in enhancing such content using deep learning is the scarcity of training data. This paper introduces a novel low-light video dataset, consisting of 40 scenes with various motion scenarios under two distinct low-lighting conditions, inco… ▽ More

    Submitted 28 July, 2024; v1 submitted 3 July, 2024; originally announced July 2024.

    Comments: arXiv admin note: text overlap with arXiv:2402.01970

  2. arXiv:2405.05039  [pdf, other

    cs.CV cs.MM

    Reviewing Intelligent Cinematography: AI research for camera-based video production

    Authors: Adrian Azzarelli, Nantheera Anantrasirichai, David R Bull

    Abstract: This paper offers a comprehensive review of artificial intelligence (AI) research in the context of real camera content acquisition for entertainment purposes and is aimed at both researchers and cinematographers. Considering the breadth of computer vision research and the lack of review papers tied to intelligent cinematography (IC), this review introduces a holistic view of the IC landscape whil… ▽ More

    Submitted 8 May, 2024; originally announced May 2024.

    Comments: For researchers and cinematographers. 43 pages including Table of Contents, List of Figures and Tables. We obtained permission to use Figures 5 and 11. All other Figures have been drawn by us

  3. arXiv:2402.19041  [pdf, other

    cs.CV cs.AI eess.IV

    Atmospheric Turbulence Removal with Video Sequence Deep Visual Priors

    Authors: P. Hill, N. Anantrasirichai, A. Achim, D. R. Bull

    Abstract: Atmospheric turbulence poses a challenge for the interpretation and visual perception of visual imagery due to its distortion effects. Model-based approaches have been used to address this, but such methods often suffer from artefacts associated with moving content. Conversely, deep learning based methods are dependent on large and diverse datasets that may not effectively represent any specific c… ▽ More

    Submitted 29 February, 2024; originally announced February 2024.

  4. arXiv:2312.02218  [pdf, other

    cs.CV cs.GR

    WavePlanes: A compact Wavelet representation for Dynamic Neural Radiance Fields

    Authors: Adrian Azzarelli, Nantheera Anantrasirichai, David R Bull

    Abstract: Dynamic Neural Radiance Fields (Dynamic NeRF) enhance NeRF technology to model moving scenes. However, they are resource intensive and challenging to compress. To address these issues, this paper presents WavePlanes, a fast and more compact explicit model. We propose a multi-scale space and space-time feature plane representation using N-level 2-D wavelet coefficients. The inverse discrete wavelet… ▽ More

    Submitted 8 May, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

  5. arXiv:2305.18079  [pdf, other

    cs.CV cs.GR

    Towards a Robust Framework for NeRF Evaluation

    Authors: Adrian Azzarelli, Nantheera Anantrasirichai, David R Bull

    Abstract: Neural Radiance Field (NeRF) research has attracted significant attention recently, with 3D modelling, virtual/augmented reality, and visual effects driving its application. While current NeRF implementations can produce high quality visual results, there is a conspicuous lack of reliable methods for evaluating them. Conventional image quality assessment methods and analytical metrics (e.g. PSNR,… ▽ More

    Submitted 31 May, 2023; v1 submitted 29 May, 2023; originally announced May 2023.

    Comments: 9 pages, 2 main experiments, 2 additional experiments

  6. arXiv:2110.06740  [pdf, other

    eess.IV cs.CV cs.LG

    Transform and Bitstream Domain Image Classification

    Authors: P. R. Hill, D. R. Bull

    Abstract: Classification of images within the compressed domain offers significant benefits. These benefits include reduced memory and computational requirements of a classification system. This paper proposes two such methods as a proof of concept: The first classifies within the JPEG image transform domain (i.e. DCT transform data); the second classifies the JPEG compressed binary bitstream directly. Thes… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 7 pages, 3 figures, one table

  7. arXiv:2110.06697  [pdf, other

    cs.CV cs.AI

    Semantic Image Fusion

    Authors: P. R. Hill, D. R. Bull

    Abstract: Image fusion methods and metrics for their evaluation have conventionally used pixel-based or low-level features. However, for many applications, the aim of image fusion is to effectively combine the semantic content of the input images. This paper proposes a novel system for the semantic combination of visual content using pre-trained CNN network architectures. Our proposed semantic fusion is ini… ▽ More

    Submitted 13 October, 2021; originally announced October 2021.

    Comments: 10 pages, 3 figures and 2 tables. To be submitted to IEEE Transactions on Image Processing

  8. arXiv:2106.08147  [pdf, other

    eess.IV cs.CV cs.LG

    Perceptually-inspired super-resolution of compressed videos

    Authors: Di Ma, Mariana Afonso, Fan Zhang, David R. Bull

    Abstract: Spatial resolution adaptation is a technique which has often been employed in video compression to enhance coding efficiency. This approach encodes a lower resolution version of the input video and reconstructs the original resolution during decoding. Instead of using conventional up-sampling filters, recent work has employed advanced super-resolution methods based on convolutional neural networks… ▽ More

    Submitted 15 June, 2021; originally announced June 2021.

  9. arXiv:2011.09190  [pdf, other

    eess.IV cs.CV

    CVEGAN: A Perceptually-inspired GAN for Compressed Video Enhancement

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: We propose a new Generative Adversarial Network for Compressed Video quality Enhancement (CVEGAN). The CVEGAN generator benefits from the use of a novel Mul2Res block (with multiple levels of residual learning branches), an enhanced residual non-local block (ERNB) and an enhanced convolutional block attention module (ECBAM). The ERNB has also been employed in the discriminator to improve the repre… ▽ More

    Submitted 26 November, 2020; v1 submitted 18 November, 2020; originally announced November 2020.

  10. Video Compression with CNN-based Post Processing

    Authors: Fan Zhang, Di Ma, Chen Feng, David R. Bull

    Abstract: In recent years, video compression techniques have been significantly challenged by the rapidly increased demands associated with high quality and immersive video content. Among various compression tools, post-processing can be applied on reconstructed video content to mitigate visible compression artefacts and to enhance overall perceptual quality. Inspired by advances in deep learning, we propos… ▽ More

    Submitted 14 January, 2021; v1 submitted 16 September, 2020; originally announced September 2020.

  11. arXiv:2007.14726  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    Video compression with low complexity CNN-based spatial resolution adaptation

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: It has recently been demonstrated that spatial resolution adaptation can be integrated within video compression to improve overall coding performance by spatially down-sampling before encoding and super-resolving at the decoder. Significant improvements have been reported when convolutional neural networks (CNNs) were used to perform the resolution up-sampling. However, this approach suffers from… ▽ More

    Submitted 29 July, 2020; originally announced July 2020.

  12. arXiv:2007.07099  [pdf, other

    eess.IV cs.CV cs.LG cs.MM

    MFRNet: A New CNN Architecture for Post-Processing and In-loop Filtering

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: In this paper, we propose a novel convolutional neural network (CNN) architecture, MFRNet, for post-processing (PP) and in-loop filtering (ILF) in the context of video compression. This network consists of four Multi-level Feature review Residual dense Blocks (MFRBs), which are connected using a cascading structure. Each MFRB extracts features from multiple convolutional layers using dense connect… ▽ More

    Submitted 11 December, 2020; v1 submitted 14 July, 2020; originally announced July 2020.

  13. BVI-DVC: A Training Database for Deep Video Compression

    Authors: Di Ma, Fan Zhang, David R. Bull

    Abstract: Deep learning methods are increasingly being applied in the optimisation of video compression algorithms and can achieve significantly enhanced coding gains, compared to conventional approaches. Such approaches often employ Convolutional Neural Networks (CNNs) which are trained on databases with relatively limited content coverage. In this paper, a new extensive and representative video database,… ▽ More

    Submitted 8 October, 2020; v1 submitted 30 March, 2020; originally announced March 2020.

  14. arXiv:2003.06637  [pdf, other

    eess.IV cs.CV

    Fast Depth Estimation for View Synthesis

    Authors: Nantheera Anantrasirichai, Majid Geravand, David Braendler, David R. Bull

    Abstract: Disparity/depth estimation from sequences of stereo images is an important element in 3D vision. Owing to occlusions, imperfect settings and homogeneous luminance, accurate estimate of depth remains a challenging problem. Targetting view synthesis, we propose a novel learning-based framework making use of dilated convolution, densely connected convolutional modules, compact decoder and skip connec… ▽ More

    Submitted 14 March, 2020; originally announced March 2020.

    Comments: 5 pages

  15. arXiv:1912.02305  [pdf, other

    cs.LG eess.SP

    HABNet: Machine Learning, Remote Sensing Based Detection and Prediction of Harmful Algal Blooms

    Authors: P. R. Hill, A. Kumar, M. Temimi, D. R. Bull

    Abstract: This paper describes the application of machine learning techniques to develop a state-of-the-art detection and prediction system for spatiotemporal events found within remote sensing data; specifically, Harmful Algal Bloom events (HABs). We propose an HAB detection system based on: a ground truth historical record of HAB events, a novel spatiotemporal datacube representation of each event (from M… ▽ More

    Submitted 16 April, 2020; v1 submitted 4 December, 2019; originally announced December 2019.

  16. ViSTRA2: Video Coding using Spatial Resolution and Effective Bit Depth Adaptation

    Authors: Fan Zhang, Mariana Afonso, David R. Bull

    Abstract: We present a new video compression framework (ViSTRA2) which exploits adaptation of spatial resolution and effective bit depth, down-sampling these parameters at the encoder based on perceptual criteria, and up-sampling at the decoder using a deep convolution neural network. ViSTRA2 has been integrated with the reference software of both the HEVC (HM 16.20) and VVC (VTM 4.01), and evaluated under… ▽ More

    Submitted 7 November, 2019; originally announced November 2019.

    Comments: 9 pages

  17. Denoising Imaging Polarimetry by an Adapted BM3D Method

    Authors: Alexander B. Tibbs, Ilse M. Daly, Nicholas W. Roberts, David R. Bull

    Abstract: Imaging polarimetry allows more information to be extracted from a scene than conventional intensity or colour imaging. However, a major challenge of imaging polarimetry is image degradation due to noise. This paper investigates the mitigation of noise through denoising algorithms and compares existing denoising algorithms with a new method, based on BM3D. This algorithm, PBM3D, gives visual quali… ▽ More

    Submitted 16 November, 2017; v1 submitted 13 November, 2017; originally announced November 2017.