Skip to main content

Showing 1–24 of 24 results for author: Crowley, E J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2405.20838  [pdf, other

    cs.LG cs.AI cs.CV stat.ML

    einspace: Searching for Neural Architectures from Fundamental Operations

    Authors: Linus Ericsson, Miguel Espinosa, Chenhongyi Yang, Antreas Antoniou, Amos Storkey, Shay B. Cohen, Steven McDonagh, Elliot J. Crowley

    Abstract: Neural architecture search (NAS) finds high performing networks for a given task. Yet the results of NAS are fairly prosaic; they did not e.g. create a shift from convolutional structures to transformers. This is not least because the search spaces in NAS often aren't diverse enough to include such transformations a priori. Instead, for NAS to provide greater potential for fundamental design shift… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: Project page at https://linusericsson.github.io/einspace/

  2. arXiv:2404.06466  [pdf, other

    cs.LG stat.ML

    Hyperparameter Selection in Continual Learning

    Authors: Thomas L. Lee, Sigrid Passano Hellan, Linus Ericsson, Elliot J. Crowley, Amos Storkey

    Abstract: In continual learning (CL) -- where a learner trains on a stream of data -- standard hyperparameter optimisation (HPO) cannot be applied, as a learner does not have access to all of the data at the same time. This has prompted the development of CL-specific HPO frameworks. The most popular way to tune hyperparameters in CL is to repeatedly train over the whole data stream with different hyperparam… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

    Comments: Preprint, 9 pages

  3. arXiv:2403.18080  [pdf, other

    cs.CV

    EgoPoseFormer: A Simple Baseline for Egocentric 3D Human Pose Estimation

    Authors: Chenhongyi Yang, Anastasia Tkach, Shreyas Hampali, Linguang Zhang, Elliot J. Crowley, Cem Keskin

    Abstract: We present EgoPoseFormer, a simple yet effective transformer-based model for stereo egocentric human pose estimation. The main challenge in egocentric pose estimation is overcoming joint invisibility, which is caused by self-occlusion or a limited field of view (FOV) of head-mounted cameras. Our approach overcomes this challenge by incorporating a two-stage pose estimation paradigm: in the first s… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Tech Report

  4. arXiv:2403.17695  [pdf, other

    cs.CV cs.LG

    PlainMamba: Improving Non-Hierarchical Mamba in Visual Recognition

    Authors: Chenhongyi Yang, Zehui Chen, Miguel Espinosa, Linus Ericsson, Zhenyu Wang, Jiaming Liu, Elliot J. Crowley

    Abstract: We present PlainMamba: a simple non-hierarchical state space model (SSM) designed for general visual recognition. The recent Mamba model has shown how SSMs can be highly competitive with other architectures on sequential data and initial attempts have been made to apply it to images. In this paper, we further adapt the selective scanning process of Mamba to the visual domain, enhancing its ability… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

  5. arXiv:2401.03836  [pdf, other

    cs.CV

    WidthFormer: Toward Efficient Transformer-based BEV View Transformation

    Authors: Chenhongyi Yang, Tianwei Lin, Lichao Huang, Elliot J. Crowley

    Abstract: In this work, we present WidthFormer, a novel transformer-based Bird's-Eye-View (BEV) 3D detection method tailored for real-time autonomous-driving applications. WidthFormer is computationally efficient, robust and does not require any special engineering effort to deploy. In this work, we propose a novel 3D positional encoding mechanism capable of accurately encapsulating 3D geometric information… ▽ More

    Submitted 15 January, 2024; v1 submitted 8 January, 2024; originally announced January 2024.

  6. arXiv:2311.08909  [pdf, other

    cs.LG cs.CV cs.PF

    DLAS: An Exploration and Assessment of the Deep Learning Acceleration Stack

    Authors: Perry Gibson, José Cano, Elliot J. Crowley, Amos Storkey, Michael O'Boyle

    Abstract: Deep Neural Networks (DNNs) are extremely computationally demanding, which presents a large barrier to their deployment on resource-constrained devices. Since such devices are where many emerging deep learning applications lie (e.g., drones, vision-based medical technology), significant bodies of work from both the machine learning and systems communities have attempted to provide optimizations to… ▽ More

    Submitted 15 November, 2023; originally announced November 2023.

  7. arXiv:2308.16648  [pdf, other

    cs.CV cs.LG

    Generate Your Own Scotland: Satellite Image Generation Conditioned on Maps

    Authors: Miguel Espinosa, Elliot J. Crowley

    Abstract: Despite recent advancements in image generation, diffusion models still remain largely underexplored in Earth Observation. In this paper we show that state-of-the-art pretrained diffusion models can be conditioned on cartographic data to generate realistic satellite images. We provide two large datasets of paired OpenStreetMap images and satellite views over the region of Mainland Scotland and the… ▽ More

    Submitted 31 August, 2023; originally announced August 2023.

    Comments: 13 pages, 6 figures. preprint

  8. arXiv:2212.06795  [pdf, other

    cs.CV cs.LG

    GPViT: A High Resolution Non-Hierarchical Vision Transformer with Group Propagation

    Authors: Chenhongyi Yang, Jiarui Xu, Shalini De Mello, Elliot J. Crowley, Xiaolong Wang

    Abstract: We present the Group Propagation Vision Transformer (GPViT): a novel nonhierarchical (i.e. non-pyramidal) transformer model designed for general visual recognition with high-resolution features. High-resolution features (or tokens) are a natural fit for tasks that involve perceiving fine-grained details such as detection and segmentation, but exchanging global information between these features is… ▽ More

    Submitted 25 April, 2023; v1 submitted 13 December, 2022; originally announced December 2022.

    Comments: Accepted by ICLR 2023. Project Page: chenhongyiyang.com/projects/GPViT/GPViT. Code: https://github.com/ChenhongyiYang/GPViT

  9. arXiv:2211.11612  [pdf, other

    cs.CV cs.LG

    Plug and Play Active Learning for Object Detection

    Authors: Chenhongyi Yang, Lichao Huang, Elliot J. Crowley

    Abstract: Annotating datasets for object detection is an expensive and time-consuming endeavor. To minimize this burden, active learning (AL) techniques are employed to select the most informative samples for annotation within a constrained "annotation budget". Traditional AL strategies typically rely on model uncertainty or sample diversity for query sampling, while more advanced methods have focused on de… ▽ More

    Submitted 14 March, 2024; v1 submitted 21 November, 2022; originally announced November 2022.

    Comments: CVPR 2024

  10. arXiv:2203.05469  [pdf, other

    cs.CV cs.LG

    Prediction-Guided Distillation for Dense Object Detection

    Authors: Chenhongyi Yang, Mateusz Ochal, Amos Storkey, Elliot J. Crowley

    Abstract: Real-world object detection models should be cheap and accurate. Knowledge distillation (KD) can boost the accuracy of a small, cheap detection model by leveraging useful information from a larger teacher model. However, a key challenge is identifying the most informative features produced by the teacher for distillation. In this work, we show that only a very small fraction of features within a g… ▽ More

    Submitted 18 July, 2022; v1 submitted 10 March, 2022; originally announced March 2022.

    Comments: ECCV 2022

  11. arXiv:2111.13651  [pdf, other

    cs.CV

    Contrastive Object-level Pre-training with Spatial Noise Curriculum Learning

    Authors: Chenhongyi Yang, Lichao Huang, Elliot J. Crowley

    Abstract: The goal of contrastive learning based pre-training is to leverage large quantities of unlabeled data to produce a model that can be readily adapted downstream. Current approaches revolve around solving an image discrimination task: given an anchor image, an augmented counterpart of that image, and some other images, the model must produce representations such that the distance between the anchor… ▽ More

    Submitted 29 November, 2021; v1 submitted 26 November, 2021; originally announced November 2021.

  12. arXiv:2102.06599  [pdf, other

    cs.LG cs.PL

    Neural Architecture Search as Program Transformation Exploration

    Authors: Jack Turner, Elliot J. Crowley, Michael O'Boyle

    Abstract: Improving the performance of deep neural networks (DNNs) is important to both the compiler and neural architecture search (NAS) communities. Compilers apply program transformations in order to exploit hardware parallelism and memory hierarchy. However, legality concerns mean they fail to exploit the natural robustness of neural networks. In contrast, NAS techniques mutate networks by operations su… ▽ More

    Submitted 12 February, 2021; originally announced February 2021.

  13. arXiv:2006.09791  [pdf, other

    cs.LG cs.CV cs.DC stat.ML

    Optimizing Grouped Convolutions on Edge Devices

    Authors: Perry Gibson, José Cano, Jack Turner, Elliot J. Crowley, Michael O'Boyle, Amos Storkey

    Abstract: When deploying a deep neural network on constrained hardware, it is possible to replace the network's standard convolutions with grouped convolutions. This allows for substantial memory savings with minimal loss of accuracy. However, current implementations of grouped convolutions in modern deep learning frameworks are far from performing optimally in terms of speed. In this paper we propose Group… ▽ More

    Submitted 17 June, 2020; originally announced June 2020.

    Comments: Camera ready version to be published at ASAP 2020 - The 31st IEEE International Conference on Application-specific Systems, Architectures and Processors. 8 pages, 6 figures

    ACM Class: I.2.6; D.3.4; C.1.4

  14. arXiv:2006.04647  [pdf, other

    cs.LG cs.CV stat.ML

    Neural Architecture Search without Training

    Authors: Joseph Mellor, Jack Turner, Amos Storkey, Elliot J. Crowley

    Abstract: The time and effort involved in hand-designing deep neural networks is immense. This has prompted the development of Neural Architecture Search (NAS) techniques to automate this design. However, NAS algorithms tend to be slow and expensive; they need to train vast numbers of candidate networks to inform the search process. This could be alleviated if we could partially predict a network's trained… ▽ More

    Submitted 11 June, 2021; v1 submitted 8 June, 2020; originally announced June 2020.

    Comments: Accepted at ICML 2021 for a long presentation

  15. arXiv:2002.08697  [pdf, other

    cs.LG stat.ML

    Performance Aware Convolutional Neural Network Channel Pruning for Embedded GPUs

    Authors: Valentin Radu, Kuba Kaszyk, Yuan Wen, Jack Turner, Jose Cano, Elliot J. Crowley, Bjorn Franke, Amos Storkey, Michael O'Boyle

    Abstract: Convolutional Neural Networks (CNN) are becoming a common presence in many applications and services, due to their superior recognition accuracy. They are increasingly being used on mobile devices, many times just by porting large models designed for server space, although several model compression techniques have been considered. One model compression technique intended to reduce computations is… ▽ More

    Submitted 20 February, 2020; originally announced February 2020.

    Comments: A copy of this was published in IISWC'19

  16. arXiv:1910.05199  [pdf, other

    cs.LG stat.ML

    Bayesian Meta-Learning for the Few-Shot Setting via Deep Kernels

    Authors: Massimiliano Patacchiola, Jack Turner, Elliot J. Crowley, Michael O'Boyle, Amos Storkey

    Abstract: Recently, different machine learning methods have been introduced to tackle the challenging few-shot learning scenario that is, learning from a small labeled dataset related to a specific task. Common approaches have taken the form of meta-learning: learning to learn on the new problem given the old. Following the recognition that meta-learning is implementing learning in a multi-level model, we p… ▽ More

    Submitted 13 October, 2020; v1 submitted 11 October, 2019; originally announced October 2019.

    Comments: Advances in Neural Information Processing Systems (NeurIPS 2020, Spotlight)

  17. arXiv:1906.04113  [pdf, other

    cs.LG stat.ML

    BlockSwap: Fisher-guided Block Substitution for Network Compression on a Budget

    Authors: Jack Turner, Elliot J. Crowley, Michael O'Boyle, Amos Storkey, Gavin Gray

    Abstract: The desire to map neural networks to varying-capacity devices has led to the development of a wealth of compression techniques, many of which involve replacing standard convolutional blocks in a large network with cheap alternative blocks. However, not all blocks are created equally; for a required compute budget there may exist a potent combination of many different cheap blocks, though exhaustiv… ▽ More

    Submitted 23 January, 2020; v1 submitted 10 June, 2019; originally announced June 2019.

    Comments: ICLR 2020

  18. arXiv:1906.00859  [pdf, other

    stat.ML cs.LG

    Separable Layers Enable Structured Efficient Linear Substitutions

    Authors: Gavin Gray, Elliot J. Crowley, Amos Storkey

    Abstract: In response to the development of recent efficient dense layers, this paper shows that something as simple as replacing linear components in pointwise convolutions with structured linear decompositions also produces substantial gains in the efficiency/accuracy tradeoff. Pointwise convolutions are fully connected layers and are thus prepared for replacement by structured transforms. Networks using… ▽ More

    Submitted 3 June, 2019; originally announced June 2019.

  19. arXiv:1811.00410  [pdf, other

    stat.ML cs.LG

    Dilated DenseNets for Relational Reasoning

    Authors: Antreas Antoniou, Agnieszka Słowik, Elliot J. Crowley, Amos Storkey

    Abstract: Despite their impressive performance in many tasks, deep neural networks often struggle at relational reasoning. This has recently been remedied with the introduction of a plug-in relational module that considers relations between pairs of objects. Unfortunately, this is combinatorially expensive. In this extended abstract, we show that a DenseNet incorporating dilated convolutions excels at relat… ▽ More

    Submitted 1 November, 2018; originally announced November 2018.

    Comments: Extended Abstract

  20. arXiv:1810.10460  [pdf, other

    stat.ML cs.LG cs.PF

    Distilling with Performance Enhanced Students

    Authors: Jack Turner, Elliot J. Crowley, Valentin Radu, José Cano, Amos Storkey, Michael O'Boyle

    Abstract: The task of accelerating large neural networks on general purpose hardware has, in recent years, prompted the use of channel pruning to reduce network size. However, the efficacy of pruning based approaches has since been called into question. In this paper, we turn to distillation for model compression---specifically, attention transfer---and develop a simple method for discovering performance en… ▽ More

    Submitted 7 March, 2019; v1 submitted 24 October, 2018; originally announced October 2018.

    Comments: Preprint. Paper title has changed

  21. arXiv:1810.04622  [pdf, other

    stat.ML cs.CV cs.LG

    A Closer Look at Structured Pruning for Neural Network Compression

    Authors: Elliot J. Crowley, Jack Turner, Amos Storkey, Michael O'Boyle

    Abstract: Structured pruning is a popular method for compressing a neural network: given a large trained network, one alternates between removing channel connections and fine-tuning; reducing the overall width of the network. However, the efficacy of structured pruning has largely evaded scrutiny. In this paper, we examine ResNets and DenseNets obtained through structured pruning-and-tuning and make two int… ▽ More

    Submitted 7 June, 2019; v1 submitted 10 October, 2018; originally announced October 2018.

    Comments: Preprint. First two authors contributed equally. Paper title has changed

  22. arXiv:1810.03505  [pdf, other

    cs.CV cs.LG stat.ML

    CINIC-10 is not ImageNet or CIFAR-10

    Authors: Luke N. Darlow, Elliot J. Crowley, Antreas Antoniou, Amos J. Storkey

    Abstract: In this brief technical report we introduce the CINIC-10 dataset as a plug-in extended alternative for CIFAR-10. It was compiled by combining CIFAR-10 with images selected and downsampled from the ImageNet database. We present the approach to compiling the dataset, illustrate the example images for different classes, give pixel distributions for each part of the repository, and give some standard… ▽ More

    Submitted 2 October, 2018; originally announced October 2018.

    Comments: Dataset compilation, 9 pages, 11 figures, technical report

    Report number: EDI-INF-ANC-1802

  23. arXiv:1809.07196  [pdf, other

    stat.ML cs.CV cs.LG cs.PF

    Characterising Across-Stack Optimisations for Deep Convolutional Neural Networks

    Authors: Jack Turner, José Cano, Valentin Radu, Elliot J. Crowley, Michael O'Boyle, Amos Storkey

    Abstract: Convolutional Neural Networks (CNNs) are extremely computationally demanding, presenting a large barrier to their deployment on resource-constrained devices. Since such systems are where some of their most useful applications lie (e.g. obstacle detection for mobile robots, vision-based medical assistive technology), significant bodies of work from both machine learning and systems communities have… ▽ More

    Submitted 19 September, 2018; originally announced September 2018.

    Comments: IISWC 2018

  24. arXiv:1711.02613  [pdf, other

    stat.ML cs.CV cs.LG

    Moonshine: Distilling with Cheap Convolutions

    Authors: Elliot J. Crowley, Gavin Gray, Amos Storkey

    Abstract: Many engineers wish to deploy modern neural networks in memory-limited settings; but the development of flexible methods for reducing memory use is in its infancy, and there is little knowledge of the resulting cost-benefit. We propose structural model distillation for memory reduction using a strategy that produces a student architecture that is a simple transformation of the teacher architecture… ▽ More

    Submitted 17 January, 2019; v1 submitted 7 November, 2017; originally announced November 2017.

    Comments: 32nd Conference on Neural Information Processing Systems (NeurIPS 2018)