Zum Hauptinhalt springen

Showing 1–16 of 16 results for author: Kaul, C

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.07251  [pdf, other

    cs.CV cs.AI

    Is One GPU Enough? Pushing Image Generation at Higher-Resolutions with Foundation Models

    Authors: Athanasios Tragakis, Marco Aversa, Chaitanya Kaul, Roderick Murray-Smith, Daniele Faccio

    Abstract: In this work, we introduce Pixelsmith, a zero-shot text-to-image generative framework to sample images at higher resolutions with a single GPU. We are the first to show that it is possible to scale the output of a pre-trained diffusion model by a factor of 1000, opening the road for gigapixel image generation at no additional cost. Our cascading method uses the image generated at the lowest resolu… ▽ More

    Submitted 12 June, 2024; v1 submitted 11 June, 2024; originally announced June 2024.

  2. GLFNET: Global-Local (frequency) Filter Networks for efficient medical image segmentation

    Authors: Athanasios Tragakis, Qianying Liu, Chaitanya Kaul, Swalpa Kumar Roy, Hang Dai, Fani Deligianni, Roderick Murray-Smith, Daniele Faccio

    Abstract: We propose a novel transformer-style architecture called Global-Local Filter Network (GLFNet) for medical image segmentation and demonstrate its state-of-the-art performance. We replace the self-attention mechanism with a combination of global-local filter blocks to optimize model efficiency. The global filters extract features from the whole feature map whereas the local filters are being adaptiv… ▽ More

    Submitted 1 March, 2024; originally announced March 2024.

    Journal ref: 2024 IEEE International Symposium on Biomedical Imaging (ISBI)

  3. arXiv:2310.20168  [pdf, other

    cs.LG physics.ao-ph physics.flu-dyn

    Understanding and Visualizing Droplet Distributions in Simulations of Shallow Clouds

    Authors: Justus C. Will, Andrea M. Jenney, Kara D. Lamb, Michael S. Pritchard, Colleen Kaul, Po-Lun Ma, Kyle Pressel, Jacob Shpund, Marcus van Lier-Walqui, Stephan Mandt

    Abstract: Thorough analysis of local droplet-level interactions is crucial to better understand the microphysical processes in clouds and their effect on the global climate. High-accuracy simulations of relevant droplet size distributions from Large Eddy Simulations (LES) of bin microphysics challenge current analysis techniques due to their high dimensionality involving three spatial dimensions, time, and… ▽ More

    Submitted 31 October, 2023; originally announced October 2023.

    Comments: 4 pages, 3 figures, accepted at NeurIPS 2023 (Machine Learning and the Physical Sciences Workshop)

  4. arXiv:2309.07096  [pdf

    q-bio.NC cs.CV eess.IV

    Computational limits to the legibility of the imaged human brain

    Authors: James K Ruffle, Robert J Gray, Samia Mohinta, Guilherme Pombo, Chaitanya Kaul, Harpreet Hyare, Geraint Rees, Parashkev Nachev

    Abstract: Our knowledge of the organisation of the human brain at the population-level is yet to translate into power to predict functional differences at the individual-level, limiting clinical applications, and casting doubt on the generalisability of inferred mechanisms. It remains unknown whether the difficulty arises from the absence of individuating biological patterns within the brain, or from limite… ▽ More

    Submitted 2 April, 2024; v1 submitted 23 August, 2023; originally announced September 2023.

    Comments: 38 pages, 6 figures, 1 table, 2 supplementary figures, 1 supplementary table

  5. arXiv:2302.14625  [pdf, other

    cs.LG eess.SP

    mmSense: Detecting Concealed Weapons with a Miniature Radar Sensor

    Authors: Kevin Mitchell, Khaled Kassem, Chaitanya Kaul, Valentin Kapitany, Philip Binner, Andrew Ramsay, Roderick Murray-Smith, Daniele Faccio

    Abstract: For widespread adoption, public security and surveillance systems must be accurate, portable, compact, and real-time, without impeding the privacy of the individuals being observed. Current systems broadly fall into two categories -- image-based which are accurate, but lack privacy, and RF signal-based, which preserve privacy but lack portability, compactness and accuracy. Our paper proposes mmSen… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted by ICASSP 2023

  6. arXiv:2302.14566  [pdf, other

    cs.HC

    Continuous interaction with a smart speaker via low-dimensional embeddings of dynamic hand pose

    Authors: Songpei Xu, Chaitanya Kaul, Xuri Ge, Roderick Murray-Smith

    Abstract: This paper presents a new continuous interaction strategy with visual feedback of hand pose and mid-air gesture recognition and control for a smart music speaker, which utilizes only 2 video frames to recognize gestures. Frame-based hand pose features from MediaPipe Hands, containing 21 landmarks, are embedded into a 2 dimensional pose space by an autoencoder. The corresponding space for interacti… ▽ More

    Submitted 28 February, 2023; originally announced February 2023.

    Comments: Accepted at ICASSP 2023

  7. Optimizing Vision Transformers for Medical Image Segmentation

    Authors: Qianying Liu, Chaitanya Kaul, Jun Wang, Christos Anagnostopoulos, Roderick Murray-Smith, Fani Deligianni

    Abstract: For medical image semantic segmentation (MISS), Vision Transformers have emerged as strong alternatives to convolutional neural networks thanks to their inherent ability to capture long-range correlations. However, existing research uses off-the-shelf vision Transformer blocks based on linear projections and feature processing which lack spatial and local context to refine organ boundaries. Furthe… ▽ More

    Submitted 26 October, 2022; v1 submitted 14 October, 2022; originally announced October 2022.

  8. arXiv:2206.00566  [pdf

    eess.IV cs.CV

    The Fully Convolutional Transformer for Medical Image Segmentation

    Authors: Athanasios Tragakis, Chaitanya Kaul, Roderick Murray-Smith, Dirk Husmeier

    Abstract: We propose a novel transformer model, capable of segmenting medical images of varying modalities. Challenges posed by the fine grained nature of medical image analysis mean that the adaptation of the transformer for their analysis is still at nascent stages. The overwhelming success of the UNet lay in its ability to appreciate the fine-grained nature of the segmentation task, an ability which exis… ▽ More

    Submitted 29 January, 2023; v1 submitted 1 June, 2022; originally announced June 2022.

    Journal ref: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), 2023, pp. 3660-3669

  9. arXiv:2111.13023  [pdf, other

    cs.CV cs.LG

    Rotation Equivariant 3D Hand Mesh Generation from a Single RGB Image

    Authors: Joshua Mitton, Chaitanya Kaul, Roderick Murray-Smith

    Abstract: We develop a rotation equivariant model for generating 3D hand meshes from 2D RGB images. This guarantees that as the input image of a hand is rotated the generated mesh undergoes a corresponding rotation. Furthermore, this removes undesirable deformations in the meshes often generated by methods without rotation equivariance. By building a rotation equivariant model, through considering symmetrie… ▽ More

    Submitted 25 November, 2021; originally announced November 2021.

  10. arXiv:2111.10866  [pdf, other

    cs.CV

    CpT: Convolutional Point Transformer for 3D Point Cloud Processing

    Authors: Chaitanya Kaul, Joshua Mitton, Hang Dai, Roderick Murray-Smith

    Abstract: We present CpT: Convolutional point Transformer - a novel deep learning architecture for dealing with the unstructured nature of 3D point cloud data. CpT is an improvement over existing attention-based Convolutions Neural Networks as well as previous 3D point cloud processing transformers. It achieves this feat due to its effectiveness in creating a novel and robust attention-based point set embed… ▽ More

    Submitted 21 November, 2021; originally announced November 2021.

  11. arXiv:2107.01614  [pdf, other

    cs.LG

    Survey: Leakage and Privacy at Inference Time

    Authors: Marija Jegorova, Chaitanya Kaul, Charlie Mayor, Alison Q. O'Neil, Alexander Weir, Roderick Murray-Smith, Sotirios A. Tsaftaris

    Abstract: Leakage of data from publicly available Machine Learning (ML) models is an area of growing significance as commercial and government applications of ML can draw on multiple sources of data, potentially including users' and clients' sensitive data. We provide a comprehensive survey of contemporary advances on several fronts, covering involuntary data leakage which is natural to ML models, potential… ▽ More

    Submitted 9 September, 2022; v1 submitted 4 July, 2021; originally announced July 2021.

  12. arXiv:2104.03427  [pdf, other

    cs.CV

    FatNet: A Feature-attentive Network for 3D Point Cloud Processing

    Authors: Chaitanya Kaul, Nick Pears, Suresh Manandhar

    Abstract: The application of deep learning to 3D point clouds is challenging due to its lack of order. Inspired by the point embeddings of PointNet and the edge embeddings of DGCNNs, we propose three improvements to the task of point cloud analysis. First, we introduce a novel feature-attentive neural network layer, a FAT layer, that combines both global point-based features and local edge-based features in… ▽ More

    Submitted 7 April, 2021; originally announced April 2021.

    Comments: Published at ICPR 2020 (Oral). arXiv admin note: substantial text overlap with arXiv:1905.07650

  13. arXiv:1912.02079  [pdf, other

    eess.IV cs.CV

    FocusNet++: Attentive Aggregated Transformations for Efficient and Accurate Medical Image Segmentation

    Authors: Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, Suresh Manandhar

    Abstract: We propose a new residual block for convolutional neural networks and demonstrate its state-of-the-art performance in medical image segmentation. We combine attention mechanisms with group convolutions to create our group attention mechanism, which forms the fundamental building block of our network, FocusNet++. We employ a hybrid loss based on balanced cross entropy, Tversky loss and the adaptive… ▽ More

    Submitted 7 April, 2021; v1 submitted 4 December, 2019; originally announced December 2019.

    Comments: Published at ISBI 2021

  14. arXiv:1910.09717  [pdf, other

    eess.IV cs.CV

    Penalizing small errors using an Adaptive Logarithmic Loss

    Authors: Chaitanya Kaul, Nick Pears, Hang Dai, Roderick Murray-Smith, Suresh Manandhar

    Abstract: Loss functions are error metrics that quantify the difference between a prediction and its corresponding ground truth. Fundamentally, they define a functional landscape for traversal by gradient descent. Although numerous loss functions have been proposed to date in order to handle various machine learning problems, little attention has been given to enhancing these functions to better traverse th… ▽ More

    Submitted 7 April, 2021; v1 submitted 21 October, 2019; originally announced October 2019.

    Comments: Published at AIHA 2020 (ICPR 2020 Workshop)

  15. arXiv:1905.07650  [pdf, other

    cs.CV

    SAWNet: A Spatially Aware Deep Neural Network for 3D Point Cloud Processing

    Authors: Chaitanya Kaul, Nick Pears, Suresh Manandhar

    Abstract: Deep neural networks have established themselves as the state-of-the-art methodology in almost all computer vision tasks to date. But their application to processing data lying on non-Euclidean domains is still a very active area of research. One such area is the analysis of point cloud data which poses a challenge due to its lack of order. Many recent techniques have been proposed, spearheaded by… ▽ More

    Submitted 18 May, 2019; originally announced May 2019.

  16. arXiv:1902.03091  [pdf, other

    cs.CV

    FocusNet: An attention-based Fully Convolutional Network for Medical Image Segmentation

    Authors: Chaitanya Kaul, Suresh Manandhar, Nick Pears

    Abstract: We propose a novel technique to incorporate attention within convolutional neural networks using feature maps generated by a separate convolutional autoencoder. Our attention architecture is well suited for incorporation with deep convolutional networks. We evaluate our model on benchmark segmentation datasets in skin cancer segmentation and lung lesion segmentation. Results show highly competitiv… ▽ More

    Submitted 8 February, 2019; originally announced February 2019.