Zum Hauptinhalt springen

Showing 1–44 of 44 results for author: Gowda, S

Searching in archive cs. Search in all archives.
.
  1. FE-Adapter: Adapting Image-based Emotion Classifiers to Videos

    Authors: Shreyank N Gowda, Boyan Gao, David A. Clifton

    Abstract: Utilizing large pre-trained models for specific tasks has yielded impressive results. However, fully fine-tuning these increasingly large models is becoming prohibitively resource-intensive. This has led to a focus on more parameter-efficient transfer learning, primarily within the same modality. But this approach has limitations, particularly in video understanding where suitable pre-trained mode… ▽ More

    Submitted 5 August, 2024; originally announced August 2024.

  2. arXiv:2408.00181  [pdf, other

    cs.CV

    CC-SAM: SAM with Cross-feature Attention and Context for Ultrasound Image Segmentation

    Authors: Shreyank N Gowda, David A. Clifton

    Abstract: The Segment Anything Model (SAM) has achieved remarkable successes in the realm of natural image segmentation, but its deployment in the medical imaging sphere has encountered challenges. Specifically, the model struggles with medical images that feature low contrast, faint boundaries, intricate morphologies, and small-sized objects. To address these challenges and enhance SAM's performance in the… ▽ More

    Submitted 31 July, 2024; originally announced August 2024.

    Comments: Accepted to ECCV 2024

  3. arXiv:2407.16264  [pdf, other

    cs.CV

    Masks and Manuscripts: Advancing Medical Pre-training with End-to-End Masking and Narrative Structuring

    Authors: Shreyank N Gowda, David A. Clifton

    Abstract: Contemporary medical contrastive learning faces challenges from inconsistent semantics and sample pair morphology, leading to dispersed and converging semantic shifts. The variability in text reports, due to multiple authors, complicates semantic consistency. To tackle these issues, we propose a two-step approach. Initially, text reports are converted into a standardized triplet format, laying the… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: Accepted in MICCAI-24

  4. arXiv:2406.02929  [pdf, other

    cs.CV cs.LG

    Exploring Data Efficiency in Zero-Shot Learning with Diffusion Models

    Authors: Zihan Ye, Shreyank N. Gowda, Xiaobo Jin, Xiaowei Huang, Haotian Xu, Yaochu Jin, Kaizhu Huang

    Abstract: Zero-Shot Learning (ZSL) aims to enable classifiers to identify unseen classes by enhancing data efficiency at the class level. This is achieved by generating image features from pre-defined semantics of unseen classes. However, most current approaches heavily depend on the number of samples from seen classes, i.e. they do not consider instance-level effectiveness. In this paper, we demonstrate th… ▽ More

    Submitted 5 June, 2024; originally announced June 2024.

  5. arXiv:2404.09752  [pdf, other

    cs.CV cs.AI cs.LG

    Can We Break Free from Strong Data Augmentations in Self-Supervised Learning?

    Authors: Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Self-supervised learning (SSL) has emerged as a promising solution for addressing the challenge of limited labeled data in deep neural networks (DNNs), offering scalability potential. However, the impact of design dependencies within the SSL framework remains insufficiently investigated. In this study, we comprehensively explore SSL behavior across a spectrum of augmentations, revealing their cruc… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

  6. arXiv:2402.10240  [pdf, other

    cs.LG cs.AI eess.SY

    A Dynamical View of the Question of Why

    Authors: Mehdi Fatemi, Sindhu Gowda

    Abstract: We address causal reasoning in multivariate time series data generated by stochastic processes. Existing approaches are largely restricted to static settings, ignoring the continuity and emission of variations across time. In contrast, we propose a learning paradigm that directly establishes causation between events in the course of time. We present two key lemmas to compute causal contributions a… ▽ More

    Submitted 27 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

    Comments: Accepted at the Twelfth International Conference on Learning Representations (ICLR'24)

  7. arXiv:2401.17883  [pdf, other

    cs.CV

    Reimagining Reality: A Comprehensive Survey of Video Inpainting Techniques

    Authors: Shreyank N Gowda, Yash Thakre, Shashank Narayana Gowda, Xiaobo Jin

    Abstract: This paper offers a comprehensive analysis of recent advancements in video inpainting techniques, a critical subset of computer vision and artificial intelligence. As a process that restores or fills in missing or corrupted portions of video sequences with plausible content, video inpainting has evolved significantly with the advent of deep learning methodologies. Despite the plethora of existing… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  8. arXiv:2401.14948  [pdf, other

    cs.LG cs.AI cs.CV

    Conserve-Update-Revise to Cure Generalization and Robustness Trade-off in Adversarial Training

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Adversarial training improves the robustness of neural networks against adversarial attacks, albeit at the expense of the trade-off between standard and robust generalization. To unveil the underlying factors driving this phenomenon, we examine the layer-wise learning capabilities of neural networks during the transition from a standard to an adversarial setting. Our empirical findings demonstrate… ▽ More

    Submitted 26 January, 2024; originally announced January 2024.

    Comments: Accepted as a conference paper at ICLR 2024

  9. arXiv:2401.11406  [pdf, other

    cs.CV

    Adversarial Augmentation Training Makes Action Recognition Models More Robust to Realistic Video Distribution Shifts

    Authors: Kiyoon Kim, Shreyank N Gowda, Panagiotis Eustratiadis, Antreas Antoniou, Robert B Fisher

    Abstract: Despite recent advances in video action recognition achieving strong performance on existing benchmarks, these models often lack robustness when faced with natural distribution shifts between training and test data. We propose two novel evaluation methods to assess model resilience to such distribution disparity. One method uses two different datasets collected from different sources and uses one… ▽ More

    Submitted 21 January, 2024; originally announced January 2024.

  10. arXiv:2310.11341  [pdf, other

    cs.CV cs.AI cs.LG

    Dual Cognitive Architecture: Incorporating Biases and Multi-Memory Systems for Lifelong Learning

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Artificial neural networks (ANNs) exhibit a narrow scope of expertise on stationary independent data. However, the data in the real world is continuous and dynamic, and ANNs must adapt to novel scenarios while also retaining the learned knowledge to become lifelong learners. The ability of humans to excel at these tasks can be attributed to multiple factors ranging from cognitive computational str… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

    Comments: Published in Transactions on Machine Learning Research (TMLR)

  11. arXiv:2310.06522  [pdf, other

    cs.LG cs.CV

    Watt For What: Rethinking Deep Learning's Energy-Performance Relationship

    Authors: Shreyank N Gowda, Xinyue Hao, Gen Li, Laura Sevilla-Lara, Shashank Narayana Gowda

    Abstract: Deep learning models have revolutionized various fields, from image recognition to natural language processing, by achieving unprecedented levels of accuracy. However, their increasing energy consumption has raised concerns about their environmental impact, disadvantaging smaller entities in research and exacerbating global energy consumption. In this paper, we explore the trade-off between model… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  12. arXiv:2309.17327  [pdf, other

    cs.CV

    Telling Stories for Common Sense Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara

    Abstract: Video understanding has long suffered from reliance on large labeled datasets, motivating research into zero-shot learning. Recent progress in language modeling presents opportunities to advance zero-shot video analysis, but constructing an effective semantic space relating action classes remains challenging. We address this by introducing a novel dataset, Stories, which contains rich textual desc… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

  13. arXiv:2309.01390  [pdf, other

    cs.CV cs.AI

    Bridging the Projection Gap: Overcoming Projection Bias Through Parameterized Distance Learning

    Authors: Chong Zhang, Mingyu Jin, Qinkai Yu, Haochen Xue, Shreyank N Gowda, Xiaobo Jin

    Abstract: Generalized zero-shot learning (GZSL) aims to recognize samples from both seen and unseen classes using only seen class samples for training. However, GZSL methods are prone to bias towards seen classes during inference due to the projection function being learned from seen classes. Most methods focus on learning an accurate projection, but bias in the projection is inevitable. We address this pro… ▽ More

    Submitted 2 April, 2024; v1 submitted 4 September, 2023; originally announced September 2023.

    Comments: 18 pages, 9 figures

  14. arXiv:2308.16041  [pdf, other

    cs.CV

    From Pixels to Portraits: A Comprehensive Survey of Talking Head Generation Techniques and Applications

    Authors: Shreyank N Gowda, Dheeraj Pandey, Shashank Narayana Gowda

    Abstract: Recent advancements in deep learning and computer vision have led to a surge of interest in generating realistic talking heads. This paper presents a comprehensive survey of state-of-the-art methods for talking head generation. We systematically categorises them into four main approaches: image-driven, audio-driven, video-driven and others (including neural radiance fields (NeRF), and 3D-based met… ▽ More

    Submitted 30 August, 2023; originally announced August 2023.

  15. arXiv:2306.04822  [pdf, other

    cs.CV

    Optimizing ViViT Training: Time and Memory Reduction for Action Recognition

    Authors: Shreyank N Gowda, Anurag Arnab, Jonathan Huang

    Abstract: In this paper, we address the challenges posed by the substantial training time and memory consumption associated with video transformers, focusing on the ViViT (Video Vision Transformer) model, in particular the Factorised Encoder version, as our baseline for action recognition tasks. The factorised encoder variant follows the late-fusion approach that is adopted by many state of the art approach… ▽ More

    Submitted 7 June, 2023; originally announced June 2023.

  16. arXiv:2304.06935  [pdf, ps, other

    cs.MS cs.SC math.AC

    Groebner.jl: A package for Gröbner bases computations in Julia

    Authors: Alexander Demin, Shashi Gowda

    Abstract: We present Groebner.jl, a Julia package for computing Groebner bases with the F4 algorithm. Groebner.jl is an efficient, portable, and open-source software. Groebner.jl works over integers modulo a prime and over the rationals, supports basic multi-threading, and specializes in computation in the degree reverse lexicographical monomial ordering. The implementation incorporates various symbolic com… ▽ More

    Submitted 12 February, 2024; v1 submitted 14 April, 2023; originally announced April 2023.

    Comments: 10 pages

  17. arXiv:2304.06672  [pdf, other

    cs.CV cs.AI

    LSFSL: Leveraging Shape Information in Few-shot Learning

    Authors: Deepan Chakravarthi Padmanabhan, Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Few-shot learning (FSL) techniques seek to learn the underlying patterns in data using fewer samples, analogous to how humans learn from limited experience. In this limited-data scenario, the challenges associated with deep neural networks, such as shortcut learning and texture bias behaviors, are further exacerbated. Moreover, the significance of addressing shortcut learning is not yet fully expl… ▽ More

    Submitted 13 April, 2023; originally announced April 2023.

    Comments: Accepted at CVPR 2023 (2nd Workshop on Learning with Limited Labelled Data for Image and Video Understanding)

  18. arXiv:2304.02846  [pdf, other

    cs.CV

    Synthetic Sample Selection for Generalized Zero-Shot Learning

    Authors: Shreyank N Gowda

    Abstract: Generalized Zero-Shot Learning (GZSL) has emerged as a pivotal research domain in computer vision, owing to its capability to recognize objects that have not been seen during training. Despite the significant progress achieved by generative techniques in converting traditional GZSL to fully supervised learning, they tend to generate a large number of synthetic features that are often redundant, th… ▽ More

    Submitted 5 April, 2023; originally announced April 2023.

    Comments: Paper accepted in CVPRW 2023

  19. arXiv:2211.05229  [pdf

    cs.CV eess.IV

    Automatic Number Plate Recognition (ANPR) with YOLOv3-CNN

    Authors: Rajdeep Adak, Abhishek Kumbhar, Rajas Pathare, Sagar Gowda

    Abstract: We present a YOLOv3-CNN pipeline for detecting vehicles, segregation of number plates, and local storage of final recognized characters. Vehicle identification is performed under various image correction schemes to determine the effect of environmental factors (angle of perception, luminosity, motion-blurring, and multi-line custom font etc.). A YOLOv3 object detection model was trained to identif… ▽ More

    Submitted 7 November, 2022; originally announced November 2022.

    Comments: 29 pages, 4 figures, 2 tables

  20. arXiv:2209.15501  [pdf, other

    cs.CV

    A Closer Look at Temporal Ordering in the Segmentation of Instructional Videos

    Authors: Anil Batra, Shreyank N Gowda, Frank Keller, Laura Sevilla-Lara

    Abstract: Understanding the steps required to perform a task is an important skill for AI systems. Learning these steps from instructional videos involves two subproblems: (i) identifying the temporal boundary of sequentially occurring segments and (ii) summarizing these steps in natural language. We refer to this task as Procedure Segmentation and Summarization (PSS). In this paper, we take a closer look a… ▽ More

    Submitted 7 October, 2022; v1 submitted 30 September, 2022; originally announced September 2022.

    Comments: Accepted at BMVC 2022

  21. arXiv:2208.10895  [pdf, other

    cs.CV cs.AI

    A Comprehensive Study of Real-Time Object Detection Networks Across Multiple Domains: A Survey

    Authors: Elahe Arani, Shruthi Gowda, Ratnajit Mukherjee, Omar Magdy, Senthilkumar Kathiresan, Bahram Zonooz

    Abstract: Deep neural network based object detectors are continuously evolving and are used in a multitude of applications, each having its own set of requirements. While safety-critical applications need high accuracy and reliability, low-latency tasks need resource and energy-efficient networks. Real-time detectors, which are a necessity in high-impact real-world applications, are continuously proposed, b… ▽ More

    Submitted 14 February, 2023; v1 submitted 23 August, 2022; originally announced August 2022.

    Comments: Published in Transactions on Machine Learning Research (TMLR) with Survey Certification

    Journal ref: Transactions on Machine Learning Research, 2022

  22. arXiv:2206.05846  [pdf, other

    cs.CV cs.AI cs.LG

    InBiaseD: Inductive Bias Distillation to Improve Generalization and Robustness through Shape-awareness

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Humans rely less on spurious correlations and trivial cues, such as texture, compared to deep neural networks which lead to better generalization and robustness. It can be attributed to the prior knowledge or the high-level cognitive inductive bias present in the brain. Therefore, introducing meaningful inductive bias to neural networks can help learn more generic and high-level representations an… ▽ More

    Submitted 12 June, 2022; originally announced June 2022.

    Comments: Accepted at 1st Conference on Lifelong Learning Agents (CoLLAs 2022)

  23. arXiv:2206.04790  [pdf, other

    cs.CV

    Learn2Augment: Learning to Composite Videos for Data Augmentation in Action Recognition

    Authors: Shreyank N Gowda, Marcus Rohrbach, Frank Keller, Laura Sevilla-Lara

    Abstract: We address the problem of data augmentation for video action recognition. Standard augmentation strategies in video are hand-designed and sample the space of possible augmented data points either at random, without knowing which augmented points will be better, or through heuristics. We propose to learn what makes a good video for action recognition and select only high-quality samples for augment… ▽ More

    Submitted 23 July, 2022; v1 submitted 9 June, 2022; originally announced June 2022.

    Comments: Accepted to ECCV-2022

  24. arXiv:2201.12468  [pdf, ps, other

    cs.SC

    Symbolic-Numeric Integration of Univariate Expressions based on Sparse Regression

    Authors: Shahriar Iravanian, Carl Julius Martensen, Alessandro Cheli, Shashi Gowda, Anand Jain, Yingbo Ma, Chris Rackauckas

    Abstract: Most computer algebra systems (CAS) support symbolic integration as core functionality. The majority of the integration packages use a combination of heuristic algebraic and rule-based (integration table) methods. In this paper, we present a hybrid (symbolic-numeric) methodology to calculate the indefinite integrals of univariate expressions. The primary motivation for this work is to add symbolic… ▽ More

    Submitted 6 February, 2022; v1 submitted 28 January, 2022; originally announced January 2022.

    Comments: 8 pages. submitted to ISSAC 2022. Code at https://github.com/SciML/SymbolicNumericIntegration.jl

    ACM Class: I.1.0; I.1.2

  25. arXiv:2201.10394  [pdf, other

    cs.CV

    Capturing Temporal Information in a Single Frame: Channel Sampling Strategies for Action Recognition

    Authors: Kiyoon Kim, Shreyank N Gowda, Oisin Mac Aodha, Laura Sevilla-Lara

    Abstract: We address the problem of capturing temporal information for video classification in 2D networks, without increasing their computational cost. Existing approaches focus on modifying the architecture of 2D networks (e.g. by including filters in the temporal dimension to turn them into 3D networks, or using optical flow, etc.), which increases computation cost. Instead, we propose a novel sampling s… ▽ More

    Submitted 10 October, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: BMVC 2022

  26. arXiv:2111.09243  [pdf, other

    cs.HC

    An Investigation into Keystroke Dynamics and Heart Rate Variability as Indicators of Stress

    Authors: Srijith Unni, Sushma Suryanarayana Gowda, Alan F. Smeaton

    Abstract: Lifelogging has become a prominent research topic in recent years. Wearable sensors like Fitbits and smart watches are now increasingly popular for recording ones activities. Some researchers are also exploring keystroke dynamics for lifelogging. Keystroke dynamics refers to the process of measuring and assessing a persons typing rhythm on digital devices. A digital footprint is created when a use… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 12 pages. To appear at MMM 2022, 28th International Conference on Multimedia Modeling, 5-8 April 2022, Phu Quoc, Vietnam

  27. arXiv:2111.05191  [pdf, other

    cs.CV cs.AI cs.LG eess.IV

    Does Thermal data make the detection systems more reliable?

    Authors: Shruthi Gowda, Bahram Zonooz, Elahe Arani

    Abstract: Deep learning-based detection networks have made remarkable progress in autonomous driving systems (ADS). ADS should have reliable performance across a variety of ambient lighting and adverse weather conditions. However, luminance degradation and visual obstructions (such as glare, fog) result in poor quality images by the visual camera which leads to performance decline. To overcome these challen… ▽ More

    Submitted 9 November, 2021; originally announced November 2021.

    Comments: Accepted at NeurIPS 2021 - ML4AD workshop (The code for this research is available at: https://github.com/NeurAI-Lab/MMC)

  28. Pulling Up by the Causal Bootstraps: Causal Data Augmentation for Pre-training Debiasing

    Authors: Sindhu C. M. Gowda, Shalmali Joshi, Haoran Zhang, Marzyeh Ghassemi

    Abstract: Machine learning models achieve state-of-the-art performance on many supervised learning tasks. However, prior evidence suggests that these models may learn to rely on shortcut biases or spurious correlations (intuitively, correlations that do not hold in the test as they hold in train) for good predictive performance. Such models cannot be trusted in deployment environments to provide accurate pr… ▽ More

    Submitted 27 August, 2021; originally announced August 2021.

    Comments: Published in CIKM 2021

  29. arXiv:2107.13029  [pdf, other

    cs.CV

    A New Split for Evaluating True Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara, Kiyoon Kim, Frank Keller, Marcus Rohrbach

    Abstract: Zero-shot action recognition is the task of classifying action categories that are not available in the training set. In this setting, the standard evaluation protocol is to use existing action recognition datasets(e.g. UCF101) and randomly split the classes into seen and unseen. However, most recent work builds on representations pre-trained on the Kinetics dataset, where classes largely overlap… ▽ More

    Submitted 13 September, 2021; v1 submitted 27 July, 2021; originally announced July 2021.

    Comments: Accepted to GCPR 2021

  30. arXiv:2107.00443  [pdf, other

    cs.RO

    Test Framework for a Virtual Competition Testbed

    Authors: Liam Wellacott, Emilyann Nault, Ioannis Skottis, Alexandre Colle, Shreyank N Gowda, Pierre Nicolay, Emily Rolley-Parnell

    Abstract: Virtual environments have been utilised in robotics research as a tool to assess systems before deploying them in the field. The COVID-19 pandemic has brought about additional motivation for the development of virtual benchmarks in order to aid in safe and productive development. In-person robotics competitions have also halted, thus limiting the scope of opportunities for students and researchers… ▽ More

    Submitted 1 July, 2021; originally announced July 2021.

  31. arXiv:2106.02567  [pdf

    cs.CV cs.AI cs.LG

    AI Driven Road Maintenance Inspection

    Authors: Ratnajit Mukherjee, Haris Iqbal, Shabbir Marzban, Ahmed Badar, Terence Brouns, Shruthi Gowda, Elahe Arani, Bahram Zonooz

    Abstract: Road infrastructure maintenance inspection is typically a labour-intensive and critical task to ensure the safety of all the road users. In this work, we propose a detailed methodology to use state-of-the-art techniques in artificial intelligence and computer vision to automate a sizeable portion of the maintenance inspection subtasks and reduce the labour costs. The proposed methodology uses stat… ▽ More

    Submitted 4 June, 2021; originally announced June 2021.

    Comments: accepted at 27th ITS World Congress, 2021

  32. arXiv:2105.05946  [pdf, other

    cs.CE

    Composing Modeling and Simulation with Machine Learning in Julia

    Authors: Chris Rackauckas, Ranjan Anantharaman, Alan Edelman, Shashi Gowda, Maja Gwozdz, Anand Jain, Chris Laughman, Yingbo Ma, Francesco Martinuzzi, Avik Pal, Utkarsh Rajput, Elliot Saba, Viral B. Shah

    Abstract: In this paper we introduce JuliaSim, a high-performance programming environment designed to blend traditional modeling and simulation with machine learning. JuliaSim can build accelerated surrogates from component-based models, such as those conforming to the FMI standard, using continuous-time echo state networks (CTESN). The foundation of this environment, ModelingToolkit.jl, is an acausal model… ▽ More

    Submitted 12 May, 2021; originally announced May 2021.

  33. arXiv:2105.03949  [pdf, other

    cs.CL cs.MS cs.PL cs.SC

    High-performance symbolic-numerics via multiple dispatch

    Authors: Shashi Gowda, Yingbo Ma, Alessandro Cheli, Maja Gwozdz, Viral B. Shah, Alan Edelman, Christopher Rackauckas

    Abstract: As mathematical computing becomes more democratized in high-level languages, high-performance symbolic-numeric systems are necessary for domain scientists and engineers to get the best performance out of their machine without deep knowledge of code optimization. Naturally, users need different term types either to have different algebraic properties for them, or to use efficient data structures. T… ▽ More

    Submitted 5 February, 2022; v1 submitted 9 May, 2021; originally announced May 2021.

    ACM Class: D.3.3; I.1.1; I.1.3

  34. arXiv:2103.05244  [pdf

    cs.MS cs.SC cs.SE

    ModelingToolkit: A Composable Graph Transformation System For Equation-Based Modeling

    Authors: Yingbo Ma, Shashi Gowda, Ranjan Anantharaman, Chris Laughman, Viral Shah, Chris Rackauckas

    Abstract: Getting good performance out of numerical equation solvers requires that the user has provided stable and efficient functions representing their model. However, users should not be trusted to write good code. In this manuscript we describe ModelingToolkit (MTK), a symbolic equation-based modeling system which allows for composable transformations to generate stable, efficient, and parallelized mod… ▽ More

    Submitted 9 February, 2022; v1 submitted 9 March, 2021; originally announced March 2021.

    Comments: 10 pages, 3 figures, 1 table

  35. arXiv:2101.07042  [pdf, other

    cs.CV

    CLASTER: Clustering with Reinforcement Learning for Zero-Shot Action Recognition

    Authors: Shreyank N Gowda, Laura Sevilla-Lara, Frank Keller, Marcus Rohrbach

    Abstract: Zero-shot action recognition is the task of recognizingaction classes without visual examples, only with a seman-tic embedding which relates unseen to seen classes. Theproblem can be seen as learning a function which general-izes well to instances of unseen classes without losing dis-crimination between classes. Neural networks can modelthe complex boundaries between visual classes, which ex-plain… ▽ More

    Submitted 23 July, 2022; v1 submitted 18 January, 2021; originally announced January 2021.

    Comments: Accepted to ECCV-22

  36. arXiv:2012.10671  [pdf, other

    cs.CV

    SMART Frame Selection for Action Recognition

    Authors: Shreyank N Gowda, Marcus Rohrbach, Laura Sevilla-Lara

    Abstract: Action recognition is computationally expensive. In this paper, we address the problem of frame selection to improve the accuracy of action recognition. In particular, we show that selecting good frames helps in action recognition performance even in the trimmed videos domain. Recent work has successfully leveraged frame selection for long, untrimmed videos, where much of the content is not releva… ▽ More

    Submitted 19 December, 2020; originally announced December 2020.

    Comments: To be published in AAAI-21

  37. arXiv:2010.04004  [pdf, other

    cs.LG math.DS

    Accelerating Simulation of Stiff Nonlinear Systems using Continuous-Time Echo State Networks

    Authors: Ranjan Anantharaman, Yingbo Ma, Shashi Gowda, Chris Laughman, Viral Shah, Alan Edelman, Chris Rackauckas

    Abstract: Modern design, control, and optimization often requires simulation of highly nonlinear models, leading to prohibitive computational costs. These costs can be amortized by evaluating a cheap surrogate of the full model. Here we present a general data-driven method, the continuous-time echo state network (CTESN), for generating surrogates of nonlinear ordinary differential equations with dynamics at… ▽ More

    Submitted 24 March, 2021; v1 submitted 7 October, 2020; originally announced October 2020.

  38. arXiv:2005.13039  [pdf, other

    cs.CV

    ALBA : Reinforcement Learning for Video Object Segmentation

    Authors: Shreyank N Gowda, Panagiotis Eustratiadis, Timothy Hospedales, Laura Sevilla-Lara

    Abstract: We consider the challenging problem of zero-shot video object segmentation (VOS). That is, segmenting and tracking multiple moving objects within a video fully automatically, without any manual initialization. We treat this as a grouping problem by exploiting object proposals and making a joint inference about grouping over both space and time. We propose a network architecture for tractably perfo… ▽ More

    Submitted 14 August, 2020; v1 submitted 26 May, 2020; originally announced May 2020.

  39. arXiv:2003.05005  [pdf, other

    cs.CV cs.CR

    Using an ensemble color space model to tackle adversarial examples

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Minute pixel changes in an image drastically change the prediction that the deep learning model makes. One of the most significant problems that could arise due to this, for instance, is autonomous driving. Many methods have been proposed to combat this with varying amounts of success. We propose a 3 step method for defending such attacks. First, we denoise the image using statistical methods. Sec… ▽ More

    Submitted 10 March, 2020; originally announced March 2020.

  40. StegColNet: Steganalysis based on an ensemble colorspace approach

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Image steganography refers to the process of hiding information inside images. Steganalysis is the process of detecting a steganographic image. We introduce a steganalysis approach that uses an ensemble color space model to obtain a weighted concatenated feature activation map. The concatenated map helps to obtain certain features explicit to each color space. We use a levy-flight grey wolf optimi… ▽ More

    Submitted 16 October, 2020; v1 submitted 6 February, 2020; originally announced February 2020.

  41. arXiv:1906.07421  [pdf

    cs.CV

    Using colorization as a tool for automatic makeup suggestion

    Authors: Shreyank Narayana Gowda

    Abstract: Colorization is the method of converting an image in grayscale to a fully color image. There are multiple methods to do the same. Old school methods used machine learning algorithms and optimization techniques to suggest possible colors to use. With advances in the field of deep learning, colorization results have improved consistently with improvements in deep learning architectures. The latest d… ▽ More

    Submitted 18 June, 2019; originally announced June 2019.

  42. arXiv:1902.00267  [pdf, ps, other

    cs.CV

    ColorNet: Investigating the importance of color spaces for image classification

    Authors: Shreyank N Gowda, Chun Yuan

    Abstract: Image classification is a fundamental application in computer vision. Recently, deeper networks and highly connected networks have shown state of the art performance for image classification tasks. Most datasets these days consist of a finite number of color images. These color images are taken as input in the form of RGB images and classification is done without modifying them. We explore the imp… ▽ More

    Submitted 1 February, 2019; originally announced February 2019.

    Journal ref: Asian Conference on Computer Vision 2018

  43. Semi-supervised Text Categorization Using Recursive K-means Clustering

    Authors: Harsha S. Gowda, Mahamad Suhil, D. S. Guru, Lavanya Narayana Raju

    Abstract: In this paper, we present a semi-supervised learning algorithm for classification of text documents. A method of labeling unlabeled text documents is presented. The presented method is based on the principle of divide and conquer strategy. It uses recursive K-means algorithm for partitioning both labeled and unlabeled data collection. The K-means algorithm is applied recursively on each partition… ▽ More

    Submitted 24 June, 2017; originally announced June 2017.

    Comments: 11 Pages, 8 Figures, Conference: RTIP2R

  44. Cluster Based Symbolic Representation for Skewed Text Categorization

    Authors: Lavanya Narayana Raju, Mahamad Suhil, D S Guru, Harsha S Gowda

    Abstract: In this work, a problem associated with imbalanced text corpora is addressed. A method of converting an imbalanced text corpus into a balanced one is presented. The presented method employs a clustering algorithm for conversion. Initially to avoid curse of dimensionality, an effective representation scheme based on term class relevancy measure is adapted, which drastically reduces the dimension to… ▽ More

    Submitted 24 June, 2017; originally announced June 2017.

    Comments: 14 Pages, 15 Figures, 1 Table, Conference: RTIP2R