Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Behpour, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.05271  [pdf, other

    cs.CV

    USE: Universal Segment Embeddings for Open-Vocabulary Image Segmentation

    Authors: Xiaoqi Wang, Wenbin He, Xiwei Xuan, Clint Sebastian, Jorge Piazentin Ono, Xin Li, Sima Behpour, Thang Doan, Liang Gou, Han Wei Shen, Liu Ren

    Abstract: The open-vocabulary image segmentation task involves partitioning images into semantically meaningful segments and classifying them with flexible text-defined categories. The recent vision-based foundation models such as the Segment Anything Model (SAM) have shown superior performance in generating class-agnostic image segments. The main challenge in open-vocabulary image segmentation now lies in… ▽ More

    Submitted 7 June, 2024; originally announced June 2024.

  2. arXiv:2403.06295  [pdf, other

    cs.CV

    A streamlined Approach to Multimodal Few-Shot Class Incremental Learning for Fine-Grained Datasets

    Authors: Thang Doan, Sima Behpour, Xin Li, Wenbin He, Liang Gou, Liu Ren

    Abstract: Few-shot Class-Incremental Learning (FSCIL) poses the challenge of retaining prior knowledge while learning from limited new data streams, all without overfitting. The rise of Vision-Language models (VLMs) has unlocked numerous applications, leveraging their existing knowledge to fine-tune on custom data. However, training the whole model is computationally prohibitive, and VLMs while being versat… ▽ More

    Submitted 10 March, 2024; originally announced March 2024.

  3. arXiv:2308.00310  [pdf, other

    cs.CV cs.LG

    GradOrth: A Simple yet Efficient Out-of-Distribution Detection with Orthogonal Projection of Gradients

    Authors: Sima Behpour, Thang Doan, Xin Li, Wenbin He, Liang Gou, Liu Ren

    Abstract: Detecting out-of-distribution (OOD) data is crucial for ensuring the safe deployment of machine learning models in real-world applications. However, existing OOD detection approaches primarily rely on the feature maps or the full gradient space information to derive OOD scores neglecting the role of most important parameters of the pre-trained network over in-distribution (ID) data. In this study,… ▽ More

    Submitted 1 August, 2023; originally announced August 2023.

  4. arXiv:2307.11227  [pdf, other

    cs.CV

    UP-DP: Unsupervised Prompt Learning for Data Pre-Selection with Vision-Language Models

    Authors: Xin Li, Sima Behpour, Thang Doan, Wenbin He, Liang Gou, Liu Ren

    Abstract: In this study, we investigate the task of data pre-selection, which aims to select instances for labeling from an unlabeled dataset through a single pass, thereby optimizing performance for undefined downstream tasks with a limited annotation budget. Previous approaches to data pre-selection relied solely on visual features extracted from foundation models, such as CLIP and BLIP-2, but largely ign… ▽ More

    Submitted 20 July, 2023; originally announced July 2023.

  5. arXiv:2306.14291  [pdf, other

    cs.CV cs.LG

    Hyp-OW: Exploiting Hierarchical Structure Learning with Hyperbolic Distance Enhances Open World Object Detection

    Authors: Thang Doan, Xin Li, Sima Behpour, Wenbin He, Liang Gou, Liu Ren

    Abstract: Open World Object Detection (OWOD) is a challenging and realistic task that extends beyond the scope of standard Object Detection task. It involves detecting both known and unknown objects while integrating learned knowledge for future tasks. However, the level of "unknownness" varies significantly depending on the context. For example, a tree is typically considered part of the background in a se… ▽ More

    Submitted 15 February, 2024; v1 submitted 25 June, 2023; originally announced June 2023.

    Comments: Accepted at AAAI 2024 || keywords: Open World Object Detection, Hyperbolic Distance, Unknown Detection, Deformable Transformers, Hierarchical Representation Learning

  6. Cryo-shift: Reducing domain shift in cryo-electron subtomograms with unsupervised domain adaptation and randomization

    Authors: Hmrishav Bandyopadhyay, Zihao Deng, Leiting Ding, Sinuo Liu, Mostofa Rafid Uddin, Xiangrui Zeng, Sima Behpour, Min Xu

    Abstract: Cryo-Electron Tomography (cryo-ET) is a 3D imaging technology that enables the visualization of subcellular structures in situ at near-atomic resolution. Cellular cryo-ET images help in resolving the structures of macromolecules and determining their spatial relationship in a single cell, which has broad significance in cell and structural biology. Subtomogram classification and recognition consti… ▽ More

    Submitted 17 November, 2021; originally announced November 2021.

    Comments: 14 pages

    Journal ref: Bioinformatics 2021

  7. arXiv:1912.12557  [pdf, other

    cs.LG cs.CV stat.ML

    Active Learning in Video Tracking

    Authors: Sima Behpour

    Abstract: Active learning methods, like uncertainty sampling, combined with probabilistic prediction techniques have achieved success in various problems like image classification and text classification. For more complex multivariate prediction tasks, the relationships between labels play an important role in designing structured classifiers with better performance. However, computational time complexity l… ▽ More

    Submitted 20 March, 2020; v1 submitted 28 December, 2019; originally announced December 2019.

    Journal ref: In International Conference on Machine Learning, pp. 563-572. 2019

  8. arXiv:1812.07526  [pdf, other

    stat.ML cs.LG

    Consistent Robust Adversarial Prediction for General Multiclass Classification

    Authors: Rizal Fathony, Kaiser Asif, Anqi Liu, Mohammad Ali Bashiri, Wei Xing, Sima Behpour, Xinhua Zhang, Brian D. Ziebart

    Abstract: We propose a robust adversarial prediction framework for general multiclass classification. Our method seeks predictive distributions that robustly optimize non-convex and non-continuous multiclass loss metrics against the worst-case conditional label distributions (the adversarial distributions) that (approximately) match the statistics of the training data. Although the optimized loss metrics ar… ▽ More

    Submitted 20 November, 2019; v1 submitted 18 December, 2018; originally announced December 2018.

    Comments: 49 pages, 10 figures

  9. arXiv:1710.07735  [pdf, other

    cs.CV cs.AI cs.GT

    ADA: A Game-Theoretic Perspective on Data Augmentation for Object Detection

    Authors: Sima Behpour, Kris M. Kitani, Brian D. Ziebart

    Abstract: The use of random perturbations of ground truth data, such as random translation or scaling of bounding boxes, is a common heuristic used for data augmentation that has been shown to prevent overfitting and improve generalization. Since the design of data augmentation is largely guided by reported best practices, it is difficult to understand if those design choices are optimal. To provide a more… ▽ More

    Submitted 12 December, 2017; v1 submitted 20 October, 2017; originally announced October 2017.