Skip to main content

Showing 1–15 of 15 results for author: Athar, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2407.04459  [pdf, other

    cs.CL

    Generalists vs. Specialists: Evaluating Large Language Models for Urdu

    Authors: Samee Arif, Abdul Hameed Azeemi, Agha Ali Raza, Awais Athar

    Abstract: In this paper, we compare general-purpose pretrained models, GPT-4-Turbo and Llama-3-8b-Instruct with special-purpose models fine-tuned on specific tasks, XLM-Roberta-large, mT5-large, and Llama-3-8b-Instruct. We focus on seven classification and six generation tasks to evaluate the performance of these models on Urdu language. Urdu has 70 million native speakers, yet it remains underrepresented i… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  2. arXiv:2405.01458  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    UQA: Corpus for Urdu Question Answering

    Authors: Samee Arif, Sualeha Farid, Awais Athar, Agha Ali Raza

    Abstract: This paper introduces UQA, a novel dataset for question answering and text comprehension in Urdu, a low-resource language with over 70 million native speakers. UQA is generated by translating the Stanford Question Answering Dataset (SQuAD2.0), a large-scale English QA dataset, using a technique called EATS (Enclose to Anchor, Translate, Seek), which preserves the answer spans in the translated con… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 17237-17244, May 2024

  3. arXiv:2404.18960  [pdf

    q-bio.QM cs.LG

    Leak Proof CMap; a framework for training and evaluation of cell line agnostic L1000 similarity methods

    Authors: Steven Shave, Richard Kasprowicz, Abdullah M. Athar, Denise Vlachou, Neil O. Carragher, Cuong Q. Nguyen

    Abstract: The Connectivity Map (CMap) is a large publicly available database of cellular transcriptomic responses to chemical and genetic perturbations built using a standardized acquisition protocol known as the L1000 technique. Databases such as CMap provide an exciting opportunity to enrich drug discovery efforts, providing a 'known' phenotypic landscape to explore and enabling the development of state o… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

  4. arXiv:2311.01520  [pdf, other

    cs.CV cs.RO

    4D-Former: Multimodal 4D Panoptic Segmentation

    Authors: Ali Athar, Enxu Li, Sergio Casas, Raquel Urtasun

    Abstract: 4D panoptic segmentation is a challenging but practically useful task that requires every point in a LiDAR point-cloud sequence to be assigned a semantic class label, and individual objects to be segmented and tracked over time. Existing approaches utilize only LiDAR inputs which convey limited information in regions with point sparsity. This problem can, however, be mitigated by utilizing RGB cam… ▽ More

    Submitted 17 November, 2023; v1 submitted 2 November, 2023; originally announced November 2023.

    Comments: accepted at CoRL 2023

  5. arXiv:2301.02657  [pdf, other

    cs.CV cs.AI cs.LG

    TarViS: A Unified Approach for Target-based Video Segmentation

    Authors: Ali Athar, Alexander Hermans, Jonathon Luiten, Deva Ramanan, Bastian Leibe

    Abstract: The general domain of video segmentation is currently fragmented into different tasks spanning multiple benchmarks. Despite rapid progress in the state-of-the-art, current methods are overwhelmingly task-specific and cannot conceptually generalize to other tasks. Inspired by recent approaches with multi-task capability, we propose TarViS: a novel, unified network architecture that can be applied t… ▽ More

    Submitted 10 May, 2023; v1 submitted 6 January, 2023; originally announced January 2023.

    Comments: Accepted to CVPR'23 (Highlight). Code is available at: https://github.com/Ali2500/TarViS

    ACM Class: I.4.6; I.4.8; I.4.10

  6. arXiv:2209.12118  [pdf, other

    cs.CV

    BURST: A Benchmark for Unifying Object Recognition, Segmentation and Tracking in Video

    Authors: Ali Athar, Jonathon Luiten, Paul Voigtlaender, Tarasha Khurana, Achal Dave, Bastian Leibe, Deva Ramanan

    Abstract: Multiple existing benchmarks involve tracking and segmenting objects in video e.g., Video Object Segmentation (VOS) and Multi-Object Tracking and Segmentation (MOTS), but there is little interaction between them due to the use of disparate benchmark datasets and metrics (e.g. J&F, mAP, sMOTSA). As a result, published works usually target a particular benchmark, and are not easily comparable to eac… ▽ More

    Submitted 22 November, 2022; v1 submitted 24 September, 2022; originally announced September 2022.

  7. arXiv:2206.00182  [pdf, other

    cs.CV

    Differentiable Soft-Masked Attention

    Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

    Abstract: Transformers have become prevalent in computer vision due to their performance and flexibility in modelling complex operations. Of particular significance is the 'cross-attention' operation, which allows a vector representation (e.g. of an object in an image) to be learned by attending to an arbitrarily sized set of input features. Recently, "Masked Attention" was proposed in which a given object… ▽ More

    Submitted 5 August, 2022; v1 submitted 31 May, 2022; originally announced June 2022.

    Comments: arXiv admin note: text overlap with arXiv:2112.09131

    ACM Class: I.4.6; I.4.8; I.4.10

  8. HODOR: High-level Object Descriptors for Object Re-segmentation in Video Learned from Static Images

    Authors: Ali Athar, Jonathon Luiten, Alexander Hermans, Deva Ramanan, Bastian Leibe

    Abstract: Existing state-of-the-art methods for Video Object Segmentation (VOS) learn low-level pixel-to-pixel correspondences between frames to propagate object masks across video. This requires a large amount of densely annotated video data, which is costly to annotate, and largely redundant since frames within a video are highly correlated. In light of this, we propose HODOR: a novel method that tackles… ▽ More

    Submitted 15 July, 2022; v1 submitted 16 December, 2021; originally announced December 2021.

    ACM Class: I.4.6; I.4.8; I.4.10

  9. arXiv:2111.07774  [pdf, other

    cs.CV

    D^2Conv3D: Dynamic Dilated Convolutions for Object Segmentation in Videos

    Authors: Christian Schmidt, Ali Athar, Sabarinath Mahadevan, Bastian Leibe

    Abstract: Despite receiving significant attention from the research community, the task of segmenting and tracking objects in monocular videos still has much room for improvement. Existing works have simultaneously justified the efficacy of dilated and deformable convolutions for various image-level segmentation tasks. This gives reason to believe that 3D extensions of such convolutions should also yield pe… ▽ More

    Submitted 15 November, 2021; originally announced November 2021.

    Comments: Accepted to WACV 2022

  10. arXiv:2008.11516  [pdf, other

    cs.CV

    Making a Case for 3D Convolutions for Object Segmentation in Videos

    Authors: Sabarinath Mahadevan, Ali Athar, Aljoša Ošep, Sebastian Hennen, Laura Leal-Taixé, Bastian Leibe

    Abstract: The task of object segmentation in videos is usually accomplished by processing appearance and motion information separately using standard 2D convolutional networks, followed by a learned fusion of the two sources of information. On the other hand, 3D convolutional networks have been successfully applied for video classification tasks, but have not been leveraged as effectively to problems involv… ▽ More

    Submitted 1 September, 2023; v1 submitted 26 August, 2020; originally announced August 2020.

    Comments: BMVC '20

  11. STEm-Seg: Spatio-temporal Embeddings for Instance Segmentation in Videos

    Authors: Ali Athar, Sabarinath Mahadevan, Aljoša Ošep, Laura Leal-Taixé, Bastian Leibe

    Abstract: Existing methods for instance segmentation in videos typically involve multi-stage pipelines that follow the tracking-by-detection paradigm and model a video clip as a sequence of images. Multiple networks are used to detect objects in individual frames, and then associate these detections over time. Hence, these methods are often non-end-to-end trainable and highly tailored to specific tasks. In… ▽ More

    Submitted 1 September, 2023; v1 submitted 18 March, 2020; originally announced March 2020.

    Comments: ECCV 2020 28 pages, 6 figures

    MSC Class: 68T45; 68T10; 62H30 ACM Class: I.2.10; I.4.6; I.4.8; I.5.3

  12. arXiv:1811.04091  [pdf, other

    cs.CV

    Multiple People Tracking Using Hierarchical Deep Tracklet Re-identification

    Authors: Maryam Babaee, Ali Athar, Gerhard Rigoll

    Abstract: The task of multiple people tracking in monocular videos is challenging because of the numerous difficulties involved: occlusions, varying environments, crowded scenes, camera parameters and motion. In the tracking-by-detection paradigm, most approaches adopt person re-identification techniques based on computing the pairwise similarity between detections. However, these techniques are less effect… ▽ More

    Submitted 17 November, 2018; v1 submitted 9 November, 2018; originally announced November 2018.

    Comments: 13 pages (8 main + 2 bibliography + 5 appendices)

    MSC Class: 68T45 ACM Class: I.2.10; I.4.8; I.2.6; I.4.9; I.5.3

  13. arXiv:1808.07530  [pdf, ps, other

    cs.NE

    An Overview of Datatype Quantization Techniques for Convolutional Neural Networks

    Authors: Ali Athar

    Abstract: Convolutional Neural Networks (CNNs) are becoming increasingly popular due to their superior performance in the domain of computer vision, in applications such as objection detection and recognition. However, they demand complex, power-consuming hardware which makes them unsuitable for implementation on low-power mobile and embedded devices. In this paper, a description and comparison of various t… ▽ More

    Submitted 22 August, 2018; originally announced August 2018.

    Comments: 4 pages, 2 figures

    MSC Class: Computer Vision: I.5.4; Data compaction and compression: E.4

  14. arXiv:1806.05432  [pdf

    cs.CL

    Urdu Word Segmentation using Conditional Random Fields (CRFs)

    Authors: Haris Bin Zia, Agha Ali Raza, Awais Athar

    Abstract: State-of-the-art Natural Language Processing algorithms rely heavily on efficient word segmentation. Urdu is amongst languages for which word segmentation is a complex task as it exhibits space omission as well as space insertion issues. This is partly due to the Arabic script which although cursive in nature, consists of characters that have inherent joining and non-joining attributes regardless… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: 8 pages, COLING 2018

  15. arXiv:1801.00409  [pdf

    cs.CL

    PronouncUR: An Urdu Pronunciation Lexicon Generator

    Authors: Haris Bin Zia, Agha Ali Raza, Awais Athar

    Abstract: State-of-the-art speech recognition systems rely heavily on three basic components: an acoustic model, a pronunciation lexicon and a language model. To build these components, a researcher needs linguistic as well as technical expertise, which is a barrier in low-resource domains. Techniques to construct these three components without having expert domain knowledge are in great demand. Urdu, despi… ▽ More

    Submitted 5 March, 2018; v1 submitted 1 January, 2018; originally announced January 2018.

    Comments: 5 pages, LREC 2018