Zum Hauptinhalt springen

Showing 1–8 of 8 results for author: Raza, A A

.
  1. arXiv:2408.08688  [pdf, other

    cs.CL cs.AI

    The Fellowship of the LLMs: Multi-Agent Workflows for Synthetic Preference Optimization Dataset Generation

    Authors: Samee Arif, Sualeha Farid, Abdul Hameed Azeemi, Awais Athar, Agha Ali Raza

    Abstract: This paper presents synthetic Preference Optimization (PO) datasets generated using multi-agent workflows and evaluates the effectiveness and potential of these workflows in the dataset generation process. PO dataset generation requires two modules: (1) response evaluation, and (2) response generation. In the response evaluation module, the responses from Large Language Models (LLMs) are evaluated… ▽ More

    Submitted 24 August, 2024; v1 submitted 16 August, 2024; originally announced August 2024.

  2. arXiv:2408.08454  [pdf, other

    cs.CV cs.LG

    Beyond Uniform Query Distribution: Key-Driven Grouped Query Attention

    Authors: Zohaib Khan, Muhammad Khaquan, Omer Tafveez, Burhanuddin Samiwala, Agha Ali Raza

    Abstract: The Transformer architecture has revolutionized deep learning through its Self-Attention mechanism, which effectively captures contextual information. However, the memory footprint of Self-Attention presents significant challenges for long-sequence tasks. Grouped Query Attention (GQA) addresses this issue by grouping queries and mean-pooling the corresponding key-value heads - reducing the number… ▽ More

    Submitted 28 August, 2024; v1 submitted 15 August, 2024; originally announced August 2024.

    Comments: 11 pages, 9 figures

  3. arXiv:2407.04459  [pdf, other

    cs.CL

    Generalists vs. Specialists: Evaluating Large Language Models for Urdu

    Authors: Samee Arif, Abdul Hameed Azeemi, Agha Ali Raza, Awais Athar

    Abstract: In this paper, we compare general-purpose pretrained models, GPT-4-Turbo and Llama-3-8b-Instruct with special-purpose models fine-tuned on specific tasks, XLM-Roberta-large, mT5-large, and Llama-3-8b-Instruct. We focus on seven classification and six generation tasks to evaluate the performance of these models on Urdu language. Urdu has 70 million native speakers, yet it remains underrepresented i… ▽ More

    Submitted 5 July, 2024; originally announced July 2024.

  4. arXiv:2405.01458  [pdf, other

    cs.CL cs.AI cs.IR cs.LG

    UQA: Corpus for Urdu Question Answering

    Authors: Samee Arif, Sualeha Farid, Awais Athar, Agha Ali Raza

    Abstract: This paper introduces UQA, a novel dataset for question answering and text comprehension in Urdu, a low-resource language with over 70 million native speakers. UQA is generated by translating the Stanford Question Answering Dataset (SQuAD2.0), a large-scale English QA dataset, using a technique called EATS (Enclose to Anchor, Translate, Seek), which preserves the answer spans in the translated con… ▽ More

    Submitted 22 July, 2024; v1 submitted 2 May, 2024; originally announced May 2024.

    Journal ref: Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), pp. 17237-17244, May 2024

  5. arXiv:2403.09259  [pdf, other

    cs.CL cs.LG

    To Label or Not to Label: Hybrid Active Learning for Neural Machine Translation

    Authors: Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

    Abstract: Active learning (AL) techniques reduce labeling costs for training neural machine translation (NMT) models by selecting smaller representative subsets from unlabeled data for annotation. Diversity sampling techniques select heterogeneous instances, while uncertainty sampling methods select instances with the highest model uncertainty. Both approaches have limitations - diversity methods may extrac… ▽ More

    Submitted 14 March, 2024; originally announced March 2024.

    Comments: 11 pages, 3 figures

  6. arXiv:2203.09829  [pdf, other

    cs.LG cs.SD eess.AS

    Representative Subset Selection for Efficient Fine-Tuning in Self-Supervised Speech Recognition

    Authors: Abdul Hameed Azeemi, Ihsan Ayyub Qazi, Agha Ali Raza

    Abstract: Self-supervised speech recognition models require considerable labeled training data for learning high-fidelity representations for Automatic Speech Recognition (ASR) which is computationally demanding and time-consuming. We consider the task of identifying an optimal subset of data for efficient fine-tuning in self-supervised speech models for ASR. We discover that the dataset pruning strategies… ▽ More

    Submitted 11 April, 2023; v1 submitted 18 March, 2022; originally announced March 2022.

    Comments: 16 pages, 8 figures

  7. arXiv:1806.05432  [pdf

    cs.CL

    Urdu Word Segmentation using Conditional Random Fields (CRFs)

    Authors: Haris Bin Zia, Agha Ali Raza, Awais Athar

    Abstract: State-of-the-art Natural Language Processing algorithms rely heavily on efficient word segmentation. Urdu is amongst languages for which word segmentation is a complex task as it exhibits space omission as well as space insertion issues. This is partly due to the Arabic script which although cursive in nature, consists of characters that have inherent joining and non-joining attributes regardless… ▽ More

    Submitted 14 June, 2018; originally announced June 2018.

    Comments: 8 pages, COLING 2018

  8. arXiv:1801.00409  [pdf

    cs.CL

    PronouncUR: An Urdu Pronunciation Lexicon Generator

    Authors: Haris Bin Zia, Agha Ali Raza, Awais Athar

    Abstract: State-of-the-art speech recognition systems rely heavily on three basic components: an acoustic model, a pronunciation lexicon and a language model. To build these components, a researcher needs linguistic as well as technical expertise, which is a barrier in low-resource domains. Techniques to construct these three components without having expert domain knowledge are in great demand. Urdu, despi… ▽ More

    Submitted 5 March, 2018; v1 submitted 1 January, 2018; originally announced January 2018.

    Comments: 5 pages, LREC 2018