Zum Hauptinhalt springen

Showing 1–50 of 99 results for author: Saad, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2408.07445  [pdf, other

    cs.CV

    Modality Invariant Multimodal Learning to Handle Missing Modalities: A Single-Branch Approach

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Zaigham Zaheer, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf, Hassan Sajjad, Tom De Schepper, Markus Schedl

    Abstract: Multimodal networks have demonstrated remarkable performance improvements over their unimodal counterparts. Existing multimodal networks are designed in a multi-branch fashion that, due to the reliance on fusion strategies, exhibit deteriorated performance if one or more modalities are missing. In this work, we propose a modality invariant multimodal learning method, which is less susceptible to t… ▽ More

    Submitted 14 August, 2024; originally announced August 2024.

  2. arXiv:2407.20910  [pdf, other

    cs.CL cs.CR

    Enabling Contextual Soft Moderation on Social Media through Contrastive Textual Deviation

    Authors: Pujan Paudel, Mohammad Hammas Saeed, Rebecca Auger, Chris Wells, Gianluca Stringhini

    Abstract: Automated soft moderation systems are unable to ascertain if a post supports or refutes a false claim, resulting in a large number of contextual false positives. This limits their effectiveness, for example undermining trust in health experts by adding warnings to their posts or resorting to vague warnings instead of granular fact-checks, which result in desensitizing users. In this paper, we prop… ▽ More

    Submitted 30 July, 2024; originally announced July 2024.

  3. arXiv:2407.19970  [pdf

    cs.GR cs.CV cs.ET

    From Flat to Spatial: Comparison of 4 methods constructing 3D, 2 and 1/2D Models from 2D Plans with neural networks

    Authors: Jacob Sam, Karan Patel, Mike Saad

    Abstract: In the field of architecture, the conversion of single images into 2 and 1/2D and 3D meshes is a promising technology that enhances design visualization and efficiency. This paper evaluates four innovative methods: "One-2-3-45," "CRM: Single Image to 3D Textured Mesh with Convolutional Reconstruction Model," "Instant Mesh," and "Image-to-Mesh." These methods are at the forefront of this technology… ▽ More

    Submitted 29 July, 2024; originally announced July 2024.

  4. arXiv:2407.18098  [pdf, other

    cs.CY cs.SI

    Unraveling the Web of Disinformation: Exploring the Larger Context of State-Sponsored Influence Campaigns on Twitter

    Authors: Mohammad Hammas Saeed, Shiza Ali, Pujan Paudel, Jeremy Blackburn, Gianluca Stringhini

    Abstract: Social media platforms offer unprecedented opportunities for connectivity and exchange of ideas; however, they also serve as fertile grounds for the dissemination of disinformation. Over the years, there has been a rise in state-sponsored campaigns aiming to spread disinformation and sway public opinion on sensitive topics through designated accounts, known as troll accounts. Past works on detecti… ▽ More

    Submitted 25 July, 2024; originally announced July 2024.

    Journal ref: International Symposium on Research in Attacks, Intrusions and Defenses (RAID 2024)

  5. arXiv:2407.16243  [pdf, other

    cs.CV

    Chameleon: Images Are What You Need For Multimodal Learning Robust To Missing Modalities

    Authors: Muhammad Irzam Liaqat, Shah Nawaz, Muhammad Zaigham Zaheer, Muhammad Saad Saeed, Hassan Sajjad, Tom De Schepper, Karthik Nandakumar, Muhammad Haris Khan Markus Schedl

    Abstract: Multimodal learning has demonstrated remarkable performance improvements over unimodal architectures. However, multimodal learning methods often exhibit deteriorated performances if one or more modalities are missing. This may be attributed to the commonly used multi-branch design containing modality-specific streams making the models reliant on the availability of a complete set of modalities. In… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

  6. arXiv:2407.04147  [pdf, other

    cs.SE

    ALPINE: An adaptive language-agnostic pruning method for language models for code

    Authors: Mootez Saad, José Antonio Hernández López, Boqi Chen, Dániel Varró, Tushar Sharma

    Abstract: Language models of code have demonstrated state-of-the-art performance across various software engineering and source code analysis tasks. However, their demanding computational resource requirements and consequential environmental footprint remain as significant challenges. This work introduces ALPINE, an adaptive programming language-agnostic pruning technique designed to substantially reduce th… ▽ More

    Submitted 4 July, 2024; originally announced July 2024.

  7. arXiv:2406.18776  [pdf, other

    cs.CL

    Implicit Discourse Relation Classification For Nigerian Pidgin

    Authors: Muhammed Saeed, Peter Bourgonje, Vera Demberg

    Abstract: Despite attempts to make Large Language Models multi-lingual, many of the world's languages are still severely under-resourced. This widens the performance gap between NLP and AI applications aimed at well-financed, and those aimed at less-resourced languages. In this paper, we focus on Nigerian Pidgin (NP), which is spoken by nearly 100 million people, but has comparatively very few NLP resources… ▽ More

    Submitted 26 June, 2024; originally announced June 2024.

  8. arXiv:2406.09630  [pdf, other

    cs.CV cs.LG

    Muharaf: Manuscripts of Handwritten Arabic Dataset for Cursive Text Recognition

    Authors: Mehreen Saeed, Adrian Chan, Anupam Mijar, Joseph Moukarzel, Georges Habchi, Carlos Younes, Amin Elias, Chau-Wai Wong, Akram Khater

    Abstract: We present the Manuscripts of Handwritten Arabic~(Muharaf) dataset, which is a machine learning dataset consisting of more than 1,600 historic handwritten page images transcribed by experts in archival Arabic. Each document image is accompanied by spatial polygonal coordinates of its text lines as well as basic page elements. This dataset was compiled to advance the state of the art in handwritten… ▽ More

    Submitted 13 June, 2024; originally announced June 2024.

  9. arXiv:2405.20987  [pdf, other

    cs.CV cs.LG eess.IV

    Early Stopping Criteria for Training Generative Adversarial Networks in Biomedical Imaging

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Generative Adversarial Networks (GANs) have high computational costs to train their complex architectures. Throughout the training process, GANs' output is analyzed qualitatively based on the loss and synthetic images' diversity and quality. Based on this qualitative analysis, training is manually halted once the desired synthetic images are generated. By utilizing an early stopping criterion, the… ▽ More

    Submitted 31 May, 2024; originally announced May 2024.

    Comments: This paper is accepted at the 35th IEEE Irish Signals and Systems Conference (ISSC 2024)

  10. arXiv:2404.19238  [pdf, other

    cs.IT cs.DC cs.GT cs.LG cs.NI

    Pilot Contamination in Massive MIMO Systems: Challenges and Future Prospects

    Authors: Muhammad Kamran Saeed, Ashfaq Khokhar, Shakil Ahmed

    Abstract: Massive multiple input multiple output (M-MIMO) technology plays a pivotal role in fifth-generation (5G) and beyond communication systems, offering a wide range of benefits, from increased spectral efficiency (SE) to enhanced energy efficiency and higher reliability. However, these advantages are contingent upon precise channel state information (CSI) availability at the base station (BS). Ensurin… ▽ More

    Submitted 29 April, 2024; originally announced April 2024.

    Comments: Accepted At IWCMC 2024 Comm & SP Symposium

  11. arXiv:2404.18264  [pdf, other

    cs.CL cs.AI

    Modeling Orthographic Variation Improves NLP Performance for Nigerian Pidgin

    Authors: Pin-Jie Lin, Merel Scholman, Muhammed Saeed, Vera Demberg

    Abstract: Nigerian Pidgin is an English-derived contact language and is traditionally an oral language, spoken by approximately 100 million people. No orthographic standard has yet been adopted, and thus the few available Pidgin datasets that exist are characterised by noise in the form of orthographic variations. This contributes to under-performance of models in critical NLP tasks. The current work is the… ▽ More

    Submitted 28 April, 2024; originally announced April 2024.

    Comments: Accepted to LREC-COLING 2024 Main Conference

  12. arXiv:2404.10188  [pdf, other

    cs.NI cs.GT cs.IT cs.LG cs.SI

    Smart Pilot Assignment for IoT in Massive MIMO Systems: A Path Towards Scalable IoT Infrastructure

    Authors: Muhammad Kamran Saeed, Ashfaq Khokhar

    Abstract: 5G sets the foundation for an era of creativity with its faster speeds, increased data throughput, reduced latency, and enhanced IoT connectivity, all enabled by Massive MIMO (M-MIMO) technology. M-MIMO boosts network efficiency and enhances user experience by employing intelligent user scheduling. This paper presents a user scheduling scheme and pilot assignment strategy designed for IoT devices,… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: Accepted At ICC-2024

  13. arXiv:2404.09342  [pdf, other

    cs.CV cs.SD eess.AS

    Face-voice Association in Multilingual Environments (FAME) Challenge 2024 Evaluation Plan

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Salman Tahir, Rohan Kumar Das, Muhammad Zaigham Zaheer, Marta Moscati, Markus Schedl, Muhammad Haris Khan, Karthik Nandakumar, Muhammad Haroon Yousaf

    Abstract: The advancements of technology have led to the use of multimodal systems in various real-world applications. Among them, the audio-visual systems are one of the widely used multimodal systems. In the recent years, associating face and voice of a person has gained attention due to presence of unique correlation between them. The Face-voice Association in Multilingual Environments (FAME) Challenge 2… ▽ More

    Submitted 22 July, 2024; v1 submitted 14 April, 2024; originally announced April 2024.

    Comments: ACM Multimedia Conference - Grand Challenge

  14. arXiv:2404.06144  [pdf, other

    cs.LG cs.AI

    Differential Privacy for Anomaly Detection: Analyzing the Trade-off Between Privacy and Explainability

    Authors: Fatima Ezzeddine, Mirna Saad, Omran Ayoub, Davide Andreoletti, Martin Gjoreski, Ihab Sbeity, Marc Langheinrich, Silvia Giordano

    Abstract: Anomaly detection (AD), also referred to as outlier detection, is a statistical process aimed at identifying observations within a dataset that significantly deviate from the expected pattern of the majority of the data. Such a process finds wide application in various fields, such as finance and healthcare. While the primary objective of AD is to yield high detection accuracy, the requirements of… ▽ More

    Submitted 9 April, 2024; originally announced April 2024.

  15. arXiv:2401.17967  [pdf, other

    cs.SE cs.LG

    CONCORD: Towards a DSL for Configurable Graph Code Representation

    Authors: Mootez Saad, Tushar Sharma

    Abstract: Deep learning is widely used to uncover hidden patterns in large code corpora. To achieve this, constructing a format that captures the relevant characteristics and features of source code is essential. Graph-based representations have gained attention for their ability to model structural and semantic information. However, existing tools lack flexibility in constructing graphs across different pr… ▽ More

    Submitted 31 January, 2024; originally announced January 2024.

  16. arXiv:2401.09824  [pdf, other

    cs.CR

    Conning the Crypto Conman: End-to-End Analysis of Cryptocurrency-based Technical Support Scams

    Authors: Bhupendra Acharya, Muhammad Saad, Antonio Emanuele Cinà, Lea Schönherr, Hoang Dai Nguyen, Adam Oest, Phani Vadrevu, Thorsten Holz

    Abstract: The mainstream adoption of cryptocurrencies has led to a surge in wallet-related issues reported by ordinary users on social media platforms. In parallel, there is an increase in an emerging fraud trend called cryptocurrency-based technical support scam, in which fraudsters offer fake wallet recovery services and target users experiencing wallet-related issues. In this paper, we perform a compre… ▽ More

    Submitted 18 January, 2024; originally announced January 2024.

  17. ArabIcros: AI-Powered Arabic Crossword Puzzle Generation for Educational Applications

    Authors: Kamyar Zeinalipour, Mohamed Zaky Saad, Marco Maggini, Marco Gori

    Abstract: This paper presents the first Arabic crossword puzzle generator driven by advanced AI technology. Leveraging cutting-edge large language models including GPT4, GPT3-Davinci, GPT3-Curie, GPT3-Babbage, GPT3-Ada, and BERT, the system generates distinctive and challenging clues. Based on a dataset comprising over 50,000 clue-answer pairs, the generator employs fine-tuning, few/zero-shot learning strat… ▽ More

    Submitted 26 January, 2024; v1 submitted 3 December, 2023; originally announced December 2023.

    Comments: Accepted Paper for ArabicNLP 2023 - The First Arabic Natural Language Processing Conference - Co-located with EMNLP 2023 in Singapore

  18. arXiv:2311.15024  [pdf

    cs.CR

    A Comparative Study of Watering Hole Attack Detection Using Supervised Neural Network

    Authors: Mst. Nishita Aktar, Sornali Akter, Md. Nusaim Islam Saad, Jakir Hosen Jisun, Kh. Mustafizur Rahman, Md. Nazmus Sakib

    Abstract: The state of security demands innovative solutions to defend against targeted attacks due to the growing sophistication of cyber threats. This study explores the nefarious tactic known as "watering hole attacks using supervised neural networks to detect and prevent these attacks. The neural network identifies patterns in website behavior and network traffic associated with such attacks. Testing on… ▽ More

    Submitted 12 February, 2024; v1 submitted 25 November, 2023; originally announced November 2023.

  19. arXiv:2311.13508  [pdf, other

    cs.SE cs.LG

    Naturalness of Attention: Revisiting Attention in Code Language Models

    Authors: Mootez Saad, Tushar Sharma

    Abstract: Language models for code such as CodeBERT offer the capability to learn advanced source code representation, but their opacity poses barriers to understanding of captured properties. Recent attention analysis studies provide initial interpretability insights by focusing solely on attention weights rather than considering the wider context modeling of Transformers. This study aims to shed some ligh… ▽ More

    Submitted 22 November, 2023; originally announced November 2023.

    Comments: Accepted at ICSE-NIER (2024) track

  20. arXiv:2310.11266  [pdf

    cs.CL cs.AI cs.NE

    Emulating Human Cognitive Processes for Expert-Level Medical Question-Answering with Large Language Models

    Authors: Khushboo Verma, Marina Moore, Stephanie Wottrich, Karla Robles López, Nishant Aggarwal, Zeel Bhatt, Aagamjit Singh, Bradford Unroe, Salah Basheer, Nitish Sachdeva, Prinka Arora, Harmanjeet Kaur, Tanupreet Kaur, Tevon Hood, Anahi Marquez, Tushar Varshney, Nanfu Deng, Azaan Ramani, Pawanraj Ishwara, Maimoona Saeed, Tatiana López Velarde Peña, Bryan Barksdale, Sushovan Guha, Satwant Kumar

    Abstract: In response to the pressing need for advanced clinical problem-solving tools in healthcare, we introduce BooksMed, a novel framework based on a Large Language Model (LLM). BooksMed uniquely emulates human cognitive processes to deliver evidence-based and reliable responses, utilizing the GRADE (Grading of Recommendations, Assessment, Development, and Evaluations) framework to effectively quantify… ▽ More

    Submitted 17 October, 2023; originally announced October 2023.

  21. arXiv:2310.03278  [pdf, other

    cs.IT cs.GT cs.LG cs.NI eess.SP

    Mitigating Pilot Contamination and Enabling IoT Scalability in Massive MIMO Systems

    Authors: Muhammad Kamran Saeed, Ahmed E. Kamal, Ashfaq Khokhar

    Abstract: Massive MIMO is expected to play an important role in the development of 5G networks. This paper addresses the issue of pilot contamination and scalability in massive MIMO systems. The current practice of reusing orthogonal pilot sequences in adjacent cells leads to difficulty in differentiating incoming inter- and intra-cell pilot sequences. One possible solution is to increase the number of orth… ▽ More

    Submitted 4 October, 2023; originally announced October 2023.

    Comments: Accepted At GLOBECOM 2023

  22. arXiv:2310.02240  [pdf, other

    cs.RO

    Spherical Rolling Robots Design, Modeling, and Control: A Systematic Literature Review

    Authors: Aminata Diouf, Bruno Belzile, Maarouf Saad, David St-Onge

    Abstract: Spherical robots have garnered increasing interest for their applications in exploration, tunnel inspection, and extraterrestrial missions. Diverse designs have emerged, including barycentric configurations, pendulum-based mechanisms, etc. In addition, a wide spectrum of control strategies has been proposed, ranging from traditional PID approaches to cutting-edge neural networks. Our systematic re… ▽ More

    Submitted 3 October, 2023; originally announced October 2023.

  23. arXiv:2309.12245  [pdf, other

    eess.IV cs.CV cs.LG

    Adaptive Input-image Normalization for Solving the Mode Collapse Problem in GAN-based X-ray Images

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More

    Submitted 29 April, 2024; v1 submitted 21 September, 2023; originally announced September 2023.

    Comments: Submitted to the Elsevier Journal

  24. arXiv:2308.05247  [pdf, other

    cs.SI cs.CR

    TUBERAIDER: Attributing Coordinated Hate Attacks on YouTube Videos to their Source Communities

    Authors: Mohammad Hammas Saeed, Kostantinos Papadamou, Jeremy Blackburn, Emiliano De Cristofaro, Gianluca Stringhini

    Abstract: Alas, coordinated hate attacks, or raids, are becoming increasingly common online. In a nutshell, these are perpetrated by a group of aggressors who organize and coordinate operations on a platform (e.g., 4chan) to target victims on another community (e.g., YouTube). In this paper, we focus on attributing raids to their source community, paving the way for moderation approaches that take the conte… ▽ More

    Submitted 22 June, 2024; v1 submitted 9 August, 2023; originally announced August 2023.

    Comments: Accepted for publication at the 18th International AAAI Conference on Web and Social Media (ICWSM 2024). Please cite accordingly

  25. arXiv:2308.02505  [pdf, other

    eess.IV cs.CV cs.LG

    Assessing Intra-class Diversity and Quality of Synthetically Generated Images in a Biomedical and Non-biomedical Setting

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: In biomedical image analysis, data imbalance is common across several imaging modalities. Data augmentation is one of the key solutions in addressing this limitation. Generative Adversarial Networks (GANs) are increasingly being relied upon for data augmentation tasks. Biomedical image features are sensitive to evaluating the efficacy of synthetic images. These features can have a significant impa… ▽ More

    Submitted 23 July, 2023; originally announced August 2023.

    Comments: This work is accepted in 25th Irish Machine Vision and Image Processing (IMVIP) Conference

  26. arXiv:2307.00382  [pdf, other

    cs.CL

    Low-Resource Cross-Lingual Adaptive Training for Nigerian Pidgin

    Authors: Pin-Jie Lin, Muhammed Saeed, Ernie Chang, Merel Scholman

    Abstract: Developing effective spoken language processing systems for low-resource languages poses several challenges due to the lack of parallel data and limited resources for fine-tuning models. In this work, we target on improving upon both text classification and translation of Nigerian Pidgin (Naija) by collecting a large-scale parallel English-Pidgin corpus and further propose a framework of cross-lin… ▽ More

    Submitted 1 July, 2023; originally announced July 2023.

    Comments: To appear in INTERSPEECH 2023

  27. arXiv:2306.02630  [pdf, other

    stat.ML cs.LG

    Covariance Adaptive Best Arm Identification

    Authors: El Mehdi Saad, Gilles Blanchard, Nicolas Verzelen

    Abstract: We consider the problem of best arm identification in the multi-armed bandit model, under fixed confidence. Given a confidence input $δ$, the goal is to identify the arm with the highest mean reward with a probability of at least 1 -- $δ$, while minimizing the number of arm pulls. While the literature provides solutions to this problem under the assumption of independent arms distributions, we pro… ▽ More

    Submitted 20 December, 2023; v1 submitted 5 June, 2023; originally announced June 2023.

    Comments: New version with some minor corrections

    Journal ref: Neurips 2023

  28. arXiv:2306.02628  [pdf, other

    stat.ML cs.LG

    Active Ranking of Experts Based on their Performances in Many Tasks

    Authors: El Mehdi Saad, Nicolas Verzelen, Alexandra Carpentier

    Abstract: We consider the problem of ranking n experts based on their performances on d tasks. We make a monotonicity assumption stating that for each pair of experts, one outperforms the other on all tasks. We consider the sequential setting where in each round, the learner has access to noisy evaluations of actively chosen pair of expert-task, given the information available up to the actual round. Given… ▽ More

    Submitted 5 June, 2023; originally announced June 2023.

  29. arXiv:2304.13253  [pdf, other

    cs.CR cs.CY cs.LG cs.SE

    Analyzing In-browser Cryptojacking

    Authors: Muhammad Saad, David Mohaisen

    Abstract: Cryptojacking is the permissionless use of a target device to covertly mine cryptocurrencies. With cryptojacking, attackers use malicious JavaScript codes to force web browsers into solving proof-of-work puzzles, thus making money by exploiting the resources of the website visitors. To understand and counter such attacks, we systematically analyze the static, dynamic, and economic aspects of in-br… ▽ More

    Submitted 25 April, 2023; originally announced April 2023.

    Comments: 14 pages, 11 tables, 8 figures, and 69 references. arXiv admin note: substantial text overlap with arXiv:1809.02152

  30. arXiv:2304.00472  [pdf, other

    cs.DB cs.AI

    Querying Large Language Models with SQL

    Authors: Mohammed Saeed, Nicola De Cao, Paolo Papotti

    Abstract: In many use-cases, information is stored in text but not available in structured data. However, extracting data from natural language text to precisely fit a schema, and thus enable querying, is a challenging task. With the rise of pre-trained Large Language Models (LLMs), there is now an effective solution to store and use information extracted from massive corpora of text documents. Thus, we env… ▽ More

    Submitted 25 October, 2023; v1 submitted 2 April, 2023; originally announced April 2023.

    Comments: Accepted for presentation at EDBT 2024 as Vision paper

  31. arXiv:2303.13055  [pdf, other

    cs.HC cs.LG

    Reimagining Application User Interface (UI) Design using Deep Learning Methods: Challenges and Opportunities

    Authors: Subtain Malik, Muhammad Tariq Saeed, Marya Jabeen Zia, Shahzad Rasool, Liaquat Ali Khan, Mian Ilyas Ahmed

    Abstract: In this paper, we present a review of the recent work in deep learning methods for user interface design. The survey encompasses well known deep learning techniques (deep neural networks, convolutional neural networks, recurrent neural networks, autoencoders, and generative adversarial networks) and datasets widely used to design user interface applications. We highlight important problems and eme… ▽ More

    Submitted 23 March, 2023; originally announced March 2023.

    Comments: A review paper on studies of UI design techniques and deep learning

  32. arXiv:2303.08729  [pdf, other

    cs.SE cs.AI cs.LG cs.PL

    DACOS-A Manually Annotated Dataset of Code Smells

    Authors: Himesh Nandani, Mootez Saad, Tushar Sharma

    Abstract: Researchers apply machine-learning techniques for code smell detection to counter the subjectivity of many code smells. Such approaches need a large, manually annotated dataset for training and benchmarking. Existing literature offers a few datasets; however, they are small in size and, more importantly, do not focus on the subjective code snippets. In this paper, we present DACOS, a manually anno… ▽ More

    Submitted 15 March, 2023; originally announced March 2023.

    Comments: 4 pages

  33. arXiv:2303.06129  [pdf, other

    cs.CV

    Single-branch Network for Multimodal Training

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Muhammad Zaigham Zaheer, Karthik Nandakumar, Muhammad Haroon Yousaf, Arif Mahmood

    Abstract: With the rapid growth of social media platforms, users are sharing billions of multimedia posts containing audio, images, and text. Researchers have focused on building autonomous systems capable of processing such multimedia data to solve challenging multimodal tasks including cross-modal retrieval, matching, and verification. Existing works use separate networks to extract embeddings of each mod… ▽ More

    Submitted 10 March, 2023; originally announced March 2023.

    Comments: Accepted at ICASSP 2023

  34. arXiv:2302.13033  [pdf, other

    cs.SD cs.CV cs.MM eess.AS

    Speaker Recognition in Realistic Scenario Using Multimodal Data

    Authors: Saqlain Hussain Shah, Muhammad Saad Saeed, Shah Nawaz, Muhammad Haroon Yousaf

    Abstract: In recent years, an association is established between faces and voices of celebrities leveraging large scale audio-visual information from YouTube. The availability of large scale audio-visual datasets is instrumental in developing speaker recognition methods based on standard Convolutional Neural Networks. Thus, the aim of this paper is to leverage large scale audio-visual information to improve… ▽ More

    Submitted 25 February, 2023; originally announced February 2023.

    Comments: Accepted at the International Conference on Artificial Intelligence (ICAI'2023)

  35. arXiv:2211.12009  [pdf

    cs.CV cs.AI

    Deep-Learning-Based Computer Vision Approach For The Segmentation Of Ball Deliveries And Tracking In Cricket

    Authors: Kumail Abbas, Muhammad Saeed, M. Imad Khan, Khandakar Ahmed, Hua Wang

    Abstract: There has been a significant increase in the adoption of technology in cricket recently. This trend has created the problem of duplicate work being done in similar computer vision-based research works. Our research tries to solve one of these problems by segmenting ball deliveries in a cricket broadcast using deep learning models, MobileNet and YOLO, thus enabling researchers to use our work as a… ▽ More

    Submitted 21 November, 2022; originally announced November 2022.

  36. arXiv:2210.06334  [pdf, other

    eess.IV cs.AI cs.CV cs.LG

    A Self-attention Guided Multi-scale Gradient GAN for Diversified X-ray Image Synthesis

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Imbalanced image datasets are commonly available in the domain of biomedical image analysis. Biomedical images contain diversified features that are significant in predicting targeted diseases. Generative Adversarial Networks (GANs) are utilized to address the data limitation problem via the generation of synthetic images. Training challenges such as mode collapse, non-convergence, and instability… ▽ More

    Submitted 12 November, 2022; v1 submitted 9 October, 2022; originally announced October 2022.

    Comments: Accepted in AICS-2022 Conference

  37. arXiv:2208.10238  [pdf, other

    cs.CV

    Learning Branched Fusion and Orthogonal Projection for Face-Voice Association

    Authors: Muhammad Saad Saeed, Shah Nawaz, Muhammad Haris Khan, Sajid Javed, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: Recent years have seen an increased interest in establishing association between faces and voices of celebrities leveraging audio-visual information from YouTube. Prior works adopt metric learning methods to learn an embedding space that is amenable for associated matching and verification tasks. Albeit showing some progress, such formulations are, however, restrictive due to dependency on distanc… ▽ More

    Submitted 22 August, 2022; originally announced August 2022.

    Comments: Submitted: IEEE Transactions on Multimedia. arXiv admin note: substantial text overlap with arXiv:2112.10483

  38. arXiv:2208.09214  [pdf, other

    cs.IR cs.AI cs.DB

    Crowdsourced Fact-Checking at Twitter: How Does the Crowd Compare With Experts?

    Authors: Mohammed Saeed, Nicolas Traub, Maelle Nicolas, Gianluca Demartini, Paolo Papotti

    Abstract: Fact-checking is one of the effective solutions in fighting online misinformation. However, traditional fact-checking is a process requiring scarce expert human resources, and thus does not scale well on social media because of the continuous flow of new content to be checked. Methods based on crowdsourcing have been proposed to tackle this challenge, as they can scale with a smaller cost, but, wh… ▽ More

    Submitted 19 August, 2022; originally announced August 2022.

    Journal ref: Proceedings of the 31st ACM International Conference on Information and Knowledge Management (CIKM 2022)

  39. arXiv:2208.08224  [pdf, other

    cs.CV eess.IV

    Blind-Spot Collision Detection System for Commercial Vehicles Using Multi Deep CNN Architecture

    Authors: Muhammad Muzammel, Mohd Zuki Yusoff, Mohamad Naufal Mohamad Saad, Faryal Sheikh, Muhammad Ahsan Awais

    Abstract: Buses and heavy vehicles have more blind spots compared to cars and other road vehicles due to their large sizes. Therefore, accidents caused by these heavy vehicles are more fatal and result in severe injuries to other road users. These possible blind-spot collisions can be identified early using vision-based object detection approaches. Yet, the existing state-of-the-art vision-based object dete… ▽ More

    Submitted 19 August, 2022; v1 submitted 17 August, 2022; originally announced August 2022.

  40. arXiv:2208.05593  [pdf, other

    eess.IV cs.CV

    Evaluating the Quality and Diversity of DCGAN-based Generatively Synthesized Diabetic Retinopathy Imagery

    Authors: Cristina-Madalina Dragan, Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Publicly available diabetic retinopathy (DR) datasets are imbalanced, containing limited numbers of images with DR. This imbalance contributes to overfitting when training machine learning classifiers. The impact of this imbalance is exacerbated as the severity of the DR stage increases, affecting the classifiers' diagnostic capacity. The imbalance can be addressed using Generative Adversarial Net… ▽ More

    Submitted 30 August, 2023; v1 submitted 10 August, 2022; originally announced August 2022.

    Comments: 29 Pages, 8 Figures, submitted to MEDAL23: Advances in Deep Generative Models for Medical Artificial Intelligence (Springer Nature series)

  41. arXiv:2208.04705  [pdf, other

    cs.CY cs.LG eess.SY

    Classification of Stress via Ambulatory ECG and GSR Data

    Authors: Zachary Dair, Muhammad Muneeb Saad, Urja Pawar, Samantha Dockray, Ruairi O'Reilly

    Abstract: In healthcare, detecting stress and enabling individuals to monitor their mental health and wellbeing is challenging. Advancements in wearable technology now enable continuous physiological data collection. This data can provide insights into mental health and behavioural states through psychophysiological analysis. However, automated analysis is required to provide timely results due to the quant… ▽ More

    Submitted 8 June, 2023; v1 submitted 19 July, 2022; originally announced August 2022.

    Comments: Associated Code to enable reproducible experimental work - https://github.com/ZacDair/EMBC_Release SMILE dataset provided by Computational Wellbeing Group (COMPWELL) https://compwell.rice.edu/workshops/embc2022/dataset - https://compwell.rice.edu/

    ACM Class: I.2.m; J.3; J.4

    Journal ref: EMBC 2022 Compwell Workshop

  42. Medical Dataset Classification for Kurdish Short Text over Social Media

    Authors: Ari M. Saeed, Shnya R. Hussein, Chro M. Ali, Tarik A. Rashid

    Abstract: The Facebook application is used as a resource for collecting the comments of this dataset, The dataset consists of 6756 comments to create a Medical Kurdish Dataset (MKD). The samples are comments of users, which are gathered from different posts of pages (Medical, News, Economy, Education, and Sport). Six steps as a preprocessing technique are performed on the raw dataset to clean and remove noi… ▽ More

    Submitted 26 March, 2022; originally announced April 2022.

    Comments: 11 pages

    Journal ref: DIB, 2020

  43. arXiv:2202.02489  [pdf, other

    cs.CV

    Investigating the Challenges of Class Imbalance and Scale Variation in Object Detection in Aerial Images

    Authors: Ahmed Elhagry, Mohamed Saeed

    Abstract: While object detection is a common problem in computer vision, it is even more challenging when dealing with aerial satellite images. The variety in object scales and orientations can make them difficult to identify. In addition, there can be large amounts of densely packed small objects such as cars. In this project, we propose a few changes to the Faster-RCNN architecture. First, we experiment w… ▽ More

    Submitted 4 February, 2022; originally announced February 2022.

  44. arXiv:2201.12946  [pdf, ps, other

    quant-ph cs.ET

    Pauli Error Propagation-Based Gate Reschedulingfor Quantum Circuit Error Mitigation

    Authors: Vedika Saravanan, Samah Mohamed Saeed

    Abstract: Noisy Intermediate-Scale Quantum (NISQ) algorithms, which run on noisy quantum computers should be carefully designed to boost the output state fidelity. While several compilation approaches have been proposed to minimize circuit errors, they often omit the detailed circuit structure information that does not affect the circuit depth or the gate count. In the presence of spatial variation in the e… ▽ More

    Submitted 30 January, 2022; originally announced January 2022.

  45. arXiv:2201.10324  [pdf, other

    eess.IV cs.CV cs.LG

    Addressing the Intra-class Mode Collapse Problem using Adaptive Input Image Normalization in GAN-based X-ray Images

    Authors: Muhammad Muneeb Saad, Mubashir Husain Rehmani, Ruairi O'Reilly

    Abstract: Biomedical image datasets can be imbalanced due to the rarity of targeted diseases. Generative Adversarial Networks play a key role in addressing this imbalance by enabling the generation of synthetic images to augment datasets. It is important to generate synthetic images that incorporate a diverse range of features to accurately represent the distribution of features present in the training imag… ▽ More

    Submitted 12 April, 2022; v1 submitted 25 January, 2022; originally announced January 2022.

    Comments: Accepted to the IEEE EMBC22 Conference

  46. arXiv:2201.07646  [pdf, other

    cs.LG cs.CV

    A Survey on Training Challenges in Generative Adversarial Networks for Biomedical Image Analysis

    Authors: Muhammad Muneeb Saad, Ruairi O'Reilly, Mubashir Husain Rehmani

    Abstract: In biomedical image analysis, the applicability of deep learning methods is directly impacted by the quantity of image data available. This is due to deep learning models requiring large image datasets to provide high-level performance. Generative Adversarial Networks (GANs) have been widely utilized to address data limitations through the generation of synthetic biomedical images. GANs consist of… ▽ More

    Submitted 10 August, 2023; v1 submitted 19 January, 2022; originally announced January 2022.

    Comments: Submitted to the AI Review Journal

  47. arXiv:2201.07219  [pdf, other

    eess.IV cs.CV cs.LG

    Contrastive Pretraining for Echocardiography Segmentation with Limited Data

    Authors: Mohamed Saeed, Rand Muhtaseb, Mohammad Yaqub

    Abstract: Contrastive learning has proven useful in many applications where access to labelled data is limited. The lack of annotated data is particularly problematic in medical image segmentation as it is difficult to have clinical experts manually annotate large volumes of data such as cardiac structures in ultrasound images of the heart. In this paper, We propose a self supervised contrastive learning me… ▽ More

    Submitted 14 July, 2022; v1 submitted 16 January, 2022; originally announced January 2022.

  48. arXiv:2112.10483  [pdf, other

    cs.CV

    Fusion and Orthogonal Projection for Improved Face-Voice Association

    Authors: Muhammad Saad Saeed, Muhammad Haris Khan, Shah Nawaz, Muhammad Haroon Yousaf, Alessio Del Bue

    Abstract: We study the problem of learning association between face and voice, which is gaining interest in the computer vision community lately. Prior works adopt pairwise or triplet loss formulations to learn an embedding space amenable for associated matching and verification tasks. Albeit showing some progress, such loss formulations are, however, restrictive due to dependency on distance-dependent marg… ▽ More

    Submitted 20 December, 2021; originally announced December 2021.

  49. arXiv:2112.00443  [pdf, other

    cs.CR cs.CY cs.SI

    TROLLMAGNIFIER: Detecting State-Sponsored Troll Accounts on Reddit

    Authors: Mohammad Hammas Saeed, Shiza Ali, Jeremy Blackburn, Emiliano De Cristofaro, Savvas Zannettou, Gianluca Stringhini

    Abstract: Growing evidence points to recurring influence campaigns on social media, often sponsored by state actors aiming to manipulate public opinion on sensitive political topics. Typically, campaigns are performed through instrumented accounts, known as troll accounts; despite their prominence, however, little work has been done to detect these accounts in the wild. In this paper, we present TROLLMAGNIF… ▽ More

    Submitted 1 December, 2021; originally announced December 2021.

  50. arXiv:2110.14205  [pdf, other

    cs.LG

    FedPrune: Towards Inclusive Federated Learning

    Authors: Muhammad Tahir Munir, Muhammad Mustansar Saeed, Mahad Ali, Zafar Ayyub Qazi, Ihsan Ayyub Qazi

    Abstract: Federated learning (FL) is a distributed learning technique that trains a shared model over distributed data in a privacy-preserving manner. Unfortunately, FL's performance degrades when there is (i) variability in client characteristics in terms of computational and memory resources (system heterogeneity) and (ii) non-IID data distribution across clients (statistical heterogeneity). For example,… ▽ More

    Submitted 27 October, 2021; originally announced October 2021.