Zum Hauptinhalt springen

Showing 1–9 of 9 results for author: Kasem, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2406.04493  [pdf, other

    cs.CV cs.CL

    CORU: Comprehensive Post-OCR Parsing and Receipt Understanding Dataset

    Authors: Abdelrahman Abdallah, Mahmoud Abdalla, Mahmoud SalahEldin Kasem, Mohamed Mahmoud, Ibrahim Abdelhalim, Mohamed Elkasaby, Yasser ElBendary, Adam Jatowt

    Abstract: In the fields of Optical Character Recognition (OCR) and Natural Language Processing (NLP), integrating multilingual capabilities remains a critical challenge, especially when considering languages with complex scripts such as Arabic. This paper introduces the Comprehensive Post-OCR Parsing and Receipt Understanding Dataset (CORU), a novel dataset specifically designed to enhance OCR and informati… ▽ More

    Submitted 6 June, 2024; originally announced June 2024.

  2. arXiv:2405.05900  [pdf, other

    cs.CV

    A Comprehensive Survey of Masked Faces: Recognition, Detection, and Unmasking

    Authors: Mohamed Mahmoud, Mahmoud SalahEldin Kasem, Hyun-Soo Kang

    Abstract: Masked face recognition (MFR) has emerged as a critical domain in biometric identification, especially by the global COVID-19 pandemic, which introduced widespread face masks. This survey paper presents a comprehensive analysis of the challenges and advancements in recognising and detecting individuals with masked faces, which has seen innovative shifts due to the necessity of adapting to new soci… ▽ More

    Submitted 9 May, 2024; originally announced May 2024.

  3. arXiv:2403.17848  [pdf, other

    cs.CL cs.IR

    ArabicaQA: A Comprehensive Dataset for Arabic Question Answering

    Authors: Abdelrahman Abdallah, Mahmoud Kasem, Mahmoud Abdalla, Mohamed Mahmoud, Mohamed Elkasaby, Yasser Elbendary, Adam Jatowt

    Abstract: In this paper, we address the significant gap in Arabic natural language processing (NLP) resources by introducing ArabicaQA, the first large-scale dataset for machine reading comprehension and open-domain question answering in Arabic. This comprehensive dataset, consisting of 89,095 answerable and 3,701 unanswerable questions created by crowdworkers to look similar to answerable ones, along with… ▽ More

    Submitted 26 March, 2024; originally announced March 2024.

    Comments: Accepted at SIGIR 2024

  4. arXiv:2312.11812  [pdf, other

    cs.CV cs.AI

    Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey

    Authors: Mahmoud SalahEldin Kasem, Mohamed Mahmoud, Hyun-Soo Kang

    Abstract: Optical character recognition (OCR) is a vital process that involves the extraction of handwritten or printed text from scanned or printed images, converting it into a format that can be understood and processed by machines. This enables further data processing activities such as searching and editing. The automatic extraction of text through OCR plays a crucial role in digitizing documents, enhan… ▽ More

    Submitted 18 December, 2023; originally announced December 2023.

  5. arXiv:2302.01786  [pdf, other

    cs.AI

    Customer Profiling, Segmentation, and Sales Prediction using AI in Direct Marketing

    Authors: Mahmoud SalahEldin Kasem, Mohamed Hamada, Islam Taj-Eddin

    Abstract: In an increasingly customer-centric business environment, effective communication between marketing and senior management is crucial for success. With the rise of globalization and increased competition, utilizing new data mining techniques to identify potential customers is essential for direct marketing efforts. This paper proposes a data mining preprocessing method for developing a customer pro… ▽ More

    Submitted 3 February, 2023; originally announced February 2023.

  6. arXiv:2211.08469  [pdf, other

    cs.CV

    Deep learning for table detection and structure recognition: A survey

    Authors: Mahmoud Kasem, Abdelrahman Abdallah, Alexander Berendeyev, Ebrahem Elkady, Mahmoud Abdalla, Mohamed Mahmoud, Mohamed Hamada, Daniyar Nurseitov, Islam Taj-Eddin

    Abstract: Tables are everywhere, from scientific journals, papers, websites, and newspapers all the way to items we buy at the supermarket. Detecting them is thus of utmost importance to automatically understanding the content of a document. The performance of table detection has substantially increased thanks to the rapid development of deep learning networks. The goals of this survey are to provide a prof… ▽ More

    Submitted 15 November, 2022; originally announced November 2022.

  7. KOHTD: Kazakh Offline Handwritten Text Dataset

    Authors: Nazgul Toiganbayeva, Mahmoud Kasem, Galymzhan Abdimanap, Kairat Bostanbekov, Abdelrahman Abdallah, Anel Alimova, Daniyar Nurseitov

    Abstract: Despite the transition to digital information exchange, many documents, such as invoices, taxes, memos and questionnaires, historical data, and answers to exam questions, still require handwritten inputs. In this regard, there is a need to implement Handwritten Text Recognition (HTR) which is an automatic way to decrypt records using a computer. Handwriting recognition is challenging because of th… ▽ More

    Submitted 22 September, 2021; originally announced October 2021.

    Journal ref: Signal Processing: Image Communication, Volume 108, October 2022

  8. arXiv:2005.10416  [pdf

    cs.CL cs.AI cs.LG

    Automated Question Answer medical model based on Deep Learning Technology

    Authors: Abdelrahman Abdallah, Mahmoud Kasem, Mohamed Hamada, Shaymaa Sdeek

    Abstract: Artificial intelligence can now provide more solutions for different problems, especially in the medical field. One of those problems the lack of answers to any given medical/health-related question. The Internet is full of forums that allow people to ask some specific questions and get great answers for them. Nevertheless, browsing these questions in order to locate one similar to your own, also… ▽ More

    Submitted 20 May, 2020; originally announced May 2020.

    Report number: 13

    Journal ref: ICEMIS'20: Proceedings of the 6th International Conference on Engineering & MIS 2020

  9. arXiv:1403.3061  [pdf

    cs.IT

    A Comparative Study of Audio Compression Based on Compressed Sensing and Sparse Fast Fourier Transform (SFFT): Performance and Challenges

    Authors: Hossam M. Kasem, Maha El-Sabrouty

    Abstract: Audio compression has become one of the basic multimedia technologies. Choosing an efficient compression scheme that is capable of preserving the signal quality while providing a high compression ratio is desirable in the different standards worldwide. In this paper we study the application of two highly acclaimed sparse signal processing algorithms, namely, Compressed Sensing (CS) and Sparse Fart… ▽ More

    Submitted 12 March, 2014; originally announced March 2014.