-
ArzEn-LLM: Code-Switched Egyptian Arabic-English Translation and Speech Recognition Using LLMs
Authors:
Ahmed Heakl,
Youssef Zaghloul,
Mennatullah Ali,
Rania Hossam,
Walid Gomaa
Abstract:
Motivated by the widespread increase in the phenomenon of code-switching between Egyptian Arabic and English in recent times, this paper explores the intricacies of machine translation (MT) and automatic speech recognition (ASR) systems, focusing on translating code-switched Egyptian Arabic-English to either English or Egyptian Arabic. Our goal is to present the methodologies employed in developin…
▽ More
Motivated by the widespread increase in the phenomenon of code-switching between Egyptian Arabic and English in recent times, this paper explores the intricacies of machine translation (MT) and automatic speech recognition (ASR) systems, focusing on translating code-switched Egyptian Arabic-English to either English or Egyptian Arabic. Our goal is to present the methodologies employed in developing these systems, utilizing large language models such as LLama and Gemma. In the field of ASR, we explore the utilization of the Whisper model for code-switched Egyptian Arabic recognition, detailing our experimental procedures including data preprocessing and training techniques. Through the implementation of a consecutive speech-to-text translation system that integrates ASR with MT, we aim to overcome challenges posed by limited resources and the unique characteristics of the Egyptian Arabic dialect. Evaluation against established metrics showcases promising results, with our methodologies yielding a significant improvement of $56\%$ in English translation over the state-of-the-art and $9.3\%$ in Arabic translation. Since code-switching is deeply inherent in spoken languages, it is crucial that ASR systems can effectively handle this phenomenon. This capability is crucial for enabling seamless interaction in various domains, including business negotiations, cultural exchanges, and academic discourse. Our models and code are available as open-source resources. Code: \url{http://github.com/ahmedheakl/arazn-llm}}, Models: \url{http://huggingface.co/collections/ahmedheakl/arazn-llm-662ceaf12777656607b9524e}.
△ Less
Submitted 12 July, 2024; v1 submitted 26 June, 2024;
originally announced June 2024.
-
DroneVis: Versatile Computer Vision Library for Drones
Authors:
Ahmed Heakl,
Fatma Youssef,
Victor Parque,
Walid Gomaa
Abstract:
This paper introduces DroneVis, a novel library designed to automate computer vision algorithms on Parrot drones. DroneVis offers a versatile set of features and provides a diverse range of computer vision tasks along with a variety of models to choose from. Implemented in Python, the library adheres to high-quality code standards, facilitating effortless customization and feature expansion accord…
▽ More
This paper introduces DroneVis, a novel library designed to automate computer vision algorithms on Parrot drones. DroneVis offers a versatile set of features and provides a diverse range of computer vision tasks along with a variety of models to choose from. Implemented in Python, the library adheres to high-quality code standards, facilitating effortless customization and feature expansion according to user requirements. In addition, comprehensive documentation is provided, encompassing usage guidelines and illustrative use cases. Our documentation, code, and examples are available in https://github.com/ahmedheakl/drone-vis.
△ Less
Submitted 1 June, 2024;
originally announced June 2024.
-
BlockCampus: A Blockchain-Based DApp for enhancing Student Engagement and Reward Mechanisms in an Academic Community for E-JUST University
Authors:
Mariam Ayman,
Youssef El-harty,
Ahmed Rashed,
Ahmed Fathy,
Ahmed Abdullah,
Omar Wassim,
Walid Gomaa
Abstract:
In today's digital age, online communities have become an integral part of our lives, fostering collaboration, knowledge sharing, and community engagement. Higher education institutions, in particular, can greatly benefit from dedicated platforms that facilitate academic discussions and provide incentives for active participation. This research paper presents a comprehensive study and implementati…
▽ More
In today's digital age, online communities have become an integral part of our lives, fostering collaboration, knowledge sharing, and community engagement. Higher education institutions, in particular, can greatly benefit from dedicated platforms that facilitate academic discussions and provide incentives for active participation. This research paper presents a comprehensive study and implementation of a decentralized application (DApp) leveraging the blockchain technology to address these needs specifically for E-JUST (Egypt-Japan University of Science and Technology) students and academic staff.
△ Less
Submitted 7 July, 2023;
originally announced July 2023.
-
Markov Switching Model for Driver Behavior Prediction: Use cases on Smartphones
Authors:
Ahmed B. Zaky,
Mohamed A. Khamis,
Walid Gomaa
Abstract:
Several intelligent transportation systems focus on studying the various driver behaviors for numerous objectives. This includes the ability to analyze driver actions, sensitivity, distraction, and response time. As the data collection is one of the major concerns for learning and validating different driving situations, we present a driver behavior switching model validated by a low-cost data col…
▽ More
Several intelligent transportation systems focus on studying the various driver behaviors for numerous objectives. This includes the ability to analyze driver actions, sensitivity, distraction, and response time. As the data collection is one of the major concerns for learning and validating different driving situations, we present a driver behavior switching model validated by a low-cost data collection solution using smartphones. The proposed model is validated using a real dataset to predict the driver behavior in short duration periods. A literature survey on motion detection (specifically driving behavior detection using smartphones) is presented. Multiple Markov Switching Variable Auto-Regression (MSVAR) models are implemented to achieve a sophisticated fitting with the collected driver behavior data. This yields more accurate predictions not only for driver behavior but also for the entire driving situation. The performance of the presented models together with a suitable model selection criteria is also presented. The proposed driver behavior prediction framework can potentially be used in accident prediction and driver safety systems.
△ Less
Submitted 29 August, 2021;
originally announced August 2021.
-
Bearings Fault Detection Using Hidden Markov Models and Principal Component Analysis Enhanced Features
Authors:
Akthem Rehab,
Islam Ali,
Walid Gomaa,
M. Nashat Fors
Abstract:
Asset health monitoring continues to be of increasing importance on productivity, reliability, and cost reduction. Early Fault detection is a keystone of health management as part of the emerging Prognostics and Health Management (PHM) philosophy. This paper proposes a Hidden Markov Model (HMM) to assess the machine health degradation. using Principal Component Analysis (PCA) to enhance features e…
▽ More
Asset health monitoring continues to be of increasing importance on productivity, reliability, and cost reduction. Early Fault detection is a keystone of health management as part of the emerging Prognostics and Health Management (PHM) philosophy. This paper proposes a Hidden Markov Model (HMM) to assess the machine health degradation. using Principal Component Analysis (PCA) to enhance features extracted from vibration signals is considered. The enhanced features capture the second order structure of the data. The experimental results based on a bearing test bed show the plausibility of the proposed method.
△ Less
Submitted 21 April, 2021;
originally announced April 2021.
-
CrossCount: A Deep Learning System for Device-free Human Counting using WiFi
Authors:
Osama T. Ibrahim,
Walid Gomaa,
Moustafa Youssef
Abstract:
Counting humans is an essential part of many people-centric applications. In this paper, we propose CrossCount: an accurate deep-learning-based human count estimator that uses a single WiFi link to estimate the human count in an area of interest. The main idea is to depend on the temporal link-blockage pattern as a discriminant feature that is more robust to wireless channel noise than the signal…
▽ More
Counting humans is an essential part of many people-centric applications. In this paper, we propose CrossCount: an accurate deep-learning-based human count estimator that uses a single WiFi link to estimate the human count in an area of interest. The main idea is to depend on the temporal link-blockage pattern as a discriminant feature that is more robust to wireless channel noise than the signal strength, hence delivering a ubiquitous and accurate human counting system. As part of its design, CrossCount addresses a number of deep learning challenges such as class imbalance and training data augmentation for enhancing the model generalizability. Implementation and evaluation of CrossCount in multiple testbeds show that it can achieve a human counting accuracy to within a maximum of 2 persons 100% of the time. This highlights the promise of CrossCount as a ubiquitous crowd estimator with non-labour-intensive data collection from off-the-shelf devices.
△ Less
Submitted 6 July, 2020;
originally announced July 2020.
-
Zero-Calibration Device-free Localization for the IoT based on Participatory Sensing
Authors:
Osama T. Ibrahim,
Walid Gomaa,
Moustafa Youssef
Abstract:
Device-free localization (DFL) is an emerging technology for estimating the position of a human or object that is not equipped with any electronic tag, nor participate actively in the localization process. Similar to device-based localization, the initial phase in DFL is to build the fingerprint database which is usually done manually using site surveying. This process is tedious, time-consuming,…
▽ More
Device-free localization (DFL) is an emerging technology for estimating the position of a human or object that is not equipped with any electronic tag, nor participate actively in the localization process. Similar to device-based localization, the initial phase in DFL is to build the fingerprint database which is usually done manually using site surveying. This process is tedious, time-consuming, and vulnerable to environmental dynamics. Motivated by the recent advances in the Internet of Things (IoT), this paper introduces RadioGrapher; a system that automates the process of device-free fingerprint calibration in IoT environments. RadioGrapher leverages the device-based locations of entities in the area of interest in a crowd-sensing manner, aided with Fresnel zones of the wirelessly connected IoT devices to automatically construct a device-free fingerprint. Experimental evaluation of RadioGrapher in an IoT testbed using multiple entities shows that it can construct DFL fingerprints with high accuracy. Moreover, its median localization accuracy is comparable to that of manual fingerprinting. This comes with no calibration overhead, highlighting the promise of RadioGrapher as a crowdsourcing device-free fingerprint constructor in IoT environments.
△ Less
Submitted 8 September, 2019;
originally announced September 2019.
-
Trans-Sense: Real Time Transportation Schedule Estimation Using Smart Phones
Authors:
Ali AbdelAziz,
Amin Shoukry,
Walid Gomaa,
Moustafa Youssef
Abstract:
Developing countries suffer from traffic congestion, poorly planned road/rail networks, and lack of access to public transportation facilities. This context results in an increase in fuel consumption, pollution level, monetary losses, massive delays, and less productivity. On the other hand, it has a negative impact on the commuters feelings and moods. Availability of real-time transit information…
▽ More
Developing countries suffer from traffic congestion, poorly planned road/rail networks, and lack of access to public transportation facilities. This context results in an increase in fuel consumption, pollution level, monetary losses, massive delays, and less productivity. On the other hand, it has a negative impact on the commuters feelings and moods. Availability of real-time transit information - by providing public transportation vehicles locations using GPS devices - helps in estimating a passenger's waiting time and addressing the above issues. However, such solution is expensive for developing countries. This paper aims at designing and implementing a crowd-sourced mobile phones-based solution to estimate the expected waiting time of a passenger in public transit systems, the prediction of the remaining time to get on/off a vehicle, and to construct a real time public transit schedule. Trans-Sense has been evaluated using real data collected for over 800 hours, on a daily basis, by different Android phones, and using different light rail transit lines at different time spans. The results show that Trans-Sense can achieve an average recall and precision of 95.35% and 90.1%, respectively, in discriminating lightrail stations. Moreover, the empirical distributions governing the different time delays affecting a passenger's total trip time enable predicting the right time of arrival of a passenger to her destination with an accuracy of 91.81%.In addition, the system estimates the stations dimensions with an accuracy of 95.71%.
△ Less
Submitted 13 June, 2019;
originally announced June 2019.
-
MARL-FWC: Optimal Coordination of Freeway Traffic Control Measures
Authors:
Ahmed Fares,
Walid Gomaa,
Mohamed A. Khamis
Abstract:
The objective of this article is to optimize the overall traffic flow on freeways using multiple ramp metering controls plus its complementary Dynamic Speed Limits (DSLs). An optimal freeway operation can be reached when minimizing the difference between the freeway density and the critical ratio for maximum traffic flow. In this article, a Multi-Agent Reinforcement Learning for Freeways Control (…
▽ More
The objective of this article is to optimize the overall traffic flow on freeways using multiple ramp metering controls plus its complementary Dynamic Speed Limits (DSLs). An optimal freeway operation can be reached when minimizing the difference between the freeway density and the critical ratio for maximum traffic flow. In this article, a Multi-Agent Reinforcement Learning for Freeways Control (MARL-FWC) system for ramps metering and DSLs is proposed. MARL-FWC introduces a new microscopic framework at the network level based on collaborative Markov Decision Process modeling (Markov game) and an associated cooperative Q-learning algorithm. The technique incorporates payoff propagation (Max-Plus algorithm) under the coordination graphs framework, particularly suited for optimal control purposes. MARL-FWC provides three control designs: fully independent, fully distributed, and centralized; suited for different network architectures. MARL-FWC was extensively tested in order to assess the proposed model of the joint payoff, as well as the global payoff. Experiments are conducted with heavy traffic flow under the renowned VISSIM traffic simulator to evaluate MARL-FWC. The experimental results show a significant decrease in the total travel time and an increase in the average speed (when compared with the base case) while maintaining an optimal traffic flow.
△ Less
Submitted 27 August, 2018;
originally announced August 2018.
-
Cyber Physical Systems: Prospects and Challenges
Authors:
Walid Gomaa
Abstract:
Cyber physical systems CPSs embodies the conception as well as the implementation of the integration of the state-of-art technologies in sensing, communication, computing, and control. Such systems incorporate new trends such as cloud computing, mobile computing, mobile sensing, new modes of communications, wearables, etc. In this article we give an exposition of the architecture of a typical CPS…
▽ More
Cyber physical systems CPSs embodies the conception as well as the implementation of the integration of the state-of-art technologies in sensing, communication, computing, and control. Such systems incorporate new trends such as cloud computing, mobile computing, mobile sensing, new modes of communications, wearables, etc. In this article we give an exposition of the architecture of a typical CPS system and the prospects of such systems in the development of the modern world. We illustrate the three major challenges faced by a CPS system: the need for rigorous numerical computation, the limitation of the current wireless communication bandwidth, and the computation/storage limitation by mobility and energy consumption. We address each one of these exposing the current techniques devised to solve each one of them.
△ Less
Submitted 14 February, 2018;
originally announced February 2018.
-
A Framework for Algebraic Characterizations in Recursive Analysis
Authors:
Olivier Bournez,
Walid Gomaa,
Emmanuel Hainry
Abstract:
Algebraic characterizations of the computational aspects of functions defined over the real numbers provide very effective tool to understand what computability and complexity over the reals, and generally over continuous spaces, mean. This is relevant for both communities of computer scientists and mathematical analysts, particularly the latter who do not understand (and/or like) the language of…
▽ More
Algebraic characterizations of the computational aspects of functions defined over the real numbers provide very effective tool to understand what computability and complexity over the reals, and generally over continuous spaces, mean. This is relevant for both communities of computer scientists and mathematical analysts, particularly the latter who do not understand (and/or like) the language of machines and string encodings. Recursive analysis can be considered the most standard framework of computation over continuous spaces, it is however defined in a very machine specific way which does not leave much to intuitiveness. Recently several characterizations, in the form of function algebras, of recursively computable functions and some sub-recursive classes were introduced. These characterizations shed light on the hidden behavior of recursive analysis as they convert complex computational operations on sequences of real objects to "simple" intuitive mathematical operations such as integration or taking limits. The authors previously presented a framework for obtaining algebraic characterizations at the complexity level over compact domains. The current paper presents a comprehensive extension to that framework. Though we focus our attention in this paper on functions defined over the whole real line, the framework, and accordingly the obtained results, can be easily extended to functions defined over arbitrary domains.
△ Less
Submitted 26 September, 2016;
originally announced September 2016.
-
Deep learning is competing random forest in computational docking
Authors:
Mohamed Khamis,
Walid Gomaa,
Basem Galal
Abstract:
Computational docking is the core process of computer-aided drug design; it aims at predicting the best orientation and conformation of a small drug molecule when bound to a target large protein receptor. The docking quality is typically measured by a scoring function: a mathematical predictive model that produces a score representing the binding free energy and hence the stability of the resultin…
▽ More
Computational docking is the core process of computer-aided drug design; it aims at predicting the best orientation and conformation of a small drug molecule when bound to a target large protein receptor. The docking quality is typically measured by a scoring function: a mathematical predictive model that produces a score representing the binding free energy and hence the stability of the resulting complex molecule. We analyze the performance of both learning techniques on the scoring power, the ranking power, docking power, and screening power using the PDBbind 2013 database. For the scoring and ranking powers, the proposed learning scoring functions depend on a wide range of features (energy terms, pharmacophore, intermolecular) that entirely characterize the protein-ligand complexes. For the docking and screening powers, the proposed learning scoring functions depend on the intermolecular features of the RF-Score to utilize a larger number of training complexes. For the scoring power, the DL\_RF scoring function achieves Pearson's correlation coefficient between the predicted and experimentally measured binding affinities of 0.799 versus 0.758 of the RF scoring function. For the ranking power, the DL scoring function ranks the ligands bound to fixed target protein with accuracy 54% for the high-level ranking and with accuracy 78% for the low-level ranking while the RF scoring function achieves (46% and 62%) respectively. For the docking power, the DL\_RF scoring function has a success rate when the three best-scored ligand binding poses are considered within 2 Å root-mean-square-deviation from the native pose of 36.0% versus 30.2% of the RF scoring function. For the screening power, the DL scoring function has an average enrichment factor and success rate at the top 1% level of (2.69 and 6.45%) respectively versus (1.61 and 4.84%) respectively of the RF scoring function.
△ Less
Submitted 23 August, 2016;
originally announced August 2016.
-
On the complexity of bounded time and precision reachability for piecewise affine systems
Authors:
Hugo Bazille,
Olivier Bournez,
Walid Gomaa,
Amaury Pouly
Abstract:
Reachability for piecewise affine systems is known to be undecidable, starting from dimension $2$. In this paper we investigate the exact complexity of several decidable variants of reachability and control questions for piecewise affine systems. We show in particular that the region to region bounded time versions leads to $NP$-complete or co-$NP$-complete problems, starting from dimension $2$. W…
▽ More
Reachability for piecewise affine systems is known to be undecidable, starting from dimension $2$. In this paper we investigate the exact complexity of several decidable variants of reachability and control questions for piecewise affine systems. We show in particular that the region to region bounded time versions leads to $NP$-complete or co-$NP$-complete problems, starting from dimension $2$. We also prove that a bounded precision version leads to $PSPACE$-complete problems.
△ Less
Submitted 17 January, 2017; v1 submitted 20 January, 2016;
originally announced January 2016.
-
Computability and Complexity over the Product Topology of Real Numbers
Authors:
Walid Gomaa
Abstract:
Kawamura and Cook have developed a framework for studying the computability and complexity theoretic problems over "large" topological spaces. This framework has been applied to study the complexity of the differential operator and the complexity of functionals over the space of continuous functions on the unit interval $C[0,1]$. In this paper we apply the ideas of Kawamura and Cook to the product…
▽ More
Kawamura and Cook have developed a framework for studying the computability and complexity theoretic problems over "large" topological spaces. This framework has been applied to study the complexity of the differential operator and the complexity of functionals over the space of continuous functions on the unit interval $C[0,1]$. In this paper we apply the ideas of Kawamura and Cook to the product space of the real numbers endowed with the product topology. We show that no computable norm can be defined over such topology. We investigate computability and complexity of total functions over the product space in two cases: (1) when the computing machine submits a uniformally bounded number of queries to the oracle and (2) when the number of queries submitted by the machine is not uniformally bounded. In the first case we show that the function over the product space can be reduced to a function over a finite-dimensional space. However, in general there exists functions whose computing machines must submit a non-uniform number of queries to the oracle indicating that computing over the product topology can not in general be reduced to computing over finite-dimensional spaces.
△ Less
Submitted 7 May, 2014;
originally announced May 2014.
-
Characterizing Polynomial Time Computability of Rational and Real Functions
Authors:
Walid Gomaa
Abstract:
Recursive analysis was introduced by A. Turing [1936], A. Grzegorczyk [1955], and D. Lacombe [1955]. It is based on a discrete mechanical framework that can be used to model computation over the real numbers. In this context the computational complexity of real functions defined over compact domains has been extensively studied. However, much less have been done for other kinds of real functions…
▽ More
Recursive analysis was introduced by A. Turing [1936], A. Grzegorczyk [1955], and D. Lacombe [1955]. It is based on a discrete mechanical framework that can be used to model computation over the real numbers. In this context the computational complexity of real functions defined over compact domains has been extensively studied. However, much less have been done for other kinds of real functions. This article is divided into two main parts. The first part investigates polynomial time computability of rational functions and the role of continuity in such computation. On the one hand this is interesting for its own sake. On the other hand it provides insights into polynomial time computability of real functions for the latter, in the sense of recursive analysis, is modeled as approximations of rational computations. The main conclusion of this part is that continuity does not play any role in the efficiency of computing rational functions. The second part defines polynomial time computability of arbitrary real functions, characterizes it, and compares it with the corresponding notion over rational functions. Assuming continuity, the main conclusion is that there is a conceptual difference between polynomial time computation over the rationals and the reals manifested by the fact that there are polynomial time computable rational functions whose extensions to the reals are not polynomial time computable and vice versa.
△ Less
Submitted 12 November, 2009;
originally announced November 2009.