-
Tailoring anisotropy in kirigami metamaterial skins with pop-up folding hinges
Authors:
Hamid Reza Tohidvand,
Alexis White,
Ali Khosravi,
Paolo Celli
Abstract:
Kirigami metamaterial sheets and tubes, owing to their capacity to undergo large elastic deformations while developing three-dimensional surface textures, have enormous potential as skins for soft robots. Here, we propose to use kirigami skins with folding hinges in this same context. These recently-introduced kirigami feature counter-rotating panels connected by pop-up folding hinges. So far, res…
▽ More
Kirigami metamaterial sheets and tubes, owing to their capacity to undergo large elastic deformations while developing three-dimensional surface textures, have enormous potential as skins for soft robots. Here, we propose to use kirigami skins with folding hinges in this same context. These recently-introduced kirigami feature counter-rotating panels connected by pop-up folding hinges. So far, researchers have only explored auxetic and highly-symmetric versions of such patterns. Yet, some of these attributes have to be relaxed in order to explore their full potential as robotic skins. Thus, we parameterize these patterns and relax symmetry constraints, with the goal of using this same platform to obtain a wide range of shape-morphing behaviors. We use a combination of: matrix analysis tools and analytical kinematic formulas to thoroughly explore the vast design space that ensues; experiments for validation purposes; and numerical simulations to explore the mechanics of selected planar and tubular patterns. We demonstrate that it is possible to tailor parameters to obtain skins that globally expand or contract due to axial elongation, and that present asymmetric pop-ups that can yield anisotropic friction -- the most desired attribute for one-way locomotion of soft robots.
△ Less
Submitted 2 August, 2024;
originally announced August 2024.
-
Machine Learning Models for the Identification of Cardiovascular Diseases Using UK Biobank Data
Authors:
Sheikh Mohammed Shariful Islam,
Moloud Abrar,
Teketo Tegegne,
Liliana Loranjo,
Chandan Karmakar,
Md Abdul Awal,
Md. Shahadat Hossain,
Muhammad Ashad Kabir,
Mufti Mahmud,
Abbas Khosravi,
George Siopis,
Jeban C Moses,
Ralph Maddison
Abstract:
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore…
▽ More
Machine learning models have the potential to identify cardiovascular diseases (CVDs) early and accurately in primary healthcare settings, which is crucial for delivering timely treatment and management. Although population-based CVD risk models have been used traditionally, these models often do not consider variations in lifestyles, socioeconomic conditions, or genetic predispositions. Therefore, we aimed to develop machine learning models for CVD detection using primary healthcare data, compare the performance of different models, and identify the best models. We used data from the UK Biobank study, which included over 500,000 middle-aged participants from different primary healthcare centers in the UK. Data collected at baseline (2006--2010) and during imaging visits after 2014 were used in this study. Baseline characteristics, including sex, age, and the Townsend Deprivation Index, were included. Participants were classified as having CVD if they reported at least one of the following conditions: heart attack, angina, stroke, or high blood pressure. Cardiac imaging data such as electrocardiogram and echocardiography data, including left ventricular size and function, cardiac output, and stroke volume, were also used. We used 9 machine learning models (LSVM, RBFSVM, GP, DT, RF, NN, AdaBoost, NB, and QDA), which are explainable and easily interpretable. We reported the accuracy, precision, recall, and F-1 scores; confusion matrices; and area under the curve (AUC) curves.
△ Less
Submitted 23 July, 2024;
originally announced July 2024.
-
Understanding the Influence of Hydrogen on BCC Iron Grain Boundaries using the Kinetic Activation Relaxation technique (k-ART)
Authors:
Aynour Khosravi,
Jun Song,
Normand Mousseau
Abstract:
Hydrogen embrittlement (HE) poses a significant challenge in the mechanical integrity of iron and its alloys. This study explores the influence of hydrogen atoms on two distinct grain boundaries (GBs), $\Sigma37$ and $\Sigma3$, in body-centered-cubic (BCC) iron. Using the kinetic activation-relaxation technique (k-ART), an off-lattice kinetic Monte Carlo approach, we examine diffusion barriers and…
▽ More
Hydrogen embrittlement (HE) poses a significant challenge in the mechanical integrity of iron and its alloys. This study explores the influence of hydrogen atoms on two distinct grain boundaries (GBs), $\Sigma37$ and $\Sigma3$, in body-centered-cubic (BCC) iron. Using the kinetic activation-relaxation technique (k-ART), an off-lattice kinetic Monte Carlo approach, we examine diffusion barriers and mechanisms associated with these GBs. Our findings reveal distinct behaviors of hydrogen in different GB environments, emphasizing the elastic deformation that arises around the GB in the presence of H that leads to either the predominance of new pathways and diffusion routes or a pinning effect of H atoms. We find that, for these systems, while GB is energetically favorable for H, this element diffuses more slowly at the GBs than in the bulk. Moreover, with detailed information about the evolution landscape around GB, we find that the saturation of a GB with hydrogen both stabilizes the GB by shifting barriers associated with Fe diffusion to higher energies and smooths the energy landscape, reducing the number of diffusion events. This comprehensive analysis enhances our understanding of hydrogen's role in GB behavior, contributing valuable insights for the design and optimization of materials in hydrogen-related applications.
△ Less
Submitted 3 July, 2024; v1 submitted 18 June, 2024;
originally announced June 2024.
-
Accretion Efficiency Evolution of Central Supermassive Black Holes in Quasars
Authors:
Arta Khosravi,
Alireza Karamzadeh,
Seyed Sajad Tabasi,
Javad T. Firouzjaee
Abstract:
The ongoing debate regarding the most accurate accretion model for supermassive black holes at the center of quasars has remained a contentious issue in astrophysics. One significant challenge is the variation in calculated accretion efficiency, with values exceeding the standard range of $0.038 < ε< 0.42$. This discrepancy is especially pronounced in high redshift supermassive black holes, necess…
▽ More
The ongoing debate regarding the most accurate accretion model for supermassive black holes at the center of quasars has remained a contentious issue in astrophysics. One significant challenge is the variation in calculated accretion efficiency, with values exceeding the standard range of $0.038 < ε< 0.42$. This discrepancy is especially pronounced in high redshift supermassive black holes, necessitating the development of a comprehensive model that can address the accretion efficiency for supermassive black holes in both the low and high redshift ranges. In this study, we have focused on low redshift ($z < 0.5$) PG quasars (79 quasars) and high redshift ($z \geq 3$) quasars with standard disks from the flux- and volume-limited QUOTAS+QuasarNET dataset (76 quasars) to establish a model for accretion efficiency. An interesting trend is revealed where in redshift larger than 3, accretion efficiency increases as redshift decreases, while in redshift lower than 0.5, accretion efficiency decreases with reducing redshift. This suggests a peak in accretion efficiency between the low and high redshift quasars. This peak is recognized for the flux- and volume-limited QUOTAS+QuasarNET+DL11 dataset, which is $z \sim 2.675$, and it seems to be related to the peak of the star formation rate. ($1 < z_{SFR} < 3$). This result can potentially lead to a more accurate correlation between the star formation rate in quasars and their relationship with the mass of the central supermassive black holes with a more comprehensive disk model in future studies.
△ Less
Submitted 6 May, 2024;
originally announced May 2024.
-
Relational Graph Convolutional Networks for Sentiment Analysis
Authors:
Asal Khosravi,
Zahed Rahmati,
Ali Vefghi
Abstract:
With the growth of textual data across online platforms, sentiment analysis has become crucial for extracting insights from user-generated content. While traditional approaches and deep learning models have shown promise, they cannot often capture complex relationships between entities. In this paper, we propose leveraging Relational Graph Convolutional Networks (RGCNs) for sentiment analysis, whi…
▽ More
With the growth of textual data across online platforms, sentiment analysis has become crucial for extracting insights from user-generated content. While traditional approaches and deep learning models have shown promise, they cannot often capture complex relationships between entities. In this paper, we propose leveraging Relational Graph Convolutional Networks (RGCNs) for sentiment analysis, which offer interpretability and flexibility by capturing dependencies between data points represented as nodes in a graph. We demonstrate the effectiveness of our approach by using pre-trained language models such as BERT and RoBERTa with RGCN architecture on product reviews from Amazon and Digikala datasets and evaluating the results. Our experiments highlight the effectiveness of RGCNs in capturing relational information for sentiment analysis tasks.
△ Less
Submitted 16 April, 2024;
originally announced April 2024.
-
Comparison of gait phase detection using traditional machine learning and deep learning techniques
Authors:
Farhad Nazari,
Navid Mohajer,
Darius Nahavandi,
Abbas Khosravi
Abstract:
Human walking is a complex activity with a high level of cooperation and interaction between different systems in the body. Accurate detection of the phases of the gait in real-time is crucial to control lower-limb assistive devices like exoskeletons and prostheses. There are several ways to detect the walking gait phase, ranging from cameras and depth sensors to the sensors attached to the device…
▽ More
Human walking is a complex activity with a high level of cooperation and interaction between different systems in the body. Accurate detection of the phases of the gait in real-time is crucial to control lower-limb assistive devices like exoskeletons and prostheses. There are several ways to detect the walking gait phase, ranging from cameras and depth sensors to the sensors attached to the device itself or the human body. Electromyography (EMG) is one of the input methods that has captured lots of attention due to its precision and time delay between neuromuscular activity and muscle movement. This study proposes a few Machine Learning (ML) based models on lower-limb EMG data for human walking. The proposed models are based on Gaussian Naive Bayes (NB), Decision Tree (DT), Random Forest (RF), Linear Discriminant Analysis (LDA) and Deep Convolutional Neural Networks (DCNN). The traditional ML models are trained on hand-crafted features or their reduced components using Principal Component Analysis (PCA). On the contrary, the DCNN model utilises convolutional layers to extract features from raw data. The results show up to 75% average accuracy for traditional ML models and 79% for Deep Learning (DL) model. The highest achieved accuracy in 50 trials of the training DL model is 89.5%.
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Comparison of Deep Learning Techniques on Human Activity Recognition using Ankle Inertial Signals
Authors:
Farhad Nazari,
Darius Nahavandi,
Navid Mohajer,
Abbas Khosravi
Abstract:
Human Activity Recognition (HAR) is one of the fundamental building blocks of human assistive devices like orthoses and exoskeletons. There are different approaches to HAR depending on the application. Numerous studies have been focused on improving them by optimising input data or classification algorithms. However, most of these studies have been focused on applications like security and monitor…
▽ More
Human Activity Recognition (HAR) is one of the fundamental building blocks of human assistive devices like orthoses and exoskeletons. There are different approaches to HAR depending on the application. Numerous studies have been focused on improving them by optimising input data or classification algorithms. However, most of these studies have been focused on applications like security and monitoring, smart devices, the internet of things, etc. On the other hand, HAR can help adjust and control wearable assistive devices, yet there has not been enough research facilitating its implementation. In this study, we propose several models to predict four activities from inertial sensors located in the ankle area of a lower-leg assistive device user. This choice is because they do not need to be attached to the user's skin and can be directly implemented inside the control unit of the device. The proposed models are based on Artificial Neural Networks and could achieve up to 92.8% average classification accuracy
△ Less
Submitted 7 March, 2024;
originally announced March 2024.
-
Reply to "Comment" on "Regular evaporating black holes with stable cores"
Authors:
Alfio Bonanno,
Amir-Pouyan Khosravi,
Frank Saueressig
Abstract:
We reply to the ``Comment'' on ``Regular evaporating black holes with stable cores'' by R. Carballo-Rubio, F. Di Filippo, S. Liberati, C. Pacilio, and M. Visser. As a key result, we show that the regime of mass-inflation identified in the comment connects smoothly to the late-time attractors discovered in our works [A. Bonanno et. al., Regular black holes with stable cores, Phys. Rev. D 103, 12402…
▽ More
We reply to the ``Comment'' on ``Regular evaporating black holes with stable cores'' by R. Carballo-Rubio, F. Di Filippo, S. Liberati, C. Pacilio, and M. Visser. As a key result, we show that the regime of mass-inflation identified in the comment connects smoothly to the late-time attractors discovered in our works [A. Bonanno et. al., Regular black holes with stable cores, Phys. Rev. D 103, 124027 (2021) and Regular evaporating black holes with stable cores, Phys. Rev. D 107, 024005 (2023)]. Hence, the late-time stability of regular black holes is not affected by this intermediate phase.
△ Less
Submitted 12 February, 2024;
originally announced February 2024.
-
Rheological softening of metal nanocontacts sheared under oscillatory strains
Authors:
Ali Khosravi,
Jin Wang,
Andrea Silva,
Andrea Vanossi,
Erio Tosatti
Abstract:
The way metal interfaces evolve during frictional sliding, and how that evolution can be externally influenced under external drivers are important questions, hard to investigate experimentally because the contacts themselves are generally difficult to access. Here we focus on an elementary constituent of a general metal-metal interface, namely an ultra-thin individual nanocontact, where recent rh…
▽ More
The way metal interfaces evolve during frictional sliding, and how that evolution can be externally influenced under external drivers are important questions, hard to investigate experimentally because the contacts themselves are generally difficult to access. Here we focus on an elementary constituent of a general metal-metal interface, namely an ultra-thin individual nanocontact, where recent rheological studies of crystalline gold nanocontacts [Nature 569, 393 (2019)] showed a dramatic and unexpected mechanical softening as a result of external oscillatory tensile stress. The question which we address through realistic nonequilibrium molecular dynamics simulations is to what extent such mechanical softening might influence the shearing habit of gold nanocontacts at room temperature. It is found that the shearing evolution, which occurs through a series of discrete slips, is indeed rheologically softened, even though not completely, by the oscillations. Differences also emerge for different types of external oscillation, tensile or rotational. The relevance of these results for future experiments will be discussed.
△ Less
Submitted 28 December, 2023;
originally announced December 2023.
-
Leveraging Optimal Transport for Enhanced Offline Reinforcement Learning in Surgical Robotic Environments
Authors:
Maryam Zare,
Parham M. Kebria,
Abbas Khosravi
Abstract:
Most Reinforcement Learning (RL) methods are traditionally studied in an active learning setting, where agents directly interact with their environments, observe action outcomes, and learn through trial and error. However, allowing partially trained agents to interact with real physical systems poses significant challenges, including high costs, safety risks, and the need for constant supervision.…
▽ More
Most Reinforcement Learning (RL) methods are traditionally studied in an active learning setting, where agents directly interact with their environments, observe action outcomes, and learn through trial and error. However, allowing partially trained agents to interact with real physical systems poses significant challenges, including high costs, safety risks, and the need for constant supervision. Offline RL addresses these cost and safety concerns by leveraging existing datasets and reducing the need for resource-intensive real-time interactions. Nevertheless, a substantial challenge lies in the demand for these datasets to be meticulously annotated with rewards. In this paper, we introduce Optimal Transport Reward (OTR) labelling, an innovative algorithm designed to assign rewards to offline trajectories, using a small number of high-quality expert demonstrations. The core principle of OTR involves employing Optimal Transport (OT) to calculate an optimal alignment between an unlabeled trajectory from the dataset and an expert demonstration. This alignment yields a similarity measure that is effectively interpreted as a reward signal. An offline RL algorithm can then utilize these reward signals to learn a policy. This approach circumvents the need for handcrafted rewards, unlocking the potential to harness vast datasets for policy learning. Leveraging the SurRoL simulation platform tailored for surgical robot learning, we generate datasets and employ them to train policies using the OTR algorithm. By demonstrating the efficacy of OTR in a different domain, we emphasize its versatility and its potential to expedite RL deployment across a wide range of fields.
△ Less
Submitted 12 October, 2023;
originally announced October 2023.
-
A Survey of Imitation Learning: Algorithms, Recent Developments, and Challenges
Authors:
Maryam Zare,
Parham M. Kebria,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
In recent years, the development of robotics and artificial intelligence (AI) systems has been nothing short of remarkable. As these systems continue to evolve, they are being utilized in increasingly complex and unstructured environments, such as autonomous driving, aerial robotics, and natural language processing. As a consequence, programming their behaviors manually or defining their behavior…
▽ More
In recent years, the development of robotics and artificial intelligence (AI) systems has been nothing short of remarkable. As these systems continue to evolve, they are being utilized in increasingly complex and unstructured environments, such as autonomous driving, aerial robotics, and natural language processing. As a consequence, programming their behaviors manually or defining their behavior through reward functions (as done in reinforcement learning (RL)) has become exceedingly difficult. This is because such environments require a high degree of flexibility and adaptability, making it challenging to specify an optimal set of rules or reward signals that can account for all possible situations. In such environments, learning from an expert's behavior through imitation is often more appealing. This is where imitation learning (IL) comes into play - a process where desired behavior is learned by imitating an expert's behavior, which is provided through demonstrations.
This paper aims to provide an introduction to IL and an overview of its underlying assumptions and approaches. It also offers a detailed description of recent advances and emerging areas of research in the field. Additionally, the paper discusses how researchers have addressed common challenges associated with IL and provides potential directions for future research. Overall, the goal of the paper is to provide a comprehensive guide to the growing field of IL in robotics and AI.
△ Less
Submitted 5 September, 2023;
originally announced September 2023.
-
Kinetics of hydrogen and vacancy diffusion in iron: A Kinetic Activation Relaxation technique (k-ART) study
Authors:
Aynour Khosravi,
Jun Song,
Normand Mousseau
Abstract:
We investigate hydrogen (H) and mono and divacancy-hydrogen complexes (VH$_x$ and V$_2$H$_x$) diffusion in body-centered-cubic (BCC) iron using the kinetic Activation-Relaxation Technique (k-ART), an off-lattice kinetic Monte Carlo approach with on-the-fly event catalog building, to explore diffusion barriers and associated mechanisms for these defects. K-ART uncovers complex diffusion pathways fo…
▽ More
We investigate hydrogen (H) and mono and divacancy-hydrogen complexes (VH$_x$ and V$_2$H$_x$) diffusion in body-centered-cubic (BCC) iron using the kinetic Activation-Relaxation Technique (k-ART), an off-lattice kinetic Monte Carlo approach with on-the-fly event catalog building, to explore diffusion barriers and associated mechanisms for these defects. K-ART uncovers complex diffusion pathways for the bound complexes, with important barrier variations that depend on the geometrical relations between the position of the inserting Fe atom and that of the bound H. Since H is small and brings little lattice deformation around itself, these bound complexes are compact, and H is fully unbound at the second neighbor site already. As more H are added, however, vacancies deform and affect the lattice over longer distances, contributing to increasing the VH$_x$ complex diffusion barrier and its impact on its local environment. We find, moreover, that the importance of this trapping decreases when going from mono to divacancy complexes, although diffusion barriers for these complexes increase with the number of trapped H.
△ Less
Submitted 18 June, 2024; v1 submitted 19 June, 2023;
originally announced June 2023.
-
Anisotropic Rheology and Friction of Suspended Graphene
Authors:
Andrea Mescola,
Andrea Silva,
Ali Khosravi,
Andrea Vanossi,
Erio Tosatti,
Sergio Valeri,
Guido Paolicelli
Abstract:
Graphene is a powerful membrane prototype for both applications and fundamental research. Rheological phenomena including indentation, twisting, and wrinkling in deposited and suspended graphene are actively investigated to unravel the mechanical laws at the nanoscale. Most studies focused on isotropic set-ups, while realistic graphene membranes are often subject to strongly anisotropic constraint…
▽ More
Graphene is a powerful membrane prototype for both applications and fundamental research. Rheological phenomena including indentation, twisting, and wrinkling in deposited and suspended graphene are actively investigated to unravel the mechanical laws at the nanoscale. Most studies focused on isotropic set-ups, while realistic graphene membranes are often subject to strongly anisotropic constraints, with important consequences for the rheology, strain, indentation, and friction in engineering conditions.
△ Less
Submitted 6 June, 2023;
originally announced June 2023.
-
Sliding and Pinning in Structurally Lubric 2D Material Interfaces
Authors:
Jin Wang,
Ali Khosravi,
Andrea Vanossi,
Erio Tosatti
Abstract:
A plethora of two-dimensional (2D) materials entered the physics and engineering scene in the last two decades. Their robust, membrane-like sheet permit -- mostly require -- deposition, giving rise to solid-solid dry interfaces whose bodily mobility, pinning, and general tribological properties under shear stress are currently being understood and controlled, experimentally and theoretically. In t…
▽ More
A plethora of two-dimensional (2D) materials entered the physics and engineering scene in the last two decades. Their robust, membrane-like sheet permit -- mostly require -- deposition, giving rise to solid-solid dry interfaces whose bodily mobility, pinning, and general tribological properties under shear stress are currently being understood and controlled, experimentally and theoretically. In this Colloquium we use simulation case studies of twisted graphene system as a prototype workhorse tool to demonstrate and discuss the general picture of 2D material interface sliding. First, we highlight the crucial mechanical difference, often overlooked, between small and large incommensurabilities, corresponding e.g., to small and large twist angles in graphene interfaces. In both cases, focusing on flat, structurally lubric, "superlubric" geometries, we elucidate and review the generally separate scaling with area of static friction in pinned states and of kinetic friction during sliding, tangled as they are with the effects of velocity, temperature, load, and defects. Including the role of island boundaries and of elasticity, and corroborating when possible the existing case-by-case results in literature beyond graphene, the overall picture proposed is meant for general 2D material interfaces, that are of importance for the physics and technology of existing and future bilayer and multilayer systems.
△ Less
Submitted 31 May, 2023;
originally announced May 2023.
-
Bending Stiffness Collapse, Buckling, Topological Bands of Freestanding Twisted Bilayer Graphene
Authors:
Jin Wang,
Ali Khosravi,
Andrea Silva,
Michele Fabrizio,
Andrea Vanossi,
Erio Tosatti
Abstract:
The freestanding twisted bilayer graphene (TBG) is unstable, below a critical twist angle θ_c~3.7 degrees, against a moire (2 \times 1) buckling distortion at T=0. Realistic simulations reveal the concurrent unexpected collapse of the bending rigidity, an unrelated macroscopic mechanical parameter. An analytical model connects bending and buckling anomalies at T=0, but as temperature rises the for…
▽ More
The freestanding twisted bilayer graphene (TBG) is unstable, below a critical twist angle θ_c~3.7 degrees, against a moire (2 \times 1) buckling distortion at T=0. Realistic simulations reveal the concurrent unexpected collapse of the bending rigidity, an unrelated macroscopic mechanical parameter. An analytical model connects bending and buckling anomalies at T=0, but as temperature rises the former fades, while buckling persists further. The (2 \times 1) electronic properties are also surprising. The magic twist angle narrow bands, now eight in number, fail to show zone boundary splittings despite the new periodicity. Symmetry shows how this is dictated by an effective single valley physics. These structural, critical, and electronic predictions promise to make the freestanding state of TBG especially interesting.
△ Less
Submitted 15 December, 2023; v1 submitted 12 May, 2023;
originally announced May 2023.
-
Uncertainty Aware Neural Network from Similarity and Sensitivity
Authors:
H M Dipu Kabir,
Subrota Kumar Mondal,
Sadia Khanam,
Abbas Khosravi,
Shafin Rahman,
Mohammad Reza Chalak Qazani,
Roohallah Alizadehsani,
Houshyar Asadi,
Shady Mohamed,
Saeid Nahavandi,
U Rajendra Acharya
Abstract:
Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar sampl…
▽ More
Researchers have proposed several approaches for neural network (NN) based uncertainty quantification (UQ). However, most of the approaches are developed considering strong assumptions. Uncertainty quantification algorithms often perform poorly in an input domain and the reason for poor performance remains unknown. Therefore, we present a neural network training method that considers similar samples with sensitivity awareness in this paper. In the proposed NN training method for UQ, first, we train a shallow NN for the point prediction. Then, we compute the absolute differences between prediction and targets and train another NN for predicting those absolute differences or absolute errors. Domains with high average absolute errors represent a high uncertainty. In the next step, we select each sample in the training set one by one and compute both prediction and error sensitivities. Then we select similar samples with sensitivity consideration and save indexes of similar samples. The ranges of an input parameter become narrower when the output is highly sensitive to that parameter. After that, we construct initial uncertainty bounds (UB) by considering the distribution of sensitivity aware similar samples. Prediction intervals (PIs) from initial uncertainty bounds are larger and cover more samples than required. Therefore, we train bound correction NN. As following all the steps for finding UB for each sample requires a lot of computation and memory access, we train a UB computation NN. The UB computation NN takes an input sample and provides an uncertainty bound. The UB computation NN is the final product of the proposed approach. Scripts of the proposed method are available in the following GitHub repository: github.com/dipuk0506/UQ
△ Less
Submitted 26 April, 2023;
originally announced April 2023.
-
A Review of Deep Learning for Video Captioning
Authors:
Moloud Abdar,
Meenakshi Kollati,
Swaraja Kuraparthi,
Farhad Pourpanah,
Daniel McDuff,
Mohammad Ghavamzadeh,
Shuicheng Yan,
Abduallah Mohamed,
Abbas Khosravi,
Erik Cambria,
Fatih Porikli
Abstract:
Video captioning (VC) is a fast-moving, cross-disciplinary area of research that bridges work in the fields of computer vision, natural language processing (NLP), linguistics, and human-computer interaction. In essence, VC involves understanding a video and describing it with language. Captioning is used in a host of applications from creating more accessible interfaces (e.g., low-vision navigatio…
▽ More
Video captioning (VC) is a fast-moving, cross-disciplinary area of research that bridges work in the fields of computer vision, natural language processing (NLP), linguistics, and human-computer interaction. In essence, VC involves understanding a video and describing it with language. Captioning is used in a host of applications from creating more accessible interfaces (e.g., low-vision navigation) to video question answering (V-QA), video retrieval and content generation. This survey covers deep learning-based VC, including but, not limited to, attention-based architectures, graph networks, reinforcement learning, adversarial networks, dense video captioning (DVC), and more. We discuss the datasets and evaluation metrics used in the field, and limitations, applications, challenges, and future directions for VC.
△ Less
Submitted 22 April, 2023;
originally announced April 2023.
-
Survey on Leveraging Uncertainty Estimation Towards Trustworthy Deep Neural Networks: The Case of Reject Option and Post-training Processing
Authors:
Mehedi Hasan,
Moloud Abdar,
Abbas Khosravi,
Uwe Aickelin,
Pietro Lio',
Ibrahim Hossain,
Ashikur Rahman,
Saeid Nahavandi
Abstract:
Although neural networks (especially deep neural networks) have achieved \textit{better-than-human} performance in many fields, their real-world deployment is still questionable due to the lack of awareness about the limitation in their knowledge. To incorporate such awareness in the machine learning model, prediction with reject option (also known as selective classification or classification wit…
▽ More
Although neural networks (especially deep neural networks) have achieved \textit{better-than-human} performance in many fields, their real-world deployment is still questionable due to the lack of awareness about the limitation in their knowledge. To incorporate such awareness in the machine learning model, prediction with reject option (also known as selective classification or classification with abstention) has been proposed in literature. In this paper, we present a systematic review of the prediction with the reject option in the context of various neural networks. To the best of our knowledge, this is the first study focusing on this aspect of neural networks. Moreover, we discuss different novel loss functions related to the reject option and post-training processing (if any) of network output for generating suitable measurements for knowledge awareness of the model. Finally, we address the application of the rejection option in reducing the prediction time for the real-time problems and present a comprehensive summary of the techniques related to the reject option in the context of extensive variety of neural networks. Our code is available on GitHub: \url{https://github.com/MehediHasanTutul/Reject_option}
△ Less
Submitted 10 April, 2023;
originally announced April 2023.
-
Adaptive formation motion planning and control of autonomous underwater vehicles using deep reinforcement learning
Authors:
Behnaz Hadi,
Alireza Khosravi,
Pouria Sarhadi
Abstract:
Creating safe paths in unknown and uncertain environments is a challenging aspect of leader-follower formation control. In this architecture, the leader moves toward the target by taking optimal actions, and followers should also avoid obstacles while maintaining their desired formation shape. Most of the studies in this field have inspected formation control and obstacle avoidance separately. The…
▽ More
Creating safe paths in unknown and uncertain environments is a challenging aspect of leader-follower formation control. In this architecture, the leader moves toward the target by taking optimal actions, and followers should also avoid obstacles while maintaining their desired formation shape. Most of the studies in this field have inspected formation control and obstacle avoidance separately. The present study proposes a new approach based on deep reinforcement learning (DRL) for end-to-end motion planning and control of under-actuated autonomous underwater vehicles (AUVs). The aim is to design optimal adaptive distributed controllers based on actor-critic structure for AUVs formation motion planning. This is accomplished by controlling the speed and heading of AUVs. In obstacle avoidance, two approaches have been deployed. In the first approach, the goal is to design control policies for the leader and followers such that each learns its own collision-free path. Moreover, the followers adhere to an overall formation maintenance policy. In the second approach, the leader solely learns the control policy, and safely leads the whole group towards the target. Here, the control policy of the followers is to maintain the predetermined distance and angle. In the presence of ocean currents, communication delays, and sensing errors, the robustness of the proposed method under realistically perturbed circumstances is shown. The efficiency of the algorithms has been evaluated and approved using a number of computer-based simulations.
△ Less
Submitted 1 April, 2023;
originally announced April 2023.
-
SFE: A Simple, Fast and Efficient Feature Selection Algorithm for High-Dimensional Data
Authors:
Behrouz Ahadzadeh,
Moloud Abdar,
Fatemeh Safara,
Abbas Khosravi,
Mohammad Bagher Menhaj,
Ponnuthurai Nagaratnam Suganthan
Abstract:
In this paper, a new feature selection algorithm, called SFE (Simple, Fast, and Efficient), is proposed for high-dimensional datasets. The SFE algorithm performs its search process using a search agent and two operators: non-selection and selection. It comprises two phases: exploration and exploitation. In the exploration phase, the non-selection operator performs a global search in the entire pro…
▽ More
In this paper, a new feature selection algorithm, called SFE (Simple, Fast, and Efficient), is proposed for high-dimensional datasets. The SFE algorithm performs its search process using a search agent and two operators: non-selection and selection. It comprises two phases: exploration and exploitation. In the exploration phase, the non-selection operator performs a global search in the entire problem search space for the irrelevant, redundant, trivial, and noisy features, and changes the status of the features from selected mode to non-selected mode. In the exploitation phase, the selection operator searches the problem search space for the features with a high impact on the classification results, and changes the status of the features from non-selected mode to selected mode. The proposed SFE is successful in feature selection from high-dimensional datasets. However, after reducing the dimensionality of a dataset, its performance cannot be increased significantly. In these situations, an evolutionary computational method could be used to find a more efficient subset of features in the new and reduced search space. To overcome this issue, this paper proposes a hybrid algorithm, SFE-PSO (particle swarm optimization) to find an optimal feature subset. The efficiency and effectiveness of the SFE and the SFE-PSO for feature selection are compared on 40 high-dimensional datasets. Their performances were compared with six recently proposed feature selection algorithms. The results obtained indicate that the two proposed algorithms significantly outperform the other algorithms, and can be used as efficient and effective algorithms in selecting features from high-dimensional datasets.
△ Less
Submitted 17 March, 2023;
originally announced March 2023.
-
Automated Diagnosis of Cardiovascular Diseases from Cardiac Magnetic Resonance Imaging Using Deep Learning Models: A Review
Authors:
Mahboobeh Jafari,
Afshin Shoeibi,
Marjane Khodatars,
Navid Ghassemi,
Parisa Moridian,
Niloufar Delfan,
Roohallah Alizadehsani,
Abbas Khosravi,
Sai Ho Ling,
Yu-Dong Zhang,
Shui-Hua Wang,
Juan M. Gorriz,
Hamid Alinejad Rokny,
U. Rajendra Acharya
Abstract:
In recent years, cardiovascular diseases (CVDs) have become one of the leading causes of mortality globally. CVDs appear with minor symptoms and progressively get worse. The majority of people experience symptoms such as exhaustion, shortness of breath, ankle swelling, fluid retention, and other symptoms when starting CVD. Coronary artery disease (CAD), arrhythmia, cardiomyopathy, congenital heart…
▽ More
In recent years, cardiovascular diseases (CVDs) have become one of the leading causes of mortality globally. CVDs appear with minor symptoms and progressively get worse. The majority of people experience symptoms such as exhaustion, shortness of breath, ankle swelling, fluid retention, and other symptoms when starting CVD. Coronary artery disease (CAD), arrhythmia, cardiomyopathy, congenital heart defect (CHD), mitral regurgitation, and angina are the most common CVDs. Clinical methods such as blood tests, electrocardiography (ECG) signals, and medical imaging are the most effective methods used for the detection of CVDs. Among the diagnostic methods, cardiac magnetic resonance imaging (CMR) is increasingly used to diagnose, monitor the disease, plan treatment and predict CVDs. Coupled with all the advantages of CMR data, CVDs diagnosis is challenging for physicians due to many slices of data, low contrast, etc. To address these issues, deep learning (DL) techniques have been employed to the diagnosis of CVDs using CMR data, and much research is currently being conducted in this field. This review provides an overview of the studies performed in CVDs detection using CMR images and DL techniques. The introduction section examined CVDs types, diagnostic methods, and the most important medical imaging techniques. In the following, investigations to detect CVDs using CMR images and the most significant DL methods are presented. Another section discussed the challenges in diagnosing CVDs from CMR data. Next, the discussion section discusses the results of this review, and future work in CVDs diagnosis from CMR images and DL techniques are outlined. The most important findings of this study are presented in the conclusion section.
△ Less
Submitted 26 October, 2022;
originally announced October 2022.
-
Regular evaporating black holes with stable cores
Authors:
Alfio Bonanno,
Amir-Pouyan Khosravi,
Frank Saueressig
Abstract:
A feature shared by many regular black hole spacetimes is the occurrence of a Cauchy horizon. It is then commonly believed that this renders the geometry unstable against perturbations through the mass-inflation effect. In this work, we perform the first dynamical study of this effect taking into account the mass-loss of the black hole due to Hawking radiation. It is shown that the time-dependence…
▽ More
A feature shared by many regular black hole spacetimes is the occurrence of a Cauchy horizon. It is then commonly believed that this renders the geometry unstable against perturbations through the mass-inflation effect. In this work, we perform the first dynamical study of this effect taking into account the mass-loss of the black hole due to Hawking radiation. It is shown that the time-dependence of the background leads to two novel types of late-time behavior whose properties are entirely determined by the Hawking flux. The first class of attractor-behavior is operative for regular black holes of the Hayward and renormalization group improved type and characterized by the square of the Weyl curvature growing as $v^6$ at asymptotically late times. This singularity is inaccessible to a radially free-falling observer though. The second class is realized by Reissner-Nordstr{ö}m black holes and regular black holes of the Bardeen type. In this case the curvature scalars remain finite as $v\rightarrow\infty$. Thus the Hawking flux has a profound effect on the mass-inflation instability, either weakening the effect significantly or even expelling it entirely.
△ Less
Submitted 21 September, 2022;
originally announced September 2022.
-
The Moral Foundations Reddit Corpus
Authors:
Jackson Trager,
Alireza S. Ziabari,
Aida Mostafazadeh Davani,
Preni Golazizian,
Farzan Karimi-Malekabadi,
Ali Omrani,
Zhihe Li,
Brendan Kennedy,
Nils Karl Reimer,
Melissa Reyes,
Kelsey Cheng,
Mellow Wei,
Christina Merrifield,
Arta Khosravi,
Evans Alvarez,
Morteza Dehghani
Abstract:
Moral framing and sentiment can affect a variety of online and offline behaviors, including donation, pro-environmental action, political engagement, and even participation in violent protests. Various computational methods in Natural Language Processing (NLP) have been used to detect moral sentiment from textual data, but in order to achieve better performances in such subjective tasks, large set…
▽ More
Moral framing and sentiment can affect a variety of online and offline behaviors, including donation, pro-environmental action, political engagement, and even participation in violent protests. Various computational methods in Natural Language Processing (NLP) have been used to detect moral sentiment from textual data, but in order to achieve better performances in such subjective tasks, large sets of hand-annotated training data are needed. Previous corpora annotated for moral sentiment have proven valuable, and have generated new insights both within NLP and across the social sciences, but have been limited to Twitter. To facilitate improving our understanding of the role of moral rhetoric, we present the Moral Foundations Reddit Corpus, a collection of 16,123 Reddit comments that have been curated from 12 distinct subreddits, hand-annotated by at least three trained annotators for 8 categories of moral sentiment (i.e., Care, Proportionality, Equality, Purity, Authority, Loyalty, Thin Morality, Implicit/Explicit Morality) based on the updated Moral Foundations Theory (MFT) framework. We use a range of methodologies to provide baseline moral-sentiment classification results for this new corpus, e.g., cross-domain classification and knowledge transfer.
△ Less
Submitted 17 August, 2022; v1 submitted 10 August, 2022;
originally announced August 2022.
-
Automatic autism spectrum disorder detection using artificial intelligence methods with MRI neuroimaging: A review
Authors:
Parisa Moridian,
Navid Ghassemi,
Mahboobeh Jafari,
Salam Salloum-Asfar,
Delaram Sadeghi,
Marjane Khodatars,
Afshin Shoeibi,
Abbas Khosravi,
Sai Ho Ling,
Abdulhamit Subasi,
Roohallah Alizadehsani,
Juan M. Gorriz,
Sara A Abdulla,
U. Rajendra Acharya
Abstract:
Autism spectrum disorder (ASD) is a brain condition characterized by diverse signs and symptoms that appear in early childhood. ASD is also associated with communication deficits and repetitive behavior in affected individuals. Various ASD detection methods have been developed, including neuroimaging modalities and psychological tests. Among these methods, magnetic resonance imaging (MRI) imaging…
▽ More
Autism spectrum disorder (ASD) is a brain condition characterized by diverse signs and symptoms that appear in early childhood. ASD is also associated with communication deficits and repetitive behavior in affected individuals. Various ASD detection methods have been developed, including neuroimaging modalities and psychological tests. Among these methods, magnetic resonance imaging (MRI) imaging modalities are of paramount importance to physicians. Clinicians rely on MRI modalities to diagnose ASD accurately. The MRI modalities are non-invasive methods that include functional (fMRI) and structural (sMRI) neuroimaging methods. However, diagnosing ASD with fMRI and sMRI for specialists is often laborious and time-consuming; therefore, several computer-aided design systems (CADS) based on artificial intelligence (AI) have been developed to assist specialist physicians. Conventional machine learning (ML) and deep learning (DL) are the most popular schemes of AI used for diagnosing ASD. This study aims to review the automated detection of ASD using AI. We review several CADS that have been developed using ML techniques for the automated diagnosis of ASD using MRI modalities. There has been very limited work on the use of DL techniques to develop automated diagnostic models for ASD. A summary of the studies developed using DL is provided in the Supplementary Appendix. Then, the challenges encountered during the automated diagnosis of ASD using MRI and AI techniques are described in detail. Additionally, a graphical comparison of studies using ML and DL to diagnose ASD automatically is discussed. We suggest future approaches to detecting ASDs using AI techniques and MRI neuroimaging.
△ Less
Submitted 6 October, 2022; v1 submitted 20 June, 2022;
originally announced June 2022.
-
Human Activity Recognition from Knee Angle Using Machine Learning Techniques
Authors:
Farhad Nazari,
Darius Nahavandi,
Navid Mohajer,
Abbas Khosravi
Abstract:
Human Activity Recognition (HAR) is a crucial technology for many applications such as smart homes, surveillance, human assistance and health care. This technology utilises pattern recognition and can contribute to the development of human-in-the-loop control of different systems such as orthoses and exoskeletons. The majority of reported studies use a small dataset collected from an experiment fo…
▽ More
Human Activity Recognition (HAR) is a crucial technology for many applications such as smart homes, surveillance, human assistance and health care. This technology utilises pattern recognition and can contribute to the development of human-in-the-loop control of different systems such as orthoses and exoskeletons. The majority of reported studies use a small dataset collected from an experiment for a specific purpose. The downsides of this approach include: 1) it is hard to generalise the outcome to different people with different biomechanical characteristics and health conditions, and 2) it cannot be implemented in applications other than the original experiment. To address these deficiencies, the current study investigates using a publicly available dataset collected for pathology diagnosis purposes to train Machine Learning (ML) algorithms. A dataset containing knee motion of participants performing different exercises has been used to classify human activity. The algorithms used in this study are Gaussian Naive Bayes, Decision Tree, Random Forest, K-Nearest Neighbors Vote, Support Vector Machine and Gradient Boosting. Furthermore, two training approaches are compared to raw data (de-noised) and manually extracted features. The results show up to 0.94 performance of the Area Under the ROC Curve (AUC) metric for 11-fold cross-validation for Gradient Boosting algorithm using raw data. This outcome reflects the validity and potential use of the proposed approach for this type of dataset.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
Comparison Study of Inertial Sensor Signal Combination for Human Activity Recognition based on Convolutional Neural Networks
Authors:
Farhad Nazari,
Navid Mohajer,
Darius Nahavandi,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Human Activity Recognition (HAR) is one of the essential building blocks of so many applications like security, monitoring, the internet of things and human-robot interaction. The research community has developed various methodologies to detect human activity based on various input types. However, most of the research in the field has been focused on applications other than human-in-the-centre app…
▽ More
Human Activity Recognition (HAR) is one of the essential building blocks of so many applications like security, monitoring, the internet of things and human-robot interaction. The research community has developed various methodologies to detect human activity based on various input types. However, most of the research in the field has been focused on applications other than human-in-the-centre applications. This paper focused on optimising the input signals to maximise the HAR performance from wearable sensors. A model based on Convolutional Neural Networks (CNN) has been proposed and trained on different signal combinations of three Inertial Measurement Units (IMU) that exhibit the movements of the dominant hand, leg and chest of the subject. The results demonstrate k-fold cross-validation accuracy between 99.77 and 99.98% for signals with the modality of 12 or higher. The performance of lower dimension signals, except signals containing information from both chest and ankle, was far inferior, showing between 73 and 85% accuracy.
△ Less
Submitted 9 June, 2022;
originally announced June 2022.
-
Automatic diagnosis of schizophrenia and attention deficit hyperactivity disorder in rs-fMRI modality using convolutional autoencoder model and interval type-2 fuzzy regression
Authors:
Afshin Shoeibi,
Navid Ghassemi,
Marjane Khodatars,
Parisa Moridian,
Abbas Khosravi,
Assef Zare,
Juan M. Gorriz,
Amir Hossein Chale-Chale,
Ali Khadem,
U. Rajendra Acharya
Abstract:
Nowadays, many people worldwide suffer from brain disorders, and their health is in danger. So far, numerous methods have been proposed for the diagnosis of Schizophrenia (SZ) and attention deficit hyperactivity disorder (ADHD), among which functional magnetic resonance imaging (fMRI) modalities are known as a popular method among physicians. This paper presents an SZ and ADHD intelligent detectio…
▽ More
Nowadays, many people worldwide suffer from brain disorders, and their health is in danger. So far, numerous methods have been proposed for the diagnosis of Schizophrenia (SZ) and attention deficit hyperactivity disorder (ADHD), among which functional magnetic resonance imaging (fMRI) modalities are known as a popular method among physicians. This paper presents an SZ and ADHD intelligent detection method of resting-state fMRI (rs-fMRI) modality using a new deep learning method. The University of California Los Angeles dataset, which contains the rs-fMRI modalities of SZ and ADHD patients, has been used for experiments. The FMRIB software library toolbox first performed preprocessing on rs-fMRI data. Then, a convolutional Autoencoder model with the proposed number of layers is used to extract features from rs-fMRI data. In the classification step, a new fuzzy method called interval type-2 fuzzy regression (IT2FR) is introduced and then optimized by genetic algorithm, particle swarm optimization, and gray wolf optimization (GWO) techniques. Also, the results of IT2FR methods are compared with multilayer perceptron, k-nearest neighbors, support vector machine, random forest, and decision tree, and adaptive neuro-fuzzy inference system methods. The experiment results show that the IT2FR method with the GWO optimization algorithm has achieved satisfactory results compared to other classifier methods. Finally, the proposed classification technique was able to provide 72.71% accuracy.
△ Less
Submitted 14 November, 2022; v1 submitted 31 May, 2022;
originally announced May 2022.
-
Controlled Dropout for Uncertainty Estimation
Authors:
Mehedi Hasan,
Abbas Khosravi,
Ibrahim Hossain,
Ashikur Rahman,
Saeid Nahavandi
Abstract:
Uncertainty quantification in a neural network is one of the most discussed topics for safety-critical applications. Though Neural Networks (NNs) have achieved state-of-the-art performance for many applications, they still provide unreliable point predictions, which lack information about uncertainty estimates. Among various methods to enable neural networks to estimate uncertainty, Monte Carlo (M…
▽ More
Uncertainty quantification in a neural network is one of the most discussed topics for safety-critical applications. Though Neural Networks (NNs) have achieved state-of-the-art performance for many applications, they still provide unreliable point predictions, which lack information about uncertainty estimates. Among various methods to enable neural networks to estimate uncertainty, Monte Carlo (MC) dropout has gained much popularity in a short period due to its simplicity. In this study, we present a new version of the traditional dropout layer where we are able to fix the number of dropout configurations. As such, each layer can take and apply the new dropout layer in the MC method to quantify the uncertainty associated with NN predictions. We conduct experiments on both toy and realistic datasets and compare the results with the MC method using the traditional dropout layer. Performance analysis utilizing uncertainty evaluation metrics corroborates that our dropout layer offers better performance in most cases.
△ Less
Submitted 6 May, 2022;
originally announced May 2022.
-
DoubleU-Net++: Architecture with Exploit Multiscale Features for Vertebrae Segmentation
Authors:
Simindokht Jahangard,
Mahdi Bonyani,
Abbas Khosravi
Abstract:
Accurate segmentation of the vertebra is an important prerequisite in various medical applications (E.g. tele surgery) to assist surgeons. Following the successful development of deep neural networks, recent studies have focused on the essential rule of vertebral segmentation. Prior works contain a large number of parameters, and their segmentation is restricted to only one view. Inspired by Doubl…
▽ More
Accurate segmentation of the vertebra is an important prerequisite in various medical applications (E.g. tele surgery) to assist surgeons. Following the successful development of deep neural networks, recent studies have focused on the essential rule of vertebral segmentation. Prior works contain a large number of parameters, and their segmentation is restricted to only one view. Inspired by DoubleU-Net, we propose a novel model named DoubleU-Net++ in which DensNet as feature extractor, special attention module from Convolutional Block Attention on Module (CBAM) and, Pyramid Squeeze Attention (PSA) module are employed to improve extracted features. We evaluate our proposed model on three different views (sagittal, coronal, and axial) of VerSe2020 and xVertSeg datasets. Compared with state-of-the-art studies, our architecture is trained faster and achieves higher precision, recall, and F1-score as evaluation (imporoved by 4-6%) and the result of above 94% for sagittal view and above 94% for both coronal view and above 93% axial view were gained for VerSe2020 dataset, respectively. Also, for xVertSeg dataset, we achieved precision, recall,and F1-score of above 97% for sagittal view, above 93% for coronal view ,and above 96% for axial view.
△ Less
Submitted 28 January, 2022;
originally announced January 2022.
-
Applied Exoskeleton Technology: A Comprehensive Review of Physical and Cognitive Human-Robot Interaction
Authors:
Farhad Nazari,
Navid Mohajer,
Darius Nahavandi,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Exoskeletons and orthoses are wearable mobile systems providing mechanical benefits to the users. Despite significant improvements in the last decades, the technology is not fully mature to be adopted for strenuous and non-programmed tasks. To accommodate this insufficiency, different aspects of this technology need to be analysed and improved. Numerous studies have tried to address some aspects o…
▽ More
Exoskeletons and orthoses are wearable mobile systems providing mechanical benefits to the users. Despite significant improvements in the last decades, the technology is not fully mature to be adopted for strenuous and non-programmed tasks. To accommodate this insufficiency, different aspects of this technology need to be analysed and improved. Numerous studies have tried to address some aspects of exoskeletons, e.g. mechanism design, intent prediction, and control scheme. However, most works have focused on a specific element of design or application without providing a comprehensive review framework. This study aims to analyse and survey the contributing aspects to this technology's improvement and broad adoption. To address this, after introducing assistive devices and exoskeletons, the main design criteria will be investigated from both physical Human-Robot Interaction (HRI) perspectives. In order to establish an intelligent HRI strategy and enable intuitive control for users, cognitive HRI will be investigated after a brief introduction to various approaches to their control strategies. The study will be further developed by outlining several examples of known assistive devices in different categories. And some guidelines for exoskeleton selection and possible mitigation of current limitations will be discussed.
△ Less
Submitted 22 March, 2023; v1 submitted 24 November, 2021;
originally announced November 2021.
-
Black Hole Remnants from Dynamical Dimensional Reduction?
Authors:
Frank Saueressig,
Amir Khosravi
Abstract:
A intriguing feature shared by many quantum gravity programs is the dynamical decrease of the spectral dimension from $D_s = 4$ at macroscopic to $D_s \approx 2$ at microscopic scales. In this note, we study the impact of this transition on the energy loss of static, spherically symmetric black holes due to Hawking radiation. We demonstrate that the decrease in the spectral dimension renders the l…
▽ More
A intriguing feature shared by many quantum gravity programs is the dynamical decrease of the spectral dimension from $D_s = 4$ at macroscopic to $D_s \approx 2$ at microscopic scales. In this note, we study the impact of this transition on the energy loss of static, spherically symmetric black holes due to Hawking radiation. We demonstrate that the decrease in the spectral dimension renders the luminosity of a black hole finite. While this slightly increases the life-time of light black holes, we find that this mechanism is insufficient to generate long-lived black hole remnants. We briefly comment on the relation of our findings to previous work on this topic.
△ Less
Submitted 9 November, 2021;
originally announced November 2021.
-
A Comprehensive Study on Torchvision Pre-trained Models for Fine-grained Inter-species Classification
Authors:
Feras Albardi,
H M Dipu Kabir,
Md Mahbub Islam Bhuiyan,
Parham M. Kebria,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
This study aims to explore different pre-trained models offered in the Torchvision package which is available in the PyTorch library. And investigate their effectiveness on fine-grained images classification. Transfer Learning is an effective method of achieving extremely good performance with insufficient training data. In many real-world situations, people cannot collect sufficient data required…
▽ More
This study aims to explore different pre-trained models offered in the Torchvision package which is available in the PyTorch library. And investigate their effectiveness on fine-grained images classification. Transfer Learning is an effective method of achieving extremely good performance with insufficient training data. In many real-world situations, people cannot collect sufficient data required to train a deep neural network model efficiently. Transfer Learning models are pre-trained on a large data set, and can bring a good performance on smaller datasets with significantly lower training time. Torchvision package offers us many models to apply the Transfer Learning on smaller datasets. Therefore, researchers may need a guideline for the selection of a good model. We investigate Torchvision pre-trained models on four different data sets: 10 Monkey Species, 225 Bird Species, Fruits 360, and Oxford 102 Flowers. These data sets have images of different resolutions, class numbers, and different achievable accuracies. We also apply their usual fully-connected layer and the Spinal fully-connected layer to investigate the effectiveness of SpinalNet. The Spinal fully-connected layer brings better performance in most situations. We apply the same augmentation for different models for the same data set for a fair comparison. This paper may help future Computer Vision researchers in choosing a proper Transfer Learning model.
△ Less
Submitted 13 October, 2021;
originally announced October 2021.
-
An Uncertainty-aware Loss Function for Training Neural Networks with Calibrated Predictions
Authors:
Afshar Shamsi,
Hamzeh Asgharnezhad,
AmirReza Tajally,
Saeid Nahavandi,
Henry Leung
Abstract:
Uncertainty quantification of machine learning and deep learning methods plays an important role in enhancing trust to the obtained result. In recent years, a numerous number of uncertainty quantification methods have been introduced. Monte Carlo dropout (MC-Dropout) is one of the most well-known techniques to quantify uncertainty in deep learning methods. In this study, we propose two new loss fu…
▽ More
Uncertainty quantification of machine learning and deep learning methods plays an important role in enhancing trust to the obtained result. In recent years, a numerous number of uncertainty quantification methods have been introduced. Monte Carlo dropout (MC-Dropout) is one of the most well-known techniques to quantify uncertainty in deep learning methods. In this study, we propose two new loss functions by combining cross entropy with Expected Calibration Error (ECE) and Predictive Entropy (PE). The obtained results clearly show that the new proposed loss functions lead to having a calibrated MC-Dropout method. Our results confirmed the great impact of the new hybrid loss functions for minimising the overlap between the distributions of uncertainty estimates for correct and incorrect predictions without sacrificing the model's overall performance.
△ Less
Submitted 5 February, 2023; v1 submitted 7 October, 2021;
originally announced October 2021.
-
Accurate Prediction Using Triangular Type-2 Fuzzy Linear Regression
Authors:
Assef Zare,
Afshin Shoeibi,
Narges Shafaei,
Parisa Moridian,
Roohallah Alizadehsani,
Majid Halaji,
Abbas Khosravi
Abstract:
Many works have been done to handle the uncertainties in the data using type 1 fuzzy regression. Few type 2 fuzzy regression works used interval type 2 for indeterminate modeling using type 1 fuzzy membership. The current survey proposes a triangular type-2 fuzzy regression (TT2FR) model to ameliorate the efficiency of the model by handling the uncertainty in the data. The triangular secondary mem…
▽ More
Many works have been done to handle the uncertainties in the data using type 1 fuzzy regression. Few type 2 fuzzy regression works used interval type 2 for indeterminate modeling using type 1 fuzzy membership. The current survey proposes a triangular type-2 fuzzy regression (TT2FR) model to ameliorate the efficiency of the model by handling the uncertainty in the data. The triangular secondary membership function is used instead of widely used interval type models. In the proposed model, vagueness in primary and secondary fuzzy sets is minimized and also, a specified x-plane of observed value is included in the same α- plane of the predicted value. Complex calculations of the type-2 fuzzy (T2F) model are simplified by reducing three dimensional type-2 fuzzy set (3DT2FS) into two dimensional interval type-2 fuzzy (2DIT2F) models. The current survey presents a new regression model of T2F by considering the more general form of T2F membership functions and thus avoids high complexity. The performance of the developed model is evaluated using the TAIEX and COVID-19 forecasting datasets. Our developed model reached the highest performance as compared to the other state-of-art techniques. Our developed method is ready to be tested with more uncertain data and has the potential to use to predict the weather and stock prediction.
△ Less
Submitted 12 September, 2021;
originally announced September 2021.
-
Detection of Epileptic Seizures on EEG Signals Using ANFIS Classifier, Autoencoders and Fuzzy Entropies
Authors:
Afshin Shoeibi,
Navid Ghassemi,
Marjane Khodatars,
Parisa Moridian,
Roohallah Alizadehsani,
Assef Zare,
Abbas Khosravi,
Abdulhamit Subasi,
U. Rajendra Acharya,
J. Manuel Gorriz
Abstract:
Epileptic seizures are one of the most crucial neurological disorders, and their early diagnosis will help the clinicians to provide accurate treatment for the patients. The electroencephalogram (EEG) signals are widely used for epileptic seizures detection, which provides specialists with substantial information about the functioning of the brain. In this paper, a novel diagnostic procedure using…
▽ More
Epileptic seizures are one of the most crucial neurological disorders, and their early diagnosis will help the clinicians to provide accurate treatment for the patients. The electroencephalogram (EEG) signals are widely used for epileptic seizures detection, which provides specialists with substantial information about the functioning of the brain. In this paper, a novel diagnostic procedure using fuzzy theory and deep learning techniques is introduced. The proposed method is evaluated on the Bonn University dataset with six classification combinations and also on the Freiburg dataset. The tunable-Q wavelet transform (TQWT) is employed to decompose the EEG signals into different sub-bands. In the feature extraction step, 13 different fuzzy entropies are calculated from different sub-bands of TQWT, and their computational complexities are calculated to help researchers choose the best set for various tasks. In the following, an autoencoder (AE) with six layers is employed for dimensionality reduction. Finally, the standard adaptive neuro-fuzzy inference system (ANFIS), and also its variants with grasshopper optimization algorithm (ANFIS-GOA), particle swarm optimization (ANFIS-PSO), and breeding swarm optimization (ANFIS-BS) methods are used for classification. Using our proposed method, ANFIS-BS method has obtained an accuracy of 99.74% in classifying into two classes and an accuracy of 99.46% in ternary classification on the Bonn dataset and 99.28% on the Freiburg dataset, reaching state-of-the-art performances on both of them.
△ Less
Submitted 7 December, 2021; v1 submitted 6 September, 2021;
originally announced September 2021.
-
MCUa: Multi-level Context and Uncertainty aware Dynamic Deep Ensemble for Breast Cancer Histology Image Classification
Authors:
Zakaria Senousy,
Mohammed M. Abdelsamea,
Mohamed Medhat Gaber,
Moloud Abdar,
U Rajendra Acharya,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Breast histology image classification is a crucial step in the early diagnosis of breast cancer. In breast pathological diagnosis, Convolutional Neural Networks (CNNs) have demonstrated great success using digitized histology slides. However, tissue classification is still challenging due to the high visual variability of the large-sized digitized samples and the lack of contextual information. In…
▽ More
Breast histology image classification is a crucial step in the early diagnosis of breast cancer. In breast pathological diagnosis, Convolutional Neural Networks (CNNs) have demonstrated great success using digitized histology slides. However, tissue classification is still challenging due to the high visual variability of the large-sized digitized samples and the lack of contextual information. In this paper, we propose a novel CNN, called Multi-level Context and Uncertainty aware (MCUa) dynamic deep learning ensemble model.MCUamodel consists of several multi-level context-aware models to learn the spatial dependency between image patches in a layer-wise fashion. It exploits the high sensitivity to the multi-level contextual information using an uncertainty quantification component to accomplish a novel dynamic ensemble model.MCUamodelhas achieved a high accuracy of 98.11% on a breast cancer histology image dataset. Experimental results show the superior effectiveness of the proposed solution compared to the state-of-the-art histology classification models.
△ Less
Submitted 24 August, 2021;
originally announced August 2021.
-
Uncertainty-Aware Credit Card Fraud Detection Using Deep Learning
Authors:
Maryam Habibpour,
Hassan Gharoun,
Mohammadreza Mehdipour,
AmirReza Tajally,
Hamzeh Asgharnezhad,
Afshar Shamsi,
Abbas Khosravi,
Miadreza Shafie-Khah,
Saeid Nahavandi,
Joao P. S. Catalao
Abstract:
Countless research works of deep neural networks (DNNs) in the task of credit card fraud detection have focused on improving the accuracy of point predictions and mitigating unwanted biases by building different network architectures or learning models. Quantifying uncertainty accompanied by point estimation is essential because it mitigates model unfairness and permits practitioners to develop tr…
▽ More
Countless research works of deep neural networks (DNNs) in the task of credit card fraud detection have focused on improving the accuracy of point predictions and mitigating unwanted biases by building different network architectures or learning models. Quantifying uncertainty accompanied by point estimation is essential because it mitigates model unfairness and permits practitioners to develop trustworthy systems which abstain from suboptimal decisions due to low confidence. Explicitly, assessing uncertainties associated with DNNs predictions is critical in real-world card fraud detection settings for characteristic reasons, including (a) fraudsters constantly change their strategies, and accordingly, DNNs encounter observations that are not generated by the same process as the training distribution, (b) owing to the time-consuming process, very few transactions are timely checked by professional experts to update DNNs. Therefore, this study proposes three uncertainty quantification (UQ) techniques named Monte Carlo dropout, ensemble, and ensemble Monte Carlo dropout for card fraud detection applied on transaction data. Moreover, to evaluate the predictive uncertainty estimates, UQ confusion matrix and several performance metrics are utilized. Through experimental results, we show that the ensemble is more effective in capturing uncertainty corresponding to generated predictions. Additionally, we demonstrate that the proposed UQ methods provide extra insight to the point predictions, leading to elevate the fraud prevention process.
△ Less
Submitted 28 July, 2021;
originally announced July 2021.
-
An Uncertainty-Aware Deep Learning Framework for Defect Detection in Casting Products
Authors:
Maryam Habibpour,
Hassan Gharoun,
AmirReza Tajally,
Afshar Shamsi,
Hamzeh Asgharnezhad,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Defects are unavoidable in casting production owing to the complexity of the casting process. While conventional human-visual inspection of casting products is slow and unproductive in mass productions, an automatic and reliable defect detection not just enhances the quality control process but positively improves productivity. However, casting defect detection is a challenging task due to diversi…
▽ More
Defects are unavoidable in casting production owing to the complexity of the casting process. While conventional human-visual inspection of casting products is slow and unproductive in mass productions, an automatic and reliable defect detection not just enhances the quality control process but positively improves productivity. However, casting defect detection is a challenging task due to diversity and variation in defects' appearance. Convolutional neural networks (CNNs) have been widely applied in both image classification and defect detection tasks. Howbeit, CNNs with frequentist inference require a massive amount of data to train on and still fall short in reporting beneficial estimates of their predictive uncertainty. Accordingly, leveraging the transfer learning paradigm, we first apply four powerful CNN-based models (VGG16, ResNet50, DenseNet121, and InceptionResNetV2) on a small dataset to extract meaningful features. Extracted features are then processed by various machine learning algorithms to perform the classification task. Simulation results demonstrate that linear support vector machine (SVM) and multi-layer perceptron (MLP) show the finest performance in defect detection of casting images. Secondly, to achieve a reliable classification and to measure epistemic uncertainty, we employ an uncertainty quantification (UQ) technique (ensemble of MLP models) using features extracted from four pre-trained CNNs. UQ confusion matrix and uncertainty accuracy metric are also utilized to evaluate the predictive uncertainty estimates. Comprehensive comparisons reveal that UQ method based on VGG16 outperforms others to fetch uncertainty. We believe an uncertainty-aware automatic defect detection solution will reinforce casting productions quality assurance.
△ Less
Submitted 24 July, 2021;
originally announced July 2021.
-
Confidence Aware Neural Networks for Skin Cancer Detection
Authors:
Donya Khaledyan,
AmirReza Tajally,
Ali Sarkhosh,
Afshar Shamsi,
Hamzeh Asgharnezhad,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Deep learning (DL) models have received particular attention in medical imaging due to their promising pattern recognition capabilities. However, Deep Neural Networks (DNNs) require a huge amount of data, and because of the lack of sufficient data in this field, transfer learning can be a great solution. DNNs used for disease diagnosis meticulously concentrate on improving the accuracy of predicti…
▽ More
Deep learning (DL) models have received particular attention in medical imaging due to their promising pattern recognition capabilities. However, Deep Neural Networks (DNNs) require a huge amount of data, and because of the lack of sufficient data in this field, transfer learning can be a great solution. DNNs used for disease diagnosis meticulously concentrate on improving the accuracy of predictions without providing a figure about their confidence of predictions. Knowing how much a DNN model is confident in a computer-aided diagnosis model is necessary for gaining clinicians' confidence and trust in DL-based solutions. To address this issue, this work presents three different methods for quantifying uncertainties for skin cancer detection from images. It also comprehensively evaluates and compares performance of these DNNs using novel uncertainty-related metrics. The obtained results reveal that the predictive uncertainty estimation methods are capable of flagging risky and erroneous predictions with a high uncertainty estimate. We also demonstrate that ensemble approaches are more reliable in capturing uncertainties through inference.
△ Less
Submitted 24 July, 2021; v1 submitted 19 July, 2021;
originally announced July 2021.
-
An overview of deep learning techniques for epileptic seizures detection and prediction based on neuroimaging modalities: Methods, challenges, and future works
Authors:
Afshin Shoeibi,
Parisa Moridian,
Marjane Khodatars,
Navid Ghassemi,
Mahboobeh Jafari,
Roohallah Alizadehsani,
Yinan Kong,
Juan Manuel Gorriz,
Javier Ramírez,
Abbas Khosravi,
Saeid Nahavandi,
U. Rajendra Acharya
Abstract:
Epilepsy is a disorder of the brain denoted by frequent seizures. The symptoms of seizure include confusion, abnormal staring, and rapid, sudden, and uncontrollable hand movements. Epileptic seizure detection methods involve neurological exams, blood tests, neuropsychological tests, and neuroimaging modalities. Among these, neuroimaging modalities have received considerable attention from speciali…
▽ More
Epilepsy is a disorder of the brain denoted by frequent seizures. The symptoms of seizure include confusion, abnormal staring, and rapid, sudden, and uncontrollable hand movements. Epileptic seizure detection methods involve neurological exams, blood tests, neuropsychological tests, and neuroimaging modalities. Among these, neuroimaging modalities have received considerable attention from specialist physicians. One method to facilitate the accurate and fast diagnosis of epileptic seizures is to employ computer-aided diagnosis systems (CADS) based on deep learning (DL) and neuroimaging modalities. This paper has studied a comprehensive overview of DL methods employed for epileptic seizures detection and prediction using neuroimaging modalities. First, DL-based CADS for epileptic seizures detection and prediction using neuroimaging modalities are discussed. Also, descriptions of various datasets, preprocessing algorithms, and DL models which have been used for epileptic seizures detection and prediction have been included. Then, research on rehabilitation tools has been presented, which contains brain-computer interface (BCI), cloud computing, internet of things (IoT), hardware implementation of DL techniques on field-programmable gate array (FPGA), etc. In the discussion section, a comparison has been carried out between research on epileptic seizure detection and prediction. The challenges in epileptic seizures detection and prediction using neuroimaging modalities and DL models have been described. In addition, possible directions for future works in this field, specifically for solving challenges in datasets, DL, rehabilitation, and hardware models, have been proposed. The final section is dedicated to the conclusion which summarizes the significant findings of the paper.
△ Less
Submitted 4 September, 2022; v1 submitted 29 May, 2021;
originally announced May 2021.
-
UncertaintyFuseNet: Robust Uncertainty-aware Hierarchical Feature Fusion Model with Ensemble Monte Carlo Dropout for COVID-19 Detection
Authors:
Moloud Abdar,
Soorena Salari,
Sina Qahremani,
Hak-Keung Lam,
Fakhri Karray,
Sadiq Hussain,
Abbas Khosravi,
U. Rajendra Acharya,
Vladimir Makarenkov,
Saeid Nahavandi
Abstract:
The COVID-19 (Coronavirus disease 2019) pandemic has become a major global threat to human health and well-being. Thus, the development of computer-aided detection (CAD) systems that are capable to accurately distinguish COVID-19 from other diseases using chest computed tomography (CT) and X-ray data is of immediate priority. Such automatic systems are usually based on traditional machine learning…
▽ More
The COVID-19 (Coronavirus disease 2019) pandemic has become a major global threat to human health and well-being. Thus, the development of computer-aided detection (CAD) systems that are capable to accurately distinguish COVID-19 from other diseases using chest computed tomography (CT) and X-ray data is of immediate priority. Such automatic systems are usually based on traditional machine learning or deep learning methods. Differently from most of existing studies, which used either CT scan or X-ray images in COVID-19-case classification, we present a simple but efficient deep learning feature fusion model, called UncertaintyFuseNet, which is able to classify accurately large datasets of both of these types of images. We argue that the uncertainty of the model's predictions should be taken into account in the learning process, even though most of existing studies have overlooked it. We quantify the prediction uncertainty in our feature fusion model using effective Ensemble MC Dropout (EMCD) technique. A comprehensive simulation study has been conducted to compare the results of our new model to the existing approaches, evaluating the performance of competing models in terms of Precision, Recall, F-Measure, Accuracy and ROC curves. The obtained results prove the efficiency of our model which provided the prediction accuracy of 99.08\% and 96.35\% for the considered CT scan and X-ray datasets, respectively. Moreover, our UncertaintyFuseNet model was generally robust to noise and performed well with previously unseen data. The source code of our implementation is freely available at: https://github.com/moloud1987/UncertaintyFuseNet-for-COVID-19-Classification.
△ Less
Submitted 30 January, 2022; v1 submitted 18 May, 2021;
originally announced May 2021.
-
Time series forecasting of new cases and new deaths rate for COVID-19 using deep learning methods
Authors:
Nooshin Ayoobi,
Danial Sharifrazi,
Roohallah Alizadehsani,
Afshin Shoeibi,
Juan M. Gorriz,
Hossein Moosaei,
Abbas Khosravi,
Saeid Nahavandi,
Abdoulmohammad Gholamzadeh Chofreh,
Feybi Ariani Goni,
Jiri Jaromir Klemes,
Amir Mosavi
Abstract:
The first known case of Coronavirus disease 2019 (COVID-19) was identified in December 2019. It has spread worldwide, leading to an ongoing pandemic, imposed restrictions and costs to many countries. Predicting the number of new cases and deaths during this period can be a useful step in predicting the costs and facilities required in the future. The purpose of this study is to predict new cases a…
▽ More
The first known case of Coronavirus disease 2019 (COVID-19) was identified in December 2019. It has spread worldwide, leading to an ongoing pandemic, imposed restrictions and costs to many countries. Predicting the number of new cases and deaths during this period can be a useful step in predicting the costs and facilities required in the future. The purpose of this study is to predict new cases and deaths rate one, three and seven-day ahead during the next 100 days. The motivation for predicting every n days (instead of just every day) is the investigation of the possibility of computational cost reduction and still achieving reasonable performance. Such a scenario may be encountered in real-time forecasting of time series. Six different deep learning methods are examined on the data adopted from the WHO website. Three methods are LSTM, Convolutional LSTM, and GRU. The bidirectional extension is then considered for each method to forecast the rate of new cases and new deaths in Australia and Iran countries.
This study is novel as it carries out a comprehensive evaluation of the aforementioned three deep learning methods and their bidirectional extensions to perform prediction on COVID-19 new cases and new death rate time series. To the best of our knowledge, this is the first time that Bi-GRU and Bi-Conv-LSTM models are used for prediction on COVID-19 new cases and new deaths time series. The evaluation of the methods is presented in the form of graphs and Friedman statistical test. The results show that the bidirectional models have lower errors than other models. A several error evaluation metrics are presented to compare all models, and finally, the superiority of bidirectional methods is determined. This research could be useful for organisations working against COVID-19 and determining their long-term plans.
△ Less
Submitted 24 December, 2021; v1 submitted 28 April, 2021;
originally announced April 2021.
-
Combining a Convolutional Neural Network with Autoencoders to Predict the Survival Chance of COVID-19 Patients
Authors:
Fahime Khozeimeh,
Danial Sharifrazi,
Navid Hoseini Izadi,
Javad Hassannataj Joloudari,
Afshin Shoeibi,
Roohallah Alizadehsani,
Juan M. Gorriz,
Sadiq Hussain,
Zahra Alizadeh Sani,
Hossein Moosaei,
Abbas Khosravi,
Saeid Nahavandi,
Sheikh Mohammed Shariful Islam
Abstract:
COVID-19 has caused many deaths worldwide. The automation of the diagnosis of this virus is highly desired. Convolutional neural networks (CNNs) have shown outstanding classification performance on image datasets. To date, it appears that COVID computer-aided diagnosis systems based on CNNs and clinical information have not yet been analysed or explored. We propose a novel method, named the CNN-AE…
▽ More
COVID-19 has caused many deaths worldwide. The automation of the diagnosis of this virus is highly desired. Convolutional neural networks (CNNs) have shown outstanding classification performance on image datasets. To date, it appears that COVID computer-aided diagnosis systems based on CNNs and clinical information have not yet been analysed or explored. We propose a novel method, named the CNN-AE, to predict the survival chance of COVID-19 patients using a CNN trained with clinical information. Notably, the required resources to prepare CT images are expensive and limited compared to those required to collect clinical data, such as blood pressure, liver disease, etc. We evaluated our method using a publicly available clinical dataset that we collected. The dataset properties were carefully analysed to extract important features and compute the correlations of features. A data augmentation procedure based on autoencoders (AEs) was proposed to balance the dataset. The experimental results revealed that the average accuracy of the CNN-AE (96.05%) was higher than that of the CNN (92.49%). To demonstrate the generality of our augmentation method, we trained some existing mortality risk prediction methods on our dataset (with and without data augmentation) and compared their performances. We also evaluated our method using another dataset for further generality verification. To show that clinical data can be used for COVID-19 survival chance prediction, the CNN-AE was compared with multiple pre-trained deep models that were tuned based on CT images.
△ Less
Submitted 8 August, 2021; v1 submitted 18 April, 2021;
originally announced April 2021.
-
Fusion of convolution neural network, support vector machine and Sobel filter for accurate detection of COVID-19 patients using X-ray images
Authors:
Danial Sharifrazi,
Roohallah Alizadehsani,
Mohamad Roshanzamir,
Javad Hassannataj Joloudari,
Afshin Shoeibi,
Mahboobeh Jafari,
Sadiq Hussain,
Zahra Alizadeh Sani,
Fereshteh Hasanzadeh,
Fahime Khozeimeh,
Abbas Khosravi,
Saeid Nahavandi,
Maryam Panahiazar,
Assef Zare,
Sheikh Mohammed Shariful Islam,
U Rajendra Acharya
Abstract:
The coronavirus (COVID-19) is currently the most common contagious disease which is prevalent all over the world. The main challenge of this disease is the primary diagnosis to prevent secondary infections and its spread from one person to another. Therefore, it is essential to use an automatic diagnosis system along with clinical procedures for the rapid diagnosis of COVID-19 to prevent its sprea…
▽ More
The coronavirus (COVID-19) is currently the most common contagious disease which is prevalent all over the world. The main challenge of this disease is the primary diagnosis to prevent secondary infections and its spread from one person to another. Therefore, it is essential to use an automatic diagnosis system along with clinical procedures for the rapid diagnosis of COVID-19 to prevent its spread. Artificial intelligence techniques using computed tomography (CT) images of the lungs and chest radiography have the potential to obtain high diagnostic performance for Covid-19 diagnosis. In this study, a fusion of convolutional neural network (CNN), support vector machine (SVM), and Sobel filter is proposed to detect COVID-19 using X-ray images. A new X-ray image dataset was collected and subjected to high pass filter using a Sobel filter to obtain the edges of the images. Then these images are fed to CNN deep learning model followed by SVM classifier with ten-fold cross validation strategy. This method is designed so that it can learn with not many data. Our results show that the proposed CNN-SVM with Sobel filtering (CNN-SVM+Sobel) achieved the highest classification accuracy of 99.02% in accurate detection of COVID-19. It showed that using Sobel filter can improve the performance of CNN. Unlike most of the other researches, this method does not use a pre-trained network. We have also validated our developed model using six public databases and obtained the highest performance. Hence, our developed model is ready for clinical application
△ Less
Submitted 13 February, 2021;
originally announced February 2021.
-
Uncertainty-Aware Semi-Supervised Method Using Large Unlabeled and Limited Labeled COVID-19 Data
Authors:
Roohallah Alizadehsani,
Danial Sharifrazi,
Navid Hoseini Izadi,
Javad Hassannataj Joloudari,
Afshin Shoeibi,
Juan M. Gorriz,
Sadiq Hussain,
Juan E. Arco,
Zahra Alizadeh Sani,
Fahime Khozeimeh,
Abbas Khosravi,
Saeid Nahavandi,
Sheikh Mohammed Shariful Islam,
U Rajendra Acharya
Abstract:
The new coronavirus has caused more than one million deaths and continues to spread rapidly. This virus targets the lungs, causing respiratory distress which can be mild or severe. The X-ray or computed tomography (CT) images of lungs can reveal whether the patient is infected with COVID-19 or not. Many researchers are trying to improve COVID-19 detection using artificial intelligence. Our motivat…
▽ More
The new coronavirus has caused more than one million deaths and continues to spread rapidly. This virus targets the lungs, causing respiratory distress which can be mild or severe. The X-ray or computed tomography (CT) images of lungs can reveal whether the patient is infected with COVID-19 or not. Many researchers are trying to improve COVID-19 detection using artificial intelligence. Our motivation is to develop an automatic method that can cope with scenarios in which preparing labeled data is time consuming or expensive. In this article, we propose a Semi-supervised Classification using Limited Labeled Data (SCLLD) relying on Sobel edge detection and Generative Adversarial Networks (GANs) to automate the COVID-19 diagnosis. The GAN discriminator output is a probabilistic value which is used for classification in this work. The proposed system is trained using 10,000 CT scans collected from Omid Hospital, whereas a public dataset is also used for validating our system. The proposed method is compared with other state-of-the-art supervised methods such as Gaussian processes. To the best of our knowledge, this is the first time a semi-supervised method for COVID-19 detection is presented. Our system is capable of learning from a mixture of limited labeled and unlabeled data where supervised learners fail due to a lack of sufficient amount of labeled data. Thus, our semi-supervised training method significantly outperforms the supervised training of Convolutional Neural Network (CNN) when labeled training data is scarce. The 95% confidence intervals for our method in terms of accuracy, sensitivity, and specificity are 99.56 +- 0.20%, 99.88 +- 0.24%, and 99.40 +- 0.18%, respectively, whereas intervals for the CNN (trained supervised) are 68.34 +- 4.11%, 91.2 +- 6.15%, and 46.40 +- 5.21%.
△ Less
Submitted 24 December, 2021; v1 submitted 12 February, 2021;
originally announced February 2021.
-
Objective Evaluation of Deep Uncertainty Predictions for COVID-19 Detection
Authors:
Hamzeh Asgharnezhad,
Afshar Shamsi,
Roohallah Alizadehsani,
Abbas Khosravi,
Saeid Nahavandi,
Zahra Alizadeh Sani,
Dipti Srinivasan
Abstract:
Deep neural networks (DNNs) have been widely applied for detecting COVID-19 in medical images. Existing studies mainly apply transfer learning and other data representation strategies to generate accurate point estimates. The generalization power of these networks is always questionable due to being developed using small datasets and failing to report their predictive confidence. Quantifying uncer…
▽ More
Deep neural networks (DNNs) have been widely applied for detecting COVID-19 in medical images. Existing studies mainly apply transfer learning and other data representation strategies to generate accurate point estimates. The generalization power of these networks is always questionable due to being developed using small datasets and failing to report their predictive confidence. Quantifying uncertainties associated with DNN predictions is a prerequisite for their trusted deployment in medical settings. Here we apply and evaluate three uncertainty quantification techniques for COVID-19 detection using chest X-Ray (CXR) images. The novel concept of uncertainty confusion matrix is proposed and new performance metrics for the objective evaluation of uncertainty estimates are introduced. Through comprehensive experiments, it is shown that networks pertained on CXR images outperform networks pretrained on natural image datasets such as ImageNet. Qualitatively and quantitatively evaluations also reveal that the predictive uncertainty estimates are statistically higher for erroneous predictions than correct predictions. Accordingly, uncertainty quantification methods are capable of flagging risky predictions with high uncertainty estimates. We also observe that ensemble methods more reliably capture uncertainties during the inference.
△ Less
Submitted 22 December, 2020;
originally announced December 2020.
-
A Review of Uncertainty Quantification in Deep Learning: Techniques, Applications and Challenges
Authors:
Moloud Abdar,
Farhad Pourpanah,
Sadiq Hussain,
Dana Rezazadegan,
Li Liu,
Mohammad Ghavamzadeh,
Paul Fieguth,
Xiaochun Cao,
Abbas Khosravi,
U Rajendra Acharya,
Vladimir Makarenkov,
Saeid Nahavandi
Abstract:
Uncertainty quantification (UQ) plays a pivotal role in reduction of uncertainties during both optimization and decision making processes. It can be applied to solve a variety of real-world applications in science and engineering. Bayesian approximation and ensemble learning techniques are two most widely-used UQ methods in the literature. In this regard, researchers have proposed different UQ met…
▽ More
Uncertainty quantification (UQ) plays a pivotal role in reduction of uncertainties during both optimization and decision making processes. It can be applied to solve a variety of real-world applications in science and engineering. Bayesian approximation and ensemble learning techniques are two most widely-used UQ methods in the literature. In this regard, researchers have proposed different UQ methods and examined their performance in a variety of applications such as computer vision (e.g., self-driving cars and object detection), image processing (e.g., image restoration), medical image analysis (e.g., medical image classification and segmentation), natural language processing (e.g., text classification, social media texts and recidivism risk-scoring), bioinformatics, etc. This study reviews recent advances in UQ methods used in deep learning. Moreover, we also investigate the application of these methods in reinforcement learning (RL). Then, we outline a few important applications of UQ methods. Finally, we briefly highlight the fundamental research challenges faced by UQ methods and discuss the future research directions in this field.
△ Less
Submitted 5 January, 2021; v1 submitted 12 November, 2020;
originally announced November 2020.
-
Regular black holes with stable cores
Authors:
Alfio Bonanno,
Amir-Pouyan Khosravi,
Frank Saueressig
Abstract:
Non-singular black hole geometries typically come with two spacetime horizons: an (outer) event horizon and an (inner) Cauchy horizon. This nurtures the speculation that they may be subject to a mass-inflation effect which renders the Cauchy horizon unstable. We analyze the dynamics associated with spherically symmetric, regular black holes taking the full backreaction between the infalling matter…
▽ More
Non-singular black hole geometries typically come with two spacetime horizons: an (outer) event horizon and an (inner) Cauchy horizon. This nurtures the speculation that they may be subject to a mass-inflation effect which renders the Cauchy horizon unstable. We analyze the dynamics associated with spherically symmetric, regular black holes taking the full backreaction between the infalling matter and geometry into account. On this basis, we identify the crucial features taming the growth of the mass function and diminishing the curvature singularity at the Cauchy horizon. It is demonstrated explicitly that the regular black hole solutions proposed by Hayward and obtained from Asymptotic Safety satisfy these properties.
△ Less
Submitted 16 November, 2022; v1 submitted 8 October, 2020;
originally announced October 2020.
-
Handling of uncertainty in medical data using machine learning and probability theory techniques: A review of 30 years (1991-2020)
Authors:
Roohallah Alizadehsani,
Mohamad Roshanzamir,
Sadiq Hussain,
Abbas Khosravi,
Afsaneh Koohestani,
Mohammad Hossein Zangooei,
Moloud Abdar,
Adham Beykikhoshk,
Afshin Shoeibi,
Assef Zare,
Maryam Panahiazar,
Saeid Nahavandi,
Dipti Srinivasan,
Amir F. Atiya,
U. Rajendra Acharya
Abstract:
Understanding data and reaching valid conclusions are of paramount importance in the present era of big data. Machine learning and probability theory methods have widespread application for this purpose in different fields. One critically important yet less explored aspect is how data and model uncertainties are captured and analyzed. Proper quantification of uncertainty provides valuable informat…
▽ More
Understanding data and reaching valid conclusions are of paramount importance in the present era of big data. Machine learning and probability theory methods have widespread application for this purpose in different fields. One critically important yet less explored aspect is how data and model uncertainties are captured and analyzed. Proper quantification of uncertainty provides valuable information for optimal decision making. This paper reviewed related studies conducted in the last 30 years (from 1991 to 2020) in handling uncertainties in medical data using probability theory and machine learning techniques. Medical data is more prone to uncertainty due to the presence of noise in the data. So, it is very important to have clean medical data without any noise to get accurate diagnosis. The sources of noise in the medical data need to be known to address this issue. Based on the medical data obtained by the physician, diagnosis of disease, and treatment plan are prescribed. Hence, the uncertainty is growing in healthcare and there is limited knowledge to address these problems. We have little knowledge about the optimal treatment methods as there are many sources of uncertainty in medical science. Our findings indicate that there are few challenges to be addressed in handling the uncertainty in medical raw data and new models. In this work, we have summarized various methods employed to overcome this problem. Nowadays, application of novel deep learning techniques to deal such uncertainties have significantly increased.
△ Less
Submitted 23 August, 2020;
originally announced August 2020.
-
Development of novel algorithm to visualize blood vessels on 3D ultrasound images during liver surgery
Authors:
Fatemeh Salehihafshejani,
Alireza Ahmadian,
Afshin Shoeibi,
Roohallah Alizadehsani,
Habibollah Dashti,
Niloofar Ayoobi Yazdi,
Abbas Khosravi,
Saeid Nahavandi
Abstract:
Volume visualization is a method that displays three-dimensional (3D) data in two-dimensional (2D) space. Using 3D datasets instead of 2D traditional images improves the visualization of anatomical structures, and volume visualization helps radiologists and surgeons to review large datasets comprehensively so that diagnosis and treatment can be enhanced. In liver surgery, blood vessel detection is…
▽ More
Volume visualization is a method that displays three-dimensional (3D) data in two-dimensional (2D) space. Using 3D datasets instead of 2D traditional images improves the visualization of anatomical structures, and volume visualization helps radiologists and surgeons to review large datasets comprehensively so that diagnosis and treatment can be enhanced. In liver surgery, blood vessel detection is important. Liver vessels have various shapes and due to the presence of noise in the ultrasound images, they can be confused with noise. Suboptimal images can sometimes lead to surgical errors where the surgeon may cut the blood vessel in error. The ultrasound system is versatile and portable and has the advantage of being able to be used in the operating theatre. Due to the nature of B-mode ultrasound, 1-D transfer function volume visualization of images cannot abrogate shadow artifacts. While multi-dimensional transfer function improves the ability to define features of interest, the high dimensionality in the parameter domain renders it unwieldy and difficult for clinicians to work with. To overcome these limitations, an algorithm for volume visualization that can provide effective 3D visualization of noisy B-mode ultrasound images, which can be useful for clinicians, is proposed. We propose a method that is appropriate for liver ultrasound images focusing on vessels and tumors (if present) in order to delineate their structure and positions clearly to preempt surgical error during operation. This method can prevent possible errors during liver surgery by providing more detailed high quality 3D images for clinicians. Key Words: Visualization, 3D ultrasound image, Volume Rendering, Liver surgery, Liver vessels.
△ Less
Submitted 19 August, 2020;
originally announced August 2020.