Search | arXiv e-print repository

A Transformer-Based Multi-Stream Approach for Isolated Iranian Sign Language Recognition

Authors: Ali Ghadami, Alireza Taheri, Ali Meghdari

Abstract: Sign language is an essential means of communication for millions of people around the world and serves as their primary language. However, most communication tools are developed for spoken and written languages which can cause problems and difficulties for the deaf and hard of hearing community. By developing a sign language recognition system, we can bridge this communication gap and enable peop… ▽ More Sign language is an essential means of communication for millions of people around the world and serves as their primary language. However, most communication tools are developed for spoken and written languages which can cause problems and difficulties for the deaf and hard of hearing community. By developing a sign language recognition system, we can bridge this communication gap and enable people who use sign language as their main form of expression to better communicate with people and their surroundings. This recognition system increases the quality of health services, improves public services, and creates equal opportunities for the deaf community. This research aims to recognize Iranian Sign Language words with the help of the latest deep learning tools such as transformers. The dataset used includes 101 Iranian Sign Language words frequently used in academic environments such as universities. The network used is a combination of early fusion and late fusion transformer encoder-based networks optimized with the help of genetic algorithm. The selected features to train this network include hands and lips key points, and the distance and angle between hands extracted from the sign videos. Also, in addition to the training model for the classes, the embedding vectors of words are used as multi-task learning to have smoother and more efficient training. This model was also tested on sentences generated from our word dataset using a windowing technique for sentence translation. Finally, the sign language training software that provides real-time feedback to users with the help of the developed model, which has 90.2% accuracy on test data, was introduced, and in a survey, the effectiveness and efficiency of this type of sign language learning software and the impact of feedback were investigated. △ Less

Submitted 27 June, 2024; originally announced July 2024.

Comments: 17 pages, 10 figures

arXiv:2406.18333 [pdf]

Continuous Sign Language Recognition Using Intra-inter Gloss Attention

Authors: Hossein Ranjbar, Alireza Taheri

Abstract: Many continuous sign language recognition (CSLR) studies adopt transformer-based architectures for sequence modeling due to their powerful capacity for capturing global contexts. Nevertheless, vanilla self-attention, which serves as the core module of the transformer, calculates a weighted average over all time steps; therefore, the local temporal semantics of sign videos may not be fully exploite… ▽ More Many continuous sign language recognition (CSLR) studies adopt transformer-based architectures for sequence modeling due to their powerful capacity for capturing global contexts. Nevertheless, vanilla self-attention, which serves as the core module of the transformer, calculates a weighted average over all time steps; therefore, the local temporal semantics of sign videos may not be fully exploited. In this study, we introduce a novel module in sign language recognition studies, called intra-inter gloss attention module, to leverage the relationships among frames within glosses and the semantic and grammatical dependencies between glosses in the video. In the intra-gloss attention module, the video is divided into equally sized chunks and a self-attention mechanism is applied within each chunk. This localized self-attention significantly reduces complexity and eliminates noise introduced by considering non-relative frames. In the inter-gloss attention module, we first aggregate the chunk-level features within each gloss chunk by average pooling along the temporal dimension. Subsequently, multi-head self-attention is applied to all chunk-level features. Given the non-significance of the signer-environment interaction, we utilize segmentation to remove the background of the videos. This enables the proposed model to direct its focus toward the signer. Experimental results on the PHOENIX-2014 benchmark dataset demonstrate that our method can effectively extract sign language features in an end-to-end manner without any prior knowledge, improve the accuracy of CSLR, and achieve the word error rate (WER) of 20.4 on the test set which is a competitive result compare to the state-of-the-art which uses additional supervisions. △ Less

Submitted 26 June, 2024; originally announced June 2024.

arXiv:2404.01749 [pdf, ps, other]

Curvature conditions, Liouville-type theorems and Harnack inequalities for a nonlinear parabolic equation on smooth metric measure spaces

Authors: Ali Taheri, Vahideh Vahidifar

Abstract: In this paper we prove gradient estimates of both elliptic and parabolic types, specifically, of Souplet-Zhang, Hamilton and Li-Yau types for positive smooth solutions to a class of nonlinear parabolic equations involving the Witten or drifting Laplacian on smooth metric measure spaces. These estimates are established under various curvature conditions and lower bounds on the generalised Bakry-Éme… ▽ More In this paper we prove gradient estimates of both elliptic and parabolic types, specifically, of Souplet-Zhang, Hamilton and Li-Yau types for positive smooth solutions to a class of nonlinear parabolic equations involving the Witten or drifting Laplacian on smooth metric measure spaces. These estimates are established under various curvature conditions and lower bounds on the generalised Bakry-Émery Ricci tensor and find utility in proving elliptic and parabolic Harnack-type inequalities as well as general elliptic and parabolic Liouville-type and other global constancy results. Several applications and consequences are presented and discussed. △ Less

Submitted 2 April, 2024; originally announced April 2024.

Comments: 44 pages

arXiv:2402.07853 [pdf, ps, other]

A New Algorithm for Computing the Frobenius Number

Authors: Abbas Taheri, Saeid Alikhani

Abstract: A number $α$ has a representation with respect to the numbers $α_1,...,α_n$, if there exist the non-negative integers $λ_1,... ,λ_n$ such that $α=λ_1α_1+...+λ_n α_n$. The largest natural number that does not have a representation with respect to the numbers $α_1,...,α_n$ is called the Frobenius number and is denoted by the symbol $g(α_1,...,α_n)$. In this paper, we present a new algorithm to calcu… ▽ More A number $α$ has a representation with respect to the numbers $α_1,...,α_n$, if there exist the non-negative integers $λ_1,... ,λ_n$ such that $α=λ_1α_1+...+λ_n α_n$. The largest natural number that does not have a representation with respect to the numbers $α_1,...,α_n$ is called the Frobenius number and is denoted by the symbol $g(α_1,...,α_n)$. In this paper, we present a new algorithm to calculate the Frobenius number. Also we present the sequential form of the new algorithm. △ Less

Submitted 12 February, 2024; v1 submitted 12 February, 2024; originally announced February 2024.

Comments: 6 pages

MSC Class: 01B39; 11D04

arXiv:2312.07671 [pdf]

Reacting like Humans: Incorporating Intrinsic Human Behaviors into NAO through Sound-Based Reactions to Fearful and Shocking Events for Enhanced Sociability

Authors: Ali Ghadami, Mohammadreza Taghimohammadi, Mohammad Mohammadzadeh, Mohammad Hosseinipour, Alireza Taheri

Abstract: Robots' acceptability among humans and their sociability can be significantly enhanced by incorporating human-like reactions. Humans can react to environmental events very quickly and without thinking. An instance where humans show natural reactions is when they encounter a sudden and loud sound that startles or frightens them. During such moments, individuals may instinctively move their hands, t… ▽ More Robots' acceptability among humans and their sociability can be significantly enhanced by incorporating human-like reactions. Humans can react to environmental events very quickly and without thinking. An instance where humans show natural reactions is when they encounter a sudden and loud sound that startles or frightens them. During such moments, individuals may instinctively move their hands, turn toward the origin of the sound, and try to determine the event's cause. This inherent behavior motivated us to explore this less-studied part of social robotics. In this work, a multi-modal system composed of an action generator, sound classifier, and YOLO object detector was designed to sense the environment and, in the presence of sudden loud sounds, show natural human fear reactions; and finally, locate the fear-causing sound source in the environment. These valid generated motions and inferences could imitate intrinsic human reactions and enhance the sociability of robots. For motion generation, a model based on LSTM and MDN networks was proposed to synthesize various motions. Also, in the case of sound detection, a transfer learning model was preferred that used the spectrogram of the sound signals as its input. After developing individual models for sound detection, motion generation, and image recognition, they were integrated into a comprehensive "fear" module implemented on the NAO robot. Finally, the fear module was tested in practical application and two groups of experts and non-experts (in the robotics area) filled out a questionnaire to evaluate the performance of the robot. We indicated that the proposed module could convince the participants that the Nao robot acts and reasons like a human when a sudden and loud sound is in the robot's peripheral environment, and additionally showed that non-experts have higher expectations about social robots and their performance. △ Less

Submitted 5 June, 2024; v1 submitted 12 December, 2023; originally announced December 2023.

Comments: 16 pages, 11 figures

MSC Class: 68T40

arXiv:2309.02402 [pdf, other]

Breaking Barriers to Creative Expression: Co-Designing and Implementing an Accessible Text-to-Image Interface

Authors: Atieh Taheri, Mohammad Izadi, Gururaj Shriram, Negar Rostamzadeh, Shaun Kane

Abstract: Text-to-image generation models have grown in popularity due to their ability to produce high-quality images from a text prompt. One use for this technology is to enable the creation of more accessible art creation software. In this paper, we document the development of an alternative user interface that reduces the typing effort needed to enter image prompts by providing suggestions from a large… ▽ More Text-to-image generation models have grown in popularity due to their ability to produce high-quality images from a text prompt. One use for this technology is to enable the creation of more accessible art creation software. In this paper, we document the development of an alternative user interface that reduces the typing effort needed to enter image prompts by providing suggestions from a large language model, developed through iterative design and testing within the project team. The results of this testing demonstrate how generative text models can support the accessibility of text-to-image models, enabling users with a range of abilities to create visual art. △ Less

Submitted 5 September, 2023; originally announced September 2023.

Comments: 12 pages, 2 figures

ACM Class: J.5; J.6; I.2.7

arXiv:2307.16681 [pdf, other]

Towards Energy Efficient Control for Commercial Heavy-Duty Mobile Cranes: Modeling Hydraulic Pressures using Machine Learning

Authors: Abdolreza Taheri, Robert Pettersson, Pelle Gustafsson, Joni Pajarinen, Reza Ghabcheloo

Abstract: A sizable part of the fleet of heavy-duty machinery in the construction equipment industry uses the conventional valve-controlled load-sensing hydraulics. Rigorous climate actions towards reducing CO$_{2}$ emissions has sparked the development of solutions to lower the energy consumption and increase the productivity of the machines. One promising solution to having a better balance between energy… ▽ More A sizable part of the fleet of heavy-duty machinery in the construction equipment industry uses the conventional valve-controlled load-sensing hydraulics. Rigorous climate actions towards reducing CO$_{2}$ emissions has sparked the development of solutions to lower the energy consumption and increase the productivity of the machines. One promising solution to having a better balance between energy and performance is to build accurate models (digital twins) of the real systems using data together with recent advances in machine learning/model-based optimization to improve the control systems. With a particular focus on real-world machines with multiple flow-controlled actuators and shared variable-displacement pumps, this paper presents a generalized machine learning approach to modeling the working pressure of the actuators and the overall pump pressures. The procedures for deriving reaction forces and flow rates as important input variables to the surrogate models are described in detail. Using data from a real loader crane testbed, we demonstrate training and validation of individual models, and showcase the accuracy of pressure predictions in five different experiments under various utilizations and pressure levels. △ Less

Submitted 31 July, 2023; originally announced July 2023.

Comments: Published in The 18th Scandinavian International Conference on Fluid Power, SICFP'23, May 30- June 1, 2023, Tampere, Finland

Journal ref: 18th Scandinavian International Conference on Fluid Power, SICFP'23, May 30- June 1, 2023, Tampere, Finland

arXiv:2306.08639 [pdf, ps, other]

Souplet-Zhang and Hamilton type gradient estimates for nonlinear elliptic equations on smooth metric measure spaces

Authors: Ali Taheri, Vahideh Vahidifar

Abstract: In this article we present new gradient estimates for positive solutions to a class of nonlinear elliptic equations involving the f-Laplacian on a smooth metric measure space. The gradient estimates of interest are of Souplet-Zhang and Hamilton types respectively and are established under natural lower bounds on the generalised Bakry-Émery Ricci curvature tensor. From these estimates we derive amo… ▽ More In this article we present new gradient estimates for positive solutions to a class of nonlinear elliptic equations involving the f-Laplacian on a smooth metric measure space. The gradient estimates of interest are of Souplet-Zhang and Hamilton types respectively and are established under natural lower bounds on the generalised Bakry-Émery Ricci curvature tensor. From these estimates we derive amongst other things Harnack inequalities and general global constancy and Liouville-type theorems. The results and approach undertaken here provide a unified treatment and extend and improve various existing results in the literature. Some implications and applications are presented and discussed. △ Less

Submitted 14 June, 2023; originally announced June 2023.

Comments: 27 pages

Journal ref: Mathematika 69(3) 2023

arXiv:2304.04005 [pdf]

A new transformation for embedded convolutional neural network approach toward real-time servo motor overload fault-detection

Authors: Seyed Mohammad Hossein Abedy Nejad, Mohammad Amin Behzadi, Abdolrahim Taheri

Abstract: Overloading in DC servo motors is a major concern in industries, as many companies face the problem of finding expert operators, and also human monitoring may not be an effective solution. Therefore, this paper proposed an embedded Artificial intelligence (AI) approach using a Convolutional Neural Network (CNN) using a new transformation to extract faults from real-time input signals without human… ▽ More Overloading in DC servo motors is a major concern in industries, as many companies face the problem of finding expert operators, and also human monitoring may not be an effective solution. Therefore, this paper proposed an embedded Artificial intelligence (AI) approach using a Convolutional Neural Network (CNN) using a new transformation to extract faults from real-time input signals without human interference. Our main purpose is to extract as many as possible features from the input signal to achieve a relaxed dataset that results in an effective but compact network to provide real-time fault detection even in a low-memory microcontroller. Besides, fault detection method a synchronous dual-motor system is also proposed to take action in faulty events. To fulfill this intention, a one-dimensional input signal from the output current of each DC servo motor is monitored and transformed into a 3d stack of data and then the CNN is implemented into the processor to detect any fault corresponding to overloading, finally experimental setup results in 99.9997% accuracy during testing for a model with nearly 8000 parameters. In addition, the proposed dual-motor system could achieve overload reduction and provide a fault-tolerant system and it is shown that this system also takes advantage of less energy consumption. △ Less

Submitted 8 April, 2023; originally announced April 2023.

arXiv:2303.05802 [pdf, ps, other]

Gradient estimates for a nonlinear parabolic equation on smooth metric measure spaces with evolving metrics and potentials

Authors: Ali Taheri, Vahideh Vahidifar

Abstract: This article presents new parabolic and elliptic type gradient estimates for positive smooth solutions to a nonlinear parabolic equation involving the Witten Laplacian in the context of smooth metric measure spaces. The metric and potential here are time dependent and evolve under a super Perelman-Ricci flow. The estimates are derived under natural lower bounds on the associated generalised Bakry-… ▽ More This article presents new parabolic and elliptic type gradient estimates for positive smooth solutions to a nonlinear parabolic equation involving the Witten Laplacian in the context of smooth metric measure spaces. The metric and potential here are time dependent and evolve under a super Perelman-Ricci flow. The estimates are derived under natural lower bounds on the associated generalised Bakry-Émery Ricci curvature tensors and are utilised in establishing fairly general local and global bounds, Harnack-type inequalities and Liouville-type global constancy theorems to mention a few. Other implications and consequences of the results are also discussed. △ Less

Submitted 10 March, 2023; originally announced March 2023.

Comments: 41 pages

Journal ref: Nonlinear Analysis 2023

arXiv:2303.01109 [pdf, ps, other]

Gradient estimates for nonlinear elliptic equations involving the Witten Laplacian on smooth metric measure spaces and implications

Authors: Ali Taheri, Vahideh Vahidifar

Abstract: This article presents new local and global gradient estimates of Li-Yau type for positive solutions to a class of nonlinear elliptic equations on smooth metric measure spaces involving the Witten Laplacian. The estimates are derived under natural lower bounds on the associated Bakry-Émery Ricci curvature tensor and find utility in proving general Harnack inequalities and Liouville-type theorems to… ▽ More This article presents new local and global gradient estimates of Li-Yau type for positive solutions to a class of nonlinear elliptic equations on smooth metric measure spaces involving the Witten Laplacian. The estimates are derived under natural lower bounds on the associated Bakry-Émery Ricci curvature tensor and find utility in proving general Harnack inequalities and Liouville-type theorems to mention a few. The results here unify, extend and improve various existing results in the literature for special nonlinearities already of huge interest and applications. Some important consequences are presented and discussed. △ Less

Submitted 2 March, 2023; originally announced March 2023.

Comments: 20 pages

Journal ref: Advances in Nonlinear Analysis 2023

arXiv:2302.07998 [pdf]

cGAN-Based High Dimensional IMU Sensor Data Generation for Enhanced Human Activity Recognition in Therapeutic Activities

Authors: Mohammad Mohammadzadeh, Ali Ghadami, Alireza Taheri, Saeed Behzadipour

Abstract: Human activity recognition is a core technology for applications such as rehabilitation, health monitoring, and human-computer interactions. Wearable devices, especially IMU sensors, provide rich features of human movements at a reasonable cost, which can be leveraged in activity recognition. Developing a robust classifier for activity recognition has always been of interest to researchers. One ma… ▽ More Human activity recognition is a core technology for applications such as rehabilitation, health monitoring, and human-computer interactions. Wearable devices, especially IMU sensors, provide rich features of human movements at a reasonable cost, which can be leveraged in activity recognition. Developing a robust classifier for activity recognition has always been of interest to researchers. One major problem is that there is usually a deficit of training data, which makes developing deep classifiers difficult and sometimes impossible. In this work, a novel GAN network called TheraGAN was developed to generate IMU signals associated with rehabilitation activities. The generated signal comprises data from a 6-channel IMU, i.e., angular velocities and linear accelerations. Also, introducing simple activities simplified the generation process for activities of varying lengths. To evaluate the generated signals, several qualitative and quantitative studies were conducted, including perceptual similarity analysis, comparing manually extracted features to those from real data, visual inspection, and an investigation into how the generated data affects the performance of three deep classifiers trained on the generated and real data. The results showed that the generated signals closely mimicked the real signals, and adding generated data resulted in a significant improvement in the performance of all tested networks. Among the tested networks, the LSTM classifier demonstrated the most significant improvement, achieving a 13.27% boost, effectively addressing the challenge of data scarcity. This shows the validity of the generated data as well as TheraGAN as a tool to build more robust classifiers in case of imbalanced and insufficient data problems. △ Less

Submitted 14 February, 2024; v1 submitted 15 February, 2023; originally announced February 2023.

arXiv:2206.03076 [pdf, ps, other]

On Multiple Solutions to a Family of Nonlinear Elliptic Systems in Divergence Form Coupled with an Incompressibility Constraint

Authors: Ali Taheri, Vahideh Vahidifar

Abstract: The aim of this paper is to prove the existence of multiple solutions for a family of nonlinear elliptic systems in divergence form coupled with a pointwise gradient constraint: \begin{align*} \left\{ \begin{array}{ll} \dive\{\A(|x|,|u|^2,|\nabla u|^2) \nabla u\} + \B(|x|,|u|^2,|\nabla u|^2) u = \dive \{ \mcP(x) [{\rm cof}\,\nabla u] \} \quad &\text{ in} \ Ω, \\ \text{det}\, \nabla u = 1 \ &\text{… ▽ More The aim of this paper is to prove the existence of multiple solutions for a family of nonlinear elliptic systems in divergence form coupled with a pointwise gradient constraint: \begin{align*} \left\{ \begin{array}{ll} \dive\{\A(|x|,|u|^2,|\nabla u|^2) \nabla u\} + \B(|x|,|u|^2,|\nabla u|^2) u = \dive \{ \mcP(x) [{\rm cof}\,\nabla u] \} \quad &\text{ in} \ Ω, \\ \text{det}\, \nabla u = 1 \ &\text{ in} \ Ω, \\ u =\varphi \ &\text{ on} \ \partial Ω, \end{array} \right. \end{align*} where $Ω\subset \mathbb{R}^n$ ($n \ge 2$) is a bounded domain, $u=(u_1, \dots, u_n)$ is a vector-map and $\varphi$ is a prescribed boundary condition. Moreover $\mathscr{P}$ is a hydrostatic pressure associated with the constraint $\det \nabla u \equiv 1$ and $\A = \A(|x|,|u|^2,|\nabla u|^2)$, $\B = \B(|x|,|u|^2,|\nabla u|^2)$ are sufficiently regular scalar-valued functions satisfying suitable growths at infinity. The system arises in diverse areas, e.g., in continuum mechanics and nonlinear elasticity, as well as geometric function theory to name a few and a clear understanding of the form and structure of the solutions set is of great significance. The geometric type of solutions constructed here draws upon intimate links with the Lie group ${\bf SO}(n)$, its Lie exponential and the multi-dimensional curl operator acting on certain vector fields. Most notably a discriminant type quantity $Δ=Δ(\A,\B)$, prompting from the PDE, will be shown to have a decisive role on the structure and multiplicity of these solutions. △ Less

Submitted 7 June, 2022; originally announced June 2022.

Comments: 24 pages

arXiv:2202.13638 [pdf, other]

GPU-Accelerated Policy Optimization via Batch Automatic Differentiation of Gaussian Processes for Real-World Control

Authors: Abdolreza Taheri, Joni Pajarinen, Reza Ghabcheloo

Abstract: The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs has made policy search a highly time and memory consuming process that has not been able to scale to larger problems. In this work, we develop a policy optimiza… ▽ More The ability of Gaussian processes (GPs) to predict the behavior of dynamical systems as a more sample-efficient alternative to parametric models seems promising for real-world robotics research. However, the computational complexity of GPs has made policy search a highly time and memory consuming process that has not been able to scale to larger problems. In this work, we develop a policy optimization method by leveraging fast predictive sampling methods to process batches of trajectories in every forward pass, and compute gradient updates over policy parameters by automatic differentiation of Monte Carlo evaluations, all on GPU. We demonstrate the effectiveness of our approach in training policies on a set of reference-tracking control experiments with a heavy-duty machine. Benchmark results show a significant speedup over exact methods and showcase the scalability of our method to larger policy networks, longer horizons, and up to thousands of trajectories with a sublinear drop in speed. △ Less

Submitted 28 February, 2022; originally announced February 2022.

Comments: Accepted for publication in 2022 International Conference on Robotics and Automation (ICRA)

arXiv:2112.02662 [pdf]

doi 10.1109/ICTE51655.2021.9584498

Autonomous Heavy-Duty Mobile Machinery: A Multidisciplinary Collaborative Challenge

Authors: Tyrone Machado, David Fassbender, Abdolreza Taheri, Daniel Eriksson, Himanshu Gupta, Amirmasoud Molaei, Paolo Forte, Prashant Rai, Reza Ghabcheloo, Saku Mäkinen, Achim Lilienthal, Henrik Andreasson, Marcus Geimer

Abstract: Heavy-duty mobile machines (HDMMs) are a wide range of machinery used in diverse and critical application areas which are currently facing several issues like skilled labor shortage, poor safety records, and harsh work environments. Consequently, efforts are underway to increase automation in HDMMs for increased productivity and safety, eventually transitioning to operator-less autonomous HDMMs to… ▽ More Heavy-duty mobile machines (HDMMs) are a wide range of machinery used in diverse and critical application areas which are currently facing several issues like skilled labor shortage, poor safety records, and harsh work environments. Consequently, efforts are underway to increase automation in HDMMs for increased productivity and safety, eventually transitioning to operator-less autonomous HDMMs to address skilled labor shortages. However, HDMM are complex machines requiring continuous physical and cognitive inputs from human-operators. Thus, developing autonomous HDMM is a huge challenge, with current research and developments being performed in several independent research domains. Through this study, we use the bounded rationality concept to propose multidisciplinary collaborations for new autonomous HDMMs and apply the transaction cost economics framework to suggest future implications in the HDMM industry. Furthermore, we introduce a conceptual understanding of collaborations in the autonomous HDMM as a unified approach, while highlighting the practical implications and challenges of the complex nature of such multidisciplinary collaborations. The collaborative challenges and potentials are mapped out between the following topics: mechanical systems, AI methods, software systems, sensors, connectivity, simulations and process optimization, business cases, organization theories, and finally, regulatory frameworks. △ Less

Submitted 9 January, 2022; v1 submitted 5 December, 2021; originally announced December 2021.

Comments: published in 2021 IEEE International Conference on Technology and Entrepreneurship (ICTE)

Journal ref: 2021 IEEE International Conference on Technology and Entrepreneurship (ICTE), 2021, Kaunas, Lithuania, pp. 1-8

arXiv:2111.03297 [pdf, other]

doi 10.1109/TETC.2021.3102041

RC-RNN: Reconfigurable Cache Architecture for Storage Systems Using Recurrent Neural Networks

Authors: Shahriar Ebrahimi, Reza Salkhordeh, Seyed Ali Osia, Ali Taheri, Hamid Reza Rabiee, Hossein Asadi

Abstract: Solid-State Drives (SSDs) have significant performance advantages over traditional Hard Disk Drives (HDDs) such as lower latency and higher throughput. Significantly higher price per capacity and limited lifetime, however, prevents designers to completely substitute HDDs by SSDs in enterprise storage systems. SSD-based caching has recently been suggested for storage systems to benefit from higher… ▽ More Solid-State Drives (SSDs) have significant performance advantages over traditional Hard Disk Drives (HDDs) such as lower latency and higher throughput. Significantly higher price per capacity and limited lifetime, however, prevents designers to completely substitute HDDs by SSDs in enterprise storage systems. SSD-based caching has recently been suggested for storage systems to benefit from higher performance of SSDs while minimizing the overall cost. While conventional caching algorithms such as Least Recently Used (LRU) provide high hit ratio in processors, due to the highly random behavior of Input/Output (I/O) workloads, they hardly provide the required performance level for storage systems. In addition to poor performance, inefficient algorithms also shorten SSD lifetime with unnecessary cache replacements. Such shortcomings motivate us to benefit from more complex non-linear algorithms to achieve higher cache performance and extend SSD lifetime. In this paper, we propose RC-RNN, the first reconfigurable SSD-based cache architecture for storage systems that utilizes machine learning to identify performance-critical data pages for I/O caching. The proposed architecture uses Recurrent Neural Networks (RNN) to characterize ongoing workloads and optimize itself towards higher cache performance while improving SSD lifetime. RC-RNN attempts to learn characteristics of the running workload to predict its behavior and then uses the collected information to identify performance-critical data pages to fetch into the cache. Experimental results show that RC-RNN characterizes workloads with an accuracy up to 94.6% for SNIA I/O workloads. RC-RNN can perform similarly to the optimal cache algorithm by an accuracy of 95% on average, and outperforms previous SSD caching architectures by providing up to 7x higher hit ratio and decreasing cache replacements by up to 2x. △ Less

Submitted 5 November, 2021; originally announced November 2021.

Comments: Date of Publication: 09 August 2021

Journal ref: IEEE Transactions on Emerging Topics in Computing (2021)

arXiv:2110.00783 [pdf, other]

doi 10.1016/j.asr.2020.03.021

Dynamic-Programming-Based Failure-Tolerant Control for Satellite with Thrusters in 6-DOF Motion

Authors: Abdolreza Taheri, Nima Assadian

Abstract: In this paper, a dynamic-programming approach to the coupled translational and rotational control of thruster-driven spacecraft is studied. To reduce the complexity of the problem, dynamic-programming-based optimal policies are calculated using decoupled position and attitude dynamics with generalized forces and torques as controls. A quadratic-programming-based control allocation is then used to… ▽ More In this paper, a dynamic-programming approach to the coupled translational and rotational control of thruster-driven spacecraft is studied. To reduce the complexity of the problem, dynamic-programming-based optimal policies are calculated using decoupled position and attitude dynamics with generalized forces and torques as controls. A quadratic-programming-based control allocation is then used to map the controls to actuator commands. To control the spacecraft in the event of thruster failure, both the dynamic programming policies and control allocation are reconfigured to cope with the losses in controls. The control allocation parameters are adjusted dynamically to ensure the satellite always approaches the target from the side with two operative thrusters to achieve a stable control. The effectiveness of the proposed dynamic programming control is compared with a Lyapunov-stable control method, which shows that the proposed method is more fuel-efficient in tracking the same path. △ Less

Submitted 2 October, 2021; originally announced October 2021.

Comments: 33 pages, 21 figures, pre-print version, published in Advances in Space Research

Journal ref: Advances in Space Research, Volume 65, Issue 12, 15 June 2020, Pages 2857-2877

arXiv:2109.01186 [pdf, other]

doi 10.1145/3458709.3458946

Exploratory Design of a Hands-free Video Game Controller for a Quadriplegic Individual

Authors: Atieh Taheri, Ziv Weissman, Misha Sra

Abstract: From colored pixels to hyper-realistic 3D landscapes of virtual reality, video games have evolved immensely over the last few decades. However, video game input still requires two-handed dexterous finger manipulations for simultaneous joystick and trigger or mouse and keyboard presses. In this work, we explore the design of a hands-free game control method using realtime facial expression recognit… ▽ More From colored pixels to hyper-realistic 3D landscapes of virtual reality, video games have evolved immensely over the last few decades. However, video game input still requires two-handed dexterous finger manipulations for simultaneous joystick and trigger or mouse and keyboard presses. In this work, we explore the design of a hands-free game control method using realtime facial expression recognition for individuals with neurological and neuromuscular diseases who are unable to use traditional game controllers. Similar to other Assistive Technologies (AT), our facial input technique is also designed and tested in collaboration with a graduate student who has Spinal Muscular Atrophy. Our preliminary evaluation shows the potential of facial expression recognition for augmenting the lives of quadriplegic individuals by enabling them to accomplish things like walking, running, flying or other adventures that may not be so attainable otherwise. △ Less

Submitted 2 September, 2021; originally announced September 2021.

Comments: Published in: Augmented Humans Conference 2021

arXiv:2011.06304 [pdf, other]

Machine Learning Interpretability Meets TLS Fingerprinting

Authors: Mahdi Jafari Siavoshani, Amir Hossein Khajepour, Amirmohammad Ziaei, Amir Ali Gatmiri, Ali Taheri

Abstract: Protecting users' privacy over the Internet is of great importance; however, it becomes harder and harder to maintain due to the increasing complexity of network protocols and components. Therefore, investigating and understanding how data is leaked from the information transmission platforms and protocols can lead us to a more secure environment. In this paper, we propose a framework to systema… ▽ More Protecting users' privacy over the Internet is of great importance; however, it becomes harder and harder to maintain due to the increasing complexity of network protocols and components. Therefore, investigating and understanding how data is leaked from the information transmission platforms and protocols can lead us to a more secure environment. In this paper, we propose a framework to systematically find the most vulnerable information fields in a network protocol. To this end, focusing on the transport layer security (TLS) protocol, we perform different machine-learning-based fingerprinting attacks on the collected data from more than 70 domains (websites) to understand how and where this information leakage occurs in the TLS protocol. Then, by employing the interpretation techniques developed in the machine learning community and applying our framework, we find the most vulnerable information fields in the TLS protocol. Our findings demonstrate that the TLS handshake (which is mainly unencrypted), the TLS record length appearing in the TLS application data header, and the initialization vector (IV) field are among the most critical leaker parts in this protocol, respectively. △ Less

Submitted 12 September, 2021; v1 submitted 12 November, 2020; originally announced November 2020.

arXiv:1910.11771 [pdf]

doi 10.1021/acs.jpcc.1c00073

Multi-objective optimization of the coiled carbon nanotubes regarding their mechanical performance: A reactive molecular dynamics simulation study

Authors: Ehsan Shahini, Fazel Rangriz, Ali Karimi Taheri

Abstract: Coiled Carbon Nanotubes (CCNTs) are increasingly set to become a vital factor in the new generation of nanodevices and energy-absorbing materials due to their outstanding properties. In the following work, the multi-objective optimization of CCNTs is applied regarding their mechanical performances. Apart from finding the best trade-off between conflicting mechanical properties (e.g. yield stress a… ▽ More Coiled Carbon Nanotubes (CCNTs) are increasingly set to become a vital factor in the new generation of nanodevices and energy-absorbing materials due to their outstanding properties. In the following work, the multi-objective optimization of CCNTs is applied regarding their mechanical performances. Apart from finding the best trade-off between conflicting mechanical properties (e.g. yield stress and yield strain), the optimization enables us to find the astonishing CCNTs concerning their stretchability. To the best of our knowledge, these structures have not been recognized before, both experimentally and computationally. Several highly accurate analytical equations are derived by insights from the findings of multi-objective optimization and fitting a theoretical model to the results of Molecular Dynamics (MD) simulations. The structures resulted from optimizations are highly resilient because of two distinct deformation mechanisms depending on the dimensions of CCNTs. For small CCNTs, extraordinary extensibility is mainly contributed by buckling and nanohinge-like deformation with maintaining the inner coil diameter, whereas for large CCNTs this is accomplished by the creation of a straight CNT-like structure in the inner-edge of the CCNT with a helical graphene ribbon twisted around it. These findings would shed light on the design of CCNT based mechanical nanodevices. △ Less

Submitted 25 October, 2019; originally announced October 2019.

Comments: 31 pages, 13 figures, 2 tables

arXiv:1903.09966

A Unified Approach to Mitigate Voltage Jump Effects in Near Optimal Switching Surface Control of DC-DC Converters

Authors: Amir Ghasemian, Asghar Taheri

Abstract: The Equivalent Series Resistance (ESR) of the output capacitor may cause output voltage Vo jumps, that are not modeled commonly for second order DC-DC converters, i.e., converters with two second order switched subsystems. These jump discontinuities in Vo lead to performance issues in Switching Surface (SS) controllers. In this paper, these ESR effects are modeled using switched systems with state… ▽ More The Equivalent Series Resistance (ESR) of the output capacitor may cause output voltage Vo jumps, that are not modeled commonly for second order DC-DC converters, i.e., converters with two second order switched subsystems. These jump discontinuities in Vo lead to performance issues in Switching Surface (SS) controllers. In this paper, these ESR effects are modeled using switched systems with state jumps, called Jump-Flow Switched (JFS) systems. Furthermore, it is shown that approximating the capacitor voltage (Vc), with Vo, can cause undesired limit cycles, oscillations, chattering or instability issues. To resolve these issues, a non-jumping normal switched system is defined for JFS systems, that is equivalent to the internal continuous dynamics. Also, the challenges of designing SS controllers, for this equivalent switched system is studied, and the Constrained Near Optimal (CNO) SS is designed for the equivalent switched system of buck, boost, and buck-boost converters. To eliminate the required estimations, a general class of switching methods are defined, that also avoids chattering and eliminates the conventional hysteresis blocks. The proposed controller is implemented using analog op-amp circuits. Experimental results show fast and robust responses of the controller board with buck, boost, and buck-boost converters. △ Less

Submitted 1 July, 2019; v1 submitted 24 March, 2019; originally announced March 2019.

Comments: The article was published without the co-Author's notice, and it is withdrawn due to his objection

arXiv:1902.01965 [pdf]

doi 10.1103/PhysRevB.99.235425

Phonon thermal transport in \b{eta}-NX (X=P, As, Sb) monolayers: a first-principles study of the interplay between harmonic and anharmonic phonon properties

Authors: Armin Taheri, Carlos Da Silva, Cristina H. Amon

Abstract: The investigation of thermal properties of recently emerged two-dimensional (2D) materials is a necessary step towards fulfilling their potential applications in nano-electronics devices. In this study, the thermal conductivity of novel \b{eta}-NX (X=P, As, Sb) monolayers are investigated using a first-principles density functional theory (DFT) study based on the full solution of the linearized Pe… ▽ More The investigation of thermal properties of recently emerged two-dimensional (2D) materials is a necessary step towards fulfilling their potential applications in nano-electronics devices. In this study, the thermal conductivity of novel \b{eta}-NX (X=P, As, Sb) monolayers are investigated using a first-principles density functional theory (DFT) study based on the full solution of the linearized Peierls-Boltzmann transport equation (PBTE). The results show that the room temperature thermal conductivities of \b{eta}-NP, \b{eta}-NAs, and \b{eta}-NSb are about 1.1, 5.5, and 34.0 times higher than those of single-element \b{eta}-P, \b{eta}-As, and \b{eta}-Sb monolayers, respectively. The phonon transport analysis reveals that higher phonon group velocities as well as phonon lifetimes are responsible for such an enhancement in the lattice thermal conductivities of \b{eta}-NX (X=P, As, Sb) binary compounds compared to single-element group-VA monolayers. We found that \b{eta}-NP has the minimum thermal conductivity among \b{eta}-NX (X=P, As, Sb) monolayers, while it has the minimum average atomic mass, which is in contrast with the common assumption that lower mass systems exhibit higher thermal conductivities. This work demonstrates the trade-off between harmonic and anharmonic phonon properties in determining the variation of the thermal conductivity among \b{eta}-NX (X=P, As, Sb) monolayers. The higher anharmonicity in \b{eta}-NP is found to be responsible for the lower thermal conductivity of this monolayer. △ Less

Submitted 15 March, 2019; v1 submitted 5 February, 2019; originally announced February 2019.

Journal ref: Phys. Rev. B 99, 235425 (2019)

arXiv:1802.03151 [pdf, other]

Deep Private-Feature Extraction

Authors: Seyed Ali Osia, Ali Taheri, Ali Shahin Shamsabadi, Kleomenis Katevas, Hamed Haddadi, Hamid R. Rabiee

Abstract: We present and evaluate Deep Private-Feature Extractor (DPFE), a deep model which is trained and evaluated based on information theoretic constraints. Using the selective exchange of information between a user's device and a service provider, DPFE enables the user to prevent certain sensitive information from being shared with a service provider, while allowing them to extract approved information… ▽ More We present and evaluate Deep Private-Feature Extractor (DPFE), a deep model which is trained and evaluated based on information theoretic constraints. Using the selective exchange of information between a user's device and a service provider, DPFE enables the user to prevent certain sensitive information from being shared with a service provider, while allowing them to extract approved information using their model. We introduce and utilize the log-rank privacy, a novel measure to assess the effectiveness of DPFE in removing sensitive information and compare different models based on their accuracy-privacy tradeoff. We then implement and evaluate the performance of DPFE on smartphones to understand its complexity, resource demands, and efficiency tradeoffs. Our results on benchmark image datasets demonstrate that under moderate resource utilization, DPFE can achieve high accuracy for primary tasks while preserving the privacy of sensitive features. △ Less

Submitted 28 February, 2018; v1 submitted 9 February, 2018; originally announced February 2018.

arXiv:1710.01727 [pdf, ps, other]

Privacy-Preserving Deep Inference for Rich User Data on The Cloud

Authors: Seyed Ali Osia, Ali Shahin Shamsabadi, Ali Taheri, Kleomenis Katevas, Hamid R. Rabiee, Nicholas D. Lane, Hamed Haddadi

Abstract: Deep neural networks are increasingly being used in a variety of machine learning applications applied to rich user data on the cloud. However, this approach introduces a number of privacy and efficiency challenges, as the cloud operator can perform secondary inferences on the available data. Recently, advances in edge processing have paved the way for more efficient, and private, data processing… ▽ More Deep neural networks are increasingly being used in a variety of machine learning applications applied to rich user data on the cloud. However, this approach introduces a number of privacy and efficiency challenges, as the cloud operator can perform secondary inferences on the available data. Recently, advances in edge processing have paved the way for more efficient, and private, data processing at the source for simple tasks and lighter models, though they remain a challenge for larger, and more complicated models. In this paper, we present a hybrid approach for breaking down large, complex deep models for cooperative, privacy-preserving analytics. We do this by breaking down the popular deep architectures and fine-tune them in a particular way. We then evaluate the privacy benefits of this approach based on the information exposed to the cloud service. We also asses the local inference cost of different layers on a modern handset for mobile applications. Our evaluations show that by using certain kind of fine-tuning and embedding techniques and at a small processing costs, we can greatly reduce the level of information available to unintended tasks applied to the data feature on the cloud, and hence achieving the desired tradeoff between privacy and performance. △ Less

Submitted 11 October, 2017; v1 submitted 4 October, 2017; originally announced October 2017.

Comments: arXiv admin note: substantial text overlap with arXiv:1703.02952

arXiv:1703.02952 [pdf, other]

doi 10.1109/JIOT.2020.2967734

A Hybrid Deep Learning Architecture for Privacy-Preserving Mobile Analytics

Authors: Seyed Ali Osia, Ali Shahin Shamsabadi, Sina Sajadmanesh, Ali Taheri, Kleomenis Katevas, Hamid R. Rabiee, Nicholas D. Lane, Hamed Haddadi

Abstract: Internet of Things (IoT) devices and applications are being deployed in our homes and workplaces. These devices often rely on continuous data collection to feed machine learning models. However, this approach introduces several privacy and efficiency challenges, as the service operator can perform unwanted inferences on the available data. Recently, advances in edge processing have paved the way f… ▽ More Internet of Things (IoT) devices and applications are being deployed in our homes and workplaces. These devices often rely on continuous data collection to feed machine learning models. However, this approach introduces several privacy and efficiency challenges, as the service operator can perform unwanted inferences on the available data. Recently, advances in edge processing have paved the way for more efficient, and private, data processing at the source for simple tasks and lighter models, though they remain a challenge for larger, and more complicated models. In this paper, we present a hybrid approach for breaking down large, complex deep neural networks for cooperative, privacy-preserving analytics. To this end, instead of performing the whole operation on the cloud, we let an IoT device to run the initial layers of the neural network, and then send the output to the cloud to feed the remaining layers and produce the final result. In order to ensure that the user's device contains no extra information except what is necessary for the main task and preventing any secondary inference on the data, we introduce Siamese fine-tuning. We evaluate the privacy benefits of this approach based on the information exposed to the cloud service. We also assess the local inference cost of different layers on a modern handset. Our evaluations show that by using Siamese fine-tuning and at a small processing cost, we can greatly reduce the level of unnecessary, potentially sensitive information in the personal data, and thus achieving the desired trade-off between utility, privacy, and performance. △ Less

Submitted 26 December, 2019; v1 submitted 8 March, 2017; originally announced March 2017.

Comments: To appear in IEEE Internet of Things Journal

Journal ref: IEEE Internet of Things Journal, May 2020

arXiv:1702.01564 [pdf, other]

On Weyl's asymptotics and remainder term for the orthogonal and unitary groups

Authors: Chalres Morris, Ali Taheri

Abstract: We examine the asymptotics of the spectral counting function of a compact Riemannian manifold by V.G.~Avakumovic \cite{Avakumovic} and L.~Hörmander \cite{Hormander-eigen} and show that for the scale of orthogonal and unitary groups ${\bf SO}(N)$, ${\bf SU}(N)$, ${\bf U}(N)$ and ${\bf Spin}(N)$ it is not sharp. While for negative sectional curvature improvements are possible and known, {\it cf.} e.… ▽ More We examine the asymptotics of the spectral counting function of a compact Riemannian manifold by V.G.~Avakumovic \cite{Avakumovic} and L.~Hörmander \cite{Hormander-eigen} and show that for the scale of orthogonal and unitary groups ${\bf SO}(N)$, ${\bf SU}(N)$, ${\bf U}(N)$ and ${\bf Spin}(N)$ it is not sharp. While for negative sectional curvature improvements are possible and known, {\it cf.} e.g., J.J.~Duistermaat $\&$ V.~Guillemin \cite{Duist-Guill}, here, we give sharp and contrasting examples in the positive Ricci curvature case [non-negative for ${\bf U}(N)$]. Furthermore here the improvements are sharp and quantitative relating to the dimension and {\it rank} of the group. We discuss the implications of these results on the closely related problem of closed geodesics and the length spectrum. △ Less

Submitted 6 February, 2017; originally announced February 2017.

arXiv:1701.07987 [pdf, other]

Twist maps as energy minimisers in homotopy classes: symmetrisation and the coarea formula

Authors: Charles Morris, Ali Taheri

Abstract: Let $\X = \X[a, b] = \{x: a<|x|<b\}\subset \R^n$ with $0<a<b<\infty$ fixed be an open annulus and consider the energy functional, \begin{equation*} {\mathbb F} [u; \X] = \frac{1}{2} \int_\X \frac{|\nabla u|^2}{|u|^2} \, dx, \end{equation*} over the space of admissible incompressible Sobolev maps \begin{equation*} {\mathcal A}_φ(\X) = \bigg\{ u \in W^{1,2}(\X, \R^n) : \det \nabla u = 1 \text{ {\it… ▽ More Let $\X = \X[a, b] = \{x: a<|x|<b\}\subset \R^n$ with $0<a<b<\infty$ fixed be an open annulus and consider the energy functional, \begin{equation*} {\mathbb F} [u; \X] = \frac{1}{2} \int_\X \frac{|\nabla u|^2}{|u|^2} \, dx, \end{equation*} over the space of admissible incompressible Sobolev maps \begin{equation*} {\mathcal A}_φ(\X) = \bigg\{ u \in W^{1,2}(\X, \R^n) : \det \nabla u = 1 \text{ {\it a.e.} in $\X$ and $u|_{\partial \X} = φ$} \bigg\}, \end{equation*} where $φ$ is the identity map of $\overline \X$. Motivated by the earlier works \cite{TA2, TA3} in this paper we examine the {\it twist} maps as extremisers of ${\mathbb F}$ over ${\mathcal A}_φ(\X)$ and investigate their minimality properties by invoking the coarea formula and a symmetrisation argument. In the case $n=2$ where ${\mathcal A}_φ(\X)$ is a union of infinitely many disjoint homotopy classes we establish the minimality of these extremising twists in their respective homotopy classes a result that then leads to the latter twists being $L^1$-local minimisers of ${\mathbb F}$ in ${\mathcal A}_φ(\X)$. We discuss variants and extensions to higher dimensions as well as to related energy functionals. △ Less

Submitted 27 January, 2017; originally announced January 2017.

Comments: 32 pages, 1 figure

arXiv:1603.05153 [pdf]

doi 10.1016/j.cap.2016.05.024

Molecular dynamics simulation of nanoindentation on nanocomposite pearlite

Authors: Hadi Ghaffarian, Ali Karimi Taheri, Seunghwa Ryu, Keonwook Kang

Abstract: We carry out molecular dynamics simulations of nanoindentation to investigate the effect of cementite size and temperature on the deformation behavior of nanocomposite pearlite composed of alternating ferrite and cementite layers. We find that, instead of the coherent transmission, dislocation propagates by forming a widespread plastic deformation in cementite layer. We also show that increasing t… ▽ More We carry out molecular dynamics simulations of nanoindentation to investigate the effect of cementite size and temperature on the deformation behavior of nanocomposite pearlite composed of alternating ferrite and cementite layers. We find that, instead of the coherent transmission, dislocation propagates by forming a widespread plastic deformation in cementite layer. We also show that increasing temperature enhances the distribution of plastic strain in the ferrite layer, which reduces the stress acting on the cementite layer. Hence, thickening cementite layer or increasing temperature reduces the likelihood of dislocation propagation through the cementite layer. Our finding sheds a light on the mechanism of dislocation blocking by cementite layer in the pearlite. △ Less

Submitted 16 March, 2016; originally announced March 2016.

Journal ref: Current Applied Physics Volume 16, Issue 9, September 2016, Pages 1015-1025

Showing 1–28 of 28 results for author: Taheri, A