Reinforcement learning algorithm for improving speed response of a five-phase permanent magnet synchronous motor based model predictive control

Ahmed M Hassan; Jafar Ababneh; Hani Attar; Tamer Shamseldin; Ahmed Abdelbaset; Mohamed Eladly Metwally

doi:10.1371/journal.pone.0316326

Reinforcement learning algorithm for improving speed response of a five-phase permanent magnet synchronous motor based model predictive control

PLoS One. 2025 Jan 3;20(1):e0316326. doi: 10.1371/journal.pone.0316326. eCollection 2025.

Authors

Ahmed M Hassan^{1

2}, Jafar Ababneh³, Hani Attar^{4

5}, Tamer Shamseldin⁶, Ahmed Abdelbaset², Mohamed Eladly Metwally²

Affiliations

¹ Department of Electrical Power and Machines Engineering, Faculty of Engineering, Benha University, Shoubra, Cairo, Egypt.
² Department of Electrical Power and Machines Engineering, Higher Institute of Engineering (HIE), El-Shorouk Academy, El-Shorouk City, Egypt.
³ Cyber Security Department, Faculty of Information Technology, Zarqa University, Zarqa, Jordan.
⁴ Faculty of Engineering, Zarqa University, Zarqa, Jordan.
⁵ College of Engineering, University of Business and Technology, Jeddah, Saudi Arabia.
⁶ Technical Research Center, Cairo, Egypt.

Abstract

Enhancing the performance of 5ph-IPMSM control plays a crucial role in advancing various innovative applications such as electric vehicles. This paper proposes a new reinforcement learning (RL) control algorithm based twin-delayed deep deterministic policy gradient (TD3) algorithm to tune two cascaded PI controllers in a five-phase interior permanent magnet synchronous motor (5ph-IPMSM) drive system based model predictive control (MPC). The main purpose of the control methodology is to optimize the 5ph-IPMSM speed response either in constant torque region or constant power region. The speed responses obtained using RL control algorithm are compared with those obtained using four of the most recent metaheuristic optimization techniques (MHOT) which are Transit Search (TS), Honey Badger Algorithm (HBA), Dwarf Mongoose (DM), and Dandelion-Optimizer (DO) optimization techniques. The speed response are compared in terms of the settling time, rise time, maximum time and maximum overshoot percentage. It is found that the suggested RL based TD3 give minimum settling time and relatively low values for the rise time, max time and overshoot percentage which makes the RL provide superior speed responses compared with those obtained from the four MHOT. The drive system speed responses are obtained in the constant torque region and constant power region using MATLAB SIMULINK package.

Copyright: © 2025 Hassan et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

MeSH terms

Algorithms*
Magnets
Models, Theoretical
Reinforcement, Psychology

Grants and funding

The author(s) received no specific funding for this work.