Artificial intelligence has revolutionized many sectors with unparalleled predictive capabilities supported by machine learning (ML). So far, this tool has not been able to provide the same level of development in pharmaceutical nanotechnology. This review discusses the current data science methodologies related to polymeric drug-loaded nanoparticle production from an innovative multidisciplinary perspective while considering the strictest data science practices. Several methodological and data interpretation flaws were identified by analyzing the few qualified ML studies. Most issues lie in following appropriate analysis steps, such as cross-validation, balancing data, or testing alternative models. Thus, better-planned studies following the recommended data science analysis steps along with adequate numbers of experiments would change the current landscape, allowing the exploration of the full potential of ML.
Keywords: artificial neural network; data mining; data science; machine learning; polymeric nanoparticle; quality by design.
[Box: see text].