Machine learning and design of experiments with an application to product innovation in the chemical industry

J Appl Stat. 2021 Mar 26;49(10):2674-2699. doi: 10.1080/02664763.2021.1907840. eCollection 2022.

Abstract

Industrial statistics plays a major role in the areas of both quality management and innovation. However, existing methodologies must be integrated with the latest tools from the field of Artificial Intelligence. To this end, a background on the joint application of Design of Experiments (DOE) and Machine Learning (ML) methodologies in industrial settings is presented here, along with a case study from the chemical industry. A DOE study is used to collect data, and two ML models are applied to predict responses which performance show an advantage over the traditional modeling approach. Emphasis is placed on causal investigation and quantification of prediction uncertainty, as these are crucial for an assessment of the goodness and robustness of the models developed. Within the scope of the case study, the models learned can be implemented in a semi-automatic system that can assist practitioners who are inexperienced in data analysis in the process of new product development.

Keywords: Experimental design; R&D; artificial neural networks; product development; random forests.