Determining the geographical origin of kimchi holds significance because of the considerable variation in quality and price among kimchi products from different regions. This study explored the feasibility of employing Fourier transform near-infrared spectroscopy in conjunction with supervised chemometric techniques to differentiate domestic and imported kimchi products. A total of 30 domestic and 30 imported kimchi products were used to build datasets. Three categories of preprocessing methods such as scattering correction (multiplicative signal correction and standard normal variate), spectral derivatives (the first derivative and the second derivative), and data smoothing (Savitzky-Golay filtering and Norris derivative filtering) were used. K-nearest neighbors, support vector machine, random forest, and partial least squares-discriminant analysis were employed. By appropriately preprocessing spectral data, these four methods successfully distinguished between the two sample groups based on their origin. Notably, the k-nearest neighbors method exhibited exceptional performance, accurately classifying the sample groups irrespective of the preprocessing method employed and swiftly achieving this classification. In comparison, classification and regression tree as well as naïve Bayes methods were outperformed by the aforementioned four classification techniques. Particularly, the efficiency and accuracy of the k-nearest neighbors method make it the most recommended chemometric tool for determining the geographical origins of kimchi.
Keywords: Data preprocessing; Discrimination; K-nearest neighbors; Partial least squares-discriminant analysis; Supervised classification; Support vector machine.
© 2024. The Author(s).