Cotton is a major economic crop predominantly cultivated under rainfed situations. The accurate prediction of cotton yield invariably helps farmers, industries, and policy makers. The final cotton yield is mostly determined by the weather patterns that prevail during the crop growing phase. Crop yield prediction with greater accuracy is possible due to the development of innovative technologies which analyses the bigdata with its high-performance computing abilities. Machine learning technologies can make yield prediction reasonable and faster and with greater flexibility than process based complex crop simulation models. The present study demonstrates the usability of ML algorithms for yield forecasting and facilitates the comparison of different models. The cotton yield was simulated by employing the weekly weather indices as inputs and the model performance was assessed by nRMSE, MAPE and EF values. Results show that stacked generalised ensemble model and artificial neural networks predicted the cotton yield with lower nRMSE, MAPE and higher efficiency compared to other models. Variable importance studies in LASSO and ENET model found minimum temperature and relative humidity as the main determinates of cotton yield in all districts. The models were ranked based these performance metrics in the order of Stacked generalised ensemble > ANN > PCA ANN > SMLR ANN > LASSO> ENET > SVM > PCA SMLR > SMLR SVM > SMLR. This study shows that stacked generalised ensembling and ANN method can be used for reliable yield forecasting at district or county level and helps stakeholders in timely decision-making.
Keywords: Cotton; Ensembles; Machine learning; Neural network; Yield forecasting.
© 2024. The Author(s) under exclusive licence to International Society of Biometeorology.