Sign in to use this feature.

Years

Between: -

Subjects

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Journals

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Article Types

Countries / Regions

remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline
remove_circle_outline

Search Results (47)

Search Parameters:
Keywords = quantile random forest

Order results
Result details
Results per page
Select all
Export citation of selected articles as:
16 pages, 533 KiB  
Article
Regularizing Lifetime Drift Prediction in Semiconductor Electrical Parameters with Quantile Random Forest Regression
by Lukas Sommeregger and Jürgen Pilz
Technologies 2024, 12(9), 165; https://doi.org/10.3390/technologies12090165 - 13 Sep 2024
Abstract
Semiconductors play a crucial role in a wide range of applications and are integral to essential infrastructures. Manufacturers of these semiconductors must meet specific quality and lifetime targets. To estimate the lifetime of semiconductors, accelerated stress tests are conducted. This paper introduces a [...] Read more.
Semiconductors play a crucial role in a wide range of applications and are integral to essential infrastructures. Manufacturers of these semiconductors must meet specific quality and lifetime targets. To estimate the lifetime of semiconductors, accelerated stress tests are conducted. This paper introduces a novel approach to modeling drift in discrete electrical parameters within stress test devices. It incorporates a machine learning (ML) approach for arbitrary panel data sets of electrical parameters from accelerated stress tests. The proposed model involves an expert-in-the-loop MLOps decision process, allowing experts to choose between an interpretable model and a robust ML algorithm for regularization and fine-tuning. The model addresses the issue of outliers influencing statistical models by employing regularization techniques. This ensures that the model’s accuracy is not compromised by outliers. The model uses interpretable statistically calculated limits for lifetime drift and uncertainty as input data. It then predicts these limits for new lifetime stress test data of electrical parameters from the same technology. The effectiveness of the model is demonstrated using anonymized real data from Infineon technologies. The model’s output can help prioritize parameters by the level of significance for indication of degradation over time, providing valuable insights for the analysis and improvement of electrical devices. The combination of explainable statistical algorithms and ML approaches enables the regularization of quality control limit calculations and the detection of lifetime drift in stress test parameters. This information can be used to enhance production quality by identifying significant parameters that indicate degradation and detecting deviations in production processes. Full article
Show Figures

Figure 1

15 pages, 23629 KiB  
Article
Machine Learning Methods for Evaluation of Technical Factors of Spraying in Permanent Plantations
by Vjekoslav Tadić, Dorijan Radočaj and Mladen Jurišić
Agronomy 2024, 14(9), 1977; https://doi.org/10.3390/agronomy14091977 - 1 Sep 2024
Viewed by 320
Abstract
Considering the demand for the optimization of the technical factors of spraying for a greater area coverage and minimal drift, field tests were carried out to determine the interaction between the area coverage, number of droplets per cm2, droplet diameter, and [...] Read more.
Considering the demand for the optimization of the technical factors of spraying for a greater area coverage and minimal drift, field tests were carried out to determine the interaction between the area coverage, number of droplets per cm2, droplet diameter, and drift. The studies were conducted with two different types of sprayers (axial and radial fan) in an apple orchard and a vineyard. The technical factors of the spraying interactions were nozzle type (ISO code 015, code 02, and code 03), working speed (6 and 8 km h−1), and spraying norm (250–400 L h−1). The airflow of both sprayers was adjusted to the plantation leaf mass and the working pressure was set for each repetition separately. A method using water-sensitive paper and a digital image analysis was used to collect data on coverage factors. The data from the field research were processed using four machine learning models: quantile random forest (QRF), support vector regression with radial basis function kernel (SVR), Bayesian Regularization for Feed-Forward Neural Networks (BRNN), and Ensemble Machine Learning (ENS). Nozzle type had the highest predictive value for the properties of number of droplets per cm2 (axial = 69.1%; radial = 66.0%), droplet diameter (axial = 30.6%; radial = 38.2%), and area coverage (axial = 24.6%; radial = 34.8%). Spraying norm had the greatest predictive value for area coverage (axial = 43.3%; radial = 26.9%) and drift (axial = 72.4%; radial = 62.3%). Greater coverage of the treated area and a greater number of droplets were achieved with the radial sprayer, as well as less drift. The accuracy of the machine learning model for the prediction of the treated surface showed a satisfactory accuracy for most properties (R2 = 0.694–0.984), except for the estimation of the droplet diameter for an axial sprayer (R2 = 0.437–0.503). Full article
Show Figures

Figure 1

28 pages, 1606 KiB  
Article
Evaluating Wind Speed Forecasting Models: A Comparative Study of CNN, DAN2, Random Forest and XGBOOST in Diverse South African Weather Conditions
by Fhulufhelo Walter Mugware, Caston Sigauke and Thakhani Ravele
Forecasting 2024, 6(3), 672-699; https://doi.org/10.3390/forecast6030035 - 19 Aug 2024
Viewed by 756
Abstract
The main source of electricity worldwide stems from fossil fuels, contributing to air pollution, global warming, and associated adverse effects. This study explores wind energy as a potential alternative. Nevertheless, the variable nature of wind introduces uncertainty in its reliability. Thus, it is [...] Read more.
The main source of electricity worldwide stems from fossil fuels, contributing to air pollution, global warming, and associated adverse effects. This study explores wind energy as a potential alternative. Nevertheless, the variable nature of wind introduces uncertainty in its reliability. Thus, it is necessary to identify an appropriate machine learning model capable of reliably forecasting wind speed under various environmental conditions. This research compares the effectiveness of Dynamic Architecture for Artificial Neural Networks (DAN2), convolutional neural networks (CNN), random forest and XGBOOST in predicting wind speed across three locations in South Africa, characterised by different weather patterns. The forecasts from the four models were then combined using quantile regression averaging models, generalised additive quantile regression (GAQR) and quantile regression neural networks (QRNN). Empirical results show that CNN outperforms DAN2 in accurately forecasting wind speed under different weather conditions. This superiority is likely due to the inherent architectural attributes of CNNs, including feature extraction capabilities, spatial hierarchy learning, and resilience to spatial variability. The results from the combined forecasts were comparable with those from the QRNN, which was slightly better than those from the GAQR model. However, the combined forecasts were more accurate than the individual models. These results could be useful to decision-makers in the energy sector. Full article
Show Figures

Figure 1

20 pages, 19235 KiB  
Article
Utilizing Machine Learning Algorithms for the Development of Gully Erosion Susceptibility Maps: Evidence from the Chotanagpur Plateau Region, India
by Md Hasanuzzaman, Pravat Kumar Shit, Saeed Alqadhi, Hussein Almohamad, Fahdah Falah ben Hasher, Hazem Ghassan Abdo and Javed Mallick
Sustainability 2024, 16(15), 6569; https://doi.org/10.3390/su16156569 - 31 Jul 2024
Viewed by 647
Abstract
Gully erosion is a serious environmental threat, compromising soil health, damaging agricultural lands, and destroying vital infrastructure. Pinpointing regions prone to gully erosion demands careful selection of an appropriate machine learning algorithm. This choice is crucial, as the complex interplay of various environmental [...] Read more.
Gully erosion is a serious environmental threat, compromising soil health, damaging agricultural lands, and destroying vital infrastructure. Pinpointing regions prone to gully erosion demands careful selection of an appropriate machine learning algorithm. This choice is crucial, as the complex interplay of various environmental factors contributing to gully formation requires a nuanced analytical approach. To develop the most accurate Gully Erosion Susceptibility Map (GESM) for India’s Raiboni River basin, researchers harnessed the power of two cutting-edge machine learning algorithm: Extreme Gradient Boosting (XGBoost) and Random Forest (RF). For a comprehensive analysis, this study integrated 24 potential control factors. We meticulously investigated a dataset of 200 samples, ensuring an even balance between non-gullied and gullied locations. To assess multicollinearity among the 24 variables, we employed two techniques: the Information Gain Ratio (IGR) test and Variance Inflation Factors (VIF). Elevation, land use, river proximity, and rainfall most influenced the basin’s GESM. Rigorous tests validated XGBoost and RF model performance. XGBoost surpassed RF (ROC 86% vs. 83.1%). Quantile classification yielded a GESM with five levels: very high to very low. Our findings reveal that roughly 12% of the basin area is severely affected by gully erosion. These findings underscore the critical need for targeted interventions in these highly susceptible areas. Furthermore, our analysis of gully characteristics unveiled a predominance of V-shaped gullies, likely in an active developmental stage, supported by an average Shape Index (SI) value of 0.26 and a mean Erosivness Index (EI) of 0.33. This research demonstrates the potential of machine learning to pinpoint areas susceptible to gully erosion. By providing these valuable insights, policymakers can make informed decisions regarding sustainable land management practices. Full article
Show Figures

Figure 1

27 pages, 27911 KiB  
Article
Enhancing the Performance of Landslide Susceptibility Mapping with Frequency Ratio and Gaussian Mixture Model
by Wenchao Huangfu, Haijun Qiu, Weicheng Wu, Yaozu Qin, Xiaoting Zhou, Yang Zhang, Mohib Ullah and Yanfen He
Land 2024, 13(7), 1039; https://doi.org/10.3390/land13071039 - 10 Jul 2024
Viewed by 542
Abstract
A rational landslide susceptibility mapping (LSM) can minimize the losses caused by landslides and enhance the efficiency of disaster prevention and reduction. At present, frequency ratio (FR), information value (IV), and certainty factor (CF) are widely used to quantify the relationships between landslides [...] Read more.
A rational landslide susceptibility mapping (LSM) can minimize the losses caused by landslides and enhance the efficiency of disaster prevention and reduction. At present, frequency ratio (FR), information value (IV), and certainty factor (CF) are widely used to quantify the relationships between landslides and their causative factors; however, it remains unclear which method is the most effective. Moreover, existing landslide susceptibility zoning methods lack full automation; thus, the results are full of uncertainties. To address this, the FR, IV, and CF were used to analyze the relationship between landslides and causative factors. Subsequently, three distinct sets of models were developed, namely random forest models (RF_FR, RF_IV, and RF_CF), support vector machine models (SVM_FR, SVM_IV, and SVM_CF), and logistic regression models (LR_FR, LR_IV, and LR_CF) using the analysis results as inputs. A Gaussian mixture model (GMM) was introduced as a new method for landslide susceptibility zoning, classifying the LSM into five distinct levels. An accuracy evaluation of the models and a rationality analysis of the LSM indicated that the FR is superior to the IV and CF in quantifying the relationship between landslides and causative factors. Additionally, the quantile method was employed as a comparative approach to the GMM, further validating the effectiveness of the GMM. This research contributes to more effective and efficient LSM, ultimately enhancing landslide prevention measures. Full article
Show Figures

Figure 1

22 pages, 17359 KiB  
Article
Comparison of QRNN and QRF Models in Forest Biomass Estimation Based on the Screening of VIs Using an Equidistant Quantile Method
by Xiao Xu, Xiaoli Zhang, Shouyun Shen and Guangyu Zhu
Forests 2024, 15(5), 782; https://doi.org/10.3390/f15050782 - 29 Apr 2024
Viewed by 768
Abstract
The investigation of a potential correlation between the filtered-out vegetation index and forest aboveground biomass (AGB) using the conventional variables screening method is crucial for enhancing the estimation accuracy. In this study, we examined the Pinus densata forests in Shangri-La and utilized 31 [...] Read more.
The investigation of a potential correlation between the filtered-out vegetation index and forest aboveground biomass (AGB) using the conventional variables screening method is crucial for enhancing the estimation accuracy. In this study, we examined the Pinus densata forests in Shangri-La and utilized 31 variables to establish quantile regression models for the AGB across 19 quantiles. The key variables associated with biomass were based on their significant correlation with the AGB in different quantiles, and the QRNN and QRF models were constructed accordingly. Furthermore, the optimal quartile models yielding the minimum mean error were combined as the best QRF (QRFb) and QRNN (QRNNb). The results were as follows: (1) certain bands exhibited significant relationships with the AGB in specific quantiles, highlighting the importance of band selection. (2) The vegetation index involving the band of blue and SWIR was more suitable for estimating the Pinus densata. (3) Both the QRNN and QRF models demonstrated their optimal performance in the 0.5 quantiles, with respective R2 values of 0.68 and 0.7. Moreover, the QRNNb achieved a high R2 value of 0.93, while the QRFb attained an R2 value of 0.86, effectively reducing the underestimation and overestimation. Overall, this research provides valuable insights into the variable screening methods that enhance estimation accuracy and mitigate underestimation and overestimation issues. Full article
Show Figures

Figure 1

23 pages, 4178 KiB  
Article
A Machine Learning-Based High-Resolution Soil Moisture Mapping and Spatial–Temporal Analysis: The mlhrsm Package
by Yuliang Peng, Zhengwei Yang, Zhou Zhang and Jingyi Huang
Agronomy 2024, 14(3), 421; https://doi.org/10.3390/agronomy14030421 - 22 Feb 2024
Cited by 1 | Viewed by 1764
Abstract
Soil moisture is a key environmental variable. There is a lack of software to facilitate non-specialists in estimating and analyzing soil moisture at the field scale. This study presents a new open-sourced R package mlhrsm, which can be used to generate Machine [...] Read more.
Soil moisture is a key environmental variable. There is a lack of software to facilitate non-specialists in estimating and analyzing soil moisture at the field scale. This study presents a new open-sourced R package mlhrsm, which can be used to generate Machine Learning-based high-resolution (30 to 500 m, daily to monthly) soil moisture maps and uncertainty estimates at selected sites across the contiguous USA at 0–5 cm and 0–1 m. The model is based on the quantile random forest algorithm, integrating in situ soil sensors, satellite-derived land surface parameters (vegetation, terrain, and soil), and satellite-based models of surface and rootzone soil moisture. It also provides functions for spatial and temporal analysis of the produced soil moisture maps. A case study is provided to demonstrate the functionality to generate 30 m daily to weekly soil moisture maps across a 70-ha crop field, followed by a spatial–temporal analysis. Full article
Show Figures

Figure 1

26 pages, 13918 KiB  
Article
Short-Term Photovoltaic Power Forecasting Based on a Feature Rise-Dimensional Two-Layer Ensemble Learning Model
by Hui Wang, Su Yan, Danyang Ju, Nan Ma, Jun Fang, Song Wang, Haijun Li, Tianyu Zhang, Yipeng Xie and Jun Wang
Sustainability 2023, 15(21), 15594; https://doi.org/10.3390/su152115594 - 3 Nov 2023
Cited by 5 | Viewed by 945
Abstract
Photovoltaic (PV) power generation has brought about enormous economic and environmental benefits, promoting sustainable development. However, due to the intermittency and volatility of PV power, the high penetration rate of PV power generation may pose challenges to the planning and operation of power [...] Read more.
Photovoltaic (PV) power generation has brought about enormous economic and environmental benefits, promoting sustainable development. However, due to the intermittency and volatility of PV power, the high penetration rate of PV power generation may pose challenges to the planning and operation of power systems. Accurate PV power forecasting is crucial for the safe and stable operation of the power grid. This paper proposes a short-term PV power forecasting method using K-means clustering, ensemble learning (EL), a feature rise-dimensional (FRD) approach, and quantile regression (QR) to improve the accuracy of deterministic and probabilistic forecasting of PV power. The K-means clustering algorithm was used to construct weather categories. The EL method was used to construct a two-layer ensemble learning (TLEL) model based on the eXtreme gradient boosting (XGBoost), random forest (RF), CatBoost, and long short-term memory (LSTM) models. The FRD approach was used to optimize the TLEL model, construct the FRD-XGBoost-LSTM (R-XGBL), FRD-RF-LSTM (R-RFL), and FRD-CatBoost-LSTM (R-CatBL) models, and combine them with the results of the TLEL model using the reciprocal error method, in order to obtain the deterministic forecasting results of the FRD-TLEL model. The QR was used to obtain probability forecasting results with different confidence intervals. The experiments were conducted with data at a time level of 15 min from the Desert Knowledge Australia Solar Center (DKASC) to forecast the PV power of a certain day. Compared to other models, the proposed FRD-TLEL model has the lowest root mean square error (RMSE) and mean absolute percentage error (MAPE) in different seasons and weather types. In probability interval forecasting, the 95%, 75%, and 50% confidence intervals all have good forecasting intervals. The results indicate that the proposed PV power forecasting method exhibits a superior performance in forecasting accuracy compared to other methods. Full article
Show Figures

Figure 1

21 pages, 3195 KiB  
Article
Assessment of Potential Prediction and Calibration Methods of Crown Width for Dahurian Larch (Larix gmelinii Rupr.) in Northeastern China
by Suoming Liu, Junjie Wang and Lichun Jiang
Forests 2023, 14(10), 2022; https://doi.org/10.3390/f14102022 - 9 Oct 2023
Viewed by 816
Abstract
Crown width (CW) is an important indicator for assessing tree health, vitality, and stability, as well as being used to predict forestry models and evaluate forest dynamics. However, acquiring CW data is laborious and time-consuming, making it crucial to establish a convenient and [...] Read more.
Crown width (CW) is an important indicator for assessing tree health, vitality, and stability, as well as being used to predict forestry models and evaluate forest dynamics. However, acquiring CW data is laborious and time-consuming, making it crucial to establish a convenient and accurate CW prediction model for forest management. In this study, we developed three models capable of conducting calibration: generalized models (GM), quantile regression models (QR), and mixed-effects models (MIXED). The aim was to effectively improve the prediction accuracy of CW using data from Dahurian larch (Larix gmelinii Rupr.) in Northeastern China. Different sampling designs were applied, including selecting the thickest, thinnest, intermediate, and random trees, with 1 to 10 sample trees for each design. The results showed that all models achieved accurate CW predictions. MIXED displayed the most superior fitting statistics than GM and QR. In model validation, with the increase in the number of sample trees, the model prediction accuracy gradually improved and the model differences gradually reduced. MIXED produced the smallest RMSE, MAE, and MAPE across all sampling designs. The intermediate tree sampling design with the best validation statistics for the given sample size was selected as the final sampling design. Under intermediate tree sampling design, MIXED required a minimum of five sample trees, while GM and QR required at least five and six sample trees for calibration, respectively. Generally, we suggested selecting MIXED as the final CW prediction model and using the intermediate tree sampling design of five trees per plot. This study could provide ideas and support for forest managers to accurately and efficiently predict CW. Full article
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)
Show Figures

Figure 1

17 pages, 33700 KiB  
Article
Comparing Algorithms for Estimation of Aboveground Biomass in Pinus yunnanensis
by Tianbao Huang, Guanglong Ou, Hui Xu, Xiaoli Zhang, Yong Wu, Zihao Liu, Fuyan Zou, Chen Zhang and Can Xu
Forests 2023, 14(9), 1742; https://doi.org/10.3390/f14091742 - 28 Aug 2023
Cited by 3 | Viewed by 1220
Abstract
Comparing algorithms are crucial for enhancing the accuracy of remote sensing estimations of forest biomass in regions with high heterogeneity. Herein, Sentinel 2A, Sentinel 1A, Landsat 8 OLI, and Digital Elevation Model (DEM) were selected as data sources. A total of 12 algorithms, [...] Read more.
Comparing algorithms are crucial for enhancing the accuracy of remote sensing estimations of forest biomass in regions with high heterogeneity. Herein, Sentinel 2A, Sentinel 1A, Landsat 8 OLI, and Digital Elevation Model (DEM) were selected as data sources. A total of 12 algorithms, including 7 types of learners, were utilized for estimating the aboveground biomass (AGB) of Pinus yunnanensis forest. The results showed that: (1) The optimal algorithm (Extreme Gradient Boosting, XGBoost) was selected as the meta-model (referred to as XGBoost-stacking) of the stacking ensemble algorithm, which integrated 11 other algorithms. The R2 value was improved by 0.12 up to 0.61, and RMSE was decreased by 4.53 Mg/ha down to 39.34 Mg/ha compared to the XGBoost. All algorithms consistently showed severe underestimation of AGB in the Pinus yunnanensis forest of Yunnan Province when AGB exceeded 100 Mg/ha. (2) XGBoost-Stacking, XGBoost, BRNN (Bayesian Regularized Neural Network), RF (Random Forest), and QRF (Quantile Random Forest) have good sensitivity to forest AGB. QRNN (Quantile Regression Neural Network), GP (Gaussian Process), and EN (Elastic Network) have more outlier data and their robustness was poor. SVM-RBF (Radial Basis Function Kernel Support Vector Machine), k-NN (K Nearest Neighbors), and SGB (Stochastic Gradient Boosting) algorithms have good robustness, but their sensitivity was poor, and QRF algorithms and BRNN algorithm can estimate low values with higher accuracy. In conclusion, the XGBoost-stacking, XGBoost, and BRNN algorithms have shown promising application prospects in remote sensing estimation of forest biomass. This study could provide a reference for selecting the suitable algorithm for forest AGB estimation. Full article
Show Figures

Figure 1

19 pages, 5297 KiB  
Article
Estimating the Aboveground Biomass of Various Forest Types with High Heterogeneity at the Provincial Scale Based on Multi-Source Data
by Tianbao Huang, Guanglong Ou, Yong Wu, Xiaoli Zhang, Zihao Liu, Hui Xu, Xiongwei Xu, Zhenghui Wang and Can Xu
Remote Sens. 2023, 15(14), 3550; https://doi.org/10.3390/rs15143550 - 14 Jul 2023
Cited by 14 | Viewed by 2253
Abstract
It is important to improve the accuracy of models estimating aboveground biomass (AGB) in large areas with complex geography and high forest heterogeneity. In this study, k-nearest neighbors (k-NN), gradient boosting machine (GBM), random forest (RF), quantile random forest (QRF), regularized random forest [...] Read more.
It is important to improve the accuracy of models estimating aboveground biomass (AGB) in large areas with complex geography and high forest heterogeneity. In this study, k-nearest neighbors (k-NN), gradient boosting machine (GBM), random forest (RF), quantile random forest (QRF), regularized random forest (RRF), and Bayesian regularization neural network (BRNN) machine learning algorithms were constructed to estimate the AGB of four forest types based on environmental factors and the variables selected by the Boruta algorithm in Yunnan Province and using integrated Landsat 8 OLI and Sentinel 2A images. The results showed that (1) DEM was the most important variable for estimating the AGB of coniferous forests, evergreen broadleaved forests, deciduous broadleaved forests, and mixed forests; while the vegetation index was the most important variable for estimating deciduous broadleaved forests, the climatic factors had a higher variable importance for estimating coniferous and mixed forests, and texture features and vegetation index had a higher variable importance for estimating evergreen broadleaved forests. (2) In terms of specific model performance for the four forest types, RRF was the best model both in estimating the AGB of coniferous forests and mixed forests; the R2 and RMSE for coniferous forests were 0.63 and 43.23 Mg ha−1, respectively, and the R2 and RMSE for mixed forests were 0.56 and 47.79 Mg ha−1, respectively. BRNN performed the best in estimating the AGB of evergreen broadleaved forests; the R2 was 0.53 and the RMSE was 68.16 Mg ha−1. QRF was the best in estimating the AGB of deciduous broadleaved forests, with R2 of 0.43 and RMSE of 45.09 Mg ha−1. (3) RRF was the best model for the four forest types according to the mean values, with R2 and RMSE of 0.503 and 52.335 Mg ha−1, respectively. In conclusion, different variables and suitable models should be considered when estimating the AGB of different forest types. This study could provide a reference for the estimation of forest AGB based on remote sensing in complex terrain areas with a high degree of forest heterogeneity. Full article
Show Figures

Figure 1

24 pages, 2999 KiB  
Article
Sentinel-2 and Sentinel-1 Bare Soil Temporal Mosaics of 6-Year Periods for Soil Organic Carbon Content Mapping in Central France
by Diego Urbina-Salazar, Emmanuelle Vaudour, Anne C. Richer-de-Forges, Songchao Chen, Guillaume Martelet, Nicolas Baghdadi and Dominique Arrouays
Remote Sens. 2023, 15(9), 2410; https://doi.org/10.3390/rs15092410 - 4 May 2023
Cited by 14 | Viewed by 4353
Abstract
Satellite-based soil organic carbon content (SOC) mapping over wide regions is generally hampered by the low soil sampling density and the diversity of soil sampling periods. Some unfavorable topsoil conditions, such as high moisture, rugosity, the presence of crop residues, the limited amplitude [...] Read more.
Satellite-based soil organic carbon content (SOC) mapping over wide regions is generally hampered by the low soil sampling density and the diversity of soil sampling periods. Some unfavorable topsoil conditions, such as high moisture, rugosity, the presence of crop residues, the limited amplitude of SOC values and the limited area of bare soil when a single image is used, are also among the influencing factors. To generate a reliable SOC map, this study addresses the use of Sentinel-2 (S2) temporal mosaics of bare soil (S2Bsoil) over 6 years jointly with soil moisture products (SMPs) derived from Sentinel 1 and 2 images, SOC measurement data and other environmental covariates derived from digital elevation models, lithology maps and airborne gamma-ray data. In this study, we explore (i) the dates and periods that are preferable to construct temporal mosaics of bare soils while accounting for soil moisture and soil management; (ii) which set of covariates is more relevant to explain the SOC variability. From four sets of covariates, the best contributing set was selected, and the median SOC content along with uncertainty at 90% prediction intervals were mapped at a 25-m resolution from quantile regression forest models. The accuracy of predictions was assessed by 10-fold cross-validation, repeated five times. The models using all the covariates had the best model performance. Airborne gamma-ray thorium, slope and S2 bands (e.g., bands 6, 7, 8, 8a) and indices (e.g., calcareous sedimentary rocks, “calcl”) from the “late winter–spring” time series were the most important covariates in this model. Our results also indicated the important role of neighboring topographic distances and oblique geographic coordinates between remote sensing data and parent material. These data contributed not only to optimizing SOC mapping performance but also provided information related to long-range gradients of SOC spatial variability, which makes sense from a pedological point of view. Full article
(This article belongs to the Special Issue Remote Sensing for Soil Mapping and Monitoring)
Show Figures

Figure 1

26 pages, 1750 KiB  
Article
Drivers of Realized Volatility for Emerging Countries with a Focus on South Africa: Fundamentals versus Sentiment
by Rangan Gupta, Jacobus Nel and Christian Pierdzioch
Mathematics 2023, 11(6), 1371; https://doi.org/10.3390/math11061371 - 12 Mar 2023
Cited by 1 | Viewed by 1202
Abstract
We use a quantile machine learning (random forests) approach to analyse the predictive ability of newspapers-based macroeconomic attention indexes (MAIs) on eight major fundamentals of the United States on the realized volatility of a major commodity-exporting emerging stock market, namely South Africa. We [...] Read more.
We use a quantile machine learning (random forests) approach to analyse the predictive ability of newspapers-based macroeconomic attention indexes (MAIs) on eight major fundamentals of the United States on the realized volatility of a major commodity-exporting emerging stock market, namely South Africa. We compare the performance of the MAIs with the performance of a news sentiment index (NSI) of the US. We find that both fundamentals and sentiment improve predictive performance, but the relative impact of the former is stronger. We document how the impact of fundamentals and sentiment on predictive performance varies across the quantiles of the conditional distribution of realized volatility, and across different prediction horizons. Specifically, fundamentals matter more at the extreme quantiles at short horizons, and at the median in the long-run. In addition, we report several robustness checks (involving sample period and alternative definitions of realized volatility), and indicate that the obtained results for South Africa also tend to carry over to other emerging countries such as, Brazil, China, India, and Russia. Our results have important implications for investors with volatility being an input for portfolio allocation decisions. In addition, with stock market variability also capturing financial uncertainty, its accurate prediction based on US fundamentals and sentiment also has a role in policy design to prevent possible collapse. Full article
Show Figures

Figure 1

20 pages, 2750 KiB  
Article
Reduction in Uncertainty in Forest Aboveground Biomass Estimation Using Sentinel-2 Images: A Case Study of Pinus densata Forests in Shangri-La City, China
by Lu Li, Boqi Zhou, Yanfeng Liu, Yong Wu, Jing Tang, Weiheng Xu, Leiguang Wang and Guanglong Ou
Remote Sens. 2023, 15(3), 559; https://doi.org/10.3390/rs15030559 - 17 Jan 2023
Cited by 13 | Viewed by 2528
Abstract
The uncertainty from the under-estimation and over-estimation of forest aboveground biomass (AGB) is an urgent problem in optical remote sensing estimation. In order to more accurately estimate the AGB of Pinus densata forests in Shangri-La City, we mainly discuss three non-parametric models—the artificial [...] Read more.
The uncertainty from the under-estimation and over-estimation of forest aboveground biomass (AGB) is an urgent problem in optical remote sensing estimation. In order to more accurately estimate the AGB of Pinus densata forests in Shangri-La City, we mainly discuss three non-parametric models—the artificial neural network (ANN), random forests (RFs), and the quantile regression neural network (QRNN) based on 146 sample plots and Sentinel-2 images in Shangri-La City, China. Moreover, we selected the corresponding optical quartile models with the lowest mean error at each AGB segment to combine as the best QRNN (QRNNb). The results showed that: (1) for the whole biomass segment, the QRNNb has the best fitting performance compared with the ANN and RFs, the ANN has the lowest R2 (0.602) and the highest RMSE (48.180 Mg/ha), and the difference between the QRNNb and RFs is not apparent. (2) For the different biomass segments, the QRNNb has a better performance. Especially when AGB is lower than 40 Mg/ha, the QRNNb has the highest R2 of 0.961 and the lowest RMSE of 1.733 (Mg/ha). Meanwhile, when AGB is larger than 160 Mg/ha, the QRNNb has the highest R2 of 0.867 and the lowest RMSE of 18.203 Mg/ha. This indicates that the QRNNb is more robust and can improve the over-estimation and under-estimation in AGB estimation. This means that the QRNNb combined with the optimal quantile model of each biomass segment provides a method with more potential for reducing the uncertainties in AGB estimation using optical remote sensing images. Full article
(This article belongs to the Special Issue Monitoring Forest Carbon Sequestration with Remote Sensing)
Show Figures

Graphical abstract

16 pages, 2455 KiB  
Article
Solar Irradiance Probabilistic Forecasting Using Machine Learning, Metaheuristic Models and Numerical Weather Predictions
by Vateanui Sansine, Pascal Ortega, Daniel Hissel and Marania Hopuare
Sustainability 2022, 14(22), 15260; https://doi.org/10.3390/su142215260 - 17 Nov 2022
Cited by 12 | Viewed by 2089
Abstract
Solar-power-generation forecasting tools are essential for microgrid stability, operation, and planning. The prediction of solar irradiance (SI) usually relies on the time series of SI and other meteorological data. In this study, the considered microgrid was a combined cold- and power-generation system, located [...] Read more.
Solar-power-generation forecasting tools are essential for microgrid stability, operation, and planning. The prediction of solar irradiance (SI) usually relies on the time series of SI and other meteorological data. In this study, the considered microgrid was a combined cold- and power-generation system, located in Tahiti. Point forecasts were obtained using a particle swarm optimization (PSO) algorithm combined with three stand-alone models: XGboost (PSO-XGboost), the long short-term memory neural network (PSO-LSTM), and the gradient boosting regression algorithm (PSO-GBRT). The implemented daily SI forecasts relied on an hourly time-step. The input data were composed of outputs from the numerical forecasting model AROME (Météo France) combined with historical meteorological data. Our three hybrid models were compared with other stand-alone models, namely, artificial neural network (ANN), convolutional neural network (CNN), random forest (RF), LSTM, GBRT, and XGboost. The probabilistic forecasts were obtained by mapping the quantiles of the hourly residuals, which enabled the computation of 38%, 68%, 95%, and 99% prediction intervals (PIs). The experimental results showed that PSO-LSTM had the best accuracy for day-ahead solar irradiance forecasting compared with the other benchmark models, through overall deterministic and probabilistic metrics. Full article
(This article belongs to the Topic Artificial Intelligence and Sustainable Energy Systems)
Show Figures

Figure 1

Back to TopTop