Integrating machine learning regression and classification for enhanced interpretability in optimizing the Fenton process for real wastewater treatment conditions


Ergan B. T., Yucel O., GENGEÇ E., Aydin E. S.

Separation and Purification Technology, cilt.363, 2025 (SCI-Expanded) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 363
  • Basım Tarihi: 2025
  • Doi Numarası: 10.1016/j.seppur.2025.132182
  • Dergi Adı: Separation and Purification Technology
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, Aerospace Database, Biotechnology Research Abstracts, Chemical Abstracts Core, Chimica, Communication Abstracts, Compendex, INSPEC, Metadex, Pollution Abstracts, Civil Engineering Abstracts
  • Anahtar Kelimeler: Decision tree regression, Fenton process, Gaussian process regression, Generalized additive model, Machine learning methods, Random forest regression
  • Kocaeli Üniversitesi Adresli: Evet

Özet

The Fenton process is an important process used in treating textile wastewater such as Jeans-wash wastewater (JWW). Predicting the results of this process according to Fenton parameters is a critical step to evaluating the treatment performance. One of the important ways to improve treatment performance is to use machine learning methods. Therefore, machine learning methods using experimental Fenton treatment data were proposed to mathematically demonstrate the effect of the hydrogen peroxide (H2O2) and iron sulfate (FeSO4) dosage on dye and total organic carbon (TOC) concentration within this research. To increase the predictive capability of machine learning methods, in addition to concentration of H2O2 and FeSO4 inputs, dye and TOC initial concentration values were used as inputs in the machine learning methods. Four regression techniques were used to forecast the dye and TOC concentration outputs of Fenton process, namely Random Forest Regression (RFR), Gaussian Process Regression (GPR), Decision Tree Regression (DTR), and Generalized Additive Model (GAM) in this study. Hold-out and k-fold cross-validation were used in combination to examine the effectiveness of the suggested regression techniques. Among these machine learning methods, GPR was more successful than the other proposed models with predicted the dye concentration as R2 > 0.97 and TOC concentration as R2 > 0.86. Finally, illustration of decision tree classifiers indicating the process operation were placed with interpretable machine learning. With these trees, the input-range/target result relationship depending on the input parameters of the process was established to eliminate the side reactions that occur due to the nature of the Fenton process.