A genetic and graph-guided feature learning strategy for improving decision tree construction


Creative Commons License

Karabadji N. E., Amara Korba A., Assi A., Seridi H., Karabadji M. A., Ghamri-Doudane Y., ...Daha Fazla

Cluster Computing, cilt.28, sa.7, 2025 (SCI-Expanded, Scopus) identifier identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 28 Sayı: 7
  • Basım Tarihi: 2025
  • Doi Numarası: 10.1007/s10586-025-05474-y
  • Dergi Adı: Cluster Computing
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier, PASCAL, Applied Science & Technology Source, Compendex, Computer & Applied Sciences, INSPEC
  • Anahtar Kelimeler: Decision tree, Feature construction, Feature selection, Genetic algorithm, Internet of Vehicles
  • Kocaeli Üniversitesi Adresli: Evet

Özet

Machine learning algorithms have offered unprecedented solutions for many real-world problems. These algorithms frequently involve using a large number of features. However, several of these features could not be very informative due to data uncertainties, such as noise and residual variation. Decision trees are among the most preferred classification models. This is due to their simplicity, explainability, and readability. However, data inaccuracies could impact the construction of decision trees and thus hinder their results. Feature selection and construction present promising research direction to enhance the performance of decision tree models. In this paper, we present a strategy that combines feature selection and construction where the construction of new features is performed by using the ones that were not chosen during the selection step. However, the search space of combinations of selected/constructed features is extremely large. To find the best solution, a genetic algorithm has been developed combined with a graph covering vertices set guided approach. The obtained results on a large number of datasets from the UCI Repository demonstrate that our approach outperforms both recent and classical decision tree construction techniques. We also present a successful use case of our approach in detecting Botnet traffic in the Internet of Vehicles.