Machine Learning Classification of Return on Equity from Sustainability Reporting and Corporate Governance Metrics: A SHAP-Based Explanation


TERZİOĞLU M., Ersoy Bozcuk A., ÜNAL UYAR G. F., KAYA N., TUTCU B., Dursun G. D.

Sustainability (Switzerland), cilt.18, sa.1, 2026 (SCI-Expanded, SSCI, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 18 Sayı: 1
  • Basım Tarihi: 2026
  • Doi Numarası: 10.3390/su18010194
  • Dergi Adı: Sustainability (Switzerland)
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Social Sciences Citation Index (SSCI), Scopus, Geobase, INSPEC
  • Anahtar Kelimeler: corporate sustainability, ESG reporting standards, financial performance, machine learning
  • Akdeniz Üniversitesi Adresli: Evet

Özet

The aim of this study was to develop a model that classifies companies into high or low categories based on their return on equity (RoE), the most important indicator of financial performance, using sustainability and governance-related committee reports and reports shared with the public. As a sample, the RoE, sustainability, and governance variables of all 427 companies traded on the Istanbul Stock Exchange in 2024 were used. Using a 70:30 stratified split between the training and test sets, three tree-based models (XGBoost, LightGBM, and Random Forest) were used to perform a binary classification task. The findings show that tree-based models perform only slightly better than the naive majority class rule, and therefore, have limited overall classification power. A noteworthy finding from the study is that SHAP-based explainability analysis shows that the Corporate Governance Report (IMNG), the Integrated Report (IREP) and the existence of a Sustainability Committee (ICOM) rank higher in terms of SHAP-based global importance in the High RoE classification model, although their average contributions are small and, in the case of IMNG, predominantly negative for the probability of belonging to the High RoE class. Methodologically, the article moves away from traditional econometric methods based on ESG scores, instead combining a predictive classification structure with TreeSHAP-based explanations. These findings indicate a need for reporting practices that offer deeper content, clearer evidence of governance quality, and stronger data integrity to better support investors’ decision-making processes through sustainability and governance.