Inter-Hammett: Enhancing Interpretability in Hammett‘s Constant Prediction via Extracting Rules

UĞURLU, SADETTİN

doi:10.1002/slct.202501778

Inter-Hammett: Enhancing Interpretability in Hammett‘s Constant Prediction via Extracting Rules

Atıf İçin Kopyala

UĞURLU S. Y.

ChemistrySelect, cilt.10, sa.30, 2025 (SCI-Expanded)

Yayın Türü: Makale / Tam Makale
Cilt numarası: 10 Sayı: 30
Basım Tarihi: 2025
Doi Numarası: 10.1002/slct.202501778
Dergi Adı: ChemistrySelect
Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Academic Search Premier
Anahtar Kelimeler: Hammett's constant, Interpretable machine learning, Molecular descriptors, Rulefit
Akdeniz Üniversitesi Adresli: Evet

Özet

The Hammett constants ((Formula presented.)) describe the electron-withdrawing and electron-donating effects of substituents in aromatic compounds and are widely used in structure–activity relationship studies. However, their experimental determination is resource-intensive and time-consuming. Although graph neural networks (GNNs), such as GCN and Weave, have been proposed for predicting Hammett constants using graph-based features, they suffer from poor interpretability. To address limited interpretability, we introduce Inter-Hammett, a framework designed to enhance interpretability while maintaining high predictive performance. Inter-Hammett leverages cheminformatics-derived descriptors from RDKit, Mordred, PyBioMed, and CDK, followed by rigorous AutoGluon-based feature selection to mitigate the curse of dimensionality. The model core is trained using RuleFit on 85% of the dataset, ensuring a balance between accuracy and interpretability. On unseen data, Inter-Hammett achieved an R2 of 0.880 and an RMSE of 0.128, outperforming eleven models, including four recently published state-of-the-art deep learning approaches. Additionally, a comprehensive interpretability analysis using seven different methods further enhances transparency, making Inter-Hammett a robust alternative for Hammett's constant prediction.