Engineering Structures, cilt.340, 2025 (SCI-Expanded)
Accurate and rapid classification of earthquake-damaged buildings is crucial for effective first response and recovery. Conventional inspection methods are labor-intensive, and thus, automated machine learning offers a promising alternative. Yet, real-world seismic datasets that can be used for training machine learning models are often highly imbalanced, with severely damaged or collapsed structures representing only a small minority. Previous studies that have explored class-imbalance methods focused on basic data-level or conventional ensemble techniques, as well as advanced strategies—such as SMOTEENN, CTGAN, and stacked ensembles with custom weighting—remain underexplored. In this study, building survey data from the 2023 Kahramanmaraş earthquakes were used to comprehensively evaluate data-level, algorithm-level, and hybrid methods. Guided by the results of these evaluations, a novel class-weighted stacked ensemble model featuring Balanced Random Forest and XGBoost as base learners was devised. This new model leverages customized misclassification penalties to improve minority class detection and achieves a balanced accuracy of 0.62 and a G-Mean of 0.75, markedly outperforming models employing data-level balancing alone.