The authors propose QC-SMOTE, a quality-controlled oversampling framework designed to address the generation of low-quality synthetic samples in noisy or overlapping regions common in imbalanced classification tasks. This method estimates minority sample reliability using a composite neighborhood trustworthiness score and employs an IPQ-guided best-of-K strategy for generating synthetic candidates.
- QC-SMOTE combines local density, safe-level, and isolation from the majority class to estimate minority sample reliability.
- Synthetic candidates are generated using an IPQ-guided best-of-K strategy that evaluates midpoint purity and majority clearance.
- Generation behavior adapts across overlap-imbalance regimes by adjusting interpolation range and selection criteria to match local data geometry.
- Low-quality synthetic samples are replaced with original minority duplicates when neighborhood purity falls below an adaptive threshold.
Experiments on 30 imbalanced datasets demonstrate that QC-SMOTE achieves the strongest average AUC-ROC and Macro F1 among compared oversampling methods, particularly under moderate and severe imbalance conditions.