Similar Items: A Leakage-Aware Machine Learning Pipeline for Credit Default Prediction Using LightGBM