How to Avoid Data Leakage When Performing Data Preparation

Data preparation is the process of transforming raw data into a form that is appropriate for modeling. A naive approach to preparing data applies the transform on the entire dataset before evaluating the performance of the model. This results in a problem referred to as data leakage, where knowledge of the hold-out test set leaks […]

The post How to Avoid Data Leakage When Performing Data Preparation appeared first on Machine Learning Mastery.

Comments