Abstract
Real-world data often present challenges such as imbalance, noise, and missing values. Recently, the Machine Learning literature has demonstrated that improving model performance also requires enhancing the quality of the data used for training, giving rise to a new research field known as Data-Centric Artificial Intelligence. Missing values, defined as the absence of information in one o…