Optimal selection of benchmarking datasets for unbiased machine learning algorithm evaluation

Pereira, Joao Luiz Junho; Smith-Miles, Kate; Munoz, Mario Andres; Lorena, Ana Carolina

Full text
Author(s):	Pereira, Joao Luiz Junho ; Smith-Miles, Kate ; Munoz, Mario Andres ; Lorena, Ana Carolina Total Authors: 4
Document type:	Journal article
Source:	DATA MINING AND KNOWLEDGE DISCOVERY; v. 38, n. 2, p. 40-pg., 2023-10-20.
Abstract
Whenever a new supervised machine learning (ML) algorithm or solution is developed, it is imperative to evaluate the predictive performance it attains for diverse datasets. This is done in order to stress test the strengths and weaknesses of the novel algorithms and provide evidence for situations in which they are most useful. A common practice is to gather some datasets from public benchmark repositories for such an evaluation. But little or no specific criteria are used in the selection of these datasets, which is often ad-hoc. In this paper, the importance of gathering a diverse benchmark of datasets in order to properly evaluate ML models and really understand their capabilities is investigated. Leveraging from meta-learning studies evaluating the diversity of public repositories of datasets, this paper introduces an optimization method to choose varied classification and regression datasets from a pool of candidate datasets. The method is based on maximum coverage, circular packing, and the meta-heuristic Lichtenberg Algorithm for ensuring that diverse datasets able to challenge the ML algorithms more broadly are chosen. The selections were compared experimentally with a random selection of datasets and with clustering by k-medoids and proved to be more effective regarding the diversity of the chosen benchmarks and the ability to challenge the ML algorithms at different levels. (AU)

FAPESP's process:	21/06870-3 - Beyond algorithm selection: meta-learning for data and algorithm analysis and understanding
Grantee:	Ana Carolina Lorena
Support Opportunities:	Research Grants - Young Investigators Grants - Phase 2


FAPESP's process:	22/10683-7 - Is my benchmark of datasets challenging enough?
Grantee:	João Luiz Junho Pereira
Support Opportunities:	Scholarships in Brazil - Post-Doctoral

Short URL