Advanced search
Start date
Betweenand

Methods for selecting mixture models

Abstract

Mixture models have gained a lot of visibility in the last two decades and have been widely used in many areas of research due to their flexibility in accommodating heterogeneous, independent or dependent, structured or unstructured data. With the availability of high-dimensional data and an increasing amount of information, two major challenges have been identifying the ideal number of components (groups) in the mixture and selecting the variables (covariates or predictors) which, in some cases, are used in a regression function to define some parameters involved in the model. This project, therefore, consists of two simultaneous sub-projects. One of them is focused on proposing more efficient frequentist and Bayesian methodologies for selecting variables in non-homogeneous hidden Markov models, which are mixture models with dependent data over time. The objective of the second sub-project is to compare the performance of several known metrics for selecting the number of topics in mixture models for textual analysis and latent Dirichlet allocation, which can be seen as a flexibilization of mixture models in the analysis and clustering of texts. The performance of the proposed and studied methods will be analyzed and compared in simulated and real data. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Please report errors in scientific publications list using this form.
X

Report errors in this page


Error details: