Scholarship 17/02975-0 - Aprendizado computacional, Agrupamento de dados

Grant number:	17/02975-0
Support Opportunities:	Scholarships in Brazil - Doctorate
Start date:	May 01, 2017
End date:	October 31, 2018
Field of knowledge:	Physical Sciences and Mathematics - Computer Science - Computing Methodologies and Techniques

Principal Investigator:	André Carlos Ponce de Leon Ferreira de Carvalho
Grantee:	Victor Alexandre Padilha

Host Institution:	Instituto de Ciências Matemáticas e de Computação (ICMC). Universidade de São Paulo (USP). São Carlos , SP, Brazil

Associated research grant:	13/07375-0 - CeMEAI - Center for Mathematical Sciences Applied to Industry, AP.CEPID


Abstract One of the main subjects in machine learning is data clustering, which aims at finding clusters that describe a set of objects in such a way that the intra-cluster similarity and inter-cluster dissimilarity are both maximized. In general, such techniques are based on measures that take into account all the available features of a dataset. However, in several real-world situations the clusters contained in the data are defined only by a subset of all features. For this reason, the biclustering paradigm aims at providing algorithms capable of simultaneously clustering the rows and columns of a data matrix in order to find homogeneous submatrices. This paradigm became widely used after its importance for gene expression data analysis was shown. However, one of the main problems of the biclustering field is the fact that there is no universal definition of which patterns define a bicluster. So, each algorithm relies on different heuristics, mathematical formulations and assumptions, which implies in different outcomes for the same input data. Therefore, the development of techniques that are capable of combining the solutions of several different algorithms that search forsimilar patterns and/or the same algorithm subject to different experimental parameters can be animportant step in order to provide more meaningful and robust results, which may not be identifiedby the individual application of a single algorithm.

News published in Agência FAPESP Newsletter about the scholarship:
More items Less items
TITULO

Articles published in other media outlets ( ):
More items Less items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications

(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)

PADILHA, VICTOR A.; DE CARVALHO, ANDRE C. P. L. F.; IEEE. A Study of Biclustering Coherence Measures for Gene Expression Data. 2018 7TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), v. N/A, p. 6-pg., 2018-01-01. (16/18615-0, 17/02975-0, 13/07375-0)

PADILHA, VICTOR ALEXANDRE; DE LEON FERREIRA DE CARVALHO, ANDRE CARLOS PONCE. Experimental correlation analysis of bicluster coherence measures and gene ontology information. APPLIED SOFT COMPUTING, v. 85, DEC 2019. (16/18615-0, 13/07375-0, 17/02975-0)

PADILHA, VICTOR A.; DE CARVALHO, ANDRE C. P. L. F.; IEEE. A Comparison of Hierarchical Biclustering Ensemble Methods. 2017 6TH BRAZILIAN CONFERENCE ON INTELLIGENT SYSTEMS (BRACIS), v. N/A, p. 6-pg., 2017-01-01. (13/07375-0, 17/02975-0, 16/18615-0)

Short URL