Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Experimental correlation analysis of bicluster coherence measures and gene ontology information

Full text
Author(s):
Padilha, Victor Alexandre [1] ; de Leon Ferreira de Carvalho, Andre Carlos Ponce [1]
Total Authors: 2
Affiliation:
[1] Univ Sao Paulo, Inst Math & Comp Sci, Av Trabalhador Sao Carlense 400, BR-13566590 Sao Carlos, SP - Brazil
Total Affiliations: 1
Document type: Journal article
Source: APPLIED SOFT COMPUTING; v. 85, DEC 2019.
Web of Science Citations: 0
Abstract

Biclustering algorithms have become popular tools for gene expression data analysis. They can identify local patterns defined by subsets of genes and subsets of samples, which cannot be detected by traditional clustering algorithms. In spite of being useful, biclustering is an NP-hard problem. Therefore, the majority of biclustering algorithms look for biclusters optimizing a pre-established coherence measure. Many heuristics and validation measures have been proposed for biclustering over the last 20 years. However, there is a lack of an extensive comparison of bicluster coherence measures on practical scenarios. To deal with this lack, this paper experimentally analyzes 17 bicluster coherence measures and external measures calculated from information obtained in the gene ontologies. In this analysis, results were produced by 10 algorithms from the literature in 19 gene expression datasets. According to the experimental results, a few pairs of strongly correlated coherence measures could be identified, which suggests redundancy. Moreover, the pairs of strongly correlated measures might change when dealing with normalized or non-normalized data and biclusters enriched by different ontologies. Finally, there was no clear relation between coherence measures and assessment using information from gene ontology. (AU)

FAPESP's process: 16/18615-0 - Advanced machine learning
Grantee:André Carlos Ponce de Leon Ferreira de Carvalho
Support type: Research Grants - Research Partnership for Technological Innovation - PITE
FAPESP's process: 13/07375-0 - CeMEAI - Center for Mathematical Sciences Applied to Industry
Grantee:José Alberto Cuminato
Support type: Research Grants - Research, Innovation and Dissemination Centers - RIDC
FAPESP's process: 17/02975-0 - Ensembles of biclustering results
Grantee:Victor Alexandre Padilha
Support type: Scholarships in Brazil - Doctorate