Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

A systematic comparative evaluation of biclustering techniques

Texto completo
Autor(es):
Padilha, Victor A. ; Campello, Ricardo J. G. B.
Número total de Autores: 2
Tipo de documento: Artigo Científico
Fonte: BMC Bioinformatics; v. 18, JAN 23 2017.
Citações Web of Science: 23
Resumo

Background: Biclustering techniques are capable of simultaneously clustering rows and columns of a data matrix. These techniques became very popular for the analysis of gene expression data, since a gene can take part of multiple biological pathways which in turn can be active only under specific experimental conditions. Several biclustering algorithms have been developed in the past recent years. In order to provide guidance regarding their choice, a few comparative studies were conducted and reported in the literature. In these studies, however, the performances of the methods were evaluated through external measures that have more recently been shown to have undesirable properties. Furthermore, they considered a limited number of algorithms and datasets. Results: We conducted a broader comparative study involving seventeen algorithms, which were run on three synthetic data collections and two real data collections with a more representative number of datasets. For the experiments with synthetic data, five different experimental scenarios were studied: different levels of noise, different numbers of implanted biclusters, different levels of symmetric bicluster overlap, different levels of asymmetric bicluster overlap and different bicluster sizes, for which the results were assessed with more suitable external measures. For the experiments with real datasets, the results were assessed by gene set enrichment and clustering accuracy. Conclusions: We observed that each algorithm achieved satisfactory results in part of the biclustering tasks in which they were investigated. The choice of the best algorithm for some application thus depends on the task at hand and the types of patterns that one wants to detect. (AU)

Processo FAPESP: 14/08840-0 - Avaliação Sistemática de Técnicas de Bi-Agrupamento de Dados
Beneficiário:Victor Alexandre Padilha
Modalidade de apoio: Bolsas no Brasil - Mestrado
Processo FAPESP: 13/18698-4 - Métodos e algoritmos em aprendizado de máquina não supervisionado e semi-supervisionado
Beneficiário:Ricardo José Gabrielli Barreto Campello
Modalidade de apoio: Auxílio à Pesquisa - Regular