Busca avançada
Ano de início
Entree
(Referência obtida automaticamente do Web of Science, por meio da informação sobre o financiamento pela FAPESP e o número do processo correspondente, incluída na publicação pelos autores.)

MIA: Mutual Information Analyzer, a graphic user interface program that calculates entropy, vertical and horizontal mutual information of molecular sequence sets

Texto completo
Autor(es):
Lichtenstein, Flavio [1, 2] ; Antoneli, Jr., Fernando [1, 2] ; Briones, Marcelo R. S. [2, 3]
Número total de Autores: 3
Afiliação do(s) autor(es):
[1] Univ Fed Sao Paulo, Escola Paulista Med, Dept Informat Saude, BR-04023062 Sao Paulo, SP - Brazil
[2] Univ Fed Sao Paulo, Escola Paulista Med, Lab Evolutionary Genom & Biocomplex, BR-04039032 Sao Paulo, SP - Brazil
[3] Univ Fed Sao Paulo, Escola Paulista Med, Dept Microbiol, BR-04023062 Sao Paulo, SP - Brazil
Número total de Afiliações: 3
Tipo de documento: Artigo Científico
Fonte: BMC Bioinformatics; v. 16, DEC 10 2015.
Citações Web of Science: 0
Resumo

Background: Short and long range correlations in biological sequences are central in genomic studies of covariation. These correlations can be studied using mutual information because it measures the amount of information one random variable contains about the other. Here we present MIA (Mutual Information Analyzer) a user friendly graphic interface pipeline that calculates spectra of vertical entropy (VH), vertical mutual information (VMI) and horizontal mutual information (HMI), since currently there is no user friendly integrated platform that in a single package perform all these calculations. MIA also calculates Jensen-Shannon Divergence (JSD) between pair of different species spectra, herein called informational distances. Thus, the resulting distance matrices can be presented by distance histograms and informational dendrograms, giving support to discrimination of closely related species. Results: In order to test MIA we analyzed sequences from Drosophila Adh locus, because the taxonomy and evolutionary patterns of different Drosophila species are well established and the gene Adh is extensively studied. The search retrieved 959 sequences of 291 species. From the total, 450 sequences of 17 species were selected. With this dataset MIA performed all tasks in less than three hours: gathering, storing and aligning fasta files; calculating VH, VMI and HMI spectra; and calculating JSD between pair of different species spectra. For each task MIA saved tables and graphics in the local disk, easily accessible for future analysis. Conclusions: Our tests revealed that the ``informational model free{''} spectra may represent species signatures. Since JSD applied to Horizontal Mutual Information spectra resulted in statistically significant distances between species, we could calculate respective hierarchical clusters, herein called Informational Dendrograms (ID). When compared to phylogenetic trees all Informational Dendrograms presented similar taxonomy and species clusterization. (AU)

Processo FAPESP: 13/07838-0 - Microdiversidade mitocondrial de Candida albicans e suas implicações em infecção hospitalar e em padrões macroevolutivos do genoma mitocondrial
Beneficiário:Marcelo Ribeiro da Silva Briones
Modalidade de apoio: Auxílio à Pesquisa - Regular