Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

MIA: Mutual Information Analyzer, a graphic user interface program that calculates entropy, vertical and horizontal mutual information of molecular sequence sets

Full text
Author(s):
Lichtenstein, Flavio [1, 2] ; Antoneli, Jr., Fernando [1, 2] ; Briones, Marcelo R. S. [2, 3]
Total Authors: 3
Affiliation:
[1] Univ Fed Sao Paulo, Escola Paulista Med, Dept Informat Saude, BR-04023062 Sao Paulo, SP - Brazil
[2] Univ Fed Sao Paulo, Escola Paulista Med, Lab Evolutionary Genom & Biocomplex, BR-04039032 Sao Paulo, SP - Brazil
[3] Univ Fed Sao Paulo, Escola Paulista Med, Dept Microbiol, BR-04023062 Sao Paulo, SP - Brazil
Total Affiliations: 3
Document type: Journal article
Source: BMC Bioinformatics; v. 16, DEC 10 2015.
Web of Science Citations: 0
Abstract

Background: Short and long range correlations in biological sequences are central in genomic studies of covariation. These correlations can be studied using mutual information because it measures the amount of information one random variable contains about the other. Here we present MIA (Mutual Information Analyzer) a user friendly graphic interface pipeline that calculates spectra of vertical entropy (VH), vertical mutual information (VMI) and horizontal mutual information (HMI), since currently there is no user friendly integrated platform that in a single package perform all these calculations. MIA also calculates Jensen-Shannon Divergence (JSD) between pair of different species spectra, herein called informational distances. Thus, the resulting distance matrices can be presented by distance histograms and informational dendrograms, giving support to discrimination of closely related species. Results: In order to test MIA we analyzed sequences from Drosophila Adh locus, because the taxonomy and evolutionary patterns of different Drosophila species are well established and the gene Adh is extensively studied. The search retrieved 959 sequences of 291 species. From the total, 450 sequences of 17 species were selected. With this dataset MIA performed all tasks in less than three hours: gathering, storing and aligning fasta files; calculating VH, VMI and HMI spectra; and calculating JSD between pair of different species spectra. For each task MIA saved tables and graphics in the local disk, easily accessible for future analysis. Conclusions: Our tests revealed that the ``informational model free{''} spectra may represent species signatures. Since JSD applied to Horizontal Mutual Information spectra resulted in statistically significant distances between species, we could calculate respective hierarchical clusters, herein called Informational Dendrograms (ID). When compared to phylogenetic trees all Informational Dendrograms presented similar taxonomy and species clusterization. (AU)

FAPESP's process: 13/07838-0 - Mitochondrial microdiversity of Candida albicans and its implications in hospital-acquired infections and patterns of mitochondrial genome macroevolution
Grantee:Marcelo Ribeiro da Silva Briones
Support type: Regular Research Grants