SIFTER-T: A scalable and optimized framework for t... - BV FAPESP
Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

SIFTER-T: A scalable and optimized framework for the SIFTER phylogenomic method of probabilistic protein domain annotation

Full text
Author(s):
Almeida-e-Silva, Danillo C. [1] ; Vencio, Ricardo Z. N. [1]
Total Authors: 2
Affiliation:
[1] Univ Sao Paulo, Dept Comp & Math, FFCLRP USP, BR-14049 Ribeirao Preto - Brazil
Total Affiliations: 1
Document type: Journal article
Source: BIOTECHNIQUES; v. 58, n. 3, p. 140-142, MAR 2015.
Web of Science Citations: 0
Abstract

Statistical Inference of Function Through Evolutionary Relationships (SIFTER) is a powerful computational platform for probabilistic protein domain annotation. Nevertheless, SIFTER is not widely used, likely due to usability and scalability issues. Here we present SIFTER-T (SIFTER Throughput-optimized), a substantial improvement over SIFTER's original proof-of-principle implementation. SIFTER-T is optimized for better performance, allowing it to be used at the genome-wide scale. Compared to SIFTER 2.0, SIFTER-T achieved an 87-fold performance improvement using published test data sets for the known annotations recovering module and a 72.3% speed increase for the gene tree generation module in quad-core machines, as well as a major decrease in memory usage during the realignment phase. Memory optimization allowed an expanded set of proteins to be handled by SIFTER's probabilistic method. The improvement in performance and automation that we achieved allowed us to build a web server to bring the power of Bayesian phylogenomic inference to the genomics community. SIFTER-T and its online interface are freely available under GNU license at http://labpib.fmrp.usp.br/methods/SIFTER-t/ and https://github.com/dcasbioinfo/SIFTER-t. (AU)

FAPESP's process: 09/53161-6 - Information technology applied to bioenergy genomics: probabilistic annotation using artificial intelligence
Grantee:Ricardo Zorzetto Nicoliello Vêncio
Support Opportunities: Program for Research on Bioenergy (BIOEN) - Regular Program Grants