Advanced search
Start date
Betweenand


Improving annotation propagation on molecular networks through random walks: introducing ChemWalker

Full text
Author(s):
Borelli, Tiago Cabral ; Arini, Gabriel Santos ; Feitosa, Luis G. P. ; Dorrestein, Pieter C. ; Lopes, Norberto Peporine ; da Silva, Ricardo R.
Total Authors: 6
Document type: Journal article
Source: Bioinformatics; v. 39, n. 3, p. 2-pg., 2023-03-01.
Abstract

Motivation: Annotation of the mass signals is still the biggest bottleneck for the untargeted mass spectrometry analysis of complex mixtures. Molecular networks are being increasingly adopted by the mass spectrometry community as a tool to annotate large-scale experiments. We have previously shown that the process of propagating annotations from spectral library matches on molecular networks can be automated using Network Annotation Propagation (NAP). One of the limitations of NAP is that the information for the spectral matches is only propagated locally, to the first neighbor of a spectral match. Here, we show that annotation propagation can be expanded to nodes not directly connected to spectral matches using random walks on graphs, introducing the ChemWalker python library.Results: Similarly to NAP, ChemWalker relies on combinatorial in silico fragmentation results, performed by MetFrag, searching biologically relevant databases. Departing from the combination of a spectral network and the structural similarity among candidate structures, we have used MetFusion Scoring function to create a weight function, producing a weighted graph. This graph was subsequently used by the random walk to calculate the probability of 'walking' through a set of candidates, departing from seed nodes (represented by spectral library matches). This approach allowed the information propagation to nodes not directly connected to the spectral library match. Compared with NAP, ChemWalker has a series of improvements, on running time, scalability and maintainability and is available as a standalone python package.Availability and implementation: ChemWalker is freely available atContact: ridasilva@usp.brSupplementary information: are available at Bioinformatics online. (AU)

FAPESP's process: 21/08235-3 - Unraveling the evolutionary logic and structure of the marine resistome: a deep structured learning approach to discovery of evolutionary constraints
Grantee:Tiago Cabral Borelli
Support Opportunities: Scholarships in Brazil - Doctorate
FAPESP's process: 20/02207-5 - Inventorying secondary metabolism applying metabolomic strategies: contribution to the Brazilian biodiversity valuation
Grantee:Norberto Peporine Lopes
Support Opportunities: BIOTA-FAPESP Program - Thematic Grants
FAPESP's process: 17/18922-2 - Development of a computing platform extensible and modular for metabolomics and metagenomics analysis: innovation with the discovery of new enzymatic activities and natural products of pharmaceutical interest derived
Grantee:Ricardo Roberto da Silva
Support Opportunities: BIOTA-FAPESP Program - Young Investigators Grants
FAPESP's process: 19/05026-4 - Development of a computational platform extensible and modular for analysis of metabolomics and metagenomics experiments: Innovating with the discovery of new enzymatic activities and derived natural products of interest
Grantee:Ricardo Roberto da Silva
Support Opportunities: Scholarships in Brazil - BIOTA - Young Researchers
FAPESP's process: 21/10401-9 - Detection and repository optimization of fragmentation spectra from biological signals in metabolomics studies
Grantee:Gabriel Santos Arini
Support Opportunities: Scholarships in Brazil - Doctorate