Advanced search
Start date
Betweenand
(Reference retrieved automatically from Web of Science through information on FAPESP grant and its corresponding number as mentioned in the publication by the authors.)

Identification of novel protein-coding sequences in Eucalyptus grandis plants by high-resolution mass spectrometry

Full text
Author(s):
Jorge, Gabriel Lemes [1] ; Balbuena, Tiago Santana [1]
Total Authors: 2
Affiliation:
[1] Sao Paulo State Univ, Dept Technol, Jaboticabal, SP - Brazil
Total Affiliations: 1
Document type: Journal article
Source: BIOCHIMICA ET BIOPHYSICA ACTA-PROTEINS AND PROTEOMICS; v. 1869, n. 3 MAR 2021.
Web of Science Citations: 0
Abstract

Eucalyptus species are widely used in the forestry industry, and a significant increase in the number of sequences available in database repositories has been observed for these species. In proteomics, a protein is identified by correlating the theoretical fragmentation spectrum derived from genomic/transcriptomic data against the experimental fragmentation mass spectrum acquired from large-scale analysis of protein mixtures. Proteogenomics is an alternative approach that can identify novel proteins encoded by regions previously considered as non-coding. This study aimed to confidently identify and confirm the existence of previously unknown protein-coding sequences in the Eucalyptus grandis genome. To this end, we used a modified spectral correlation strategy and a dedicated de novo peptide sequencing pipeline. Upon the strategy used here, we confidently identified 41 novel peptide forms and six peptides containing at least one single amino acid substitution. The most representative genomic class of novel peptides was identified as originating from alternative reading frames. In contrast, no clear single amino acid substitution pattern was identified. Validation of the identifications was carried out using a parallel reaction monitoring approach that provided further mass spectrometry support for the existence of the novel peptide sequences. Data are available via ProteomeXchange with identifier PXD022110. (AU)

FAPESP's process: 18/15035-8 - Probing Eucalytpus plant performance to atmospheric carbon dioxide levels: the source/sink relationships unveiled by targeted proteomics approach
Grantee:Tiago Santana Balbuena
Support Opportunities: Research Grants - Young Investigators Grants - Phase 2