Advanced search
Start date
Betweenand

Conditional Analysis of Audio and Speech Signals for Coding and Recognition

Grant number: 15/25512-0
Support Opportunities:Regular Research Grants
Start date: July 01, 2016
End date: December 31, 2018
Field of knowledge:Engineering - Electrical Engineering - Telecommunications
Principal Investigator:Miguel Arjona Ramírez
Grantee:Miguel Arjona Ramírez
Host Institution: Escola Politécnica (EP). Universidade de São Paulo (USP). São Paulo , SP, Brazil
Associated researchers:Demostenes Zegarra Rodriguez ; Emilio Del Moral Hernandez ; Mario Minami

Abstract

This research plan addresses themes common to a number of areas in signal processing such as speech analysis, speech coding and audio coding, speech recognition and audio feature recognition and source separation with regularizations to carry out adjustments suitable to the desired application. Speech analysis, in addition to its own importance, also provides signal representations and model parameters that are necessary to the other areas. Novel types of time-frequency decomposition and modification and autoregressive analysis will be explored for coding short-term spectra, mainly by means of vector quantization techniques, density mixture models and optimization. Parameters and representations of the speech signal will also be used for the elaboration of models to be included in the definition of non-intrusive speech quality measures. Several parametric representations will also be applied to audio source separation tasks, occasionally including the identification of the speaker or the instrument, conditioned to other sources of knowledge or constraints. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications (10)
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
AFFONSO, EMMANUEL T.; NUNES, RODRIGO D.; ROSA, RENATA L.; PIVARO, GABRIEL F.; RODRIGUEZ, DEMOSTENES Z.. Speech Quality Assessment in Wireless VoIP Communication Using Deep Belief Network. IEEE ACCESS, v. 6, p. 77022-77032, . (15/24496-0, 15/25512-0)
RAMIREZ, MIGUEL ARJONA. Non-Negative Temporal Decomposition Regularization With an Augmented Lagrangian. IEEE SIGNAL PROCESSING LETTERS, v. 23, n. 5, p. 663-667, . (12/24789-0, 15/25512-0)
BEGAZO, DANTE COAQUIRA; RODRIGUEZ, DEMOSTENES ZEGARRA; RAMIREZ, MIGUEL ARJONA; IEEE. No-reference Video Quality Metric based on the Packet Delay Variation Parameter. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS - 20TH IEEE ISCE, v. N/A, p. 2-pg., . (15/25512-0)
RODRIGUEZ, DEMOSTENES Z.; ROSA, RENATA L.; ALMEIDA JR, FRANCISCONE L.; MITTAG, GABRIEL; MOELLER, SEBASTIAN. Speech Quality Assessment in Wireless Communications With MIMO Systems Using a Parametric Model. IEEE ACCESS, v. 7, p. 35719-35730, . (15/24496-0, 15/25512-0)
RODRIGUEZ, DEMOSTENES ZEGARRA; RAMIREZ, MIGUEL ARJONA; BERNARDES, LEONARDO FERNANDES; MITTAG, GABRIEL; MOELLER, SEBASTIAN; IEEE. Impact of FEC codes on speech communication quality using WB E-model algorithm. 2019 WIRELESS DAYS (WD), v. N/A, p. 4-pg., . (15/24496-0, 15/25512-0)
RAMIREZ, MIGUEL ARJONA. Hybrid Autoregressive Resonance Estimation and Density Mixture Formant Tracking Model. IEEE ACCESS, v. 6, p. 30217-30224, . (15/25512-0)
RODRIGUEZ, DEMOSTENES ZEGARRA; MOELLER, SEBASTIAN; IEEE. Speech Quality Parametric Model that Considers Wireless Network Characteristics. 2019 ELEVENTH INTERNATIONAL CONFERENCE ON QUALITY OF MULTIMEDIA EXPERIENCE (QOMEX), v. N/A, p. 6-pg., . (15/24496-0, 15/25512-0)
AFFONSO, EMMANUEL T.; ROSA, RENATA L.; RODRIGUEZ, DEMOSTENES Z.. Speech Quality Assessment Over Lossy Transmission Channels Using Deep Belief Networks. IEEE SIGNAL PROCESSING LETTERS, v. 25, n. 1, p. 70-74, . (15/25512-0, 15/24496-0)
RODRIGUEZ, DEMOSTENES Z.; PIVARO, GABRIEL F.; ROSA, RENATA L.; MITTAG, GABRIEL; MOELLER, SEBASTIAN; IEEE. Improving a Parametric Model for Speech Quality Assessment in Wireless Communication Systems. 2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), v. N/A, p. 5-pg., . (15/24496-0, 15/25512-0)
RODRIGUEZ, DEMOSTENES Z.; PIVARO, GABRIEL F.; ROSA, RENATA L.; MITTAG, GABRIEL; MOELLER, SEBASTIAN; IEEE. Quantifying the Quality Improvement of MIMO Transmission Systems in VoIP Communication. 2022 25TH INTERNATIONAL CONFERENCE ON INFORMATION FUSION (FUSION 2022), v. N/A, p. 5-pg., . (15/24496-0, 15/25512-0)