Advanced search
Start date
Betweenand

Analysis of audio and speech signals for reconstruction and recognition

Grant number: 12/24789-0
Support Opportunities:Regular Research Grants
Start date: July 01, 2013
End date: December 31, 2015
Field of knowledge:Engineering - Electrical Engineering - Telecommunications
Principal Investigator:Miguel Arjona Ramírez
Grantee:Miguel Arjona Ramírez
Host Institution: Escola Politécnica (EP). Universidade de São Paulo (USP). São Paulo , SP, Brazil
Associated researchers: Mario Minami

Abstract

This research plan is made up of multiple themes connected to speech analysis, speech coding and audio coding, speech recognition and audio feature recognition and speaker identification. Speech analysis, in addition to its own importance, also provides signal and parameter representations that are necessary for the other themes. Novel types of autoregressive analysi will be explored for coding the short-term spectral envelope of speech signals, mainly, by means of vector quantization techniques and Gaussian mixture models. Several parametric representations will also be applied to speaker identification or audio source separation tasks, contemplating dynamic aspects due to the intrinsic nature of these tasks and applying long-term modeling tools. (AU)

Articles published in Agência FAPESP Newsletter about the research grant:
More itemsLess items
Articles published in other media outlets ( ):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)

Scientific publications
(References retrieved automatically from Web of Science and SciELO through information on FAPESP grants and their corresponding numbers as mentioned in the publications by the authors)
RAMIREZ, MIGUEL ARJONA. Non-Negative Temporal Decomposition Regularization With an Augmented Lagrangian. IEEE SIGNAL PROCESSING LETTERS, v. 23, n. 5, p. 663-667, . (12/24789-0, 15/25512-0)
RAMIREZ, MIGUEL ARJONA. Intra-Predictive Switched Split Vector Quantization of Speech Spectra. IEEE SIGNAL PROCESSING LETTERS, v. 20, n. 8, p. 791-794, . (12/24789-0)
AFFONSO, EMMANUEL T.; RODRIGUEZ, DEMOSTENES Z.; ROSA, RENATA L.; ANDRADE, THIAGO; BRESSAN, GRACA; IEEE. Voice Quality Assessment in Mobile Devices Considering Different Fading Models. 2016 IEEE INTERNATIONAL SYMPOSIUM ON CONSUMER ELECTRONICS - 20TH IEEE ISCE, v. N/A, p. 2-pg., . (12/24789-0)