Speech Quality Assessment Over Lossy Transmission Channels Using Deep Belief Networks

Affonso, Emmanuel T.; Rosa, Renata L.; Rodriguez, Demostenes Z.

Full text
Author(s):	Affonso, Emmanuel T. ^[1] ; Rosa, Renata L. ^[1] ; Rodriguez, Demostenes Z. ^[1] Total Authors: 3
Affiliation:	^[1] Univ Fed Lavras, BR-37200000 Lavras, MG - Brazil Total Affiliations: 1
Document type:	Journal article
Source:	IEEE SIGNAL PROCESSING LETTERS; v. 25, n. 1, p. 70-74, JAN 2018.
Web of Science Citations:	10
Abstract
Nowadays, there are several telephone services based on IP networks. However, the networks can present many disturbances, such as packet loss rate (PLR), which is one of the most impairing network factors. An impaired speech communication affects the users' quality of experience; hence, the assessment of speech quality is relevant to the telephone operators. Therefore, the determination of a methodology to predict a speech quality with a higher accuracy in telephone services is relevant. In this context, this letter introduces a novel nonintrusive speech quality classifier (SQC) model based on deep belief networks (DBN), in which the support vector machine with radial basis function kernel is the classifier applied in DBN, in order to identify four speech quality classes. A speech database was built, based on unimpaired speech files of public databases, in which different PLR models and values are applied, and a standardized intrusive method is used to calculate the index quality of each file. Results show that SQC largely overcomes the results obtained by ITU-T Recommendation P.563. Also, subjective tests are performed to validate the SQC performance, and it reached an accuracy of 95% on speech quality classification. Furthermore, a solution architecture is introduced, demonstrating the usefulness and flexibility of the proposed SQC. (AU)

FAPESP's process:	15/25512-0 - Conditional Analysis of Audio and Speech Signals for Coding and Recognition
Grantee:	Miguel Arjona Ramírez
Support Opportunities:	Regular Research Grants


FAPESP's process:	15/24496-0 - Evaluation of the service of communication operators using the voice Quality Index
Grantee:	Demostenes Zegarra Rodriguez
Support Opportunities:	Regular Research Grants

Short URL