Advanced search
Start date
Betweenand

Speech Tera Ltda: development of computational resources for speech technologies

Abstract

This project aims to create computational resources for the development of Speech Technologies, focused on Brazilian Portuguese. With the development of robust algorithms to treat speech databases, applications involving recognition or speech synthesis, respectively, ASR (Automatic Speech Recognition) and TTS (Text-to-Speech), have gained more space in our everyday life and become increasingly accurate. However, although Brazilian Portuguese is the sixth most spoken language in the world, resources available for the development of speech processing technologies for that for that language are scarce: there are few databases, grapheme-phoneme converters and acoustic or pronunciation models on the market. This project seeks to act precisely in that gap. Our purpose is to develop computational resources in order to encourage the development of speech technologies for Brazilian Portuguese, both for the industry and academia. Its proposes are the development of four types of products: i) speech corpora ii) acoustic models, iii) models of pronunciation and iv) grapheme-phoneme converters. For speech corpora, we propose methods of collecting and annotating data based on crawling and crowd-sourcing, that will enable the development of speech resources at the most competitive and affordable prices that currently in existence on the market. State of the art techniques will be employed in the preparation of the acoustic models, like Deep Neural Networks; and grapheme-phoneme converters as hybrid models based on manual rules and machine learning techniques (SVM, CART, MARS). The proposed business model focuses on a business-to-business approach (B2B), focused on information processing; speech and natural language processing technology companies, especially with the start-ups niche in mind. (AU)

Articles published in Pesquisa para Inovação FAPESP about research grant:
Startup develops computational resources for speech technologies 
Articles published in Agência FAPESP Newsletter about the research grant:
Articles published in other media outlets (0 total):
More itemsLess items
VEICULO: TITULO (DATA)
VEICULO: TITULO (DATA)