Advanced search
Start date
Betweenand

Speech Tera Ltda: development of computational resources for speech technologies

Grant number: 14/21750-0
Support type:Research Grants - Innovative Research in Small Business - PIPE
Duration: September 01, 2015 - July 31, 2016
Field of knowledge:Interdisciplinary Subjects
Principal Investigator:Vanessa Marquiafável Serrani
Grantee:Vanessa Marquiafável Serrani
Company:SpeechTera Desenvolvimento de Programas para Computadores Ltda
City: Araras
Associated grant(s):16/08355-0 - SpeechTera Ltda: development of computational resources for speech technologies, AP.PIPE
Associated scholarship(s):15/21900-5 - Speech Tera Ltda: development of computational resources for speech technologies, BP.PIPE

Abstract

This project aims to create computational resources for the development of Speech Technologies, focused on Brazilian Portuguese. With the development of robust algorithms to treat speech databases, applications involving recognition or speech synthesis, respectively, ASR (Automatic Speech Recognition) and TTS (Text-to-Speech), have gained more space in our everyday life and become increasingly accurate. However, although Brazilian Portuguese is the sixth most spoken language in the world, resources available for the development of speech processing technologies for that for that language are scarce: there are few databases, grapheme-phoneme converters and acoustic or pronunciation models on the market. This project seeks to act precisely in that gap. Our purpose is to develop computational resources in order to encourage the development of speech technologies for Brazilian Portuguese, both for the industry and academia. Its proposes are the development of four types of products: i) speech corpora ii) acoustic models, iii) models of pronunciation and iv) grapheme-phoneme converters. For speech corpora, we propose methods of collecting and annotating data based on crawling and crowd-sourcing, that will enable the development of speech resources at the most competitive and affordable prices that currently in existence on the market. State of the art techniques will be employed in the preparation of the acoustic models, like Deep Neural Networks; and grapheme-phoneme converters as hybrid models based on manual rules and machine learning techniques (SVM, CART, MARS). The proposed business model focuses on a business-to-business approach (B2B), focused on information processing; speech and natural language processing technology companies, especially with the start-ups niche in mind. (AU)

Articles published in Agência FAPESP Newsletter about the research grant
Startup develops computational resources for speech technologies 
Articles published in Pesquisa para Inovação FAPESP about research grant:
Startup develops computational resources for speech technologies 
Distribution map of accesses to this page
Click here to view the access summary to this page.