SCRIPT: Speech Synthesis for Spoken Content Production
How can we support the research of hybrid text-to-speech synthesis technologies in low-resource languages?
SCRIPT is a three-year research and innovation project looking to develop synthetic voices for low-resource languages. The idea for SCRIPT originated at a language technology #newsHACK that we hosted in collaboration with BBC Connected Studio.
The BBC is a SCRIPT project partner, and is collaborating with the University of Edinburgh's Centre for Speech Technology Research (CSTR), where the project began in January 2017.
Our aim in partnering with the CSTR is to support the research of hybrid text-to-speech synthesis technologies in low-resource languages.
The CSTR is researching the possible integration of two methods for producing synthetic voices: unit selection and deep neural network, or parametric text-to-speech. The goal is to combine the broadcast-quality, "natural"-sounding voice recordings produced through unit selection, driven by deep neutral network technology to enable parametric changes to tone, pitch and speed. We have decided to focus our effort on the creation of voices in Hausa, Swahili and Bengali — all important languages for the BBC World Service.
This project has been developed as part of the multilingual solutions workstream.