MBROLA 



a joint cooperation
between

&

TCTS Lab

The goal of the MBROLA project is to obtain a set a high quality speech synthesizers for as many languages as possible, free for use in non-commercial applications. The ultimate goal is to boost up academic research on speech synthesis, and particularly on prosody generation, known as one of the biggest challenges in Text-to-Speech Synthesis for the years to come. Brazilian Portuguese, Breton, British English, Dutch, French, German, Romanian, Spanish, Swedish and now Italian! are already available as full software multilingual speech synthesizers (i.e., the DSP part of a TTS system). Many other languages are in preparation.

MBROLA is a speech synthesizer based on the concatenation of diphones developed at the FacultÈ Polytechnique de Mons, TCTS Lab. in Mons, Belgium

Central to the MBROLA project is MBROLA, a speech synthesizer based on the concatenation of diphones. It takes a list of phonemes as input, together with prosodic information (duration of phonemes and a piecewise linear description of pitch), and produces speech samples on 16 bits (linear), at the sampling frequency of the diphone database used (it is therefore NOT a Text-To-Speech (TTS)synthesizer, since it does not accept raw text as input). This synthesizer is provided for free, for non commercial, non military applications only.

Diphone databases tailored to the Mbrola format are needed to run the synthesizer. A French voices have been made available by the authors of MBROLA, and the MBROLA project has itself been organized so as to incite other research labs or companies to share their diphone databases and, finally,..................... MBROLA speaks Italian!

 

MBROLA Italian Diphone DataBase    top.gif (983 bytes)

The new male Italian database, developed for RE-LPC Festival and OGI RE-LPC for Festival, has been converted in MBROLA format by TCTS Lab and FESTIVAL modules, specifically created for MBROLA synthesizer, have been adapted to Italian. Various scripts for automatic segmentation, diphone extraction and MBROLA format adaptation have been developed with the function of making faster the creation of a new voice.

  • (the new Italian MBROLA database and scripts, developed  by ISTC-SPFD, is now available for download)


References    top.gif (983 bytes)

T.Dutoit, H.Leich, "MBR-PSOLA : Text-To-Speech Synthesis based on an MBE Re-Synthesis of the Segments Database", Speech Communication, Elsevier Publisher, November 1993, vol. 13, n°3-4.

T. Dutoit, An Introduction to Text-To-Speech Synthesis¸ Kluwer Academic Publishers, 1996, 326 pp.

T.Dutoit, V.Pagel, N.Pierret, F.Bataille, O.Van Der Vrecken "The MBROLA Project: Towards a Set of High-Quality Speech Synthesizers Free of Use for Non-Commercial Purposes" Proc. ICSLP'96, Philadelphia, vol. 3, pp. 1393-1396.


Author    top.gif (983 bytes)

Piero Cosi
 

ISTC-SPFD CNR
Istituto di Scienze e Tecnologie della Cognizione
Sezione di Padova "Fonetica e Dialettologia"
Consiglio Nazionale delle Ricerche
e-mail:
cosi@pd.istc.cnr.it

Project Leader


working.gif (1843 bytes)