An End to End Bilingual TTS System for Fongbe and Yoruba

BOCO, Charbel Arnaud Cedrique Y.; DAGBA, THÉOPHILE KOMLAN

doi:10.1007/978-3-031-16210-7

An End to End Bilingual TTS System for Fongbe and Yoruba

dc.contributor.author	BOCO, Charbel Arnaud Cedrique Y.
dc.contributor.author	DAGBA, THÉOPHILE KOMLAN
dc.date.accessioned	2026-06-02T16:06:57Z
dc.date.available	2026-06-02T16:06:57Z
dc.date.issued	2022
dc.description.abstract	This paper aims to present an end to end bilingual TTS system for Yoruba and Fongbe based on Fastspeech 2, a non-autoregressive model. From this baseline, a simple concatenation of speaker, language and phoneme embeddings was used as input for the encoder and the decoder. The training was done on a multi-speaker dataset collected for both languages. Two types of input were used: a shared representation of phoneme between both languages and a language specific representation of phonemes. Then some experimentations were made to test both input representations showing that results are smoother for the shared representation of phoneme. But with all input sets, the proposed model was able to synthesize speech in each language with voice cloning ability. The model produces good speech quality waveform with great fidelity and naturalness and shows its ability to generate speech waveforms for both languages. A comparison was also made between the proposed bilingual system and the same model trained on monolingual dataset to show that the bilingual dataset allows more accurate result.
dc.identifier.doi	10.1007/978-3-031-16210-7
dc.identifier.other	BECDB-12043
dc.identifier.uri	https://dspace.uac.bj/handle/123456789/10417
dc.language.iso	fr
dc.relation.ispartof	Advances in Computational Collective Intelligence
dc.subject	Bilingual text-to-speech · African language · Tonal language
dc.title	An End to End Bilingual TTS System for Fongbe and Yoruba
dc.type	Article

Collections

Articles

An End to End Bilingual TTS System for Fongbe and Yoruba

Files

Collections