Publications

2018

Building open javanese and sundanese corpora for multilingual text-tospeech

J. A. E. Wibawa, S. Sarin, C. F. Li, K. Pipatsrisawat, K. Sodimana, O. Kjartansson, A. Gutkin, M. Jansche, and L. Ha

Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)

2023

A Step-by-Step Process for Building TTS Voices Using Open Source Data and Framework for Bangla, Khmer, Nepali, Javanese, Sinhala, and Sundanese

K. Sodimana, P. De Silva, S. Sarin, K. Pipatsrisawat

Google Research

2023

Text Normalization for Bangla, Khmer, Nepali, Javanese, Sinhala and Sundanese Text-to-Speech Systems

K. Sodimana, P. De Silva, R. Sproat, T. Wattanavekin, A. Gutkin, K. Pipatsrisawat

Google Research

2023

Google Crowdsourced Speech Corpora and Related Open-Source Resources for Low-Resource Languages and Dialects: An Overview

Alena Butryna, Shan{-}Hui Cathy Chu, Isin Demirsahin, Alexander Gutkin, Linne Ha, Fei He, Martin Jansche, Cibu Johny, Anna Katanova, Oddur Kjartansson, Chenfang Li, Tatiana Merkulova, Yin May Oo, Knot Pipatsrisawat, Clara Rivera, Supheakmungkol Sarin, Pasindu De Silva, Keshan Sodimana, Richard Sproat, Theeraphol Wattanavekin, Jaka Aris Eko Wibawa

Google Research