Convert audio to text using automatic speech recognition
Generate Vietnamese voice from text and sample audio
Generate Vietnamese voice from text and audio sample