How to identify word ends (boundary event ) while using non-native voice TTS

Hello Team,

I am working on a web app which would require a non-native voice TTS with
word by word highlighting in sync with the voice.
I hit the first hurdle when the boundary event failed to work with
non-native voices. Is there any way to identify word ends or to get the
boundary event to work while using non-native voices?
Please guide me.
Thank you in advance.

*KC*

<https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>
Virus-free.
www.avast.com
<https://www.avast.com/sig-email?utm_medium=email&utm_source=link&utm_campaign=sig-email&utm_content=webmail>
<#DAB4FAD8-2DD7-40BB-A1B8-4E2AA1F9FDF2>

Received on Monday, 26 April 2021 23:10:37 UTC