Authors : Ihsan Al-Hassani, Oumayma Al-Dakkak and Abdlnaser Assami
Abstract: Speech segmentation is the process of breaking speech signal into distinct acoustic blocks that could be words, syllabus or phonemes. Phonetic segmentation is about finding the exact boundaries for the different phonemes that composes a specific speech signal. Phonetic segmentation is crucial for many applications basically speech recognition ASR and speech to text systems STT as ASR needs phonetically transcribed training corpus, STT needs phoneme database. Phonetic segmentation techniques are divided into two major categories: Text-Dependent (TD) and Text-Independent (TI). In the text-dependent segmentation techniques, the phonetic annotation of the speech signal is already known and we only need to find the boundaries of each phoneme segment. In this study, we present a thorough survey of the different algorithm and techniques proposed so far for solving the problem of text-dependent phonetic segmentation.
Ihsan Al-Hassani, Oumayma Al-Dakkak and Abdlnaser Assami, 2021. An Inclusive Survey for Text Dependent Automatic Speech Segmentation Techniques. Research Journal of Applied Sciences, 16: 65-74.