Segmentation of speech on phonetic elements for systems of speech information protection
https://doi.org/10.35596/1729-7648-2019-123-5-66-71
Abstract
The article is devoted to the development of speech segmentation algorithm on phonetic elements for the synthesis of speech-like signals in speech information protection systems. The main attention is paid to establishing the boundaries of phonetic units of speech, taking into account the influence of this factor on the quality of the synthesized speech by the compilation method. The features of establishing the boundaries of phonemes for fused speech and the influence of this factor on the quality of synthesized speech on the basis of phonemes are considered. It is proposed to ensure the quality of synthesized speech beginning and ending phonemes at the segmentation set in the transition implementation of a signal through zero and in the synthesis of speech-like signals to use the spline function at the boundaries of segments phonemes.
About the Authors
Y. N. SeitkulovKazakhstan
PhD, director of the institute of information security and cryptology
S. N. Boranbayev
Kazakhstan
Eurasian national university named after L.N. Gumilyov
A. V. Patapovich
Belarus
Patapovich Aleksandr Vladimirovich - researcher of SRL 5.3 of R&D department.
20013, Minsk, P. Brovka st., 6, tel. +375-29-670-30-40
H. V. Davydau
Belarus
PhD, researcher of SRL 5.3 of R&D department.
20013, Minsk, P. Brovka st., 6
References
1. Sakoe H., Chiba S. Dynamic Programming Algorithm Optimization for Spoken Word Recognition // IEEE Transactions on Acoustics, Speech, and Signal Processing. 1978. Vol. ASSP-26, No. 1. P. 43-49.
2. Scharenborg O., Wan V., Ernestus M. Unsupervised speech segmentation: An analysis of the hypothesized phone boundaries // The Journal of the Acoustical Society of America. 2010. Vol. 127, No. 2. P. 1084-1095.
3. Gomez J.A., Calvo M. Improvements on automatic speech segmentation at the phonetic level // Materials of 16th Iberoamerican CongressProgress in Pattern Recognition, Image Analysis, Computer Vision and Applikations. 2011. P. 557-564.
4. Bemdt D.J., Clifford J. Using Dynamic Time Warping to FindPatterns in Time Series // AAAI Proc. knowledge discovery in databases. 1994. P. 359-370.
5. A Review: Automatic Speech Segmentation / Sakran A.E. [et al.] // International Jornal of Computer Science and Mobile Computing. 2017. Vol. 6, No. 4. P. 308-315.
6. Makowski R., Hossa R. Automatic speech signal segmentation based on the innovation adaptive filter // International Journal of Applied Mathematics and Computer Science. 2014. Vol. 24, No. 2. P. 259-270.
7. Kamarauskas J. Automatic Segmetation of Phonemes using Artificial Neural Networks // Elektronika ir Elektrotechnika. 2006. Vol. 72, No. 8. P. 39-42.
8. Automatic Silence/Unvoiced/Voiced Classification of Bangla Velar Phonemes: New Approach / Syed Akhter Hossain [et al.] // 8th ICCIT. Dhaka, 2005.
9. . Highly accurate phonetic segmentation using boundary correction models and system fusion / A. Stolcke [et al.] // 2014 IEEE International Conference on Acoustic, Speech and Signal Processing (ICASSP). IEEE, 2014. P. 5552-5556.
10. Method for protecting speech information / H.V. Davydau [et al.] // Doklady BGUIR. 2015. N° 8 (94). P. 107-110.
11. Rationale for the method of formation of the combined speech masking signals / Y. Seitkulov [et al.] // IEEE 8th International Conference on Application on Information and Communication Technologies (AICT). Astana, Kazakhstan, 2014.
12. Sorokin V.N. Segmentation of the period of the fundamental tone of a voice source // Acoustical Physics. 2016. Vol. 62, No. 2. P. 244-254.
13. Algoritym of forming speech base units using the method of dynamic programming / Seitkulov Y.N. [et al.] // Journal of Theoretical and Applied Information Technology. 2018. Vol. 96, No 23. P. 7928-7941.
Review
For citations:
Seitkulov Y.N., Boranbayev S.N., Patapovich A.V., Davydau H.V. Segmentation of speech on phonetic elements for systems of speech information protection. Doklady BGUIR. 2019;(5):66-71. (In Russ.) https://doi.org/10.35596/1729-7648-2019-123-5-66-71