Combined Method for Informative Feature Selection for Speech Pathology Detection
https://doi.org/10.35596/1729-7648-2023-21-4-110-117
Abstract
The task of detecting vocal abnormalities is characterized by a small amount of available data for training, as a consequence of which classification systems that use low-dimensional data are the most relevant. We propose to use LASSO (least absolute shrinkage and selection operator) and BSS (backward stepwise selection) methods together to select the most significant features for the detection of vocal pathologies, in particular amyotrophic lateral sclerosis. Features based on fine-frequency cepstral coefficients, traditionally used in speech signal processing, and features based on discrete estimation of the autoregressive spectrum envelope are used. Spectral features based on the autoregressive process envelope spectrum are extracted using the generative method, which involves calculating a discrete Fourier transform of the report sequence generated using the autoregressive model of the input voice signal. The sequence is generated by the autoregressive model so as to account for the periodic nature of the Fourier transform. This improves the accuracy of the spectrum estimation and reduces the spectral leakage effect. Using LASSO in conjunction with BSS allowed us to improve the classification efficiency using a smaller number of features as compared to using the LASSO method alone.
About the Authors
D. S. LikhachovBelarus
Likhachov Denis Sergeevich - Cand. of Sci., Associate Professor, Associate Professor at Computer Engineering De partment.
220013, Minsk, P. Brovki St., 6. Tel.: +375 17 293-85-05
M. I. Vashkevich
Belarus
Maxim I. Vashkevich - Dr. of Sci. (Tech.), Associate Professor at Computer Engineering Department.
220013, Minsk, P. Brovki St., 6
N. A. Petrovsky
Belarus
Nick A. Petrovsky - Cand. of Sci., Associate Professor, Associate Professor at Computer Engineering De partment.
220013, Minsk, P. Brovki St., 6
E. S. Azarov
Belarus
Elias S. Azarov - Dr. of Sci. (Tech.), Associate Professor, Head of Computer Engineering Department.
220013, Minsk, P. Brovki St., 6
References
1. Rabiner L. R., Juang B. H. (1993) Fundamentals of Speech Recognition. Pearson Education.
2. Benba A., Jilbab A., Hammouch A. (2016) Discriminating between Patients with Parkinson’s and Neurological Diseases Using Cepstral Analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 24 (10), 1100–1108.
3. Vashkevich M., Rushkevich Y. (2021) Classification of ALS Patients Based on Acoustic Analysis of Sustained Vowel Phonations. Biomedical Signal Processing and Control. 65, 1–14.
4. Kiernan M. C. (2011) Amyotrophic Lateral Sclerosis. Lancet. 377 (9769), 942–955.
5. Yunusova Y. (2013) Detection of Bulbar ALS Using a Comprehensive Speech Assessment Battery. Proceedings of the International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications. 217–220.
6. Spangler T. (2017) Fractal Features for Automatic Detection of Dysarthria. IEEE EMBS International Conference on Biomedical Health Informatics. 437–440.
7. Likhachov D. S., Vashkevich M. I., Petrovsky N. A., Azarov E. S. (2023) Small-Size Spectral Features for Machine Learning in Voice Signal Analysis and Classification Tasks. Informatics. (20), 102–112. DOI: 10.37661/1816-0301-2023-20-1-102-112 (in Russian).
8. Markel J. D., Gray A. H. (1976) Linear Prediction of Speech. Berlin, New York, Springer-Verlag. 290.
9. Vashkevich M. I., Likhachov D. S., Azarov E. S. (2022) Voice Analysis and Classification System Based on Perturbation Parameters and Cepstral Presentation in Psychoacoustic Scales. Doklady BGUIR. 20 (1), 73–82. DOI: 10.35596/1729-7648-2022-20-1-73-82 (in Russian).
10. Vashkevich M. I., Burak A. A., Kanoika N. S., Daldova V. S. (2020) Analysis of Acoustic Voice Parameters for Larynx Pathology Detection. Informatics. 17 (1), 78–86 (in Russian).
11. Flach P. (2012) Machine Learning: the Art and Science of Algorithms that Make Sense of Data. Great Britain, Cambridge University Press Publ. 416.
12. James G., Witten D., Hastie T., Tibshirani R. (2013) An Introduction to Statistical Learning with Applications in R. Springer Publ. 440.
13. Kotu V., Deshpande B. (2019) Data Science: Concepts and Practice. 2 ed. USA, Morgan Kaufmann Publishers an Imprint of Elsevier.
14. Voice Database Used in the Article Classification of ALS Patients Based on Acoustic Analysis of Sustained Vowel Phonations. Available: https://github.com/Mak-Sim/Minsk2020_ALS_database (Accessed 12 May 2023).
15. Kunjan S., Grummett T. S., Pope K. J., Powers D. M. W., Fitzgibbon S. P., Lewis T. W. (2021) The Necessity of Leave One Subject Out (LOSO) Cross Validation for EEG Disease Diagnosis. Brain Informatics. Springer Publ. 558–567.
Review
For citations:
Likhachov D.S., Vashkevich M.I., Petrovsky N.A., Azarov E.S. Combined Method for Informative Feature Selection for Speech Pathology Detection. Doklady BGUIR. 2023;21(4):110-117. (In Russ.) https://doi.org/10.35596/1729-7648-2023-21-4-110-117