Preview

Doklady BGUIR

Advanced search

Combined Method for Informative Feature Selection for Speech Pathology Detection

https://doi.org/10.35596/1729-7648-2023-21-4-110-117

Abstract

The task of detecting vocal abnormalities is characterized by a small amount of available data for training, as a consequence of which classification systems that use low-dimensional data are the most relevant. We propose to use LASSO (least absolute shrinkage and selection operator) and BSS (backward stepwise selection) methods together to select the most significant features for the detection of vocal pathologies, in particular amyotrophic lateral sclerosis. Features based on fine-frequency cepstral coefficients, traditionally used in speech signal processing, and features based on discrete estimation of the autoregressive spectrum envelope are used. Spectral features based on the autoregressive process envelope spectrum are extracted using the generative method, which involves calculating a discrete Fourier transform of the report sequence generated using the autoregressive model of the input voice signal. The sequence is generated by the autoregressive model so as to account for the periodic nature of the Fourier transform. This improves the accuracy of the spectrum estimation and reduces the spectral leakage effect. Using LASSO in conjunction with BSS allowed us to improve the classification efficiency using a smaller number of features as compared to using the LASSO method alone.

About the Authors

D. S. Likhachov
Belarusian State University of Informatics and Radioelectronics
Belarus

Likhachov Denis Sergeevich - Cand. of Sci., Associate Professor, Associate Professor at Computer Engineering De partment.

220013, Minsk, P. Brovki St., 6. Tel.: +375 17 293-85-05



M. I. Vashkevich
Belarusian State University of Informatics and Radioelectronics
Belarus

Maxim I. Vashkevich - Dr. of Sci. (Tech.), Associate Professor at Computer Engineering Department.

220013, Minsk, P. Brovki St., 6



N. A. Petrovsky
Belarusian State University of Informatics and Radioelectronics
Belarus

Nick A. Petrovsky - Cand. of Sci., Associate Professor, Associate Professor at Computer Engineering De partment.

220013, Minsk, P. Brovki St., 6



E. S. Azarov
Belarusian State University of Informatics and Radioelectronics
Belarus

Elias S. Azarov - Dr. of Sci. (Tech.), Associate Professor, Head of Computer Engineering Department.

220013, Minsk, P. Brovki St., 6



References

1. Rabiner L. R., Juang B. H. (1993) Fundamentals of Speech Recognition. Pearson Education.

2. Benba A., Jilbab A., Hammouch A. (2016) Discriminating between Patients with Parkinson’s and Neurological Diseases Using Cepstral Analysis. IEEE Transactions on Neural Systems and Rehabilitation Engineering. 24 (10), 1100–1108.

3. Vashkevich M., Rushkevich Y. (2021) Classification of ALS Patients Based on Acoustic Analysis of Sustained Vowel Phonations. Biomedical Signal Processing and Control. 65, 1–14.

4. Kiernan M. C. (2011) Amyotrophic Lateral Sclerosis. Lancet. 377 (9769), 942–955.

5. Yunusova Y. (2013) Detection of Bulbar ALS Using a Comprehensive Speech Assessment Battery. Proceedings of the International Workshop on Models and Analysis of Vocal Emissions for Biomedical Applications. 217–220.

6. Spangler T. (2017) Fractal Features for Automatic Detection of Dysarthria. IEEE EMBS International Conference on Biomedical Health Informatics. 437–440.

7. Likhachov D. S., Vashkevich M. I., Petrovsky N. A., Azarov E. S. (2023) Small-Size Spectral Features for Machine Learning in Voice Signal Analysis and Classification Tasks. Informatics. (20), 102–112. DOI: 10.37661/1816-0301-2023-20-1-102-112 (in Russian).

8. Markel J. D., Gray A. H. (1976) Linear Prediction of Speech. Berlin, New York, Springer-Verlag. 290.

9. Vashkevich M. I., Likhachov D. S., Azarov E. S. (2022) Voice Analysis and Classification System Based on Perturbation Parameters and Cepstral Presentation in Psychoacoustic Scales. Doklady BGUIR. 20 (1), 73–82. DOI: 10.35596/1729-7648-2022-20-1-73-82 (in Russian).

10. Vashkevich M. I., Burak A. A., Kanoika N. S., Daldova V. S. (2020) Analysis of Acoustic Voice Parameters for Larynx Pathology Detection. Informatics. 17 (1), 78–86 (in Russian).

11. Flach P. (2012) Machine Learning: the Art and Science of Algorithms that Make Sense of Data. Great Britain, Cambridge University Press Publ. 416.

12. James G., Witten D., Hastie T., Tibshirani R. (2013) An Introduction to Statistical Learning with Applications in R. Springer Publ. 440.

13. Kotu V., Deshpande B. (2019) Data Science: Concepts and Practice. 2 ed. USA, Morgan Kaufmann Publishers an Imprint of Elsevier.

14. Voice Database Used in the Article Classification of ALS Patients Based on Acoustic Analysis of Sustained Vowel Phonations. Available: https://github.com/Mak-Sim/Minsk2020_ALS_database (Accessed 12 May 2023).

15. Kunjan S., Grummett T. S., Pope K. J., Powers D. M. W., Fitzgibbon S. P., Lewis T. W. (2021) The Necessity of Leave One Subject Out (LOSO) Cross Validation for EEG Disease Diagnosis. Brain Informatics. Springer Publ. 558–567.


Review

For citations:


Likhachov D.S., Vashkevich M.I., Petrovsky N.A., Azarov E.S. Combined Method for Informative Feature Selection for Speech Pathology Detection. Doklady BGUIR. 2023;21(4):110-117. (In Russ.) https://doi.org/10.35596/1729-7648-2023-21-4-110-117

Views: 286


Creative Commons License
This work is licensed under a Creative Commons Attribution 4.0 License.


ISSN 1729-7648 (Print)
ISSN 2708-0382 (Online)