Computational Analysis of the Structural Composition of Coronavirus Genomes
https://doi.org/10.35596/1729-7648-2023-21-2-104-113
Abstract
Ecological disasters, wars in regions with microbiological weapons depots, deforestation, domestication of wild animals, consumption of infected animals, contamination of water and food products and their components, experiments with viruses, deficiencies and other defects of the immune system in modern humans and other mammals became the impetus for the evolution of new dangerous and extremely dangerous viruses. Due to the emergence of new dangerous viruses, the importance and demand for knowledge and skills of computational biology, epidemiology and virology in modern society have increased. Modern sequencers are capable of producing large amounts of bioinformatic data that is represented in the form of genomic texts. Comparative сomputational analysis of this information is necessary to clarify the issues of phylogenesis, mutational profiling, molecular evolution, identification of insertions of other genomes, annotation of genome regions, search for targets for vaccine development and pharmacotherapy. In this сontext, authors conducted a computational experiment of comparative analysis of the genomic texts of Belarusian coronavirus samples against a number of selected complete genomes of dangerous and extremely dangerous viruses and coronaviruses of various origins. Data analysis was performed using the YASS, genomic texts were downloaded from the GISAID, the custom genomic data processing pipeline based on the Galaxy bioinformatics platform was also applied. The article presents the results of an analysis of the available scientific literature and the computational experiment comparing the genomic texts of Belarusian coronavirus samples with a number of selected complete genomes of dangerous and especially dangerous viruses and coronaviruses of various origin. A significant similarity of the new coronavirus with the recombinant coronavirus, as well as partial similarity with synthetic coronavirus, Rubella, Ebola 1976, HIV-2 (human immunodeficiency virus), Middle East respiratory syndrome, simian immunodeficiency and Marburg fever viruses have been found.
Keywords
About the Authors
M. V. SprindzukBelarus
Sprindzuk Matvey Vladimirovich, Cand. of Sci., Senior Researcher at the United Institute of Informatics Problems of the NAS of Belarus, the Institute of Mathematics of the NAS of Belarus, Dr. Student at the Department of Electronic Computing Media of the Belarusian State University of Informatics and Radioelectronics
220012, Minsk, Surganova St., 6
Tel.: +375 33 682-57-55
V. I. Bernik
Belarus
Dr. of Sci. (Phys. and Math.), Professor, Principal Researcher at the Department of Number Theory
Minsk
A. S. Vladyko
Belarus
Dr. of Sci. (Med.), Professor, Principal Researcher
Minsk
L. Zhuozhuang
China
Dr. of Sci. (Med.), Professor
Beijing
L. P. Titov
Belarus
Academic of the NAS of Belarus, Dr. of Sci. (Med.), Professor, Head of Laboratory of Experimental Immunology
Minsk
References
1. Behbahani M., Rabiei P., Mohabatkar H. (2021) Analysis and Comparison of Physiochemical Properties, Mutations and Glycosylation Patterns between RNA Polymerase and Membrane Protein of SARS-CoV and SARS-CoV-2. Molecular Biology Research Communications. 10 (4), 171–178. Doi: 10.22099/ mbrc.2021.42187.1692.
2. Kaur N., Singh R., Dar Z., Bijarnia R. K., Dhingra N., Kaur T. (2021) Genetic Comparison Among Various Coronavirus Strains for the Identification of Potential Vaccine Targets of SARS-CoV-2. Infection, Genetics and Evolution. 89. Doi: 10.1016/j.meegid.2020.104490.
3. Makarov L. M., Ivanov D. O., Pozdnyakov A. V. (2020) Comparative Analysis of Coronavirus Strains. European Science. 53 (4), 61–66 (in Russian).
4. Kharchenko E. P. (2020) Coronavirus SARS-CoV-2: Features of Structural Proteins, Contagiousness and Possible Immune Collisions. Epidemiology and Vaccine Prevention. 19 (2), 13–30. https://doi. org/10.31631/2073-3046-2020-19-2-13-30 (in Russian).
5. Noé L., Kucherov G. (2005) YASS: Enhancing the Sensitivity of DNA Similarity Search. Nucleic Acids Research. 33, W540–W543. https://doi.org/10.1093/nar/gki478.
6. Vilsker M., Moosa Y., Nooij S., Fonseca V., Ghysens Y., Dumon K., Pauwels R., Alcantara L. C., Eynden E. V., Vandamme A.-M., Deforche K., de Oliveira T. (2019) Genome Detective: an Automated System for Virus Identification from High-Throughput Sequencing Data. Bioinformatics. 35 (5), 871–873. https://doi. org/10.1093/bioinformatics/bty695.
7. Cleemput S., Dumon W., Fonseca V., Wasim A. K., Giovanetti M., Alcantara L. C., Deforche K., de Oliveira T. (2020) Genome Detective Coronavirus Typing Tool for Rapid Identification and Characterization of Novel Coronavirus Genomes. Bioinformatics. 36 (11), 3552–3555. Doi: 10.1093/bioinformatics/btaa145.
8. Sprindzuk M. V., Bernik V. I., Kalosha N. I., Vladyko A. S., Ulziybat B., Batgerel B., Titov L. P., Klimuk D.A., Skriahina E. M., Skriahin A. E., Glinskaya T. N. (2022) Automated Data Analysis Pipeline for Coronavirus Genomes. Medeelectronics-2022. Means of Medical Electronics and New Medical Technologies, Collection of Scientific Articles XIII International Scientific and Technical Conference, Minsk, 8–9 Dec. 2022. Minsk, Belarusian State University of Informatics and Radioelectronics Publ. 60–65 (in Russian).
9. Voropaev E. V., Stoma I. O., Tapalsky D. V. (2021) Experience of Modern Genomic Technologies Application for the Study of Microorganisms and their Communities. Problems of Health and Ecology. 18 (3), 159–167 (in Russian)
Review
For citations:
Sprindzuk M.V., Bernik V.I., Vladyko A.S., Zhuozhuang L., Titov L.P. Computational Analysis of the Structural Composition of Coronavirus Genomes. Doklady BGUIR. 2023;21(2):104-113. (In Russ.) https://doi.org/10.35596/1729-7648-2023-21-2-104-113