<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">bsuir</journal-id><journal-title-group><journal-title xml:lang="ru">Доклады БГУИР</journal-title><trans-title-group xml:lang="en"><trans-title>Doklady BGUIR</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1729-7648</issn><issn pub-type="epub">2708-0382</issn><publisher><publisher-name>БГУИР</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.35596/1729-7648-2020-18-6-33-40</article-id><article-id custom-type="elpub" pub-id-type="custom">bsuir-2782</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ЭЛЕКТРОНИКА, РАДИОФИЗИКА, РАДИОТЕХНИКА, ИНФОРМАТИКА</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>ELECTRONICS, RADIOPHYSICS, RADIOENGINEERING, INFORMATICS</subject></subj-group></article-categories><title-group><article-title>String processing model for knowledge-driven systems</article-title><trans-title-group xml:lang="en"><trans-title>String processing model for knowledge-driven systems</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Ivashenko</surname><given-names>V. P.</given-names></name><name name-style="western" xml:lang="en"><surname>Ivashenko</surname><given-names>V. P.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Ivashenko Valerian Petrovich, PhD, Associate Professor of the Intelligent Information Technologies Department</p><p>220013, Belarus, Minsk, P. Brovki str., 6</p><p>tel. +375-17-293-80-92</p></bio><bio xml:lang="en"><p>Ivashenko Valerian Petrovich, PhD, Associate Professor of the Intelligent Information Technologies Department</p><p>220013, Belarus, Minsk, P. Brovki str., 6</p><p>tel. +375-17-293-80-92</p></bio><email xlink:type="simple">ivashenko@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Belarusian State University of Informatics and Radioelectronics</institution></aff><aff xml:lang="en"><institution>Belarusian State University of Informatics and Radioelectronics</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2020</year></pub-date><pub-date pub-type="epub"><day>30</day><month>09</month><year>2020</year></pub-date><volume>18</volume><issue>6</issue><fpage>33</fpage><lpage>40</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Ivashenko V.P., 2020</copyright-statement><copyright-year>2020</copyright-year><copyright-holder xml:lang="ru">Ivashenko V.P.</copyright-holder><copyright-holder xml:lang="en">Ivashenko V.P.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://doklady.bsuir.by/jour/article/view/2782">https://doklady.bsuir.by/jour/article/view/2782</self-uri><abstract><p>The purpose of the work is to confirm experimentally theoretical estimates for time complexity of operations of the string processing model linked with the metric space for solving data processing problems in knowledge-driven systems including the research and comparison of the operation characteristics of these operations with the characteristics of similar operations for the most relevant data structures. Integral and unit testing were used to obtain the results of the performed computational experiments and verify their correctness. The C \ C++ implementation of operations of the string processing model was tested. The paper gives definitions of concepts necessary for the calculation of metric features calculated over strings. As a result of the experiments, theoretical estimates of the computational complexity of the implemented operations and the validity of the choice of parameters of the used data structures were confirmed, which ensures near-optimal throughput and operation time indicators of operations. According to the obtained results, the advantage is the ability to guarantee the time complexity of the string processing operations no higher than O  at all stages of a life cycle of data structures used to represent strings, from their creation to destruction, which allows for high throughput in data processing and responsiveness of systems built on the basis of the implemented operations. In case of solving particular string processing problems and using more suitable for these cases data structures such as vector or map the implemented operations have disadvantages meaning they are inferior in terms of the amount of data processed per time unit. The string processing model is focused on the application in knowledge-driven systems at the data management level.</p></abstract><trans-abstract xml:lang="en"><p>The purpose of the work is to confirm experimentally theoretical estimates for time complexity of operations of the string processing model linked with the metric space for solving data processing problems in knowledge-driven systems including the research and comparison of the operation characteristics of these operations with the characteristics of similar operations for the most relevant data structures. Integral and unit testing were used to obtain the results of the performed computational experiments and verify their correctness. The C \ C++ implementation of operations of the string processing model was tested. The paper gives definitions of concepts necessary for the calculation of metric features calculated over strings. As a result of the experiments, theoretical estimates of the computational complexity of the implemented operations and the validity of the choice of parameters of the used data structures were confirmed, which ensures near-optimal throughput and operation time indicators of operations. According to the obtained results, the advantage is the ability to guarantee the time complexity of the string processing operations no higher than O  at all stages of a life cycle of data structures used to represent strings, from their creation to destruction, which allows for high throughput in data processing and responsiveness of systems built on the basis of the implemented operations. In case of solving particular string processing problems and using more suitable for these cases data structures such as vector or map the implemented operations have disadvantages meaning they are inferior in terms of the amount of data processed per time unit. The string processing model is focused on the application in knowledge-driven systems at the data management level.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>strings processing</kwd><kwd>lists processing</kwd><kwd>string operation</kwd><kwd>concatenation</kwd><kwd>string splitting</kwd><kwd>searching by key</kwd></kwd-group><kwd-group xml:lang="en"><kwd>strings processing</kwd><kwd>lists processing</kwd><kwd>string operation</kwd><kwd>concatenation</kwd><kwd>string splitting</kwd><kwd>searching by key</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Ivashenko V.P., Sintsov S.V. [Operations on management data sets in linear memory]. Doklady BGUIR = Doklady BGUIR. 2016;6(100):86-93. (in Russ.)</mixed-citation><mixed-citation xml:lang="en">Ivashenko V.P., Sintsov S.V. [Operations on management data sets in linear memory]. Doklady BGUIR = Doklady BGUIR. 2016;6(100):86-93. (in Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Bates R. Language Definition in the Schütz Semantic Editor. JMLC 2003. 2003;2789:229-240. DOI: 10.1007/978-3-540-45213-3_29.</mixed-citation><mixed-citation xml:lang="en">Bates R. Language Definition in the Schütz Semantic Editor. JMLC 2003. 2003;2789:229-240. DOI: 10.1007/978-3-540-45213-3_29.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Xie T., Taneja K., Kale Sh., Marinov D. Towards a Framework for Differential Unit Testing of Object- Oriented Programs. Proceedings – International Conference on Software Engineering. 2007;1:5-5. DOI:10.1109/AST.2007.15.</mixed-citation><mixed-citation xml:lang="en">Xie T., Taneja K., Kale Sh., Marinov D. Towards a Framework for Differential Unit Testing of Object- Oriented Programs. Proceedings – International Conference on Software Engineering. 2007;1:5-5. DOI:10.1109/AST.2007.15.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Cadar C., Koushik S. Symbolic execution for software testing: three decades later. Communications of the ACM. 2013;56(2):82-90.DOI: 10.1145/2408776.2408795.</mixed-citation><mixed-citation xml:lang="en">Cadar C., Koushik S. Symbolic execution for software testing: three decades later. Communications of the ACM. 2013;56(2):82-90.DOI: 10.1145/2408776.2408795.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Le Traon Y., Jéron T., Jézéquel J.-M., Morel P. Efficient Object-Oriented Integration and Regression Testing. IEEE Transactions on Reliability. 2000;49:12-25. DOI: 10.1109/24.855533.</mixed-citation><mixed-citation xml:lang="en">Le Traon Y., Jéron T., Jézéquel J.-M., Morel P. Efficient Object-Oriented Integration and Regression Testing. IEEE Transactions on Reliability. 2000;49:12-25. DOI: 10.1109/24.855533.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Hirschberg D.S. A linear space algorithm for computing maximal common subsequences. Communications of the ACM. 1975;18 (6):341-343. DOI: 10.1145/360825.360861.</mixed-citation><mixed-citation xml:lang="en">Hirschberg D.S. A linear space algorithm for computing maximal common subsequences. Communications of the ACM. 1975;18 (6):341-343. DOI: 10.1145/360825.360861.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Gusfield D. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge: Cambridge University Press; 1997. DOI: 10.1086/420407.</mixed-citation><mixed-citation xml:lang="en">Gusfield D. Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. Cambridge: Cambridge University Press; 1997. DOI: 10.1086/420407.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Levenshtein V.I. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady. 1966:10(8):707-710. (in Russ.)</mixed-citation><mixed-citation xml:lang="en">Levenshtein V.I. Binary codes capable of correcting deletions, insertions, and reversals. Soviet Physics Doklady. 1966:10(8):707-710. (in Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Damerau F.J. A technique for computer detection and correction of spelling errors. Communications of the ACM. 1964;7(3):171-176. DOI: 10.1145/363958.363994.</mixed-citation><mixed-citation xml:lang="en">Damerau F.J. A technique for computer detection and correction of spelling errors. Communications of the ACM. 1964;7(3):171-176. DOI: 10.1145/363958.363994.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Navarro G. A guided tour to approximate string matching. ACM Computing Surveys. 2001;33(1):31-88. DOI: 10.1145/375360.375365.</mixed-citation><mixed-citation xml:lang="en">Navarro G. A guided tour to approximate string matching. ACM Computing Surveys. 2001;33(1):31-88. DOI: 10.1145/375360.375365.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Ehrenfeucht A., Haussler D.A New Distance Metric on Strings Computable in Linear Time. Discrete Applied Mathematics. 1988;20(3):191-203. DOI:10.1016/0166-218X(88)90076-5.</mixed-citation><mixed-citation xml:lang="en">Ehrenfeucht A., Haussler D.A New Distance Metric on Strings Computable in Linear Time. Discrete Applied Mathematics. 1988;20(3):191-203. DOI:10.1016/0166-218X(88)90076-5.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Ivashenko V.P. [Ontological model of space-time relations for events and phenomena in processing of knowledge] Vestnik BrGTU. 2017; 5(107):13-17. (in Russ.)</mixed-citation><mixed-citation xml:lang="en">Ivashenko V.P. [Ontological model of space-time relations for events and phenomena in processing of knowledge] Vestnik BrGTU. 2017; 5(107):13-17. (in Russ.)</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
