<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">bsuir</journal-id><journal-title-group><journal-title xml:lang="ru">Доклады БГУИР</journal-title><trans-title-group xml:lang="en"><trans-title>Doklady BGUIR</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1729-7648</issn><issn pub-type="epub">2708-0382</issn><publisher><publisher-name>БГУИР</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.35596/1729-7648-2019-123-5-60-65</article-id><article-id custom-type="elpub" pub-id-type="custom">bsuir-1152</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ЭЛЕКТРОНИКА, РАДИОФИЗИКА, РАДИОТЕХНИКА, ИНФОРМАТИКА</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>ELECTRONICS, RADIOPHYSICS, RADIOENGINEERING, INFORMATICS</subject></subj-group></article-categories><title-group><article-title>Анализ методов разрешения лексической многозначности в области биомедицины</article-title><trans-title-group xml:lang="en"><trans-title>Analysis of the methods of word sense disambiguation in the biomedical domain</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Пашук</surname><given-names>А. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Pashuk</surname><given-names>A. V.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Пашук Александр Владимирович - аспирант кафедры систем управления.</p><p>220013, Минск, ул. П. Бровки, 6, тел. +375-29-875-23-34</p></bio><bio xml:lang="en"><p>Pashuk Aleksandr Vladimirovich - PG student of the control systems department.</p><p>220013, Minsk, P. Brovka str., 6, tel. +375-29-875-23-34</p></bio><email xlink:type="simple">pashuk@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Гуринович</surname><given-names>А. Б.</given-names></name><name name-style="western" xml:lang="en"><surname>Gurinovich</surname><given-names>A. B.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Кандидат физико-математических наук, доцент кафедры вычислительных методов и программирования.</p><p>220013, Минск, ул. П. Бровки, 6</p></bio><bio xml:lang="en"><p>PhD, associate professor of computational methods and programming department.</p><p>220013, Minsk, P. Brovka str., 6</p></bio><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Волорова</surname><given-names>Н. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Volorova</surname><given-names>N. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Кандидат технических наук, доцент кафедры информатики.</p><p>220013, Минск, ул. П. Бровки, 6</p></bio><bio xml:lang="en"><p>PhD, associate professor of the informatics department of Belarusian state university of informatics and radioelectronics.</p><p>220013, Minsk, P. Brovka str., 6</p></bio><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Кузнецов</surname><given-names>А. П.</given-names></name><name name-style="western" xml:lang="en"><surname>Kuznetsov</surname><given-names>A. P.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Доктор технических наук, профессор кафедры систем управления.</p><p>220013, Минск, ул. П. Бровки, 6</p></bio><bio xml:lang="en"><p>D.Sci, professor of control systems department.</p><p>220013, Minsk, P. Brovka str., 6</p></bio><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Белорусский государственный университет информатики и радиоэлектроники</institution></aff><aff xml:lang="en"><institution>Belarusian state university of informatics and radioelectronics</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2019</year></pub-date><pub-date pub-type="epub"><day>03</day><month>07</month><year>2019</year></pub-date><volume>0</volume><issue>5</issue><fpage>60</fpage><lpage>65</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Пашук А.В., Гуринович А.Б., Волорова Н.А., Кузнецов А.П., 2019</copyright-statement><copyright-year>2019</copyright-year><copyright-holder xml:lang="ru">Пашук А.В., Гуринович А.Б., Волорова Н.А., Кузнецов А.П.</copyright-holder><copyright-holder xml:lang="en">Pashuk A.V., Gurinovich A.B., Volorova N.A., Kuznetsov A.P.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://doklady.bsuir.by/jour/article/view/1152">https://doklady.bsuir.by/jour/article/view/1152</self-uri><abstract><p>Предложен метод разрешения лексической многозначности биомедицинских терминов на основе сравнения «мешков слов», полученных из контекста, определений и информации о связанных терминах из метатезауруса UMLS [<xref ref-type="bibr" rid="cit1">1</xref>], а также модификация метода с использованием оценки важности слов с помощью статистической меры TF-IDF. Проведена экспериментальная проверка метода на открытом тестовом наборе данных MSH WSD [<xref ref-type="bibr" rid="cit2">2</xref>], разработанном с целью поддержки исследований в области разрешения лексической многозначности.</p></abstract><trans-abstract xml:lang="en"><p>A method for resolving the lexical ambiguity of biomedical terms has been proposed. The method is based on a comparison of «word bags» obtained from the context, definitions and information on related terms from the UMLS metathesaurus [<xref ref-type="bibr" rid="cit1">1</xref>]. Modification of the method using the analysis of word importance using the statistical measure TF-IDF has been proposed. Experimental verification of the method has been performed on the open test MSH WSD data set [<xref ref-type="bibr" rid="cit2">2</xref>], developed to support research in the field of lexical resolution.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>машинное обучение</kwd><kwd>обработка текста естественного языка</kwd><kwd>разрешение лексической многозначности</kwd><kwd>извлечение информации</kwd></kwd-group><kwd-group xml:lang="en"><kwd>machine learning</kwd><kwd>natural language processing</kwd><kwd>word sense disambiguation</kwd><kwd>information retrieval</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">. Unified Medical Language System (UMLS) // U.S. National Library of Medicine. URL: https://www.nlm.nih.gov/research/umls/ (date of access: 20.11.2018).</mixed-citation><mixed-citation xml:lang="en">. Unified Medical Language System (UMLS) // U.S. National Library of Medicine. URL: https://www.nlm.nih.gov/research/umls/ (date of access: 20.11.2018).</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Word Sense Disambiguation (WSD) Test Collections // U.S. National Library of Medicine. URL: https://wsd.nlm.nih.gov/ (date of access: 30.11.2018).</mixed-citation><mixed-citation xml:lang="en">Word Sense Disambiguation (WSD) Test Collections // U.S. National Library of Medicine. URL: https://wsd.nlm.nih.gov/ (date of access: 30.11.2018).</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Statistical Reports on MEDLINE/PubMed Baseline Data // U.S. National Library of Medicine. URL: https://www.nlm.nih.gov/bsd/licensee/baselinestats.html (date of access: 16.11.2018).</mixed-citation><mixed-citation xml:lang="en">Statistical Reports on MEDLINE/PubMed Baseline Data // U.S. National Library of Medicine. URL: https://www.nlm.nih.gov/bsd/licensee/baselinestats.html (date of access: 16.11.2018).</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Ide N., Veronis J. Introduction to the special issue on word sense disambiguation: the state of the art // Computational Linguistics - Special issue on word sense disambiguation. 1998. № 24. P. 2-40.</mixed-citation><mixed-citation xml:lang="en">Ide N., Veronis J. Introduction to the special issue on word sense disambiguation: the state of the art // Computational Linguistics - Special issue on word sense disambiguation. 1998. № 24. P. 2-40.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Navigli R. Word sense disambiguation: a survey // ACM Computing Surveys. 2009. № 41. P. 1-69.</mixed-citation><mixed-citation xml:lang="en">Navigli R. Word sense disambiguation: a survey // ACM Computing Surveys. 2009. № 41. P. 1-69.</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Lesk M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone // Proceeding SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation. Toronto, Ontario, Canada: ACM, 1986. P. 24-26.</mixed-citation><mixed-citation xml:lang="en">Lesk M. Automatic sense disambiguation using machine readable dictionaries: how to tell a pine cone from an ice cream cone // Proceeding SIGDOC '86 Proceedings of the 5th annual international conference on Systems documentation. Toronto, Ontario, Canada: ACM, 1986. P. 24-26.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Leacock C., Miller G.A. Using corpus statistics and WordNet relations for sense identification // Computational Linguistics - Special issue on word sense disambiguation. 1998. № 24. P. 147-165.</mixed-citation><mixed-citation xml:lang="en">Leacock C., Miller G.A. Using corpus statistics and WordNet relations for sense identification // Computational Linguistics - Special issue on word sense disambiguation. 1998. № 24. P. 147-165.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Preiss J., Stevenson M. DALE: A Word Sense Disambiguation System for Biomedical Documents Trained using Automatically Labeled Examples // Proceedings of the 2013 NAACL HLT Demonstration Session. Atlanta, Georgia: Association for Computational Linguistics, 2013. P. 1-4.</mixed-citation><mixed-citation xml:lang="en">Preiss J., Stevenson M. DALE: A Word Sense Disambiguation System for Biomedical Documents Trained using Automatically Labeled Examples // Proceedings of the 2013 NAACL HLT Demonstration Session. Atlanta, Georgia: Association for Computational Linguistics, 2013. P. 1-4.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Liu H., Teller V., Friedman C.A Multi-aspect Comparison Study of Supervised Word Sense Disambiguation // Journal of the American Medical Informatics Association. 2004. № 11. P. 320-331.</mixed-citation><mixed-citation xml:lang="en">Liu H., Teller V., Friedman C.A Multi-aspect Comparison Study of Supervised Word Sense Disambiguation // Journal of the American Medical Informatics Association. 2004. № 11. P. 320-331.</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Word sense disambiguation across two domains: Biomedical literature and clinical notes / G.K. Savova [et al.] // Journal of Biomedical Informatics. 2008. № 41. P. 1088-1100.</mixed-citation><mixed-citation xml:lang="en">Word sense disambiguation across two domains: Biomedical literature and clinical notes / G.K. Savova [et al.] // Journal of Biomedical Informatics. 2008. № 41. P. 1088-1100.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Jimeno-Yepes A. J., Aronson A. R. Knowledge-based biomedical word sense disambiguation: comparison of approaches // BMC Bioinformatics. 2010. № 11. P. 569-581.</mixed-citation><mixed-citation xml:lang="en">Jimeno-Yepes A. J., Aronson A. R. Knowledge-based biomedical word sense disambiguation: comparison of approaches // BMC Bioinformatics. 2010. № 11. P. 569-581.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
