<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">bsuir</journal-id><journal-title-group><journal-title xml:lang="ru">Доклады БГУИР</journal-title><trans-title-group xml:lang="en"><trans-title>Doklady BGUIR</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">1729-7648</issn><issn pub-type="epub">2708-0382</issn><publisher><publisher-name>БГУИР</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.35596/1729-7648-2020-18-1-43-51</article-id><article-id custom-type="elpub" pub-id-type="custom">bsuir-2591</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>ЭЛЕКТРОНИКА, РАДИОФИЗИКА, РАДИОТЕХНИКА, ИНФОРМАТИКА</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>ELECTRONICS, RADIOPHYSICS, RADIOENGINEERING, INFORMATICS</subject></subj-group></article-categories><title-group><article-title>МЕТОД КОРРЕКЦИИ СЛУХА НА ОСНОВЕ ПСИХОАКУСТИЧЕСКИ ОБУСЛОВЛЕННОГО ПЕРЕНОСА ЧАСТОТ В РЕЧЕВОМ СИГНАЛЕ</article-title><trans-title-group xml:lang="en"><trans-title>HEARING CORRECTION METHOD BASED ON PSYCHOACOUSTICALLY MOTIVATED FREQUENCY TRANSPOSITION IN A SPEECH SIGNAL</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Порхун</surname><given-names>М. И.</given-names></name><name name-style="western" xml:lang="en"><surname>Porhun</surname><given-names>M. I.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Порхун Максим Игоревич, ассистент кафедры электронных вычислительных средств</p><p>220013, г. Минск, ул. П. Бровки, д. 6, тел. +375-17-293-84-20</p></bio><bio xml:lang="en"><p>Porhun Maxim Igorevich, Assistant Lecturer of Computer Engineering Department</p><p>220013, Minsk, P. Brovki str., 6, tel. +375-17-293-84-20</p></bio><email xlink:type="simple">porhun@bsuir.by</email><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Вашкевич</surname><given-names>М. И.</given-names></name><name name-style="western" xml:lang="en"><surname>Vashkevich</surname><given-names>M. I.</given-names></name></name-alternatives><bio xml:lang="ru"><p>к.т.н., доцент, доцент кафедры электронных вычислительных средств</p></bio><bio xml:lang="en"><p>PhD, Associate Professor, Associate Professor of Computer Engineering Department</p></bio><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Белорусский государственный университет информатики и радиоэлектроники</institution></aff><aff xml:lang="en"><institution>Belarussian state university of informatics and radioelectronics</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2020</year></pub-date><pub-date pub-type="epub"><day>06</day><month>03</month><year>2020</year></pub-date><volume>18</volume><issue>1</issue><fpage>43</fpage><lpage>51</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Порхун М.И., Вашкевич М.И., 2020</copyright-statement><copyright-year>2020</copyright-year><copyright-holder xml:lang="ru">Порхун М.И., Вашкевич М.И.</copyright-holder><copyright-holder xml:lang="en">Porhun M.I., Vashkevich M.I.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://doklady.bsuir.by/jour/article/view/2591">https://doklady.bsuir.by/jour/article/view/2591</self-uri><abstract><p>Целью работы являлась разработка метода обработки речевого сигнала для коррекции слуховых патологий на основе психоакустически обусловленного переноса высокочастотных составляющих спектра сигнала в низкочастотную область с последующим частотно-зависимым усилением. Для достижения поставленной цели были решены задачи, связанные с разработкой принципов переноса частот в речевом сигнале. Разработанный метод является адаптивным, его настройка осуществляется согласно аудиограмме тугоухого человека. Для переноса частот выбираются две частотных полосы: исходная (откуда производится перенос) и целевая (куда производится перенос). Ширина исходной частотной полосы фиксирована, а ширина целевой полосы выбирается адаптивно. Перенос спектра выполняется только для согласных звуков, восприятие которых тугоухими людьми затруднено. Классификация звуков по признаку гласный/согласный/пауза реализована на базе нейронной сети. В качестве информационных признаков выбирались: среднее число переходов через нуль, кратковременная энергия, кратковременная амплитуда, нормализованная автокорреляционная функция и первый спектральный момент. Чтобы сохранить максимально натуральное звучание переносимых звуков используется концепция равной громкости. Для компенсации ослабления восприятия звука тугоухим человеком используется частотно-зависимое усиление сигнала на основе аудиограммы. Эффективность предложенного метода проверена экспериментально с использованием моделирования эффекта потери слуха. В эксперименте учувствовали 10 человек, которым давали прослушивать записи, пропущенные через модель потери слуха, а также записи, прощенные через модель потери слуха с последующей коррекцией. Результаты показали, что применение разработанного метода коррекции слуха в среднем улучшает разборчивость речи на 6 %.</p></abstract><trans-abstract xml:lang="en"><p>The purpose of the work was to develop a speech signal processing method for the hearing pathologies correction based on psychoacoustically motivated transposition of high-frequency components of the signal spectrum to the low-frequency region with subsequent frequency-dependent amplification. To achieve this goal, several tasks related to the development of principles of frequency transposition in a speech signal were solved. The adjustment of the method is carried out according to the audiogram of a deaf person. For frequency transposition, source and target frequency bands are selected. The width of the source frequency band is fixed, while the width of the target band is adaptive. Spectrum transposition is performed only for consonants, the perception of which is more difficult for people with hearing loss. The classification of sounds (into vowel-consonant - pause classes) is implemented using one-layer neural network. The feature vector consists of: the zero crossing rate, short-term energy, short-term magnitude, normalized autocorrelation function and the first spectral moment. To preserve the naturalness of transposed sounds, the concept of equal loudness is used. To compensate for the attenuation in the perception of sound by a deaf person, a frequencydependent signal amplification based on an audiogram is used. The effectiveness of the proposed method was verified experimentally using hearing loss effect simulation. The experiment involved 10 people who were given to listen to the recordings passed through the hearing loss model, as well as recordings passed through the hearing loss model with subsequent correction (using proposed method). The results showed that using the proposed hearing correction method improves speech intelligibility on average by 6 %.</p></trans-abstract><kwd-group xml:lang="ru"><kwd>коррекция слуха</kwd><kwd>слуховые патологии</kwd><kwd>моделирование потери слуха</kwd></kwd-group><kwd-group xml:lang="en"><kwd>hearing correction</kwd><kwd>hearing impairments</kwd><kwd>hearing loss simulation</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Simpson A. Frequency-lowering devices for managing high-frequency hearing loss: a review. Trends in amplification. 2009;13(2):87-106. DOI: 10.1177/1084713809336421.</mixed-citation><mixed-citation xml:lang="en">Simpson A. Frequency-lowering devices for managing high-frequency hearing loss: a review. Trends in amplification. 2009;13(2):87-106. DOI: 10.1177/1084713809336421.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Alexander J.M. Individual variability in recognition of frequency-lowered speech. Seminars in Hearing. 2013;34(2):86-109. DOI: 10.1055/s-0033-1341346.</mixed-citation><mixed-citation xml:lang="en">Alexander J.M. Individual variability in recognition of frequency-lowered speech. Seminars in Hearing. 2013;34(2):86-109. DOI: 10.1055/s-0033-1341346.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Robinson J.D., Baer T., Moore B. Using transposition to improve consonant discrimination and detection for listeners with severe high-frequency hearing loss. International Journal of Audiology. 2007;46(6):293-308. DOI: 10.1080/14992020601188591.</mixed-citation><mixed-citation xml:lang="en">Robinson J.D., Baer T., Moore B. Using transposition to improve consonant discrimination and detection for listeners with severe high-frequency hearing loss. International Journal of Audiology. 2007;46(6):293-308. DOI: 10.1080/14992020601188591.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Hogan C.A., Turner C.W. High-frequency audibility: Benefits for hearing-impaired listeners. The Journal of the Acoustical Society of America. 1998;104:432-441. DOI: 10.1121/1.423247.</mixed-citation><mixed-citation xml:lang="en">Hogan C.A., Turner C.W. High-frequency audibility: Benefits for hearing-impaired listeners. The Journal of the Acoustical Society of America. 1998;104:432-441. DOI: 10.1121/1.423247.</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Королёва И.В. Введение в аудиологию и слухопротезирование. СПб: КАРО; 2012.</mixed-citation><mixed-citation xml:lang="en">Korolyova I.V. [Introduction to Audiology and Hearing Prosthetics]. SPb : KARO; 2012. (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Фонлантен А., Хорст А. Слуховые аппараты. Ростов н/Д.: Феникс; 2009.</mixed-citation><mixed-citation xml:lang="en">Vonlanthen A., Horst A. [Hearing Aids]. Rostov n/D: Phoenix; 2009. (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Traunmuller H. Analytical Expressions for the tonotopic sensory scale. Acoustical Society of America. 1990; 88(1):97-100. DOI: 10.1121/1.399849.</mixed-citation><mixed-citation xml:lang="en">Traunmuller H. Analytical Expressions for the tonotopic sensory scale. Acoustical Society of America. 1990; 88(1):97-100. DOI: 10.1121/1.399849.</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Liu Y.-T., Chang R.Y., Tsao Y., Chang Y.-P. A new frequency lowering technique for Mandarin-speaking hearing aid users. IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL. 2015;722-726. DOI: 10.1109/GlobalSIP.2015.7418291.</mixed-citation><mixed-citation xml:lang="en">Liu Y.-T., Chang R.Y., Tsao Y., Chang Y.-P. A new frequency lowering technique for Mandarin-speaking hearing aid users / IEEE Global Conference on Signal and Information Processing (GlobalSIP), Orlando, FL. 2015;722-726. DOI: 10.1109/GlobalSIP.2015.7418291.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Николенко С.И., Архангельская Е.В., Кадурин А.А. Глубокое обучение. Погружение в мир нейронных сетей. СПб.: Питер; 2019.</mixed-citation><mixed-citation xml:lang="en">Nikolenko S.I., Arhangel'skaya E.V., Kadurin A.A. Glubokoe obuchenie. Pogruzhenie v mir neyronnyih setey. SPb.: Piter; 2019. (in Russ.)</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
