<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">izvestswsu</journal-id><journal-title-group><journal-title xml:lang="ru">Известия Юго-Западного государственного университета</journal-title><trans-title-group xml:lang="en"><trans-title>Proceedings of the Southwest State University</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">2223-1560</issn><issn pub-type="epub">2686-6757</issn><publisher><publisher-name>ЮЗГУ</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.21869/2223-1560-2019-23-6-225-240</article-id><article-id custom-type="elpub" pub-id-type="custom">izvestswsu-670</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>Информатика, вычислительная техника и управление</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>Computer science, computer engineering and IT managment</subject></subj-group></article-categories><title-group><article-title>Модели и методика определения речевой активности пользователя социо-киберфизической системы</article-title><trans-title-group xml:lang="en"><trans-title>Models and a Tecnique for Determining the Speech Activity  of a User of a Socio-Cyberphysical System</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Усина</surname><given-names>Е. Е.</given-names></name><name name-style="western" xml:lang="en"><surname>Usina</surname><given-names>E. E.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Усина Елизавета Евгеньевна, младший  научный сотрудник лаборатории технологий больших данных социокиберфизических  систем   </p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Elizaveta E. Usina, Junior Researcher, Laboratory of Big Data Technologies of Sociocyberphysical Systems</p><p>St. Petersburg</p></bio><xref ref-type="aff" rid="aff-1"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Шабанова</surname><given-names>А. Р.</given-names></name><name name-style="western" xml:lang="en"><surname>Shabanova</surname><given-names>A. R.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Шабанова Александра Романовна, младший научный сотрудник лаборатории технологий больших данных социокиберфизических  систем</p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Alexandra R. Shabanova, Junior Researcher,  Laboratory of Big Data Technologies  of Sociocyberphysical Systems</p><p>St. Petersburg</p></bio><xref ref-type="aff" rid="aff-2"/></contrib><contrib contrib-type="author" corresp="yes"><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Лебедев</surname><given-names>И. В.</given-names></name><name name-style="western" xml:lang="en"><surname>Lebedev</surname><given-names>I. V.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Лебедев Игорь Владимирович, младший научный сотрудник лаборатории технологий больших данных социокиберфизических  систем</p><p>Санкт-Петербург</p></bio><bio xml:lang="en"><p>Igor V. Lebedev, Junior Researcher, Laboratory of Big Data Technologies of Sociocyberphysical System</p><p>St. Petersburg</p></bio><xref ref-type="aff" rid="aff-2"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Санкт-Петербургский институт  информатики и автоматизации  Российской академии наук</institution></aff><aff xml:lang="en"><institution>St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences</institution></aff></aff-alternatives><aff-alternatives id="aff-2"><aff xml:lang="ru"><institution>Санкт-Петербургский институт  информатики и автоматизации  Российской академии наук,</institution></aff><aff xml:lang="en"><institution>St. Petersburg Institute for Informatics and Automation of the Russian Academy of Sciences</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2019</year></pub-date><pub-date pub-type="epub"><day>23</day><month>02</month><year>2020</year></pub-date><volume>23</volume><issue>6</issue><fpage>225</fpage><lpage>240</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Усина Е.Е., Шабанова А.Р., Лебедев И.В., 2020</copyright-statement><copyright-year>2020</copyright-year><copyright-holder xml:lang="ru">Усина Е.Е., Шабанова А.Р., Лебедев И.В.</copyright-holder><copyright-holder xml:lang="en">Usina E.E., Shabanova A.R., Lebedev I.V.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://izvestswsu.elpub.ru/jour/article/view/670">https://izvestswsu.elpub.ru/jour/article/view/670</self-uri><abstract><sec><title>Цель исследования</title><p>Цель исследования. Статья посвящена разработке модельно-алгоритмического обеспечения процесса определения речевой активности пользователя социо-киберфизической системы. Предложена топологическая модель распределенной подсистемы аудиозаписи, реализуемой в ограниченных физических пространствах (помещениях), позволяющая оценить качество воспринимаемых аудиосигналов для случая распределения микрофонов в таком помещении. На основе данной модели разработана методика определения речевой активности пользователя социо-киберфизической системы, максимизирующая качество воспринимаемых аудиосигналов при перемещении пользователя в помещении за счет определения координат установки микрофонов.</p></sec><sec><title>Методы</title><p>Методы. Для наиболее полного анализа и формального описания распределенной подсистемы аудиозаписи был использован математический аппарат теории графов и теории множеств. С целью определения координат размещения микрофонов в одном помещении была разработана соответствующая методика, которая предполагает проведение таких операций, как излучение в помещении речевого сигнала с помощью акустического оборудования и замер уровней сигнала в предполагаемых для установки микрофонов местах с использованием шумомера.</p></sec><sec><title>Результаты</title><p>Результаты. Были рассчитаны зависимости коэффициента корреляции объединенного сигнала и исходного тестового сигнала от расстояния до источника сигнала для различного количества микрофонов. Полученные зависимости позволяют определить минимально необходимое количество разнесенных микрофонов для обеспечения качественной записи речи пользователя. Результаты апробации разработанной методики речевой активности в конкретном помещении свидетельствуют о возможности и высокой эффективности определения речевой активности пользователя социо-киберфизической системы.</p></sec><sec><title>Заключение</title><p>Заключение. Использование предложенной методики определения речевой активности пользователя социокиберфизической системы позволит повысить качество записи аудиосигнала и, как следствие, его последующей обработки с учетом возможного перемещения пользователя.</p></sec></abstract><trans-abstract xml:lang="en"><sec><title>Purpose of reseach</title><p>Purpose of reseach. The article presents the development of the model-algorithmic support for the process of determining the speech activity of a user of a socio-cyberphysical system. A topological model of a distributed subsystem of audio recordings implemented in limited physical spaces (rooms) is proposed; the model makes it possible to assess the quality of perceived audio signals for the case of distribution of microphones in such a room. Based on this model, a technique for determining the speech activity of a user of a socio-cyberphysical system, which maximizes the quality of perceived audio signals when a user moves in a room by means of determining the installation coordinates of microphones has been developed.</p></sec><sec><title>Methods</title><p> Methods. The mathematical tools of graph theory and set theory was used for the most complete analysis and formal description of the distributed subsystem of the audiorecording. In order to determine the coordinates of the placement of microphones in one room, a relevant technique was developed; it involves performing such operations as emitting a speech signal in a room using acoustic equipment and measuring signal levels using a noise meter in the places intended for installing microphones. </p></sec><sec><title>Results</title><p> Results. The dependences of the correlation coefficient of the combined signal and the initial test signal on the distance to the signal source were calculated for a different number of microphones. The obtained dependences allow us to determine the minimum required number of spaced microphones to ensure high-quality recording of the user’s speech. The results of testing the developed technique for determining speech activity in a particular room indicate the possibility and high efficiency of determining the speech activity of a user of a socio-cyberphysical system.</p></sec><sec><title>Conclusion</title><p> Conclusion. Application of the proposed technique for determining the speech activity of a user of a sociocyberphysical system will improve the recording quality of the audio signal and, as a consequence, its subsequent processing, taking into account the possible movement of a user. </p></sec></trans-abstract><kwd-group xml:lang="ru"><kwd>социо-киберфизическая система</kwd><kwd>речь</kwd><kwd>микрофоны</kwd><kwd>распределенная аудиозапись</kwd></kwd-group><kwd-group xml:lang="en"><kwd>socio-cyberphysical system</kwd><kwd>speech</kwd><kwd>microphones</kwd><kwd>distributed audio recording</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Internet of Things, IoT European Research Cluster. [процитировано 6 ноября 2019]. URL: http://www.internet-of-things-research.eu/about_iot.htm</mixed-citation><mixed-citation xml:lang="en">Internet of Things, IoT European Research Cluster. [Quoted November 6, 2019]. Available at: http://www.internet-of-things-research.eu/about_iot.htm</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Teaching Smart Production: An insight into the Learning Factory for Cyber-Physical Production Systems (LVP) / L. Merkela, J. Atuga, L. Merhara, C. Schultza, S. Braunreuthera, G. Reinharta // Procedia Manufacturing. 2017. №9. P. 269-274. https://doi.org/10.1016/j.promfg.2017.04.034</mixed-citation><mixed-citation xml:lang="en">Merkela L., Atuga J., Merhara L., Schultza C., Braunreuthera S., Reinharta G. Teaching Smart Production: An insight into the Learning Factory for Cyber-Physical Production Systems (LVP). Procedia Manufacturing, 2017, no. 9, pp. 269-274. https:// doi.org/ 10.1016/j.promfg.2017.04.034</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Knowledge-Based Decision Making in a CyberPhysical Production Scenario / J. KlöberKoch, S. Pielmeier, J. Grimm, M. Brandt, M., Schneider G. Reinhart // 7th Conference on Learning Factories. 2017. №7. P. 167-174. https://doi.org/10.1016/ j.promfg.2017.04.014</mixed-citation><mixed-citation xml:lang="en">Klöber-Koch J., Pielmeier S. Grimm J., Brandt M., Schneider M., Reinhart G. KnowledgeBased Decision Making in a CyberPhysical Production Scenario. 7th Conference on Learning Factories, 2017, no. 7, pp. 167-174. https://doi.org/ 10.1016/ j.promfg.2017.04.014</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Jiang P., Ding K., Leng J. Towards a cyber-physical-socialconnected and serviceoriented manufacturing paradigm: Social Manufacturing // Manufacturing Letters. 2016. №7. P. 15-21. https://doi.org/10.1016/j.mfglet.2015.12.002</mixed-citation><mixed-citation xml:lang="en">Jiang P., Ding K., Leng J. Towards a cyber-physical-socialconnected and serviceoriented manufacturing paradigm: Social Manufacturing. Manufacturing Letters. 2016, no. 7, pp. 15-21. https://doi.org/10.1016/j.mfglet.2015.12.002</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Cassandras C.G. Smart Cities as Cyber-Physical Social Systems // Engineering. 2016. №2. P. 156-158. https://doi.org/10.1016/J.ENG.2016.02.012</mixed-citation><mixed-citation xml:lang="en">Cassandras C.G. Smart Cities as Cyber-Physical Social Systems. Engineering, 2016, no. 2, pp. 156-158. https://doi.org/10.1016/J.ENG.2016.02.012</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Смирнов А.В., Левашова Т.В. Приобретение знаний в социокиберфизических системах в процессе информационного взаимодействия ресурсов // Информационноуправляющие системы. 2017. №6. P. 113–122.</mixed-citation><mixed-citation xml:lang="en">Smirnov A.V., Levashova T.V. Priobretenie znanii v sotsiokiberfizicheskikh sistemakh v protsesse informatsionnogo vzaimodeistviya resursov [The acquisition of knowledge in sociocyberphysical systems in the process of information interaction of resources]. Informatsionno-upravlyayushchie sistemy = Information and Control Systems, 2017, no.6,  pp. 113–122 (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Мазуренко И.Л. Многоканальная система распознавания речи // Сборник трудов VI всероссийской конференции «Нейрокомпьютеры и их применение». М., 2000.</mixed-citation><mixed-citation xml:lang="en">Mazurenko I.L. Mnogokanal'naya sistema raspoznavaniya rechi [Multi-channel speech recognition system]. Sbornik trudov VI vserossiiskoi konferentsii "Neirokomp'yutery i ikh primenenie" [Proceedings of the VI All-Russian conference "Neurocomputers and their application"]. Moscow, 2000. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Beyond Close-talk – Issues in Distant Speech Acquisition, Conditioning Classification, and Recognition / V. Stanford, C. Rochet, M. Michel, J.N Garofolo / Proc. ICASSP 2004 Meeting Recognition Workshop. 2004. P. 123-127.</mixed-citation><mixed-citation xml:lang="en">Stanford V., Rochet C., Michel M., Garofolo J. Beyond Close-talk – Issues in Distant Speech Acquisition, Conditioning Classification, and Recognition. Proc. ICASSP 2004 Meeting Recognition Workshop, 2004, pp. 123-127.</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Pfau T., Ellis D. P. W., Stolcke A. Multispeaker speech activity detection for the ICSI meeting recorder // IEEE Workshop on Automatic Speech Recognition and Understanding. 2001. P. 107-110. https://doi.org/10.1109/ASRU.2001.1034599</mixed-citation><mixed-citation xml:lang="en">Pfau T., Ellis D. P. W., Stolcke A. Multispeaker speech activity detection for the ICSI meeting recorder. IEEE Workshop on Automatic Speech Recognition and Understanding, 2001, pp. 107-110. https://doi.org/10.1109/ASRU.2001.1034599</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Центр речевых технологий. 2019 [процитировано 6 ноября 2019]. URL: http://www.speechpro.ru</mixed-citation><mixed-citation xml:lang="en">Centr rechevykh tekhnologiy [Speech technology center]. 2019 [Quoted November 6, 2019]. Available at: http://www.speechpro.ru (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">АО «ОКБ «Октава». 2019 [процитировано 6 ноября 2019]. URL: https:// www.окбоктава.рф</mixed-citation><mixed-citation xml:lang="en">AO "OKB "Oktava". 2019 [Quoted November 6, 2019]. Available at: https://www.окбоктава.рф (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Акустика / Ш.Я. Вахитов, Ю.А. Ковалгин, А.А. Фадеев, Ю.П. Щевьев. М.: Горячая линия, 2009.</mixed-citation><mixed-citation xml:lang="en">Vakhitov Sh.Ya., Kovalgin Yu.A., Fadeev A.A., Shcheviev Yu.P. Akustika [Acoustics]. Moscow, Goryachaya liniya Publ., 2009. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Разработка многомодального информационного киоска / А.Л. Ронжин, А.А. Карпов, А.Б., Леонтьева Б.Е. Костюченко // Труды СПИИРАН. 2007. №5(1). C. 227-245.</mixed-citation><mixed-citation xml:lang="en">Ronzhin A.L., Karpov A.A., Leontyeva A.B., Kostyuchenko B.E. Razrabotka mnogomodal'nogo informatsionnogo kioska [The development of the multimodal information kiosk].  Trudy SPIIRAN  = SPIIRAS Proceedings, 2007, no. 5(1), pp. 227-245 (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Ронжин А.Л., Карпов А.А., Кагиров И.А. Особенности дистанционной записи и обработки речи в автоматах самообслуживания // Информационно-управляющие системы. 2009. №42(5). C. 32-38.</mixed-citation><mixed-citation xml:lang="en">Ronzhin A.L., Karpov A.A., Kagirov I.A. Osobennosti distantsionnoi zapisi i obrabotki rechi v avtomatakh samoobsluzhivaniya  [Features of remote recording and speech processing in self-service machines]. Informatsionno-upravlyayushchie sistemy  = Information and control systems, 2009, no. 42(5), pp. 32–38 (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Харкевич А.А. Борьба с помехами. Изд. 4-е. М.: Книжный дом «ЛИБРОКОМ», 2013.</mixed-citation><mixed-citation xml:lang="en">Kharkevich A.A. Bor'ba s pomekhami  [Struggle against interference]. Moscow, Knizhnyi dom "LIBROKOM" Publ., 2013 (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Скляр Б. Цифровая связь. Теоретические основы и практическое применение. Изд. 2-е, испр. М.: Издательский дом «Вильямс», 2003.</mixed-citation><mixed-citation xml:lang="en">Sklar B. Tsifrovaya svyaz'. Teoreticheskie osnovy i prakticheskoe primenenie  [Digital communication. Theoretical foundations and practical application]. Moscow, Izdatel'skii dom "Vil'yams" Publ., 2003 (In Russ.)</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Ogunfunmi T., Togneri R., Narasimha M. Speech and audio processing for coding, enhancement and recognition. New York : Springer, 2015.</mixed-citation><mixed-citation xml:lang="en">Ogunfunmi T., Togneri R., Narasimha M. Speech and audio processing for coding, enhancement and recognition. New York, Springer Publ., 2015.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Марковников Н.М., Кипяткова И.С. Аналитический обзор интегральных систем распознавания речи // Труды СПИИРАН. 2018. №3. C. 77-110. https:// doi.org/10.15622/sp.58.4</mixed-citation><mixed-citation xml:lang="en">Markovnikov N.M., Kipyatkova I.S. Analiticheskii obzor integral'nykh sistem raspoznavaniya rechi [An Analytic Survey of End-to-End Speech Recognition Systems]. Trudy SPIIRAN  = SPIIRAS Proceedings, 2018, no. 3, pp. 77-110 (In Russ.). https://doi.org/10.15622/sp.58.4.</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
