Цель исследования

izvestswsu

Известия Юго-Западного государственного университета

Proceedings of the Southwest State University

2223-15602686-6757

ЮЗГУ

10.21869/2223-1560-2019-23-3-86-99

izvestswsu-531

Research Article

Информатика, вычислительная техника и управление

Computer science, computer engineering and IT managment

Алгоритмы автоматизированного обучения диалоговых систем

Automated Training Algorithms of Dialog Systems

Спирин

Д. В.

Spirin

D. V.

spirin.dmitrij@list.ru

Брежнев

О. С.

Brezhnev

O. S.

oleg-423@yandex.ru

Пензенский государственный университетPenza State University

2019

06092019

2338699

2019

Спирин Д.В., Брежнев О.С.

Spirin D.V., Brezhnev O.S.

Данная работа распространяется под лицензией Creative Commons Attribution 4.0.

This work is licensed under a Creative Commons Attribution 4.0 License.

https://izvestswsu.elpub.ru/jour/article/view/531

Цель исследования

Цель исследования. Представленное в данной статье исследование проведено в рамках проекта Salebot.pro (на ресурсе https://salebot.pro) и было нацелено на разработку простой и эффективной реализации диалоговой системы.

Методы

Методы. План исследования предусматривал анализ различных методов обработки естественных язы-ков и машинного обучения. Реализации методов были взяты из популярных библиотек с открытым исход-ным кодом. Построена модель диалоговой системы в двух вариантах: на основе фреймворка Spacy и метрического алгоритма оценки, на основе расстояния Левенштейна. Сравнивались простота реализа-ции и затраты на обучение системы и персонала.

Результаты

Результаты. Описанные в статье алгоритмы сопоставляют наиболее похожие слова из двух текстов и подсчитывают средний процент совпадений. Такой подход обеспечивает возможность приемлемой работы на языках со свободным порядком слов, к которым относится и русский язык. Выполненное исследование позволило разработать алгоритм автоматизированного обучения диалоговых систем в режиме реального времени без потери контекста. На той же основе разработан алгоритм обучения диалоговой системы по истории диалога. Предлагается использовать данные алгоритмы совместно. При создании диалоговой системы первоначально необходимо ее обучить на истории диалогов, а затем перманентно обучать в режиме реального времени.

Заключение

Заключение. Достоинством разработанного алгоритма является легкость в реализации и дешевизна построения инфраструктуры, необходимой для обучения модели, и ее обслуживания, а также простота в эксплуатации. Применяется подход, который отличается от обучения с учителем, что позволяет ускорить процесс обучения и ввода в систему новых данных. Особенностью разработанных алгоритмов является игнорирование семантики текста, что делает обучение автоматизированным, а не автома-тическим.

Purpose of research

Purpose of research. The research described in this article is conducted within the Salebot.pro project (on the https://salebot.pro resource) and aimed at development of simple and effective realization of a dialog system.

Methods

Methods. The research plan provided the analysis of various methods of natural processing languages and machine learning languages. Implementation of these methods was taken from popular libraries with an open source code. The model of a dialog system was made in two options: on the basis of Spacy freymvork and metric assessment algorithm, on the basis of Levenstein's distance. Simplicity of implementation and costs on training of a system and personnel were compared.

Results

Results. The algorithms described in article compare the most similar words from two texts and count average percent of coincidence. Such approach provides a possibility of acceptable work in languages with free word order. Russian is one such languages. The executed research allowed developing an automated training algorithm of dialog systems in real time without context loss. On the same basis training algorithm of a dialog system in dialog history is developed. It is offered to use these algorithms together. It is originally necessary to train it at history of dialogues during creation of a dialogue system. And then it is necessary to train it permanently in real time.

Conclusion

Conclusion. The advantage of the developed algorithm is ease in implementation and low cost of infrastructure which is necessary for model training and its service and also operation simplicity. Approach which differs from training with the teacher allows accelerating training process and input of new data into the system. Specific feature of the developed algorithms is ignoring of text semantics that makes training automated but not automatic.

диалоговая системаконечный автоматфреймавтоматизированное обучениеалгоритм

dialog systemfinite-state machineframeautomated trainingalgorithm

References1

Провотар А.И., Клочко К. А. Особенности и проблемы виртуального общения с помощью чат-ботов // Информационные технологии и компьютерная техника. Научные работы ВНТУ. 2013. № 3. С. 1-6.

Provotar A. I., Klochko K. A. Osobennosti i problemy virtual'nogo obshcheniya s pomoshch'yu chat-botov [Features and problems of virtual communication using chat bots]. Informatsionnye tekhnologii i komp'yuternaya tekhnika. Nauchnye raboty VNTU. = Information technologies and computer equipment Scientific works VNTU, 2013, no. 3, pp. 1-6 (In Russ.).

Training spaCy’s Statistical Models. URL: https://spacy.io/usage/training (дата обращения: 07.05.2019).

[Training spaCy’s Statistical Models]. Available at: https://spacy.io/usage/training (accessed 07.05.2019).

Apache OpenNLP DeveloperDocumentation. URL: https://opennlp.apache.org/ docs/1.9.0/manual/ opennlp.html (дата обращения: 07.05.2019).

Apache OpenNLP Developer Documentation. Available at: https:// opennlp.apache.org/ docs/1.9.0/manual/ opennlp.html (accessed 07.05.2019).

Задача о редакционном расстоянии, алгоритм Вагнера-Фишера. URL: https:// neerc.ifmo.ru/wiki/index.php?title= Задача_о_редакционном_расстоянии,_ алгоритм_ Вагнера-Фишера (дата обращения: 07.05.2019).

Zadacha o redaktsionnom rasstoyanii, algoritm Vagnera-Fishera [The task of the editorial distance, the algorithm of Wagner-Fisher]. Available at: The access method is free: https://neerc.ifmo.ru/wiki/index.php?title=Task_about_education_distance ,_algorithm_Wagner-Fisher (accessed 07.05.2019) (In Russ.).

Ramsay A. Discourse. In Mitkov, R. (Ed.). The Oxford Handbook of Computational Linguistics. Oxford University Press, USA, 2003. 717 p.

Ramsay A. Discourse. In Mitkov, R. (Ed.). The Oxford Handbook of Computational Linguistics. Oxford University Press, USA, 2003, 717 p.

Traum D., Larsson S. The information state approach to dialogue management // In J. van Kuppevelt & R. Smith (Eds.), Current and new directions in discourse and dialogue. Springer, 2003. P. 325–354.

Traum D., Larsson S. The information state approach to dialogue management. In J. van Kuppevelt & R. Smith (Eds.), Current and new directions in discourse and dialogue Springer, 2003, p. 325–354.

Computing Power Throughout History. URL: https://www.alternatewars.com/ BBOW/ Computing/Computing_Power.htm (дата обращения: 07.05.2019).

Computing Power Throughout History Available at: https:// www.alternatewars.com/ BBOW/ Computing / Computing_Power.htm (accessed 07.05.2019).

Автоматизированное обучение. URL: https://salebot.pro/articles/9 (дата обращения: 07.05.2019).

Avtomatizirovannoe obuchenie. Available at: The access method is free: https://salebot.pro/articles/9 (accessed 07.05.2019) (In Russ.).

Спирин Д.В., Брежнев О.С., Баринов А.Д. Алгоритм автоматизированного обучения // Сборник статей II Международной научно-практической конференции. Пенза: МЦНС «Наука и Просвещение», 2018. С. 49-53.

Spirin D.V., Brezhnev O. S., Barinov A. D. [Algorithm of automated learning]. Sbornik statei II Mezhdunarodnoi nauchno-prakticheskoi konferentsii [Collection of articles of the II International Scientific and Practical Conference]. Penza, 2018, pp. 49-53 (In Russ.).

A multi-task approach for named entity recognition in social media data / G. Aguilar, S. Maharjan, A. Pastor Lopez-Monroy, T. Solorio // In Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017. P. 148–153.

Aguilar G., Maharjan S., Pastor Lopez-Monroy A., Solorio T..A multi-task approach for named entity recognition in social media data. In Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017, pp. 148–153.

Daniken P., Cieliebak M. Transfer learning and sentence level features for named entity recognition on tweets // In Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017. P. 166–171.

Daniken P., Cieliebak M. Transfer learning and sentence level features for named entity recognition on tweets. In Proceedings of the 3rd Workshop on Noisy User-generated Text, 2017, pp. 166–171.

Neural Architectures for Named Entity Recognition / G. Lample, M. Ballesteros, S. Subramanian, K. Kawakami, C. Dyer // In Proceedings of NAACL-HLT 2016, San Diego, California, June 12-17, 2016. P. 260–270.

Lample G., Ballesteros M., S Subramanian., Kawakami K., Dyer C. Neural Architectures for Named Entity Recognition. In Proceedings of NAACL-HLT 2016, San Diego, California, June 12-17, 2016, pp. 260–270.

Strakova J. Neural Network Based Named Entity Recognition. – Institute of Formal and Applied Linguistics, Prague. 2017. 120 p.

Strakova J. Neural Network Based Named Entity Recognition. Institute of Formal and Applied Linguistics, Prague, 2017, 120 p.

Akkaya E.K. Deep neural networks for named entity recognition on social media. Computer Engineering Dept., Hacettepe University. Beytepe-Ankara, Turkey, 2018. 126 p.

Akkaya E.K. Deep neural networks for named entity recognition on social media. Computer Engineering Dept., Hacettepe University, Beytepe-Ankara, Turkey, 2018. 126 p.

The authors declare that there are no conflicts of interest present.