<?xml version="1.0" encoding="UTF-8"?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.3 20210610//EN" "JATS-journalpublishing1-3.dtd">
<article article-type="research-article" dtd-version="1.3" xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance" xml:lang="ru"><front><journal-meta><journal-id journal-id-type="publisher-id">izvestswsu</journal-id><journal-title-group><journal-title xml:lang="ru">Известия Юго-Западного государственного университета</journal-title><trans-title-group xml:lang="en"><trans-title>Proceedings of the Southwest State University</trans-title></trans-title-group></journal-title-group><issn pub-type="ppub">2223-1560</issn><issn pub-type="epub">2686-6757</issn><publisher><publisher-name>ЮЗГУ</publisher-name></publisher></journal-meta><article-meta><article-id pub-id-type="doi">10.21869/2223-1560-2025-29-2-186-200</article-id><article-id custom-type="elpub" pub-id-type="custom">izvestswsu-1463</article-id><article-categories><subj-group subj-group-type="heading"><subject>Research Article</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="ru"><subject>Информатика, вычислительная техника и управление</subject></subj-group><subj-group subj-group-type="section-heading" xml:lang="en"><subject>Computer science, computer engineering and IT managment</subject></subj-group></article-categories><title-group><article-title>Программное обеспечение для преобразования двумерных изображений в трехмерные модели</article-title><trans-title-group xml:lang="en"><trans-title>Software for converting two-dimensional images into three-dimensional models</trans-title></trans-title-group></title-group><contrib-group><contrib contrib-type="author" corresp="yes"><contrib-id contrib-id-type="orcid">https://orcid.org/0000-0002-2497-6433</contrib-id><name-alternatives><name name-style="eastern" xml:lang="ru"><surname>Зоткина</surname><given-names>А. А.</given-names></name><name name-style="western" xml:lang="en"><surname>Zotkina</surname><given-names>A. A.</given-names></name></name-alternatives><bio xml:lang="ru"><p>Зоткина Алена Александровна, старший преподаватель кафедры «Программирование», </p><p>пр. Байдукова/ул. Гагарина, д. 1а/11, г. Пенза 440039.</p></bio><bio xml:lang="en"><p>Alena A. Zotkina, Senior Lecturer of the Programming Department, </p><p>1a/11, Baidukova ave. / Gagarina str., Penza 440039.</p></bio><email xlink:type="simple">alena.zotkina.97@mail.ru</email><xref ref-type="aff" rid="aff-1"/></contrib></contrib-group><aff-alternatives id="aff-1"><aff xml:lang="ru"><institution>Пензенский государственный технологический университет</institution></aff><aff xml:lang="en"><institution>Penza State Technological University</institution></aff></aff-alternatives><pub-date pub-type="collection"><year>2025</year></pub-date><pub-date pub-type="epub"><day>01</day><month>10</month><year>2025</year></pub-date><volume>29</volume><issue>2</issue><fpage>186</fpage><lpage>200</lpage><permissions><copyright-statement>Copyright &amp;#x00A9; Зоткина А.А., 2025</copyright-statement><copyright-year>2025</copyright-year><copyright-holder xml:lang="ru">Зоткина А.А.</copyright-holder><copyright-holder xml:lang="en">Zotkina A.A.</copyright-holder><license xml:lang="ru" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>Данная работа распространяется под лицензией Creative Commons Attribution 4.0.</license-p></license><license xml:lang="en" license-type="creative-commons-attribution" xlink:href="https://creativecommons.org/licenses/by/4.0/" xlink:type="simple"><license-p>This work is licensed under a Creative Commons Attribution 4.0 License.</license-p></license></permissions><self-uri xlink:href="https://izvestswsu.elpub.ru/jour/article/view/1463">https://izvestswsu.elpub.ru/jour/article/view/1463</self-uri><abstract><sec><title>Цель исследования</title><p>Цель исследования. Разработка программного комплекса для автоматического создания фотореалистичных трехмерных моделей лиц на основе одного входного изображения, который обеспечит высокую степень детализации и реалистичности моделей, а также простоту использования,</p></sec><sec><title>Методы</title><p>Методы. В исследовании используется комплексный подход для создания фотореалистичных трехмерных моделей лиц из двумерных изображений, основанный на методах обратного рендеринга и каскадных сверточных нейронных сетей (CNN). Основным элементом является трехмерная трансформируемая модель (3DMM), которая описывает геометрию и альбедо лица через линейные комбинации базисов главных компонент (PCA). Для соответствия 3D-геометрии и 2D-изображению применяется слабая перспективная проекция, учитывающая углы Эйлера и условия освещения. Оптимизация целевой функции с использованием метода Гаусса-Ньютона минимизирует различия между входным и визуализированным изображениями, а коррекция глубины и деталей лица достигается через адаптацию 3D-графики. Линейная интерполяция альбедо уточняет детали модели в ключевых областях, что способствует созданию высококачественных и реалистичных 3D-моделей лиц.</p></sec><sec><title>Результаты</title><p>Результаты. В данной статье успешно реализован программный комплекс, способный генерировать фотореалистичные трехмерные модели лиц из одномерных изображений с использованием обратного рендеринга и каскадных сверточных нейронных сетей. Проведенные эксперименты подтвердили способность алгоритма к восприятию важных характеристик лиц и создают возможности для дальнейших приложений в сферах компьютерной графики, анимации и виртуальных интерфейсов. </p></sec><sec><title>Заключение</title><p>Заключение. Полученные результаты свидетельствуют о высокой эффективности разработанного алгоритма для генерации фотореалистичных трехмерных моделей лиц из двумерных изображений. Кроме того, результаты подтверждают, что применение методов обратного рендеринга совместно с каскад-ными сверточными нейронными сетями позволяет добиться значительных улучшений в качестве визуализации.</p></sec></abstract><trans-abstract xml:lang="en"><sec><title>Purpose of reseach</title><p>Purpose of reseach. Development of a software package for the automatic creation of photorealistic threedimensional models of faces based on a single input image, which will provide a high degree of detail and realism of models, as well as ease of use,</p></sec><sec><title>Methods</title><p>Methods. The study uses an integrated approach to create photorealistic three-dimensional models of faces from two-dimensional images based on reverse rendering methods and cascading convolutional neural networks (CNN). The main element is a three-dimensional transformable model (3DMM), which describes the geometry and albedo of a face through linear combinations of principal component bases (PCA). To match the 3D geometry and the 2D image, a weak perspective projection is used, taking into account Euler angles and lighting conditions. Optimization of the objective function using the Gauss-Newton method minimizes the differences between the input and rendered images, and correction of depth and facial details is achieved through the adaptation of 3D graphics. Linear albedo interpolation clarifies the details of the model in key areas, which contributes to the creation of high-quality and realistic 3D models of faces.</p></sec><sec><title>Results</title><p>Results. This article successfully implements a software package capable of generating photorealistic threedimensional models of faces from one-dimensional images using reverse rendering and cascading convolutional neural networks. The experiments have confirmed the algorithm's ability to perceive important facial characteristics and create opportunities for further applications in the fields of computer graphics, animation and virtual interfaces.</p></sec><sec><title>Conclusion</title><p>Conclusion. The results obtained indicate the high efficiency of the developed algorithm for generating photorealistic three-dimensional models of faces from two-dimensional images. In addition, the results confirm that the use of reverse rendering methods in conjunction with cascading convolutional neural networks allows for significant improvements in visualization quality.</p></sec></trans-abstract><kwd-group xml:lang="ru"><kwd>программный комплекс</kwd><kwd>3D-объект/модель</kwd><kwd>сверточные нейронные сети</kwd><kwd>рендеринг</kwd><kwd>3DMM</kwd></kwd-group><kwd-group xml:lang="en"><kwd>software package</kwd><kwd>3D object/model</kwd><kwd>convolutional neural networks</kwd><kwd>rendering</kwd><kwd>3DMM</kwd></kwd-group></article-meta></front><back><ref-list><title>References</title><ref id="cit1"><label>1</label><citation-alternatives><mixed-citation xml:lang="ru">Lee H., Ranganath R., Ng A.Y. Convolutional Deep Belief Networks for Scalable Unsupervised Learning of Hierarchical Representations. Grosse // Proceedings of the 26th Annual International Conference on Machine Learning. 2009. Р. 34–45.</mixed-citation><mixed-citation xml:lang="en">Lee H., Ranganath R., Ng A.Y. Convolutional Deep Belief Networks for Scalable Unsupervised Learning of Hierarchical Representations. Grosse. Proceedings of the 26th Annual International Conference on Machine Learning. 2009. P. 34-45.</mixed-citation></citation-alternatives></ref><ref id="cit2"><label>2</label><citation-alternatives><mixed-citation xml:lang="ru">Bengio Y. Learning deep architectures for AI // Foundations and Trends in Machine Learning, 2009. P. 245–257.</mixed-citation><mixed-citation xml:lang="en">Bengio Y. Learning deep architectures for AI. Foundations and Trends in Machine Learning. 2009. P. 245-257.</mixed-citation></citation-alternatives></ref><ref id="cit3"><label>3</label><citation-alternatives><mixed-citation xml:lang="ru">Ignatenko A., Konushin A. A Framework for Depth Image-Based Modeling and Rendering // Graphicon-2003 Proceedings. Moscow, 2003. 246 р.</mixed-citation><mixed-citation xml:lang="en">Ignatenko A., Konushin A. Framework for Depth Image-Based Modeling and Rendering. Graphicon-2003 Proceedings. Moscow; 2003. 246 p.</mixed-citation></citation-alternatives></ref><ref id="cit4"><label>4</label><citation-alternatives><mixed-citation xml:lang="ru">Василенко B. A. Сплайн-функции: теория, алгоритмы, программы. Новосибирск: Наука, 1983. 215 с.</mixed-citation><mixed-citation xml:lang="en">Vasilenko B. A. Spline functions: theory, algorithms, programs. Novosibirsk: Nauka; 1983. 215 p. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit5"><label>5</label><citation-alternatives><mixed-citation xml:lang="ru">Соловьева А. Алгоритм модификации типового трехмерного портрета по заданным фотоизображениям // Труды XX международной конференции по компьютерной графике и машинному зрению. M.: изд-во «Графикон», 2010. С. 346–347.</mixed-citation><mixed-citation xml:lang="en">Solovyova A. Algorithm for modification of a typical three-dimensional portrait based on specified photographic images. In: Trudy XX mezhdunarodnoi konferentsii po komp'yuternoi grafike i mashinnomu zreniyu = Proceedings of the XX International Conference on Computer Graphics and Machine Vision. Moscow;  2010. P. 346-347. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit6"><label>6</label><citation-alternatives><mixed-citation xml:lang="ru">Zhao H., Osher S. Visualization, analysis and shape reconstruction of unorganized data sets // Geometric Level Set Methods in Imaging and Vision and Graphics. SpringerVerlag, 2002. 256 p.</mixed-citation><mixed-citation xml:lang="en">Zhao H., Osher S. Visualization, analysis and shape reconstruction of unorganized data sets. In: Geometric Level Set Methods in Imaging and Vision and Graphics. SpringerVerlag, 2002. 256 p.</mixed-citation></citation-alternatives></ref><ref id="cit7"><label>7</label><citation-alternatives><mixed-citation xml:lang="ru">Зоткина А.А. Анализ алгоритмов машинного обучения, используемых в классификации изображений, публикуемых пользователями социальных сетей // Современные информационные технологии. 2023. № 38 (38). С. 38-40.</mixed-citation><mixed-citation xml:lang="en">Zotkina A.A. Analysis of machine learning algorithms used in the classification of images published by users of social networks. Sovremennye informatsionnye tekhnologii = Modern information technologies. 2023; (38): 38-40. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit8"><label>8</label><citation-alternatives><mixed-citation xml:lang="ru">Зоткина А.А. Распознавание изображений с помощью сверточных нейронных сетей // Современные информационные технологии. 2023. № 38 (38). С. 60-63.</mixed-citation><mixed-citation xml:lang="en">Zotkina A.A. Image recognition using convolutional neural networks. Sovremennye informatsionnye tekhnologii=  Modern information technologies. 2023; (38): 60-63. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit9"><label>9</label><citation-alternatives><mixed-citation xml:lang="ru">Зоткина А.А., Мартышкин А.И., Ткаченко А.В. Особенности работы сверточных нейронных сетей: архитектура и применение // Современные методы и средства обработки пространственно-временных сигналов: сборник статей XX Всероссийской научно-технической конференции. Пенза, 2023. С. 32-35.</mixed-citation><mixed-citation xml:lang="en">Zotkina A.A., Martyshkin A.I., Tkachenko A.V. Features of convolutional neural networks: architecture and application. In: Sovremennye metody i sredstva obrabotki prostranstvennovremennykh signalov. Sbornik statei XX Vserossiiskoi nauchno-tekhnicheskoi konferentsii = Modern methods and means of processing spatiotemporal signals. Collection of articles of the XX AllRussian Scientific and Technical Conference. Penza; 2023. P. 32-35. (In Russ.).</mixed-citation></citation-alternatives></ref><ref id="cit10"><label>10</label><citation-alternatives><mixed-citation xml:lang="ru">Blanz V., Vetter T. Face recognition based on fitting a 3-D morphable model // IEEE Trans Pattern Anal Mach Intell. 2003. 25(9). Р. 1063-1074.</mixed-citation><mixed-citation xml:lang="en">Blanz V., Vetter T. Face recognition is based on fitting a 3-D morphable model. IEEE Trans Pattern Anal Mach Intell. 2003; 25(9):1063-1074.</mixed-citation></citation-alternatives></ref><ref id="cit11"><label>11</label><citation-alternatives><mixed-citation xml:lang="ru">Learning detailed face reconstruction from a single image // IEEE conference on computer vision and pattern recognition (CVPR) / E. Richardson, M. Sela, R. OR-EL, R. Kimmel. Honolulu, HI, 2017. P. 5553-5562.</mixed-citation><mixed-citation xml:lang="en">Richardson E., Sela M., OR-EL R., Kimmel R. Learning detailed face reconstruction from a single image. In: IEEE conference on computer vision and pattern recognition (CVPR). Honolulu, HI; 2017. P. 5553-5562.</mixed-citation></citation-alternatives></ref><ref id="cit12"><label>12</label><citation-alternatives><mixed-citation xml:lang="ru">Blanz V., Vetter T.A. Morphable model for the synthesis of 3-D faces // 26th annual conference on Computer graphics and interactive techniques (SIGGRAPH ’99). ACM Press/AddisonWesley Publishing Co., USA, 1999. P. 5553-5562.</mixed-citation><mixed-citation xml:lang="en">Blanz V., Vetter T.A. Morphable model for the synthesis of 3-D faces. In: 26th annual conference on Computer graphics and interactive techniques (SIGGRAPH ’99). ACM Press/AddisonWesley Publishing Co., USA; 1999. P. 5553-5562.</mixed-citation></citation-alternatives></ref><ref id="cit13"><label>13</label><citation-alternatives><mixed-citation xml:lang="ru">Heo J.Three-dimensional generic elastic models for two-dimensionalpose synthesis and face recognition. Proquest, Umi Dissertation Publishing, 2011. 154 c.</mixed-citation><mixed-citation xml:lang="en">Heo J.Three-dimensional generic elastic models for two-dimensionalpose synthesis and face recognition. Proquest, Umi Dissertation Publishing; 2011. 154 p.</mixed-citation></citation-alternatives></ref><ref id="cit14"><label>14</label><citation-alternatives><mixed-citation xml:lang="ru">Korikov A.M., Tungusova A.V. Neural network technologies for image classification // 21st Int. Symp. Atmos. Ocean Opt. Atmos. Phys. 2015. Vol. 9680. P. 426–429.</mixed-citation><mixed-citation xml:lang="en">Korikov A.M., Tungusova A.V. Neural network technologies for image classification. 21st Int. Symp. Atmos. Ocean Opt. Atmos. Phys. 2015; 9680: 426–429.</mixed-citation></citation-alternatives></ref><ref id="cit15"><label>15</label><citation-alternatives><mixed-citation xml:lang="ru">Krizhevsky A., Sutskever I., Hinton G.E. Imagenet classification with deep convolutional neural networks // Adv. Neural Inf. Process. Syst. 2012. Vol. 25.</mixed-citation><mixed-citation xml:lang="en">Krizhevsky A., Sutskever I., Hinton G.E. Imagenet classification with deep convolutional neural networks. Adv. Neural Inf. Process. Syst. 2012; 25.</mixed-citation></citation-alternatives></ref><ref id="cit16"><label>16</label><citation-alternatives><mixed-citation xml:lang="ru">Face recognition: A literature survey / W. Zhao, R. Chellappa, P.J. Phillips, A. Rosenfeld // ACM Comput. Surv. 2003. Vol. 35, № 4. P. 399– 458.</mixed-citation><mixed-citation xml:lang="en">Zhao W., Chellappa R., Phillips P.J., Rosenfeld A. Face recognition: A literature survey. ACM Comput. Surv. 2003; 35(4): 399 – 458.</mixed-citation></citation-alternatives></ref><ref id="cit17"><label>17</label><citation-alternatives><mixed-citation xml:lang="ru">Is faster R-CNN doing well for pedestrian detection? / L. Zhang, L. Lin, X. Liang, K. He // Eur. Conf. Comput. vision. Springer, Cham, 2016. P. 443–457.</mixed-citation><mixed-citation xml:lang="en">Zhang L., Lin L., Liang X., He K. Is faster R-CNN doing well for pedestrian detection? Eur. Conf. Comput. vision. Springer, Cham; 2016. P. 443–457.</mixed-citation></citation-alternatives></ref><ref id="cit18"><label>18</label><citation-alternatives><mixed-citation xml:lang="ru">Коэльо Л. П., Ричарт В. Построение систем машинного обучения на языке Python / пер. с англ. А. А. Слинкин. 2-е изд. М.: ДМК Пресс, 2016. 302 с. URL: https://e.lanbook.com/book/82818</mixed-citation><mixed-citation xml:lang="en">Coelho L. P., Richart  V. Building machine learning systems in Python. Moscow: DMK Press; 2016. 302 p. (In Russ.). Available at: https://e.lanbook.com/book/82818</mixed-citation></citation-alternatives></ref><ref id="cit19"><label>19</label><citation-alternatives><mixed-citation xml:lang="ru">Флах П. Машинное обучение. Наука и искусство построения алгоритмов, которые извлекают знания из данных. М.: ДМК Пресс, 2015. 400 с. URL: https://e.lanbook.com/book/69955</mixed-citation><mixed-citation xml:lang="en">Flach P. Machine learning. The science and art of building algorithms that extract knowledge from data. Moscow: DMK Press; 2015. 400 p. (In Russ.). Available at: https://e.lanbook.com/book/69955</mixed-citation></citation-alternatives></ref><ref id="cit20"><label>20</label><citation-alternatives><mixed-citation xml:lang="ru">PyTorch. URL: https://pytorch.org/</mixed-citation><mixed-citation xml:lang="en">PyTorch. Available at: https://pytorch.org/</mixed-citation></citation-alternatives></ref><ref id="cit21"><label>21</label><citation-alternatives><mixed-citation xml:lang="ru">Numpy. URL: https://numpy.org/</mixed-citation><mixed-citation xml:lang="en">Numpy.  Available at: https://numpy.org/</mixed-citation></citation-alternatives></ref><ref id="cit22"><label>22</label><citation-alternatives><mixed-citation xml:lang="ru">How to install dlib library for Python in Windows 10. URL: https://medium.com/analytics-vidhya/how-to-install-dlib-library-for-python-in-windows-10-57348ba1117f 23. Opencv. URL: https://opencv.org/</mixed-citation><mixed-citation xml:lang="en">How to install dlib library for Python in Windows 10. Available at: https://medium.com/analytics-vidhya/how-to-install-dlib-library-for-python-in-windows-10-57348ba1117f23  Opencv. Available at: https://opencv.org/</mixed-citation></citation-alternatives></ref></ref-list><fn-group><fn fn-type="conflict"><p>The authors declare that there are no conflicts of interest present.</p></fn></fn-group></back></article>
