Розробка алгоритму автоматичної синхронізації губ та рис обличчя у відеопотоці з аудіо

Андронік, Владислав

Розробка алгоритму автоматичної синхронізації губ та рис обличчя у відеопотоці з аудіо

Files

Andronik_Bakalavrska_robota.pdf (3.46 MB)

Date

2021

Authors

Андронік, Владислав

Abstract

This material presents the solution to generate talking face images with the use of deep learning. We conduct the research of existing literature to compose more efficient network design. The final version has additional pre-trained discriminator network to reach superior lip synchronization performance with adversarial training to improve the visual quality of images. We provide comparative analysis and ablation studies which show insights on how different components of the solution affect the result. This approach achieves comparable consistency in lip movements to other solutions in the field, but has higher visual quality.

Keywords

deep learning, face animation, lip synchronization, generative adversarial networks, bachelor thesis

URI

https://ekmair.ukma.edu.ua/handle/123456789/22540

Collections

F3 Комп'ютерні науки

Full item page