Modelling prosody in the task of human speech synthesis with the use of machine learning

dc.contributor.advisorКрюкова, Галина
dc.contributor.authorПроцик, Олексій
dc.date.accessioned2020-12-05T21:08:26Z
dc.date.available2020-12-05T21:08:26Z
dc.date.issued2020
dc.description.abstractGenerating high fidelity speech using a text-to-speech (TTS) system remains a challenging task despite the decades of research and investigations. Modern TTS systems are very complex. For example, it is a common practice for a statistical TTS system to have a linguistic extractor in the front, which extracts different linguistic features. It is followed by a duration model to estimate the speech length in time of a given text and an acoustic feature prediction model. Given these features, it is all fed into a vocoder, which synthesizes speech out of acoustic features. All these components are trained independently and require extensive field knowledge to be sophisticated enough and produce considerable results. Because it has a modular design, it is prone to errors which will proceed in the following modules and can accumulate.uk_UA
dc.identifier.urihttps://ekmair.ukma.edu.ua/handle/123456789/18999
dc.language.isoenuk_UA
dc.statusfirst publisheduk_UA
dc.subjectmodelling prosodyuk_UA
dc.subjectthe task of human speechuk_UA
dc.subjectsynthesisuk_UA
dc.subjectthe useuk_UA
dc.subjectmachine learninguk_UA
dc.subjectбакалаврська роботаuk_UA
dc.titleModelling prosody in the task of human speech synthesis with the use of machine learninguk_UA
dc.typeOtheruk_UA
Files
Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Protsyk_Bakalavrska_robota.pdf
Size:
1.11 MB
Format:
Adobe Portable Document Format
Description:
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
7.54 KB
Format:
Item-specific license agreed upon to submission
Description: