Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence

Kočí, Ondřej

CS SKLog in Log in (EduId)

Theses m4ma99

Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence – Ondřej Kočí

Zpět na vyhledávání

Ondřej Kočí

Master's thesis

Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence

Extrakce prozódických vlastností a syntéza zpěvu pomocí end-to-end sekvence neurálních modulů

Abstract:

Prozódie je neodmyslitelnou vlastností lidského hlasu. Většina dostupných hlasových syntetizátorů ji však ignoruje, či využívá pouze její průměrnou reprezentaci pro generaci umělého hlasu. Tato práce navrhuje nový prototyp složený z end-to-end sekvence modelů, který je založen na architektuře hlubokých neuronových sítí. S využitím Mellotronu (Rafael Valle et al. 2019), Tacotronu (Yuxuan Wang et al …more

Abstract:

Prosody is an intrinsic aspect of human speech. However, most popular speech synthesizers ignore it or only use its average representation when synthesizing artificial voices. This thesis proposes a new End-to-End model sequence prototype based on the Deep Neural Network architecture. Utilizing Mellotron (Rafael Valle et al. 2019), Tacotron (Yuxuan Wang et al. 2017), WaveGlow (Ryan Prenger et al. 2018 …more

Language used: English

Date on which the thesis was submitted / produced: 1. 5. 2022

Identifier: https://vskp.vse.cz/eid/86021

Thesis defence

Date of defence: 2. 6. 2022
Supervisor: Jan Mittner
Reader: Petr Polák

Citation record

Cite this text

ISO 690-compliant citation record:

KOČÍ, Ondřej. \textit{Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence}. Online. Master's thesis. Praha: University of Economics, Prague. 2022. Available from: https://theses.cz/id/m4ma99/.

{{Citace kvalifikační práce
 | příjmení = Kočí
 | jméno = Ondřej
 | instituce = University of Economics, Prague
 | titul = Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence
 | url = https://theses.cz/id/m4ma99/
 | typ práce = Master's thesis
 | vedoucí = Jan Mittner
 | rok = 2022
 | počet stran =
 | strany =
 | citace = 2024-04-26
 | poznámka =
 | jazyk = 
}}

Full text of thesis

Contents of on-line thesis archive

Published in Theses:

autentizovaným zaměstnancům ze stejné školy/fakulty

Other ways of accessing the text

Institution archiving the thesis and making it accessible: Vysoká škola ekonomická v Praze
https://vskp.vse.cz/eid/86021

Vysoká škola ekonomická v Praze

Master programme / field:
Informační systémy a technologie / Vývoj informačních systémů

Theses on a related topic

Segmentation of Dense Cell Populations using Convolutional Neural Networks
Filip Lux
Artificial Neural Networks in Space of Stock Returns: Volatility Prediction
Šimon Škorňa
Modul LSTM a Rekurentních neuronových sítí pro program Modeler neuronových sítí
Jiří Lagan
Modul samoorganizačních map pro program Modeler neuronových sítí
Jakub Komoráš
Modul SARSA pro Modeler neuronových sítí
Tomáš Mariňák
Modul RBM a DBM pro program Modeler neuronových sítí
Patrik Lyčka
Modul pro Reinforcment Learning pro Modeler neuronových sítí
Jakub Holaza
Modul pro Q-Learning pro Modeler neuronových sítí
Jan Bauer

All theses

Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence – Ondřej Kočí

Ondřej Kočí

Master's thesis

Prosodic feature extraction and singing voice synthesis with an End-to-end Neural network model sequence

Extrakce prozódických vlastností a syntéza zpěvu pomocí end-to-end sekvence neurálních modulů

Abstract:

Abstract:

Keywords

Keywords

Thesis defence

Citation record

ISO 690-compliant citation record:

Full text of thesis

Contents of on-line thesis archive

Other ways of accessing the text

Vysoká škola ekonomická v Praze

Theses on a related topic