Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes

Vahala, Jiří

CS SKLog in Log in (EduId)

Theses eoq4da

Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes – Bc. Jiří Vahala

Back to search

Bc. Jiří Vahala

Master's thesis

Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes

Abstract:

Optimalizace průměrné kumulované odměny za nejistoty výsledku je stěžejní problém v mnoha aplikacích. Typické metody posilovaného učení se soustředí pouze na maximalizaci průmřené kumulované odměny bez jakéhokoli přihlížení k risku. Tato práce shrnuje již existujicí metody zaobírající se maximalizací nejistého výsledku a navrhuje nový algoritmus posilovaného učení Ralf0, který optimalizuje strategie …more

Abstract:

Optimizing the expected cumulative reward under uncertainty is a crucial problem in many applications. A typical reinforcement learning approach is to maximize the expected cumulative reward without any sense of risk. In this thesis, we summarize already existing risk-averse learning techniques and introduce a new reinforcement learning algorithm Ralf0, which optimizes risk-averse policies without …more

Keywords

Ralf0 Risk-averse Reinforcement learning Policy MCTS MDP

Language used: English

Date on which the thesis was submitted / produced: 20. 5. 2019

Identifier: https://is.muni.cz/th/gv8zz/

Thesis defence

Date of defence: 18. 6. 2019
Supervisor: doc. RNDr. Tomáš Brázdil, Ph.D.
Reader: Mgr. Branislav Bošanský, Ph.D.

Citation record

Cite this text

ISO 690-compliant citation record:

VAHALA, Jiří. \textit{Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes}. Online. Master's thesis. Brno: Masaryk University, Faculty of Informatics. 2019. Available from: https://theses.cz/id/eoq4da/.

@MastersThesis{Vahala2019thesis,
AUTHOR = {Vahala, Jiří},
TITLE = {Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes},
YEAR = {2019},
TYPE = {Master's thesis},
INSTITUTION = {Masaryk University, Faculty of Informatics},
LOCATION = {Brno},
SUPERVISOR = {doc. RNDr. Tomáš Brázdil, Ph.D.},
URL = {https://theses.cz/id/eoq4da/},
URL_DATE = {2024-11-11},
}

{{Citace kvalifikační práce
 | příjmení = Vahala
 | jméno = Jiří
 | instituce = Masaryk University, Faculty of Informatics
 | titul = Reinforcement Learning of Risk-Averse Policies in Markov Decision Processes
 | url = https://theses.cz/id/eoq4da/
 | typ práce = Master's thesis
 | vedoucí = doc. RNDr. Tomáš Brázdil, Ph.D.
 | rok = 2019
 | počet stran =
 | strany =
 | citace = 2024-11-11
 | poznámka =
 | jazyk = 
}}

Full text of thesis

Contents of on-line thesis archive

Published in Theses:

světu

Other ways of accessing the text

Institution archiving the thesis and making it accessible: Masarykova univerzita, Fakulta informatiky

Reference to the local database directory of the institution

Masaryk University

Faculty of Informatics

Master programme / field:
Informatics / Artificial Intelligence and Natural Language Processing

Theses on a related topic

Experimental Evaluation of Risk-Averse Planners
Martin Bendel
Sampling Methods for Risk-Averse MDP Solvers
Václav Nevyhoštěný
Extending the Synthesis Algorithm for Consumption MDPs with LTL Objectives
Dávid Meluš
Sampling Methods for Risk-Averse MDP Solvers
Václav Nevyhoštěný
Vacant taxi routing in Markov Decision Process (MDP)
Nurbulat Shektbayev