Program
Verification
Book

PV3 ⊧ Mungojerriegenerates a reward scheme from an ω-regular objective and checks it on finite models

Tool for testing reinforcement learning reward schemes for

ω

-regular objectives

Application domain/field

Reinforcement learning
$ω$ -regular objectives
Linear Temporal Logic (LTL)
Branching Markov Decision Processes (BMDPs)
Reward functions

Expected input

Model (Branching Markov Decision Processes (BMDPs))
Properties ( $ω$ -automata)

Format:

Model: finite state and action models in PRISM language
Properties: $ω$ -automata in HOA

Links

Project page: https://plv.colorado.edu/wwwmungojerrie/

Related papers

Model-Free Reinforcement Learning for Branching Markov Decision Processes (CAV '21)

Last publication date

15 July 2021

ProVerB specific

Markdown description: view/edit
Contained in the ProVerB22 dataset (paper + artefact)

ProVerB is a part of SLEBoK. Last updated: February 2023.