PV3 ⊧ Mungojerriegenerates a reward scheme from an ω-regular objective and checks it on finite models
Tool for testing reinforcement learning reward schemes for -regular objectivesApplication domain/field
- Reinforcement learning
- -regular objectives
- Linear Temporal Logic (LTL)
- Branching Markov Decision Processes (BMDPs)
- Reward functions
Expected input
- Model (Branching Markov Decision Processes (BMDPs))
- Properties (-automata)
Format: