Skip to content

Latest commit

 

History

History
17 lines (13 loc) · 851 Bytes

README.md

File metadata and controls

17 lines (13 loc) · 851 Bytes

text2task

This repo contains code for my master's thesis, Synthesis of reward functions from natural language descriptions.

YouTube video

The core task it accomplishes is captured by this workflow:

  1. The user describes the task in English. /images/ulaz.png
  2. The language model is invoked to translate the user's description into a reward machine formalism. /images/model.png
  3. QRM and the reward machine from the previous step are used to train the agent to perform the task described by the user in the first step. /images/training.png
  4. The results are visually demonstrated to the user in a small number of episodes. /images/demo.png