Derived from following repository: https://github.com/uoe-agents/uoe-rl2023-coursework
Algorithm implementations and wandb integration/hyperparameter tuning are my work. The rest of the code is from the above repository. (aside from some refactoring and minor changes)