mrscc

Using the Hugging Face Trainer I use the RoBERTa model to compete in the Microsoft Research Sentence Completion Challenge to achieve an accuracy of 82.6. A CBOW model is also implemented on the MRSCC. Notebooks are provided for both models.

Two notebooks are presented, each notebook presents a different architectural approach to the challenge. One is the CBOW model, the other is the RoBERTa model using the hugging face trainer.

Both have been developed within Colab using a dedicated GPU for training. Please note, that the CBOW model is developed in pytorch, specifically, pytorch lightning has been used to reduce boiler plate code and improve the development experience. Furthermore, Ray Tune has been used to launch optimisation trials for both approaches. For the hugging face RoBERTa model, the Hugging Face trainer (which uses Pytorch internally) was used as this offers fantastic and easy to use functionality.

Each main section of each notebook have comparable headings, the general work flow is as follows for both of them.

Import libraries
Load MRSCC data
Load data in datasets for model
Declare model and get ready for training.
Automatic or manual hyper-parameter tuning over range of parameters.
Test model on the MRSCC sentences using a variety of techniques. Declare accuracy.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
ANLE Assignment 1 - 198680 - Rupert Menneer.pdf		ANLE Assignment 1 - 198680 - Rupert Menneer.pdf
README.md		README.md
RM Portfolio - MRSCC - CBOW Pytorch		RM Portfolio - MRSCC - CBOW Pytorch
RM Portfolio - MRSCC - Hugging Face Trainer Roberta.ipynb		RM Portfolio - MRSCC - Hugging Face Trainer Roberta.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

mrscc

About

Releases

Packages

Languages

gowtham07/mrscc

Folders and files

Latest commit

History

Repository files navigation

mrscc

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages