Automating Empathy in Dialogue Systems

This is the official respository for the Automating Empathy in Dialogue Systems project. A complete technical report is available here. All our model checkpoints can be found here. In case of any queries, please contact [email protected].

Please run pip install -r requirements.txt to install all third-party libraries required by the framework.

To setup the EPITOME, EMO-ACC, and INTENT-ACC models, use the following steps:

Download all files from the Google Drive hosted by Lee et al., 2022.
Create a directory called empathy_metric_models in your root folder.
Create a sub-directory in empathy_metric_models called epitome_models and place the fine_tuned_ER.pth, fine_tuned_EX.pth, and fine_tuned_IP.pth files in this sub-directory.
Create a sub-directory in empathy_metric_models called emo_classifier. Unpack emotion.tar.gz and place all its contents in this sub-directory.
Create a sub-directory in empathy_metric_models called intent_classifier. Unpack empintent.tar.gz and place all its contents in this sub-directory.

We provide some example commands for the training and testing framework below. Additional command line arguments can also be included. Simply refer to the available list of arguments in the parse_args function of the corresponding python script. We also have additional scripts such as test_gpt3.py, interact.py, evaluate.py, compare.py, and multi-turn.py to aide with testing and evaluation as well as interact with models. Please refer to these scripts to learn how to use them.

Fine-tuning a dialogue model

Following is an example command:

python3 src/train_dialogue_model.py --dataset_dir datasets/empathetic_dialogues --model GODEL_LARGE --output_dir ~/fine_tuned_models/GODEL_LARGE --max_epochs 5 --batch_size 8 --initial_lr 0.00001

To add a custom model, follow these steps:

Add the model class inherited from EncoderDecoderModel or DecoderModel to src/dialogue_models.py with appropriately defined functions (see base_classes.py for all top-level functions for the model class)
Add the tokenizer class inherited from TokenizerBase or HuggingFaceTokenizerBase to src/custom_tokenizers.py with appropriately defined functions (see base_classes.py for all top-level functions for the tokenizer class)
Add a model configuration in configs.json with any custom name and the corresponding model and tokenizer class as well as any additional arguments. The format can be copied from some pre-defined configurations.

A custom dataset can be added by splitting the dataset directory into train, val, and test subdirectories and formatting the data in each directory in the following categories which should be self-explanatory: contexts.json, targets.json, instructions.json(optional), and knowledge.json (optional). See datasets/empathetic_dialogues or datasets/blended_skill_talk for reference.

Fine-tuning a reward model

Following is an example command:

python3 src/train_reward_model.py --dataset_dir datasets/reward_dataset --model GODEL_LARGE --output_dir ~/reward_models/GODEL_LARGE --max_epochs 1 --initial_lr 0.00001.

Batch size support is not included yet. Custom models are added exactly as discussed in the last section. To add a custom dataset, specify a train subdirectory in the dataset directory formatted in the following categories: contexts.json, responses.json, and ratings.json. We use the same dataset for validation and have not yet included support for testing reward models. See datasets/reward_dataset for reference.

Reinforcement Learning using PPO

Following is an example command:

python3 src/train_rlhf_pipeline.py --dataset_dir datasets/empathetic_dialogues --pretrained_model_dir ~/fine_tuned_models/GODEL_LARGE/tensorboard_logs/version_0 --reward_model_dir ~/reward_models/GODEL_LARGE/tensorboard_logs/version_0 --output_dir ~/fine_tuned_models/GODEL_LARGE_RLHF --max_epochs 5 --batch_size 8 --initial_lr 0.00001 --beam_width 1 --sample --top_p 0.9 --top_k 50.

Testing

Following is an example command:

python3 src/test.py --dataset_dir datasets/empathetic_dialogues --pretrained_model_dir ~/fine_tuned_models/GODEL_LARGE_RLHF/tensorboard_logs/version_0 --emo_classifier_dir ~/empathy_metric_models/emo_classifier --intent_classifier_dir ~/empathy_metric_models/intent_classifier --epitome_dir ~/empathy_metric_models/epitome_models --reward_model_dir ~/reward_models/GODEL_LARGE --batch_size 128.

Name		Name	Last commit message	Last commit date
Latest commit History 131 Commits
crowd_sourced_feedback		crowd_sourced_feedback
datasets		datasets
fine_tuned_models		fine_tuned_models
human_evaluation		human_evaluation
pretrained_models		pretrained_models
src		src
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Automating Empathy in Dialogue Systems

Fine-tuning a dialogue model

Fine-tuning a reward model

Reinforcement Learning using PPO

Testing

About

Releases

Packages

Languages

tarushbansal/automating-empathy-project

Folders and files

Latest commit

History

Repository files navigation

Automating Empathy in Dialogue Systems

Fine-tuning a dialogue model

Fine-tuning a reward model

Reinforcement Learning using PPO

Testing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages