Skip to content

Commit

Permalink
update parameters and fix grammar for roberta-base and bert-base-case…
Browse files Browse the repository at this point in the history
…d notebook
  • Loading branch information
Bruce Zhang committed Sep 14, 2022
1 parent 952b68e commit 1b3e9ca
Show file tree
Hide file tree
Showing 2 changed files with 16 additions and 37 deletions.
Original file line number Diff line number Diff line change
Expand Up @@ -50,7 +50,7 @@
"source": [
"## Introduction\n",
"\n",
"This notebooks is an end-to-end binary text classification example. In this demo, we use the Hugging Face's `transformers` and `datasets` libraries with SageMaker Training Compiler to compile and fine-tune a pre-trained transformer for binary text classification. In particular, the pre-trained model will be fine-tuned using the Stanford Sentiment Treebank (SST) dataset. To get started, you need to set up the environment with a few prerequisite steps, for permissions, configurations, and so on. \n",
"This notebook is an end-to-end binary text classification example. In this demo, we use the Hugging Face's `transformers` and `datasets` libraries with SageMaker Training Compiler to compile and fine-tune a pre-trained transformer for binary text classification. In particular, the pre-trained model will be fine-tuned using the `Stanford Sentiment Treebank (SST)` dataset. To get started, you need to set up the environment with a few prerequisite steps, for permissions, configurations, and so on. \n",
"\n",
"![image.png](attachment:image.png)\n",
"\n",
Expand Down Expand Up @@ -81,7 +81,7 @@
"metadata": {},
"outputs": [],
"source": [
"!pip install \"sagemaker>=2.108.0\" botocore boto3 awscli s3fs typing-extensions --upgrade"
"!pip install \"sagemaker>=2.108.0\" botocore boto3 awscli s3fs typing-extensions \"torch==1.11.0\" --upgrade"
]
},
{
Expand Down Expand Up @@ -112,7 +112,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"Copy and run the following code if you need to upgrade ipywidgets for `datasets` library and restart kernel. This is only needed when preprocessing is done in the notebook.\n",
"Copy and run the following code if you need to upgrade \"ipywidgets\" for `datasets` library and restart kernel. This is only needed when preprocessing is done in the notebook.\n",
"\n",
"```python\n",
"%%capture\n",
Expand All @@ -134,7 +134,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** If you are going to use Sagemaker in a local environment. You need access to an IAM Role with the required permissions for SageMaker. To learn more, see [SageMaker Roles](https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-roles.html)."
"**Note:** If you are going to use SageMaker in a local environment. You need access to an IAM Role with the required permissions for SageMaker. To learn more, see [SageMaker Roles](https://docs.aws.amazon.com/sagemaker/latest/dg/sagemaker-roles.html)."
]
},
{
Expand Down Expand Up @@ -176,7 +176,7 @@
"\n",
"If you'd like to try other training datasets later, you can simply use this method.\n",
"\n",
"For this example notebook, we prepared the [SST2 dataset](https://www.tensorflow.org/datasets/catalog/glue#gluesst2) in the public SageMaker sample S3 bucket. The following code cells show how you can directly load the dataset and convert to a HuggingFace DatasetDict."
"For this example notebook, we prepared the [SST2 dataset](https://www.tensorflow.org/datasets/catalog/glue#gluesst2) in the public SageMaker sample S3 bucket. The following code cells show how you can directly load the dataset and convert to a `HuggingFace DatasetDict`."
]
},
{
Expand Down Expand Up @@ -406,7 +406,7 @@
"source": [
"from sagemaker.pytorch import PyTorch\n",
"\n",
"hyperparameters = {\"epochs\": 5, \"train_batch_size\": 14, \"model_name\": \"bert-base-cased\"}\n",
"hyperparameters = {\"epochs\": 5, \"train_batch_size\": 16, \"model_name\": \"bert-base-cased\"}\n",
"\n",
"# Scale the learning rate by batch size, as original LR was using batch size of 32\n",
"hyperparameters[\"learning_rate\"] = float(\"5e-5\") / 32 * hyperparameters[\"train_batch_size\"]\n",
Expand Down Expand Up @@ -712,7 +712,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"### Plot and compare throughputs of compiled training and native training"
"### Plot and compare throughput of compiled training and native training"
]
},
{
Expand Down Expand Up @@ -765,7 +765,7 @@
"cell_type": "markdown",
"metadata": {},
"source": [
"#### Example output for SageMaker Training Compiler traing job\n",
"#### Example output for SageMaker Training Compiler training job\n",
"\n",
"{'train_runtime': 3742.9028,\n",
" 'train_samples_per_second': 89.969,\n",
Expand Down Expand Up @@ -801,27 +801,6 @@
"plt.xticks(ticks=[1, 1.5], labels=[\"Baseline PT\", \"SM-Training-Compiler-enhanced PT\"])"
]
},
{
"attachments": {
"throughput.png": {
"image/png": ""
}
},
"cell_type": "markdown",
"metadata": {},
"source": [
"## Training Throughput Example Plot\n",
"\n",
"![throughput.png](attachment:throughput.png)"
]
},
{
"cell_type": "markdown",
"metadata": {},
"source": [
"**Note:** For this example, the compiler delivers higher throughput for an ML model as measured by samples per second. However, you might not see an improvement in the total training time for your model. The total training time depends on several other factors, such as key components of the Trainer and TFTrainer APIs."
]
},
{
"cell_type": "markdown",
"metadata": {},
Expand Down
Loading

0 comments on commit 1b3e9ca

Please sign in to comment.