minor change to avoid confusion in ch2-5 #310

daspartho · 2022-09-12T07:13:22Z

very small PR

Change

Replaced "it" with "the tokenizer" on one line in ch2-5

Motivation

I was going through the course (which is fantastic by the way), and I was a little confused by this one line; perhaps it's just me, but I think it's better to replace "it" with the exact thing here to avoid confusion.

HuggingFaceDocBuilderDev · 2022-09-12T07:23:51Z

The documentation is not available anymore as the PR was closed or merged.

lewtun

Thanks for this tweak @daspartho and welcome to the 🤗 community!

I've left a small suggestion and then we can merge this!

lewtun · 2022-09-13T11:59:21Z

chapters/en/chapter2/5.mdx

@@ -87,7 +87,7 @@ InvalidArgumentError: Input to reshape is a tensor with 14 values, but the reque

 Oh no! Why did this fail? "We followed the steps from the pipeline in section 2.

-The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`, but if you look closely, you'll see that it didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:
+The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`, but if you look closely, you'll see that the tokenizer didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:


I think we can use this as an opportunity to split the long sentence :)

Suggested change

The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`, but if you look closely, you'll see that the tokenizer didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:

The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`. But if you look closely, you'll see that the tokenizer didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:

daspartho · 2022-09-13T15:15:41Z

@lewtun I've made the suggested changes :)

lewtun · 2022-09-13T15:30:51Z

Thanks for iterating!

minor change to avoid confusion

003a761

lewtun approved these changes Sep 13, 2022

View reviewed changes

added suggestion

d84f79c

lewtun merged commit 7a4a051 into huggingface:main Sep 13, 2022

daspartho deleted the ch2-5 branch September 13, 2022 15:51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

minor change to avoid confusion in ch2-5 #310

minor change to avoid confusion in ch2-5 #310

daspartho commented Sep 12, 2022

HuggingFaceDocBuilderDev commented Sep 12, 2022 •

edited

Loading

lewtun left a comment

lewtun Sep 13, 2022

daspartho commented Sep 13, 2022

lewtun commented Sep 13, 2022

	The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`, but if you look closely, you'll see that the tokenizer didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:
	The problem is that we sent a single sequence to the model, whereas 🤗 Transformers models expect multiple sentences by default. Here we tried to do everything the tokenizer did behind the scenes when we applied it to a `sequence`. But if you look closely, you'll see that the tokenizer didn't just convert the list of input IDs into a tensor, it added a dimension on top of it:

minor change to avoid confusion in ch2-5 #310

minor change to avoid confusion in ch2-5 #310

Conversation

daspartho commented Sep 12, 2022

Change

Motivation

HuggingFaceDocBuilderDev commented Sep 12, 2022 • edited Loading

lewtun left a comment

Choose a reason for hiding this comment

lewtun Sep 13, 2022

Choose a reason for hiding this comment

daspartho commented Sep 13, 2022

lewtun commented Sep 13, 2022

HuggingFaceDocBuilderDev commented Sep 12, 2022 •

edited

Loading