Skip to content
This repository has been archived by the owner on Oct 11, 2024. It is now read-only.

Benchmark Fix : Remove special tokens from warmup prompts #140

Merged

Conversation

varun-sundar-rabindranath
Copy link

@varun-sundar-rabindranath varun-sundar-rabindranath commented Mar 20, 2024

Summary:
When sampling words at random for prompt generation, we sometimes pick up the <pad> token.
The Tokenizer doesn't recognize this as a special token and leaves it in the prompt as-is. This causes the backend to fail with,

../aten/src/ATen/native/cuda/Indexing.cu:1292: indexSelectLargeIndex: block: [312,0,0], thread: [64,0,0] Assertion `srcIndex < srcSelectDimSize` failed

like errors.

Test:
Manual tests

@varun-sundar-rabindranath varun-sundar-rabindranath merged commit d630f71 into main Mar 20, 2024
2 checks passed
@varun-sundar-rabindranath varun-sundar-rabindranath deleted the varun/fix-sporadic-benchmark-failure branch March 20, 2024 21:27
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants