-
Notifications
You must be signed in to change notification settings - Fork 27.5k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Unpin numba #23162
Unpin numba #23162
Conversation
The documentation is not available anymore as the PR was closed or merged. |
13ff157
to
a066590
Compare
64bf551
to
67da5f1
Compare
Failing test is unrelated (tf compile test) |
@@ -116,7 +116,7 @@ def test_retrieve(self): | |||
retriever = self.get_dummy_retriever() | |||
tokenizer = retriever.tokenizer | |||
|
|||
retrieved_block_ids = np.array([0, 3], dtype=np.long) | |||
retrieved_block_ids = np.array([0, 3], dtype="long") |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
np.long
is also deprecated in favour of dtype="long"
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks!
* fix for ragged list * unpin numba * make style * np.object -> object * propagate changes to tokenizer as well * np.long -> "long" * revert tokenization changes * check with tokenization changes * list/tuple logic * catch numpy * catch else case * clean up * up * better check * trigger ci * Empty commit to trigger CI
* fix for ragged list * unpin numba * make style * np.object -> object * propagate changes to tokenizer as well * np.long -> "long" * revert tokenization changes * check with tokenization changes * list/tuple logic * catch numpy * catch else case * clean up * up * better check * trigger ci * Empty commit to trigger CI
* fix for ragged list * unpin numba * make style * np.object -> object * propagate changes to tokenizer as well * np.long -> "long" * revert tokenization changes * check with tokenization changes * list/tuple logic * catch numpy * catch else case * clean up * up * better check * trigger ci * Empty commit to trigger CI
What does this PR do?
Numba was pinned to <0.57.0 in #23118 - this is because it forced an update of the numpy package to >= 1.24. From numpy >= 1.24, converting a ragged list to a numpy array requires the user to explicitly set
dtype=object
(before this happened automatically, but threw a deprecation warning).This PR updates the feature extraction and tokenisation utils to explicitly specify
dtype=object
when converting ragged lists to numpy arrays.