Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Remove CoGBK in MLTransform's TFTProcessHandler #30146
Remove CoGBK in MLTransform's TFTProcessHandler #30146
Changes from 5 commits
abc1522
ee770c4
48614b3
98404ca
094a888
066c4ce
cddc98a
b8dd486
e0fc00b
File filter
Filter by extension
Conversations
Jump to
There are no files selected for viewing
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
interesting. Is it possible for
exclude_columns
be emtpy? I'd imagine it could rather be the opposite, where all columns are being processed, so there is nothing to encode/decode.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, that is right but it errors because we are adding the temp id column name to the schema during construction so TFT errors out if the pcoll doesn't have the temp id column. So when the unused columns are none, we have to encode the empty dict and pass it to the PColl.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
what is the function of
.item()
here? what is the type of clone[_TEMP_KEY]? are the elements in given that we call .item() here - will elements inclone
have consistent type after decoding?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Type of clone[_TEMP_KEY] is a numpy array and .item() returns underlying element of the numpy array.
It should be. depending on the Coder.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
leftover comment, also we no longer add keys , so
keyed_
might not be the best name.There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Removed the keyed_ from variable names
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
for my understanding, why is this called raw_data_list? it's modified, so not raw i think, and what's here about
_list
?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, it is modified. I removed raw from the variable name.
_list: we convert the scalar element to list (len:1) to maintain uniformity. Users can pass list/np arrays to TFT ops and TFT outputs numpy arrays. Users when pass scalars, TFT outputs scalars. to maintain consistent output format, we convert scalar to list.