Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

One-Hot Encoding #45

Closed
luccalb opened this issue Nov 13, 2024 · 1 comment · Fixed by #71
Closed

One-Hot Encoding #45

luccalb opened this issue Nov 13, 2024 · 1 comment · Fixed by #71
Assignees
Labels
component Related to a RTDIP component

Comments

@luccalb
Copy link

luccalb commented Nov 13, 2024

User Story

  1. As a RTDIP user
  2. I want to one hot encode incoming data
  3. So that i can use it for further ML processing

Additional context

Acceptance Criteria

  • A component exists that
    • receives the name of a column to apply the encoding to
    • receives an optional list of values to encode (one-hot encoding depends on the number of distinct values, having them as input parameter upfront makes it easier, otherwise you would have to identify distinct values from the data, which could be incomplete)

Definition of Done

  • Test cases have been created and are running successfully
  • Documentation for the new component was added
  • Github Actions are running without errors
@luccalb luccalb converted this from a draft issue Nov 13, 2024
@luccalb luccalb assigned luccalb and unassigned luccalb Nov 13, 2024
@luccalb luccalb added the component Related to a RTDIP component label Nov 13, 2024
@kristen149 kristen149 self-assigned this Nov 19, 2024
kristen149 pushed a commit that referenced this issue Nov 19, 2024
Signed-off-by: Minh Khue Tran <[email protected]>
@kristen149 kristen149 linked a pull request Nov 26, 2024 that will close this issue
@github-project-automation github-project-automation bot moved this from Awaiting Review to Feature Archive in amos2024ws01-feature-board Nov 26, 2024
@kristen149
Copy link

I move the component to src\sdk\python\rtdip_sdk\pipelines\transformers\spark\machine_learning\one_hot_encoding.py

@kristen149 kristen149 moved this from Feature Archive to Awaiting Review in amos2024ws01-feature-board Nov 27, 2024
@sanalmert sanalmert moved this from Awaiting Review to Feature Archive in amos2024ws01-feature-board Nov 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
component Related to a RTDIP component
Projects
Status: Feature Archive
Development

Successfully merging a pull request may close this issue.

2 participants