-
Notifications
You must be signed in to change notification settings - Fork 540
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[FEA] TfidfVectorizer to accept Pandas Series as input #3403
Comments
Thanks for the issue @abhipn , I was wondering if you could provide the script and data that caused this to happen? This would very helpful to triage the issue. Thanks! |
@dantegd I don't have the permissions to share the data, it's text sentences and i want to convert them to tfidf vector matrix.
This is all I have used, and it returned that error |
Based on the error message, it looks like you may be passing a pandas Series to @dantegd , perhaps this should be updated to be a feature request for input type conversion on the TFIDF vectorizer, and perhaps the other vectorizers if need be. |
This issue has been labeled |
Resolves #3403 This PR adds support for using `pandas.Series` as an input to `TfidfVectorizer`, `HashingVectorizer` and `CountVectorizer`. Authors: - Shaswat Anand (https://github.com/shaswat-indian) - Ray Douglass (https://github.com/raydouglass) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: #4811
Resolves rapidsai#3403 This PR adds support for using `pandas.Series` as an input to `TfidfVectorizer`, `HashingVectorizer` and `CountVectorizer`. Authors: - Shaswat Anand (https://github.com/shaswat-indian) - Ray Douglass (https://github.com/raydouglass) Approvers: - Dante Gama Dessavre (https://github.com/dantegd) URL: rapidsai#4811
I am trying to use tfidf vectorizer and I keep getting this below error,
I am using the cuml 0.17 stable.
The text was updated successfully, but these errors were encountered: