-
Notifications
You must be signed in to change notification settings - Fork 282
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
create_vectorized_features error #103
Comments
I have the same problem. @mrphilroth is this a common problem? |
i have the same problem , if you have fix it please tell me how |
I have not been able to find a fix for this yet, although I have not spent a lot of time on this |
i had the same issue, downgraded python to 3.6 in my environment, worked like charm. |
A way to fix it is to replace: in |
Same problem. I started a fork to curate this repo. Also my PR #108 fixes the issue |
downgrade to py3.6will easily solve |
Fixed: ValueError: Samples can not be a single string. The input must be an iterable over iterables of strings. By: entry_name_hashed = FeatureHasher(50, input_type="string").transform([raw_obj['entry']]).toarray()[0] with: entry_name_hashed = FeatureHasher(50, input_type="string").transform([ [raw_obj['entry']] ]).toarray()[0] at line 192. In this way an iterable over iterable over raw features is obtained, as transform() method require.
Can anyone provide any insight on what the intended output for the entry name hash table is supposed to be? Using it the way it's written with Python3.6 or earlier, the FeatureHasher hashes each character in the entry string. For example, if Using the fixed version, the FeatureHasher hashes the entire string, so an entry point string of In the grand scheme of the model, I don't know if either way has much of an impact, but it would be good to know if the authors intended the hash table to be one way or the other. |
I have problems running the following commands in python:
I have installed the dependencies and tried on docker with leif versions 0.9.0, 0.10.1 and i still get the same failure:
I seems from the error msg, that the input is not the same format as expected in the vectorizor?
Any fix to this?
The text was updated successfully, but these errors were encountered: