-
Notifications
You must be signed in to change notification settings - Fork 239
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add ClusterSeries to Hero Series Types #170
base: master
Are you sure you want to change the base?
Add ClusterSeries to Hero Series Types #170
Conversation
suport MultiIndex as function parameter returns MultiIndex, where Representation was returned * missing: correct test Co-authored-by: Henri Froese <[email protected]>
*missing: test adopting for new types Co-authored-by: Henri Froese <[email protected]>
- add functionality for decorator @InputSeries to handle several allowed input types - Add typing decorator/hints to representation.py - add tests for _types DocumentTermDF Co-authored-by: Maximilian Krahn <[email protected]>
Co-authored-by: Maximilian Krahn <[email protected]>
Note: Black (our formatter) just rolled out V20.8b1 3 days ago. This creates errors with our ./tests.sh in preprocessing because of whitespace. Will investigate this further but atm we set the black version in EDIT: found the issue, see the issue opened at Black here |
Merged master branch into this one, so it is ready for review/to be merged 🦂 🐼 |
We have noticed that with topic modelling etc. coming up, we get more and more use out of the clustering functions. It thus makes sense to introduce a
HeroSeries
typeClusterSeries
through this PR.A ClusterSeries has dtype "category" and every entry is a cluster-ID (e.g. 5 or "topic 1"). For example,
pd.Series([0, 3, 0, 1], dtype="category")
is a valid ClusterSeries.NOTE: only so many commits/lines as this builds on #157