Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat: add is_in to DaskExpr and mark is_duplicated and is_unique as not implemented #802

Merged
merged 1 commit into from
Aug 16, 2024

Conversation

benrutter
Copy link
Contributor

What type of PR is this? (check all applicable)

  • πŸ’Ύ Refactor
  • ✨ Feature
  • πŸ› Bug Fix
  • πŸ”§ Optimization
  • πŸ“ Documentation
  • βœ… Test
  • 🐳 Other

Related issues

Checklist

  • Code follows style guide (ruff)
  • Tests added
  • Documented the changes

If you have comments or can explain your changes, please do so below.

Main implementation here is is_in which is fairly simple (I've just used Dask's own isis method, and then removed the xfail from the Dask pytest).

I've also marked in is_duplicated and is_unique as not implemented. I think this is the right call because Dask doesn't have any internal duplicated check (at least yet! Looks like it could be on the cards soon-ish)

Here's the Github issues/PRs relating to Dask duplication checks:

@github-actions github-actions bot added the enhancement New feature or request label Aug 16, 2024
is_unique not implemented

precommit
Copy link
Member

@MarcoGorelli MarcoGorelli left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

thanks @benrutter !

@MarcoGorelli MarcoGorelli merged commit 4ba0ae9 into narwhals-dev:main Aug 16, 2024
20 of 21 checks passed
raise NotImplementedError(msg)

def is_unique(self: Self) -> Self:
msg = "`Expr.is_duplicated` is not support since Dask currently has no native duplicated check"
Copy link
Member

@FBruzzesi FBruzzesi Aug 16, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am late but

-        msg = "`Expr.is_duplicated` is not support since Dask currently has no native duplicated check"
+        msg = "`Expr.is_unique` is not support since Dask currently has no native duplicated check" 

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

aah crap, thanks - fancy bundling this in with another one?

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Sure!

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Doh! Sorry both πŸ€¦β€β™‚οΈ

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

No worries! I tend to say that until a release we are fine to mess up a bit πŸ˜‚

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants