Reduce warnings in pytest output #10168

bdice · 2022-01-31T05:35:50Z

This PR reduces some warnings in pytest output.

The pyarrow warning that I silenced was occurring thousands of times during our tests but should be fixed by #9686, so I marked it with a TODO.

I just tackled a few files for now, aiming for around 100 LOC changed for ideal review size.

…0.1 (rapidsai#9686).

codecov · 2022-01-31T07:12:32Z

Codecov Report

Merging #10168 (a2fbb64) into branch-22.04 (a7d88cd) will increase coverage by 0.05%.
The diff coverage is 0.00%.

@@               Coverage Diff                @@
##           branch-22.04   #10168      +/-   ##
================================================
+ Coverage         10.42%   10.47%   +0.05%     
================================================
  Files               119      122       +3     
  Lines             20603    20487     -116     
================================================
- Hits               2148     2147       -1     
+ Misses            18455    18340     -115

Impacted Files	Coverage Δ
python/cudf/cudf/_fuzz_testing/fuzzer.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_fuzz_testing/io.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_fuzz_testing/main.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/_version.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/comm/gpuarrow.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/_base_index.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/categorical.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/column.py	`0.00% <0.00%> (ø)`
python/cudf/cudf/core/column/datetime.py	`0.00% <ø> (ø)`
python/cudf/cudf/core/column/numerical.py	`0.00% <0.00%> (ø)`
... and 42 more

Continue to review full report at Codecov.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 4e8cb4f...a2fbb64. Read the comment docs.

python/cudf/cudf/core/frame.py

bdice · 2022-01-31T23:48:04Z

python/cudf/cudf/tests/test_column.py

@@ -29,8 +29,37 @@

 @pytest.fixture(params=dtypes, ids=dtypes)
 def pandas_input(request):


This fixture was throwing a lot of warnings about unsafe casting -- in particular, things like creating a Series with dtype int8 from data that could exceed 127 (the max random value was 1000). Instead, I implemented a function that creates random values based on the type.

python/cudf/cudf/tests/test_column.py

…ings

bdice · 2022-02-03T05:52:40Z

pytest output on branch-22.04, gpuCI running commit b2b9232 on Python 3.9:

88729 passed, 2399 skipped, 985 xfailed, 1978 xpassed, 19524 warnings in 2314.01s (0:38:34)

Warnings on this branch (as of 99006ce, results from a local build):

88693 passed, 2401 skipped, 985 xfailed, 1978 xpassed, 3041 warnings in 618.52s (0:10:18)

Looks like this PR cuts the warning count from 19524 warnings to 3041. That's a good start!

vyasr

I left some suggestions for improvement, otherwise LGTM.

python/cudf/cudf/tests/test_column.py

Co-authored-by: Vyas Ramasubramani <[email protected]>

…ings

cwharris · 2022-02-04T16:32:48Z

python/cudf/cudf/tests/test_column.py

+    def random_ints(dtype, size):
+        dtype_min = np.iinfo(dtype).min
+        dtype_max = np.iinfo(dtype).max
+        return rng.integers(dtype_min, dtype_max, size=size, dtype=dtype)
+
+    try:
+        dtype = np.dtype(dtype)
+    except TypeError:
+        if dtype == "category":
+            data = random_ints(np.int64, size)
+        else:
+            raise
+    else:
+        if dtype.kind == "b":
+            data = rng.choice([False, True], size=size)
+        elif dtype.kind in ("m", "M"):
+            # datetime or timedelta
+            data = random_ints(np.int64, size)
+        elif dtype.kind == "U":
+            # Unicode strings of integers like "12345"
+            data = random_ints(np.int64, size).astype(dtype.str)
+        elif dtype.kind == "f":
+            # floats in [0.0, 1.0)
+            data = rng.random(size=size, dtype=dtype)
+        else:
+            data = random_ints(dtype, size)


Seems like this could be more generally useful. Which also makes me wonder... does something like this already exist in cudf, and can we re-use that?

That's definitely a good question. I plan to investigate that further and strive for de-duplication as I work through some more warnings. Lots of the warnings are affected by dtypes!

bdice · 2022-02-04T17:12:48Z

@gpucibot merge

bdice added 3 commits January 30, 2022 22:20

Fix warnings for ceil/floor.

47be09f

Fix warnings in test_unaops.py.

51d4b74

Silence warning from pyarrow 5.0.0. Appears to be fixed in pyarrow 6.…

b6428b2

…0.1 (rapidsai#9686).

bdice self-assigned this Jan 31, 2022

github-actions bot added the Python Affects Python cuDF API. label Jan 31, 2022

bdice added improvement Improvement / enhancement to an existing function non-breaking Non-breaking change tech debt labels Jan 31, 2022

bdice added 3 commits January 31, 2022 09:18

Generate random values appropriate for the given dtype.

dbfa651

Avoid creating GPU objects in parametrize.

47669c2

Update python/cudf/cudf/tests/test_column.py

d2c0c4b

bdice marked this pull request as ready for review January 31, 2022 23:42

bdice requested a review from a team as a code owner January 31, 2022 23:42

bdice requested review from cwharris and skirui-source January 31, 2022 23:42

bdice commented Jan 31, 2022

View reviewed changes

python/cudf/cudf/core/frame.py Show resolved Hide resolved

bdice commented Jan 31, 2022

View reviewed changes

bdice commented Feb 1, 2022

View reviewed changes

python/cudf/cudf/tests/test_column.py Outdated Show resolved Hide resolved

vyasr mentioned this pull request Feb 3, 2022

[REVIEW] Fix typo in Frame.floor deprecation warning #10197

Closed

bdice added 2 commits February 2, 2022 23:27

Merge remote-tracking branch 'upstream/branch-22.04' into reduce-warn…

2c3c87e

…ings

Silence warning from pandas.testing.assert_series_equal.

99006ce

bdice requested a review from vyasr February 3, 2022 05:38

vyasr approved these changes Feb 4, 2022

View reviewed changes

bdice and others added 2 commits February 3, 2022 19:31

Apply suggestions from code review

d8a0f1b

Co-authored-by: Vyas Ramasubramani <[email protected]>

Fix parentheses.

e6a92dc

shwina approved these changes Feb 4, 2022

View reviewed changes

bdice added 2 commits February 4, 2022 08:22

Merge remote-tracking branch 'upstream/branch-22.04' into reduce-warn…

3867c65

…ings

Fix test function.

a2fbb64

cwharris approved these changes Feb 4, 2022

View reviewed changes

rapids-bot bot merged commit f654c4a into rapidsai:branch-22.04 Feb 4, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Reduce warnings in pytest output #10168

Reduce warnings in pytest output #10168

bdice commented Jan 31, 2022 •

edited

Loading

codecov bot commented Jan 31, 2022 •

edited

Loading

bdice Jan 31, 2022 •

edited

Loading

bdice commented Feb 3, 2022

vyasr left a comment

cwharris Feb 4, 2022

bdice Feb 4, 2022

bdice commented Feb 4, 2022

		@@ -29,8 +29,37 @@

		@pytest.fixture(params=dtypes, ids=dtypes)
		def pandas_input(request):

Reduce warnings in pytest output #10168

Reduce warnings in pytest output #10168

Conversation

bdice commented Jan 31, 2022 • edited Loading

codecov bot commented Jan 31, 2022 • edited Loading

Codecov Report

bdice Jan 31, 2022 • edited Loading

Choose a reason for hiding this comment

bdice commented Feb 3, 2022

vyasr left a comment

Choose a reason for hiding this comment

cwharris Feb 4, 2022

Choose a reason for hiding this comment

bdice Feb 4, 2022

Choose a reason for hiding this comment

bdice commented Feb 4, 2022

bdice commented Jan 31, 2022 •

edited

Loading

codecov bot commented Jan 31, 2022 •

edited

Loading

bdice Jan 31, 2022 •

edited

Loading