Window frame GROUPS mode support #4155

zembunia · 2022-11-09T16:49:57Z

Which issue does this PR close?

This PR provides the support of the GROUPS mode in the window frames, which was a missing item in #3570 enhancement.
The GROUPS mode is implemented regarding the specification in PostgreSQL window function calls.

Rationale for this change

This change is part of an enhancement #361 that is on the roadmap.

What changes are included in this PR?

The common single method calculating the window range (calculate_range) is removed from the window_expr. New structs that can hold any state information for each window frame mode are introduced.
The ROWS mode does not require a state as it is simple row index calculation, thought the state struct is empty apart from the simple calculate_range method specific to ROWS mode.
For the RANGE mode, a stateful calculation can be utilized in the future. For now, the state struct is empty and the specific calculate_range implementation is moved to the state struct.
For the GROUPS mode, a stateful implementation, that keeps track of the moving window range of groups for each consecutive row, is provided.
The frame exclusion is still not supported.

Observations

The implementation for the RANGE mode can also utilize a stateful implementation, instead of calculating the window range for each row from scratch.

Future work

Stateful RANGE mode implementation
A method to find the next group index, utilizing an exponentially growing step size, is implemented in this PR (find_next_group_and_start_index). This method can be improved to choose an approach depending on statistics about previous group sizes. It can either search the next group by advancing one-by-one (for small group sizes) or utilizing the exponentially growing step size, or even setting a base step size when exponentially growing. We can also create a benchmark implementation to get insights about the crossover point.

Are these changes tested?

New unit tests relevant to the added functionality are added in window_frame_state.rs. The tests in windows.rs is extended to cover the GROUPS mode, and a test file is added to the integration test SQLs.

Are there any user-facing changes?

No

…g search algorithm

ozankabak · 2022-11-09T19:23:05Z

@alamb, this already went through our internal review process, so I can say LGTM. Looking forward to getting community feedback.

alamb · 2022-11-09T21:38:30Z

Thank you @ozankabak -- I will put this on my review queue for tomorrow

alamb

@zembunia this is a very nice PR and a pleasure to read. It is well tested, well commented, and well structured. 🏆 Thank you.

I left a few style comments, but nothing that needs to be completed prior to merging from my perspective.

Here is the relevant description of GROUPs for anyone else reviewing this PR

In GROUPS mode, the offset again must yield a non-null, non-negative integer, and the option means that the frame starts or ends the specified number of peer groups before or after the current row's peer group, where a peer group is a set of rows that are equivalent in the ORDER BY ordering. (There must be an ORDER BY clause in the window definition to use GROUPS mode.)

I'll plan to merge this PR tomorrow unless there are any additional comments

datafusion/common/src/bisect.rs

alamb · 2022-11-10T19:58:55Z

datafusion/core/src/physical_plan/planner.rs

@@ -1511,12 +1511,6 @@ pub fn create_window_expr_with_name(
                })
                .collect::<Result<Vec<_>>>()?;
            if let Some(ref window_frame) = window_frame {
-                if window_frame.units == WindowFrameUnits::Groups {


alamb · 2022-11-10T19:59:17Z

datafusion/core/tests/sql/window.rs

+    let err = df.collect().await.unwrap_err();
+    assert_contains!(
+        err.to_string(),
+        "Execution error: GROUPS mode requires an ORDER BY clause".to_owned()


alamb · 2022-11-10T20:01:13Z

datafusion/physical-expr/src/window/built_in.rs

@@ -113,10 +114,10 @@ impl WindowExpr for BuiltInWindowExpr {
                    .iter()
                    .map(|v| v.slice(partition_range.start, length))
                    .collect::<Vec<_>>();
+                let mut window_frame_ctx = WindowFrameContext::new(&window_frame);


This is a very nice encapsulation of the window frame calculation. Thank you

alamb · 2022-11-10T20:05:20Z