You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
In terms of competition / optics of DuckDB vs DataFusion (vs Pola.rs) -- I think the best approach is to define the areas each is best at rather than try to "compete" head to head. I would be quite happy to have comparable performance with DuckDB (not faster) and pola.rs
Some thoughts on the benefits of DataFusion where it has clear differentiation:
Target audience is different (developers rather than end users / data scientists)
Designed to be embedded (rather than designed to be a file based sql engine)
Community / ASF (rather than being tightly controlled in Amsterdam)
Rust implementation (all the cool kids want Rust, I hear!)
The text was updated successfully, but these errors were encountered:
alamb
changed the title
Clarify DataFusion similarities and differences with duckdb, pola.rs and other similar systems
[DISCUSS] Clarify DataFusion similarities and differences with duckdb, pola.rs and other similar systems
Mar 7, 2023
alamb
changed the title
[DISCUSS] Clarify DataFusion similarities and differences with duckdb, pola.rs and other similar systems
Clarify DataFusion similarities and differences with duckdb, pola.rs and other similar systems
Mar 7, 2023
Please comment if you have any thoughts on these ideas:
I think it would be good to update the text here: https://github.com/apache/arrow-datafusion/blob/main/README.md#comparisons-with-other-projects
In terms of competition / optics of DuckDB vs DataFusion (vs Pola.rs) -- I think the best approach is to define the areas each is best at rather than try to "compete" head to head. I would be quite happy to have comparable performance with DuckDB (not faster) and pola.rs
Some thoughts on the benefits of DataFusion where it has clear differentiation:
The text was updated successfully, but these errors were encountered: