Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature scalar regexp match benchmark #13789

Open
wants to merge 28 commits into
base: main
Choose a base branch
from
Open
Changes from 1 commit
Commits
Show all changes
28 commits
Select commit Hold shift + click to select a range
ecd4793
bench: scalar regex match benchmark
zhuliquan Dec 5, 2024
c3e0951
refactor: migrate `LinearSearch` to `HashTable` (#13658)
crepererum Dec 6, 2024
1e507ad
Minor: Comment temporary function for documentation migration (#13669)
comphead Dec 6, 2024
61fd077
Minor: Rephrase MSRV policy to be more explanatory (#13668)
comphead Dec 6, 2024
67260a0
fix: repartitioned reads of CSV with custom line terminator (#13677)
korowa Dec 7, 2024
3618cfe
chore: macros crate cleanup (#13685)
findepi Dec 7, 2024
d3e0860
Refactor regexplike signature (#13394)
jiashenC Dec 8, 2024
a960c6d
Performance: enable array allocation reuse (`ScalarFunctionArgs` gets…
alamb Dec 8, 2024
d39852d
Temporary fix for CI (#13689)
jonahgao Dec 8, 2024
98372cc
refactor: use `LazyLock` in the `user_doc` macro (#13684)
jonahgao Dec 8, 2024
e8226f5
Unlock lexical-write-integer version. (#13693)
Alexhuszagh Dec 9, 2024
bd91271
Minor: Use `div_ceil`
akurmustafa Dec 9, 2024
45926ab
Fix hash join with sort push down (#13560)
haohuaijin Dec 9, 2024
16d2ab1
Improve substr() performance by avoiding using owned string (#13688)
richox Dec 9, 2024
d8c9cfb
reinstate down_cast_any_ref (#13705)
andygrove Dec 9, 2024
f8c0efe
Optimize performance of `character_length` function (#13696)
tlm365 Dec 10, 2024
2d8bd42
Update prost-build requirement from =0.13.3 to =0.13.4 (#13698)
dependabot[bot] Dec 10, 2024
5dc6e42
Minor: Output elapsed time for sql logic test (#13718)
comphead Dec 10, 2024
4fb9d2a
refactor: simplify the `make_udf_function` macro (#13712)
jonahgao Dec 11, 2024
fa0440b
refactor: replace `Vec` with `IndexMap` for expression mappings in `P…
Weijun-H Dec 11, 2024
d3c459e
Handle alias when parsing sql(parse_sql_expr) (#12939)
Eason0729 Dec 11, 2024
ddfc9e5
Improve documentation for TableProvider (#13724)
alamb Dec 11, 2024
b494157
Reveal implementing type and return type in simple UDF implementation…
findepi Dec 11, 2024
3b5daa2
minor: Extract tests for `EXTRACT` AND `date_part` to their own file …
alamb Dec 11, 2024
50ce883
Support unparsing `UNNEST` plan to `UNNEST` table factor SQL (#13660)
goldmedal Dec 11, 2024
13b581a
Merge branch 'apache:main' into feature-scalar_regexp_match_benchmark
zhuliquan Dec 12, 2024
065eb47
Merge branch 'apache:main' into feature-scalar_regexp_match_benchmark
zhuliquan Dec 15, 2024
c697bb0
fix: take taplo formatter suggestion
zhuliquan Dec 16, 2024
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Improve documentation for TableProvider (#13724)
alamb authored and zhuliquan committed Dec 11, 2024
commit ddfc9e5f95c468d845b65bf4f05a8fcb08366914
14 changes: 13 additions & 1 deletion datafusion/catalog/src/table.rs
Original file line number Diff line number Diff line change
@@ -33,7 +33,19 @@ use datafusion_expr::{
};
use datafusion_physical_plan::ExecutionPlan;

/// Source table
/// A named table which can be queried.
///
/// Please see [`CatalogProvider`] for details of implementing a custom catalog.
///
/// [`TableProvider`] represents a source of data which can provide data as
/// Apache Arrow `RecordBatch`es. Implementations of this trait provide
/// important information for planning such as:
///
/// 1. [`Self::schema`]: The schema (columns and their types) of the table
/// 2. [`Self::supports_filters_pushdown`]: Should filters be pushed into this scan
/// 2. [`Self::scan`]: An [`ExecutionPlan`] that can read data
///
/// [`CatalogProvider`]: super::CatalogProvider
#[async_trait]
pub trait TableProvider: Debug + Sync + Send {
/// Returns the table provider as [`Any`](std::any::Any) so that it can be