Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Sync #1 #4

Merged
merged 172 commits into from
Feb 18, 2021
Merged

Sync #1 #4

merged 172 commits into from
Feb 18, 2021

Conversation

aregm
Copy link
Owner

@aregm aregm commented Feb 18, 2021

What do these changes do?

  • commit message follows format outlined here
  • passes flake8 modin
  • passes black --check modin
  • signed commit with git commit -s
  • Resolves #?
  • tests added and passing

anmyachev and others added 30 commits October 20, 2020 16:36
* FIX-#2195: fix describe error for datasets with datetimes

Signed-off-by: Anatoly Myachev <[email protected]>

* FIX-#2195: add test

Signed-off-by: Anatoly Myachev <[email protected]>

* FIX-#2195: enable fix

Signed-off-by: Anatoly Myachev <[email protected]>

* FIX-#2195: Update modin/pandas/test/dataframe/test_reduction.py

Co-authored-by: Dmitry Chigarev <[email protected]>

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2260: use recommended pandas testing api

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2260: replace getSeriesData with test_data

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2260: remove assert_categories_equal

Signed-off-by: Anatoly Myachev <[email protected]>
…partitioning isn't aligned (#2256)

Signed-off-by: Alexey Prutskov <[email protected]>
* FIX-#2239: Compute row index start using pandas

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#2239: Documentation

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#2239: Improve testing for case

Signed-off-by: Devin Petersohn <[email protected]>
* FIX-#0000: Indexing regression

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#0000: Fix `loc`

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#0000: Fix DatetimeIndex

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#0000: Fix Datetime and checks

Signed-off-by: Devin Petersohn <[email protected]>
* DOCS-#2193: update contributing doc

Signed-off-by: Anatoly Myachev <[email protected]>
…2332)

* FIX-#2269: Move `default_to_pandas` logic from API layer to backend

Signed-off-by: Igoshev, Yaroslav <[email protected]>

* FIX-#2269: Added a test which calls _apply_agg_function

Signed-off-by: Gregory Shimansky <[email protected]>

* FIX-#2269: Added required arguments for groupby_agg

Moved wrap_udf_function into backend because omnisci doesn't support
executing lambdas.

Signed-off-by: Gregory Shimansky <[email protected]>

* FIX-#2269: Use correct default_to_pandas for groupby in backend,
refactor default to pandas functions in BaseQC

Signed-off-by: Igoshev, Yaroslav <[email protected]>

* FIX-#2269: Renamed new default_to_pandas_groupby function

into private function of Pandas backend because it is not used anywhere
else.

Signed-off-by: Gregory Shimansky <[email protected]>

* FIX-#2269: Fixed specification of backend

now it is possible to specify --backend=PandasOnDask,
--backend=PandasOnRay or --backend=PandasOnPython, not just
--backend=BaseOnPython.

Signed-off-by: Gregory Shimansky <[email protected]>

* FIX-#2269: Fix BaseOnPython tests

Signed-off-by: Igoshev, Yaroslav <[email protected]>

* FIX-#2269: Remove default_to_pandas_groupby

Signed-off-by: Igoshev, Yaroslav <[email protected]>

* FIX-#2269: logic of dropping 'by' moved back to API level

Signed-off-by: Dmitry Chigarev <[email protected]>

Co-authored-by: Gregory Shimansky <[email protected]>
Co-authored-by: Dmitry Chigarev <[email protected]>
anmyachev and others added 28 commits February 4, 2021 10:36
* TEST-#2670: some updates to improve asv tests stability

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: fixes

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: data_size -> shape

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: use dict approach

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: use CpuCount when Npartitions isn't defined

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: fix ASV_DATASET_SIZE

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: update TimeSortValues

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: modify asv tests for using with old modin version

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: reply to review comments

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2670: use env variables for default values

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2686: add fillna benchmark

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2686: reply to review comments

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2686: add inplace parameter

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2692: add drop benchmark

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2692: add one column case

Signed-off-by: Anatoly Myachev <[email protected]>
* FIX-#2688: Update ray.ObjectID to ray.ObjectRef for Ray 2.0

Resovles #2688

Signed-off-by: Devin Petersohn <[email protected]>

* FIX-#2688: Address comments

Signed-off-by: Devin Petersohn <[email protected]>
* TEST-#2694: add head benchmark

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2694: add small number for head op

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2705: add 'value_counts' benchmarks

Signed-off-by: Dmitry Chigarev <[email protected]>

* TEST-#2705: apply suggestions from review

Signed-off-by: Dmitry Chigarev <[email protected]>
Co-authored-by: Alexey Prutskov <[email protected]>
Co-authored-by: Devin Petersohn <[email protected]>
Co-authored-by: Dmitry Chigarev <[email protected]>
Co-authored-by: Devin Petersohn <[email protected]>
Signed-off-by: Igoshev, Yaroslav <[email protected]>
* TEST-#2690: add astype benchmark

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2690: add category dtype; use df.types

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2690: add case with one column

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2702: add loc/iloc benchmark

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2702: add multiindex loc bench

Signed-off-by: Anatoly Myachev <[email protected]>

* TEST-#2702: add row_loc check

Signed-off-by: Anatoly Myachev <[email protected]>
* TEST-#2722: add ASV read_csv skiprows benchmark

Co-authored-by: Anatoly Myachev <[email protected]>
Signed-off-by: Alexander Myskov <[email protected]>
…tion (#2743)

* FIX-#2742: changed callable functions to its names in dict aggregation

Signed-off-by: Dmitry Chigarev <[email protected]>

* FIX-#2742: commends added

Signed-off-by: Dmitry Chigarev <[email protected]>
* Resolves #2584

Signed-off-by: Devin Petersohn <[email protected]>
@aregm
Copy link
Owner Author

aregm commented Feb 18, 2021

Sync

@aregm aregm merged commit 62bbe39 into aregm:master Feb 18, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.