Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: Renaming Cursor Functions #895

Merged
merged 25 commits into from
Jun 28, 2023

Conversation

aryan-rajoria
Copy link
Collaborator

@aryan-rajoria aryan-rajoria commented Jun 25, 2023

  • Renaming Cursor Functions to better understand their uses.
  • Fixing create_vector_index() docstring

@aryan-rajoria aryan-rajoria requested a review from gaurav274 June 25, 2023 18:50
This PR removes CREATE MATERIALIZED VIEW statement, a deprecated
feature.

---------

Co-authored-by: xzdandy <[email protected]>
Co-authored-by: jarulraj <[email protected]>
@jarulraj
Copy link
Member

@aryan-rajoria Please update the docs as well, that's very important.

jarulraj and others added 12 commits June 25, 2023 19:12
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.

---------

Co-authored-by: Chitti Ankith <[email protected]>
This PR presents a sample app that lets users run data analytics with
pandas like a conversation, credits to workflows presented by
[pandas-ai](https://github.com/gventuri/pandas-ai)
Example:


🔮 Welcome to EvaDB! This app lets you to run data analytics on a csv
file like in a conversational manner.
You will only need to supply a path to csv file and an OpenAI API key.

📋 Enter the csv file path (press Enter to use our default csv file):
[enter]
🔑 Enter your OpenAI key: [your openai key]
Could not find image processor class in the image processor config or
the model config. Loading based on pattern matching with the model's
feature extractor configuration.

\===========================================
🪄 Run anything on the csv table like a conversation!
What do you want to do with the dataframe? 
(enter 'exit' to exit): print out the 3 countries with the highest GDPs
⏳ Generating response (may take a while)...
\+--------------------------------------------------+
✅ Script body:
\# First, we need to sort the dataframe by GDP in descending order
sorted_df = df.sort_values(by='gdp', ascending=False)

\# Then, we can select the top 3 countries with the highest GDPs
top_3_countries = sorted_df.head(3)['country']


\# Finally, we can print out the result
print("The 3 countries with the highest GDPs are:")
for country in top_3_countries:
    print(country)
\+--------------------------------------------------+
🪄 Want to run it? (y/n): y
The 3 countries with the highest GDPs are:
United States
China
Japan
✅ Session ended.
\===========================================

---------

Co-authored-by: Yulai Cui <[email protected]>
Co-authored-by: Gaurav <[email protected]>
Co-authored-by: Yulai Cui <[email protected]>
Co-authored-by: jarulraj <[email protected]>
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.

---------

Co-authored-by: xzdandy <[email protected]>
This PR addresses two issues.

1. We previously only allows parallelism when there is more than two
GPUs available. I think this is too conservative. First, some functional
expressions only run on CPUs and even parallelization of CPU functional
expression can bring performance benefit. Additionally, even if there is
only one GPU available, it still provide performance benefit when
parallelization is enabled because of better GPU utilizations.
Currently, I just hardcode the DOP, but I am thinking to allow users to
configure this themselves through the `eva.yml`. Any feedback on this?
@xzdandy @gaurav274 @jarulraj

2. I realize a bug related to the previous approach. This will happen
for more than 1 GPU case. I will give a concrete example here. Let's
assume we have two GPUs. The GPU id that we get from the `Context` for
the second GPU is `1` because context runs in the main process, which
detects two GPUs on the PyTorch side (e.g., `GPU ID = [0, 1]`. However,
within the Ray process, after we set the environmental variable
`CUDA_VISIBLE_DEVICES=1`, the PyTorch indices is reset. Because it only
accesses one GPU within the Ray process, it only gets GPU ids like `GPU
ID = [0]`. Thus, it causes the no GPU error when it moves to GPU
`to('cuda:1')`. The fix is very easy, that we just simply expose all
GPUs to every Ray process, so we don't need to worry about PyTorch
device id reset. To expose all GPUs, we just find the max GPU id from
the context and expose all GPU ids that are lower than the max in the
environmental variables.

---------

Co-authored-by: xzdandy <[email protected]>
* Validate index scan with descending order works fine
* Add more error msg

---------

Co-authored-by: jarulraj <[email protected]>
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.
@aryan-rajoria aryan-rajoria marked this pull request as ready for review June 26, 2023 16:09
@aryan-rajoria aryan-rajoria requested a review from jarulraj June 26, 2023 16:10
@jarulraj
Copy link
Member

jarulraj commented Jun 26, 2023

@aryan-rajoria We need to update the notebooks as well.

@jarulraj jarulraj added Feature Request ✨ New feature or request Minor 💅 Polishing EVA is always nice labels Jun 26, 2023
jarulraj and others added 3 commits June 26, 2023 18:36
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.

---------

Co-authored-by: Chitti Ankith <[email protected]>
👋 Thanks for submitting a Pull Request to EvaDB!

🙌 We want to make contributing to EvaDB as easy and transparent as
possible. Here are a few tips to get you started:

- 🔍 Search existing EvaDB
[PRs](https://github.com/georgia-tech-db/eva/pulls) to see if a similar
PR already exists.
- 🔗 Link this PR to a EvaDB
[issue](https://github.com/georgia-tech-db/eva/issues) to help us
understand what bug fix or feature is being implemented.
- 📈 Provide before and after profiling results to help us quantify the
improvement your PR provides (if applicable).

👉 Please see our ✅ [Contributing
Guide](https://evadb.readthedocs.io/en/stable/source/contribute/index.html)
for more details.

---------

Co-authored-by: Chitti Ankith <[email protected]>
@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@jarulraj jarulraj merged commit b574fd7 into georgia-tech-db:master Jun 28, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Feature Request ✨ New feature or request Minor 💅 Polishing EVA is always nice
Projects
Development

Successfully merging this pull request may close these issues.

4 participants