-
Notifications
You must be signed in to change notification settings - Fork 93
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Proposal for Adding Contributions Space for Experimental Datasets #517
Comments
Add
The route that they took is different, tensorflow 1.0 was growing fast and
|
More questions:
|
Fun fact our 'extras' folder was called 'contrib' in v0.0.1 Two ideas:
|
Also we have some great community contributions in the form of the following:
We don't do a great job regarding discoverability and aren't really 'blessed' as Kedro approved. |
What about leaning into the community side via the website? We could do something like this where the user just needs to provide a class, a sample file and we do the rest? |
We discussed this issue in technical design and some of the points that came up were the following:
|
Pros/cons of proposed experimental contribution model:1. A
|
I would separate one-off costs (like creating new repos) from ongoing costs when weighing pros and cons. Creating a new repo is trivial (even with linting, CI/CD, PyPI publishing etc). "Need to adjust CI/CD and test-coverage to skip any datasets annotated with experimental", on the other hand, sounds like an ongoing pain. And "Harder to discover the experimental datasets" could be an ongoing problem too. |
I am happy with either 1/2, I fear yet another package is going to diverging users from the main one. For 3., we may need to add another RTD project which feels confusing. Actually, I am not sure how RTD will work because it required the package to be installed, so the pre-requisite is installing all the package? For 2. I will add one pro compare to 2, which is when something is not "experimental" anymore, there will be no breaking change because import will stay the same. |
This topic was again discussed in technical design on 7/02/2024. The pros and cons of the various contribution models were discussed and then we voted again for the model, the outcome was as follows:
The majority of votes went to number 1 and thus that will be the model we'll implement. Further decisions will need to be made about the graduation/demotion process of experimental datasets. New issues will be opened to address that. |
Description
Introduce a
contrib
folder within the Kedro datasets repository to accommodate contributions that are more experimental and may not fully adhere to the usual standards, such as being fully tested. This space will allow for the inclusion of datasets that are in the early stages of development or might not meet the criteria for being part of the core Kedro datasets.An example of such datasets are the
langchain
based datasets for which we have an open draft PR: #434Key Points:
contrib
folder is designated for experimental contributions and should not be held to the same maintenance standards as the core Kedro datasets.contrib
folder are owned by their primary authors, and the core Kedro team is not responsible for their active maintenance.contrib
folder may evolve and improve over time. Successful and well-maintained contributions can graduate from thecontrib
folder and move to the regularkedro_datasets
space.Considerations:
contrib
folder to the regularkedro_datasets
space.contrib
folder is clearly communicated as an experimental space, encouraging users to be cautious when relying on datasets from this folder.contrib
and regularkedro_datasets
spaces in the project's documentation.Next steps
contrib
and regularkedro_datasets
spaces.contrib
folder and update the documentation accordingly.Note:
This issue serves as a proposal and discussion point. Further details and decisions will be made in collaboration with the Kedro community and maintainers.
Examples
Projects that have a similar contribution space:
The text was updated successfully, but these errors were encountered: