Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(storage/dataflux): add dataflux interface #10748

Merged
merged 46 commits into from
Sep 19, 2024
Merged

feat(storage/dataflux): add dataflux interface #10748

merged 46 commits into from
Sep 19, 2024

Conversation

akansha1812
Copy link
Contributor

@akansha1812 akansha1812 commented Aug 22, 2024

feat: add dataflux interface and helper functions to storage/dataflux.
Dataflux fast-listing will be used to quickly list objects in a bucket in parallel.

Fixes #10731

@akansha1812 akansha1812 requested review from a team as code owners August 22, 2024 17:32
@akansha1812 akansha1812 changed the title feat(storage/dataflux)Dataflux initial commit feat(storage/dataflux): Dataflux initial commit Aug 22, 2024
@akansha1812 akansha1812 changed the title feat(storage/dataflux): Dataflux initial commit feat(storage/dataflux): add dataflux interface Aug 22, 2024
storage/dataflux/worksteal.go Outdated Show resolved Hide resolved
storage/dataflux/doc.go Show resolved Hide resolved
@product-auto-label product-auto-label bot added the api: storage Issues related to the Cloud Storage API. label Aug 23, 2024
Copy link
Contributor

@tritone tritone left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some initial comments. Overall I'd like to see an MVP that works end-to-end (maybe just sequential list?) before we merge, as well as an example_test.go file and basic integration test. That will help validate that this interface is workable before we release anything. Obviously we can make breaking changes in the subpackage since we have the preview tag, but it's not ideal.

storage/dataflux/doc.go Show resolved Hide resolved
storage/dataflux/doc.go Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/fast_list.go Show resolved Hide resolved
storage/dataflux/range_splitter.go Show resolved Hide resolved
storage/dataflux/sequential.go Outdated Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/README.md Outdated Show resolved Hide resolved
storage/dataflux/README.md Show resolved Hide resolved
storage/dataflux/README.md Outdated Show resolved Hide resolved
storage/dataflux/README.md Outdated Show resolved Hide resolved
storage/dataflux/README.md Show resolved Hide resolved
storage/dataflux/fast_list.go Show resolved Hide resolved
storage/dataflux/fast_list.go Show resolved Hide resolved
storage/dataflux/fast_list.go Outdated Show resolved Hide resolved
storage/dataflux/worksteal.go Outdated Show resolved Hide resolved
storage/dataflux/sequential.go Outdated Show resolved Hide resolved
gcf-owl-bot bot and others added 4 commits September 11, 2024 21:22
- [ ] Regenerate this pull request now.

PiperOrigin-RevId: 673380763

Source-Link: https://togithub.com/googleapis/googleapis/commit/8ebfd76bd91ba97b86491de9161c9e5e6884a68a

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/c7314d1edf63123ef0efbe9bc4d996391025fb44
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYzczMTRkMWVkZjYzMTIzZWYwZWZiZTliYzRkOTk2MzkxMDI1ZmI0NCJ9
BEGIN_NESTED_COMMIT
feat(bigtable/admin): Add support for Cloud Bigtable Row Affinity in App Profiles
PiperOrigin-RevId: 673093969

Source-Link: https://togithub.com/googleapis/googleapis/commit/cbf696d38a963c5ab333f85fc9a910b5698ad415

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/a2f7ec1191813304b3bd0097caa33956bdb3b637
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYTJmN2VjMTE5MTgxMzMwNGIzYmQwMDk3Y2FhMzM5NTZiZGIzYjYzNyJ9
END_NESTED_COMMIT
BEGIN_NESTED_COMMIT
feat(aiplatform): add Pinecone and Vector Search integration for Vertex RAG
PiperOrigin-RevId: 673087899

Source-Link: https://togithub.com/googleapis/googleapis/commit/afb6b3599d50103e022e9c22c5057bf94be9dcf8

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/00a4515ab465e98d56627075675209631ee51f39
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiMDBhNDUxNWFiNDY1ZTk4ZDU2NjI3MDc1Njc1MjA5NjMxZWU1MWYzOSJ9
END_NESTED_COMMIT
* feat(firestore): Adding distance threshold and result field

* refactor(firestore): Renaming method names

* refactor(firestore): Move threshold and result field to options. Rename FindNearestOptions

* refactor(firestore): Rename to FindNearestOptions

* refactor(firestore): Refactoring code
@akansha1812 akansha1812 requested a review from a team as a code owner September 11, 2024 21:22
akansha1812 and others added 11 commits September 11, 2024 21:53
- [ ] Regenerate this pull request now.

PiperOrigin-RevId: 673380763

Source-Link: https://togithub.com/googleapis/googleapis/commit/8ebfd76bd91ba97b86491de9161c9e5e6884a68a

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/c7314d1edf63123ef0efbe9bc4d996391025fb44
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYzczMTRkMWVkZjYzMTIzZWYwZWZiZTliYzRkOTk2MzkxMDI1ZmI0NCJ9
BEGIN_NESTED_COMMIT
feat(bigtable/admin): Add support for Cloud Bigtable Row Affinity in App Profiles
PiperOrigin-RevId: 673093969

Source-Link: https://togithub.com/googleapis/googleapis/commit/cbf696d38a963c5ab333f85fc9a910b5698ad415

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/a2f7ec1191813304b3bd0097caa33956bdb3b637
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYTJmN2VjMTE5MTgxMzMwNGIzYmQwMDk3Y2FhMzM5NTZiZGIzYjYzNyJ9
END_NESTED_COMMIT
BEGIN_NESTED_COMMIT
feat(aiplatform): add Pinecone and Vector Search integration for Vertex RAG
PiperOrigin-RevId: 673087899

Source-Link: https://togithub.com/googleapis/googleapis/commit/afb6b3599d50103e022e9c22c5057bf94be9dcf8

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/00a4515ab465e98d56627075675209631ee51f39
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiMDBhNDUxNWFiNDY1ZTk4ZDU2NjI3MDc1Njc1MjA5NjMxZWU1MWYzOSJ9
END_NESTED_COMMIT
- [ ] Regenerate this pull request now.

PiperOrigin-RevId: 673380763

Source-Link: https://togithub.com/googleapis/googleapis/commit/8ebfd76bd91ba97b86491de9161c9e5e6884a68a

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/c7314d1edf63123ef0efbe9bc4d996391025fb44
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYzczMTRkMWVkZjYzMTIzZWYwZWZiZTliYzRkOTk2MzkxMDI1ZmI0NCJ9
BEGIN_NESTED_COMMIT
feat(bigtable/admin): Add support for Cloud Bigtable Row Affinity in App Profiles
PiperOrigin-RevId: 673093969

Source-Link: https://togithub.com/googleapis/googleapis/commit/cbf696d38a963c5ab333f85fc9a910b5698ad415

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/a2f7ec1191813304b3bd0097caa33956bdb3b637
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYTJmN2VjMTE5MTgxMzMwNGIzYmQwMDk3Y2FhMzM5NTZiZGIzYjYzNyJ9
END_NESTED_COMMIT
BEGIN_NESTED_COMMIT
feat(aiplatform): add Pinecone and Vector Search integration for Vertex RAG
PiperOrigin-RevId: 673087899

Source-Link: https://togithub.com/googleapis/googleapis/commit/afb6b3599d50103e022e9c22c5057bf94be9dcf8

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/00a4515ab465e98d56627075675209631ee51f39
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiMDBhNDUxNWFiNDY1ZTk4ZDU2NjI3MDc1Njc1MjA5NjMxZWU1MWYzOSJ9
END_NESTED_COMMIT
- [ ] Regenerate this pull request now.

PiperOrigin-RevId: 673380763

Source-Link: https://togithub.com/googleapis/googleapis/commit/8ebfd76bd91ba97b86491de9161c9e5e6884a68a

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/c7314d1edf63123ef0efbe9bc4d996391025fb44
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYzczMTRkMWVkZjYzMTIzZWYwZWZiZTliYzRkOTk2MzkxMDI1ZmI0NCJ9
BEGIN_NESTED_COMMIT
feat(bigtable/admin): Add support for Cloud Bigtable Row Affinity in App Profiles
PiperOrigin-RevId: 673093969

Source-Link: https://togithub.com/googleapis/googleapis/commit/cbf696d38a963c5ab333f85fc9a910b5698ad415

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/a2f7ec1191813304b3bd0097caa33956bdb3b637
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiYTJmN2VjMTE5MTgxMzMwNGIzYmQwMDk3Y2FhMzM5NTZiZGIzYjYzNyJ9
END_NESTED_COMMIT
BEGIN_NESTED_COMMIT
feat(aiplatform): add Pinecone and Vector Search integration for Vertex RAG
PiperOrigin-RevId: 673087899

Source-Link: https://togithub.com/googleapis/googleapis/commit/afb6b3599d50103e022e9c22c5057bf94be9dcf8

Source-Link: https://togithub.com/googleapis/googleapis-gen/commit/00a4515ab465e98d56627075675209631ee51f39
Copy-Tag: eyJwIjoiLmdpdGh1Yi8uT3dsQm90LnlhbWwiLCJoIjoiMDBhNDUxNWFiNDY1ZTk4ZDU2NjI3MDc1Njc1MjA5NjMxZWU1MWYzOSJ9
END_NESTED_COMMIT
aiplatform/apiv1beta1/aiplatformpb/vertex_rag_data.pb.go Outdated Show resolved Hide resolved
storage/dataflux/integration_test.go Show resolved Hide resolved
storage/dataflux/sequential.go Outdated Show resolved Hide resolved
storage/dataflux/sequential.go Outdated Show resolved Hide resolved
storage/dataflux/sequential.go Show resolved Hide resolved
Copy link
Contributor

@tritone tritone left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Two more small comments, otherwise LGTM -- I will approve now and let you fix.

storage/dataflux/sequential.go Show resolved Hide resolved
storage/dataflux/worksteal.go Outdated Show resolved Hide resolved
@tritone tritone added the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Sep 19, 2024
@kokoro-team kokoro-team removed the kokoro:force-run Add this label to force Kokoro to re-run the tests. label Sep 19, 2024
@tritone tritone merged commit cb7b0a1 into googleapis:main Sep 19, 2024
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
api: storage Issues related to the Cloud Storage API.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

storage: implement dataflux fast listing
5 participants