-
Notifications
You must be signed in to change notification settings - Fork 1k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat: Apply cache to load proto registry for performance #3702
Merged
Merged
Changes from all commits
Commits
File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -1,4 +1,5 @@ | ||
import uuid | ||
from functools import wraps | ||
from typing import List, Optional | ||
|
||
from feast import usage | ||
|
@@ -23,6 +24,26 @@ | |
from feast.stream_feature_view import StreamFeatureView | ||
|
||
|
||
def registry_proto_cache(func): | ||
cache_key = None | ||
cache_value = None | ||
|
||
@wraps(func) | ||
def wrapper(registry_proto: RegistryProto, project: str): | ||
nonlocal cache_key, cache_value | ||
|
||
key = tuple([id(registry_proto), registry_proto.version_id, project]) | ||
|
||
if key == cache_key: | ||
return cache_value | ||
else: | ||
cache_value = func(registry_proto, project) | ||
Comment on lines
+39
to
+40
There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. Don't you need to update There was a problem hiding this comment. Choose a reason for hiding this commentThe reason will be displayed to describe this comment to others. Learn more. |
||
cache_key = key | ||
return cache_value | ||
|
||
return wrapper | ||
|
||
|
||
def init_project_metadata(cached_registry_proto: RegistryProto, project: str): | ||
new_project_uuid = f"{uuid.uuid4()}" | ||
usage.set_current_project_uuid(new_project_uuid) | ||
|
@@ -137,8 +158,9 @@ def get_validation_reference( | |
raise ValidationReferenceNotFound(name, project=project) | ||
|
||
|
||
@registry_proto_cache | ||
def list_feature_services( | ||
registry_proto: RegistryProto, project: str, allow_cache: bool = False | ||
registry_proto: RegistryProto, project: str | ||
) -> List[FeatureService]: | ||
feature_services = [] | ||
for feature_service_proto in registry_proto.feature_services: | ||
|
@@ -147,6 +169,7 @@ def list_feature_services( | |
return feature_services | ||
|
||
|
||
@registry_proto_cache | ||
def list_feature_views( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[FeatureView]: | ||
|
@@ -157,6 +180,7 @@ def list_feature_views( | |
return feature_views | ||
|
||
|
||
@registry_proto_cache | ||
def list_request_feature_views( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[RequestFeatureView]: | ||
|
@@ -169,6 +193,7 @@ def list_request_feature_views( | |
return feature_views | ||
|
||
|
||
@registry_proto_cache | ||
def list_stream_feature_views( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[StreamFeatureView]: | ||
|
@@ -181,6 +206,7 @@ def list_stream_feature_views( | |
return stream_feature_views | ||
|
||
|
||
@registry_proto_cache | ||
def list_on_demand_feature_views( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[OnDemandFeatureView]: | ||
|
@@ -193,6 +219,7 @@ def list_on_demand_feature_views( | |
return on_demand_feature_views | ||
|
||
|
||
@registry_proto_cache | ||
def list_entities(registry_proto: RegistryProto, project: str) -> List[Entity]: | ||
entities = [] | ||
for entity_proto in registry_proto.entities: | ||
|
@@ -201,6 +228,7 @@ def list_entities(registry_proto: RegistryProto, project: str) -> List[Entity]: | |
return entities | ||
|
||
|
||
@registry_proto_cache | ||
def list_data_sources(registry_proto: RegistryProto, project: str) -> List[DataSource]: | ||
data_sources = [] | ||
for data_source_proto in registry_proto.data_sources: | ||
|
@@ -209,6 +237,7 @@ def list_data_sources(registry_proto: RegistryProto, project: str) -> List[DataS | |
return data_sources | ||
|
||
|
||
@registry_proto_cache | ||
def list_saved_datasets( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[SavedDataset]: | ||
|
@@ -219,6 +248,7 @@ def list_saved_datasets( | |
return saved_datasets | ||
|
||
|
||
@registry_proto_cache | ||
def list_validation_references( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[ValidationReference]: | ||
|
@@ -231,6 +261,7 @@ def list_validation_references( | |
return validation_references | ||
|
||
|
||
@registry_proto_cache | ||
def list_project_metadata( | ||
registry_proto: RegistryProto, project: str | ||
) -> List[ProjectMetadata]: | ||
|
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I'm not sure I understand this.. If you use the same annotation for multiple methods then the cache value needs have different shapes, such as
List[FeatureService]
orList[FeatureView]
right? Is that going to work as expected here? Can we add tests?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The context of the wrapper decorator is created whenever different argument functions (e.g list_feature_views, list_feature_services..) are called.
Therefore, when calling list_feature_views and when calling list_feature_services, different results are returned.
The operation of this cache is similar to Python's functools.lru_cache, but since the registryProto object is not hashable, it is implemented to hash using the object's id.