#144: Extracted load_models into seperate class #161

MarleneKress79789 · 2023-12-01T12:49:42Z

All Submissions:

Is the title of the Pull Request correct?
Is the title of the corresponding issue correct?
Have you updated the changelog?
Have you checked to ensure there aren't other open Pull Requests for the same update/change?
Are you mentioning the issue which this PullRequest fixes ("Fixes...")
Before you merge don't forget to run tests in AWS CodeBuild, by adding [CodeBuild] to the commit message

Fixes #144

… changed test setup accordingly

ahsimb · 2023-12-01T15:15:54Z

exasol_transformers_extension/utils/load_model.py

+    def load_models(self, model_name: str,
+                    current_model_key,
+                    cache_dir,
+                    token_conn_obj) -> None:


Can we provide type annotations for all parameters?

regarding the documentation strings and stuff: this whole class will be replaced with the one that will be created in #145. so i would rather spend the time to do proper docu for the new class and leave this one as is.

I would say the released code should be in a state of completeness. If the class morphs into something else so will the documentation. Plus it doesn't take long to get the docstrings sorted.

ahsimb · 2023-12-01T15:17:05Z

exasol_transformers_extension/utils/load_model.py

+                    cache_dir,
+                    token_conn_obj) -> None:
+        """
+        Load model and tokenizer model from the cached location in bucketfs.


Maybe "Load the language model and tokenizer model" or "Load the model and tokenizer"

ahsimb · 2023-12-01T15:17:51Z

exasol_transformers_extension/utils/load_model.py

+        https://github.com/exasol/transformers-extension/issues/43.
+
+        :param model_name: The model name to be loaded
+        """


"The model name to be loaded" => "The name of the model to be loaded".

Other parameters' description?

The description of the function doesn't reflect everything that the function is doing. For example, it doesn't say it will create and return a pipeline.

ahsimb · 2023-12-01T15:20:57Z

exasol_transformers_extension/utils/load_model.py

+        :param model_name: The model name to be loaded
+        """
+        token = False
+        if token_conn_obj:


Maybe
if token_con_obj is not None:
The object may implement some boolean conversion rules that evaluate to False.

new bug ticket added here: #163

ahsimb · 2023-12-01T15:26:32Z

exasol_transformers_extension/utils/load_model.py

+                 tokenizer,
+                 task_name,
+                 device
+                 ):


Adding type annotations and/or parameter descriptions would be helpful.

ahsimb · 2023-12-01T16:37:39Z

exasol_transformers_extension/udfs/models/base_model_udf.py

-        self.last_loaded_model = None
-        self.last_loaded_tokenizer = None
+        self.model_loader.last_loaded_model = None
+        self.model_loader.last_loaded_tokenizer = None


I would add a clear method to the ModelLoader setting these two variables to None.
If last_laded_model and last_loaded_tokenizer still need to be visible I would make them properties.

BTW, if the last_created_pipeline keeps references to these objects they won't be garbage-collected. So you probably need to it None too.

will move the clear method from base udf also

ahsimb · 2023-12-01T16:51:28Z

tests/unit_tests/udfs/base_model_dummy_implementation.py

+            model_name, cache_dir=cache_dir, use_auth_token=token)
+        return None
+
+


Why can't you use the actual ModelLoader with a pipeline-like function that does nothing and returns None?

because the model loader is only initialized at run time of the udf because it gets input about the device thats only known then. i dont know of a way to change the functioncall at that point

If all you need is to make a LoadModel without a pipeline you could do something like this:

class DummyLoadModel(LoadModel): def __init__(self, base_model, tokenizer, task_name, device): super().__init__(self, lambda task_name, model, tokenizer, device, framework: None, base_model, tokenizer, task_name, device)

But I am not sure you need it.

You are completely right, i misunderstood. changed

MarleneKress79789 added 4 commits November 30, 2023 14:46

start

360297c

Moved model loader to seperate class, injected into basemodel udf and…

987a1ef

… changed test setup accordingly

moved last_created_pipeline back

bbf3505

[CodeBuild]

aa871fe

ahsimb reviewed Dec 1, 2023

View reviewed changes

[CodeBuild] changes from code review, security update

3e751bc

ahsimb approved these changes Dec 8, 2023

View reviewed changes

MarleneKress79789 merged commit 4629152 into main Dec 8, 2023
3 checks passed

MarleneKress79789 deleted the refactoring/#144_extract_load_models_into_class branch December 8, 2023 11:14

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#144: Extracted load_models into seperate class #161

#144: Extracted load_models into seperate class #161

MarleneKress79789 commented Dec 1, 2023 •

edited

Loading

ahsimb Dec 1, 2023

MarleneKress79789 Dec 4, 2023

ahsimb Dec 5, 2023

ahsimb Dec 1, 2023

ahsimb Dec 1, 2023

ahsimb Dec 1, 2023

MarleneKress79789 Dec 8, 2023

ahsimb Dec 1, 2023

ahsimb Dec 1, 2023

MarleneKress79789 Dec 4, 2023

ahsimb Dec 1, 2023

MarleneKress79789 Dec 4, 2023

ahsimb Dec 4, 2023

MarleneKress79789 Dec 5, 2023

		model_name, cache_dir=cache_dir, use_auth_token=token)
		return None

#144: Extracted load_models into seperate class #161

#144: Extracted load_models into seperate class #161

Conversation

MarleneKress79789 commented Dec 1, 2023 • edited Loading

All Submissions:

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

MarleneKress79789 commented Dec 1, 2023 •

edited

Loading