Using Mlflow logger #11197
Replies: 5 comments 6 replies
-
I'm having the same issue. The problem is that the the autolog function patches the fit function and tries to start a run on its own. https://docs.databricks.com/applications/mlflow/databricks-autologging.html#disable-databricks-autologging can help as well, if you are using databricks |
Beta Was this translation helpful? Give feedback.
-
I faced the same problem. For my scenario the solution was- |
Beta Was this translation helpful? Give feedback.
-
I was able to share the same MLflow run while using both
So this yields only one run in MLflow UI with all functionality (autologged model params, manually logged params via |
Beta Was this translation helpful? Give feedback.
-
For Databricks users try setting the default_root_dir, as in |
Beta Was this translation helpful? Give feedback.
-
I had a very similar problem and I was able to solve it as follows:
Basically the solution is to initialize MLFlowLogger inside the start_run() context and specify the Hope it helps someone, cheers |
Beta Was this translation helpful? Give feedback.
-
I have been trying to use Mlflow to track my experiments. I wanted to record my runs under the experiment name "Training"
However I seem unable to have a run record the run name, all of the parameters (from argparser and the PL callbacks), log the loss and the final model and have the symbol saying that it finished correctly or not, all in one line.
If I only use the mlflow logger without autolog, then I am unable to log the model. If I then log the model manually using
mlflow.pytorch.log_model(model, "model")
then another run will be created just to log this model, while the original one still doesnt have the model.If I use both mlflow logger and autolog two runs are created and one will log the model and not the parameters and the other the opposite. Only the run logging the model will have the finished symbol.
If I use only autolog then I won't have metrics for training or the full set of parameters as the argparser ones arent saved.
Using
with mlflow.start_run() as run:
also doesnt seem to help.Here's the basic idea of my code. This is one combination I tried.
Here is how the two runs issue then appears on mlflow UI
Beta Was this translation helpful? Give feedback.
All reactions