[Feature Request] Consistent user experience for finetuning and init-model #3747

zjgemi · 2024-05-06T09:16:40Z

Summary

Consistent user experience for finetuning and init-model

Detailed Description

Currently, finetuning uses model structure defined in the pretrained model, ignoring those in input.json. In comparison, training with init-model uses model structure defined in input.json, inconsistency with the init model will cause exception. I suggest a more consistent user experience for the two modes.

Further Information, Files, and Links

No response

The text was updated successfully, but these errors were encountered:

@wanghan-iapcm

Fix #3747. Fix #3455. - Consistent fine-tuning with init-model, now in pt, fine-tuning include three steps: 1. Change model params (for multitask fine-tuning, random fitting and type-related params), 2. Init-model, 3. Change bias - By default, input will use user input while fine-tuning, instead of being overwritten by that in the pre-trained model. When adding “--use-pretrain-script”, user can use that in the pre-trained model. - Now `type_map` will use that in the user input instead of overwritten by that in the pre-trained model. Note: 1. After discussed with @wanghan-iapcm, **behavior of fine-tuning in TF is kept as before**. If needed in the future, it can be implemented then. 2. Fine-tuning using DOSModel in PT need to be fixed. (an issue will be opened, maybe fixed in another PR, cc @anyangml )  ## Summary by CodeRabbit - **New Features** - Added support for using model parameters from a pretrained model script. - Introduced new methods to handle type-related parameters and fine-tuning configurations. - **Documentation** - Updated documentation to clarify the model section requirements and the new `--use-pretrain-script` option for fine-tuning. - **Refactor** - Simplified and improved the readability of key functions related to model training and fine-tuning. - **Tests** - Added new test methods and utility functions to ensure consistency of type mapping and parameter updates.  --------- Signed-off-by: Duo <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Han Wang <[email protected]> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

@wanghan-iapcm

Fix deepmodeling#3747. Fix deepmodeling#3455. - Consistent fine-tuning with init-model, now in pt, fine-tuning include three steps: 1. Change model params (for multitask fine-tuning, random fitting and type-related params), 2. Init-model, 3. Change bias - By default, input will use user input while fine-tuning, instead of being overwritten by that in the pre-trained model. When adding “--use-pretrain-script”, user can use that in the pre-trained model. - Now `type_map` will use that in the user input instead of overwritten by that in the pre-trained model. Note: 1. After discussed with @wanghan-iapcm, **behavior of fine-tuning in TF is kept as before**. If needed in the future, it can be implemented then. 2. Fine-tuning using DOSModel in PT need to be fixed. (an issue will be opened, maybe fixed in another PR, cc @anyangml )  ## Summary by CodeRabbit - **New Features** - Added support for using model parameters from a pretrained model script. - Introduced new methods to handle type-related parameters and fine-tuning configurations. - **Documentation** - Updated documentation to clarify the model section requirements and the new `--use-pretrain-script` option for fine-tuning. - **Refactor** - Simplified and improved the readability of key functions related to model training and fine-tuning. - **Tests** - Added new test methods and utility functions to ensure consistency of type mapping and parameter updates.  --------- Signed-off-by: Duo <[email protected]> Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Han Wang <[email protected]> Co-authored-by: coderabbitai[bot] <136622811+coderabbitai[bot]@users.noreply.github.com>

zjgemi added the enhancement label May 6, 2024

iProzd self-assigned this May 6, 2024

iProzd added this to DeePMD-3.0.0 beta release May 6, 2024

github-project-automation bot moved this to Backlog in DeePMD-3.0.0 beta release May 6, 2024

iProzd linked a pull request May 22, 2024 that will close this issue

feat(pt): consistent fine-tuning with init-model #3803

Merged

iProzd mentioned this issue May 22, 2024

feat(pt): consistent fine-tuning with init-model #3803

Merged

iProzd closed this as completed Jun 13, 2024

github-project-automation bot moved this from Backlog to Done in DeePMD-3.0.0 beta release Jun 13, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request] Consistent user experience for finetuning and init-model #3747

[Feature Request] Consistent user experience for finetuning and init-model #3747

zjgemi commented May 6, 2024

[Feature Request] Consistent user experience for finetuning and init-model #3747

[Feature Request] Consistent user experience for finetuning and init-model #3747

Comments

zjgemi commented May 6, 2024

Summary

Detailed Description

Further Information, Files, and Links