Added ability to load local models, added early stopping, remove vocab check, fixed GPTJ model conversion #38

mallorbc · 2023-04-07T06:47:39Z

This PR is built on top of the pip package PR. Merge that PR first and if that PR changes those changes will need to be merged first.

In this PR I added the ability to load a model from a local path.

I also added early stopping allowing one to stop based on some token. Related to this, I changed the default behavior of waiting for the subprocess. Using these two together, I was able to achieve a large speedup for my desired task.

I removed the check for vocab size. When finetuning the vocab size can change, especially for GPTJ. Figuring out this was causing issues was a source of headache and is needed for fientuned models.

I also made it so that instead of using the Huggingface vocab and config, if we give a local path we will use those files in that folder, again common for finetuning.

This repo was the only one I could get working for GPTJ. Other repos are using a different GGML format. However, those other repos keep the model in memory with pybindings, such as pyllamacpp. If that feature was added, this repo would be great as suggested #36

add pip install

…op early, added the ability to not wait for process.

…gs during finetuning

…ocab

merge main into branch

kamalojasv181 and others added 5 commits March 31, 2023 12:49

created a pip package

da8b110

Merge pull request #1 from kamalojasv181/master

6ba3a93

add pip install

added the ability load other models using a path, added ability to st…

33de155

…op early, added the ability to not wait for process.

removed chedck for vocab size for GPTJ. Models often resize emnbeddin…

f70738d

…gs during finetuning

added support for using local model for conversion for the basis of v…

869dce2

…ocab

mallorbc mentioned this pull request Apr 7, 2023

Custom Finetuned Models? #32

Open

Merge pull request #2 from NolanoOrg/master

8e31276

merge main into branch

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added ability to load local models, added early stopping, remove vocab check, fixed GPTJ model conversion #38

Added ability to load local models, added early stopping, remove vocab check, fixed GPTJ model conversion #38

mallorbc commented Apr 7, 2023

Added ability to load local models, added early stopping, remove vocab check, fixed GPTJ model conversion #38

Are you sure you want to change the base?

Added ability to load local models, added early stopping, remove vocab check, fixed GPTJ model conversion #38

Conversation

mallorbc commented Apr 7, 2023