Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Fix bug in tools/ckpts/convert_neox_to_hf.py for setting intermediate…
…_size (#1209) In tools/ckpts/convert_neox_to_hf.py, for neox architecture the 'intermediate_size' argument is not explicitly set, so it defaults to 24576 from: https://github.com/huggingface/transformers/blob/9fe3f585bb4ea29f209dc705d269fbe292e1128f/src/transformers/models/gpt_neox/configuration_gpt_neox.py#L48 Proposed solution: set intermediate-size to 4 * hidden-size
- Loading branch information