- you should put configurations on generate_input_path.py
- directory path and output text file name
- maximum number is 1) train = 40,000 2) validation = 8,000 3) test = 12,000
- train.txt path
- valid.txt path
- test.txt path
- GPU environment
- save point directory
- use python3 Model/byteT5.py --mode train
- use python3 Model/byteT5.py --mode test --model_file "your savepoint path"