Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

BART fine-tuning #17

Open
5 tasks done
minji-o-j opened this issue Jul 29, 2023 · 2 comments
Open
5 tasks done

BART fine-tuning #17

minji-o-j opened this issue Jul 29, 2023 · 2 comments

Comments

@minji-o-j
Copy link

minji-o-j commented Jul 29, 2023

best checkpoint기준 실험 (멈춘것 -3)
**bart finetuning**시 batch 또 조정해줘야함

QKV 학습 버전 (tag: QKVo_#num)

  • 1번 서버 5 gpu 01
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_task1 --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-26_05-16-58_QKVo_1/checkpoint_epoch-11 --train_batch_size=4 --accumulation_steps=24

  • 5번 서버 5 gpu23
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset2 --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-26_04-56-03_QKVo_5/checkpoint_epoch-3 --train_batch_size=4 --accumulation_steps=24

  • 9번 서버 5
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset3_paper --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-26_04-59-25_QKVo_9/checkpoint_epoch-5 --train_batch_size=4 --accumulation_steps=24

  • 6번 서버 3 gpu 23
accelerate launch run_textbox.py --model=PTG --dataset=dd  --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset2 --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-dd-2023-Jul-26_04-59-17_QKVo_6/checkpoint_epoch-4 --train_batch_size=2 --accumulation_steps=48

  • 10번 서버 3 gpu 23
accelerate launch run_textbox.py --model=PTG --dataset=dd --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset4_paper --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-dd-2023-Jul-26_12-31-16_QKVo_10/checkpoint_epoch-1 --train_batch_size=2 --accumulation_steps=48
@minji-o-j
Copy link
Author

minji-o-j commented Jul 29, 2023

QKV 학습X 버전 (tag: QKVx_#num)

  • 1번 서버 5
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_task1 --training_option=BART-finetuning --QKV_training=False --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-28_14-55-31_QKVx_1/checkpoint_epoch-3 --train_batch_size=4 --accumulation_steps=24

  • 5번 서버 5 gpu 01
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset2 --training_option=BART-finetuning --QKV_training=False --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-26_18-38-20_QKVx_5/checkpoint_epoch-2 --train_batch_size=4 --accumulation_steps=24

  • 9번 서버 5 23
accelerate launch run_textbox.py --model=PTG --dataset=pc --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset3_paper  --training_option=BART-finetuning --QKV_training=False --learning_rate=3e-5 --attention_path=PTG-pc-2023-Jul-28_08-42-47_QKVx_9/checkpoint_epoch-13 --train_batch_size=4 --accumulation_steps=24

  • 6번 서버 5 gpu 5,6
accelerate launch run_textbox.py --model=PTG --dataset=dd  --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset2 --training_option=BART-finetuning --QKV_training=False --learning_rate=3e-5 --attention_path=PTG-dd-2023-Jul-27_10-17-18_QKVx_6/checkpoint_epoch-10 --train_batch_size=2 --accumulation_steps=48

  • 10번 서버5 gpu 2,3
accelerate launch run_textbox.py --model=PTG --dataset=dd --model_path=facebook/bart-large --gpu_id=0,1 --find_unused_parameters=true --source_task=cross_dataset4_paper --training_option=BART-finetuning --QKV_training=False --learning_rate=3e-5 --attention_path=PTG-dd-2023-Jul-29_13-34-25_QKVx_10/checkpoint_epoch-1 --train_batch_size=2 --accumulation_steps=48

@minji-o-j minji-o-j reopened this Aug 14, 2023
@minji-o-j
Copy link
Author

minji-o-j commented Aug 14, 2023

  • 4번 서버5 gpu0123 진행중
accelerate launch run_textbox.py --model=PTG --dataset=xsum --model_path=facebook/bart-large --gpu_id=0,1,2,3 --find_unused_parameters=true --source_task=cross_dataset1 --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-xsum-2023-Aug-14_03-53-23_QKVo_4/checkpoint_epoch-8 --train_batch_size=2 --accumulation_steps=48 --eval_batch_size=8
  • 8번 서버5 gpu 4567
accelerate launch run_textbox.py --model=PTG --dataset=xsum --model_path=facebook/bart-large --gpu_id=0,1,2,3 --find_unused_parameters=true --source_task=cross_dataset2_paper --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-xsum-2023-Aug-14_03-53-29_QKVo_8/checkpoint_epoch-1 --train_batch_size=2 --accumulation_steps=48 --eval_batch_size=8

8번 prompt1 말고
accelerate launch run_textbox.py --model=PTG --dataset=xsum --model_path=facebook/bart-large --gpu_id=0,1,2,3 --find_unused_parameters=true --source_task=cross_dataset2_paper --training_option=BART-finetuning --QKV_training=True --learning_rate=3e-5 --attention_path=PTG-xsum-2023-Aug-14_03-53-29_QKVo_8/checkpoint_epoch-4 --train_batch_size=2 --accumulation_steps=48 --eval_batch_size=8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant