SciGen

SciGen is a generation model trained on scientific articles based on GPT2 and the code is based heavily on HuggingFace's GPT2 transformers examples. For more information see our paper Explaining Relationships Between Scientific Documents

Downloading Trained Models

SciGEN SciGPT2 SciGPT2_Clean

We note that SciGPT2_Clean was trained on a reduced set of papers to prevent leakage in our experiments and is released for reproducibility. In general, we recommend using the full veresion of SciGPT2.

Running our Scripts

Data Processing

Please follow the steps under data processing.

Training

python ft.py --model_type=gpt2 --do_eval --max_eval_steps 100000 --num_train_epochs=1 --save_steps=5000 --eval_all_checkpoints --tokenizer_path=$MODEL_PATH --output_dir=$OUTPUT_PATH --eval_data_file=$EVAL_FILE --model_name_or_path=$MODEL_PATH

Generation

python val_generation.py --model_type=gpt2 --length 50 --stop_token='. ' --tokenizer_path=$TOKENPATH --prompt=$TEST_FILE --output_file $OUTPUT_FILE --model_name_or_path=$MODEL_PATH

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
data_processing		data_processing
evaluation_scripts		evaluation_scripts
.gitignore		.gitignore
README.md		README.md
ft.py		ft.py
requirements.txt		requirements.txt
val_generation.py		val_generation.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SciGen

Downloading Trained Models

Running our Scripts

Data Processing

Training

Generation

About

Releases

Packages

Contributors 167

Languages

Kel-Lu/SciGen

Folders and files

Latest commit

History

Repository files navigation

SciGen

Downloading Trained Models

Running our Scripts

Data Processing

Training

Generation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 167

Languages

Packages