A Sober Look at LLMs for Material Discovery

Official experiment repo for the "A Sober Look at LLMs for Material Discovery" paper (ICML 2024).

Tip

If you just want to use the method as a library, check out the sister repo: https://github.com/wiseodd/lapeft-bayesopt.

Setup

Important

Note that the ordering is important.

Install PyTorch (with CUDA): https://pytorch.org/get-started/locally/
Install Huggingface libraries and others: pip install transformers datasets peft tqdm
Install laplace-torch: pip install laplace-torch

Fixed-Feature Experiments

Cache molecules in $\mathcal{D}_{\mathrm{cand}}$ (see full parameters in the Python file):

python cache_features.py --feature_type {FEATURE_TYPE} --problem {PROBLEM} --prompt_type {PROMPT_TYPE}

Then, do BO:

python run_fixed_features.py --feature_type {FEATURE_TYPE} --method {METHOD} --randseed {RANDSEED} --problem {PROBLEM}

Similarly for the multiobjective experiments (cache_features_multiobj.py and run_multiobj.py).

Finetuning Experiments

Simply run the following.

python run_finetuning.py --foundation_model {FOUNDATION_MODEL} --randseed {RANDSEED} --problem {PROBLEM}

See the Python file for the full arguments.

BO-LIFT In Context Learning Baseline

The script is in baselines/run_bolift.py. It has similar options as the fixed-feature script.

Citation

@inproceedings{kristiadi2024sober,
  title={A Sober Look at {LLMs} for Material Discovery: {A}re They Actually Good for {B}ayesian Optimization Over Molecules?},
  author={Kristiadi, Agustinus and Strieth-Kalthoff, Felix and Skreta, Marta and Poupart, Pascal and Aspuru-Guzik, Al\'{a}n and Pleiss, Geoff},
  booktitle={ICML},
  year={2024}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
bayesopt		bayesopt
data		data
foundation_models		foundation_models
llm_bayesopt		llm_bayesopt
problems		problems
results		results
utils		utils
.gitignore		.gitignore
README.md		README.md
cache_features.py		cache_features.py
cache_features_multiobj.py		cache_features_multiobj.py
eval_gap.py		eval_gap.py
eval_gap_ft.py		eval_gap_ft.py
get_data_subsets.py		get_data_subsets.py
plot_embds.py		plot_embds.py
plot_results_ei.py		plot_results_ei.py
plot_results_fixed.py		plot_results_fixed.py
plot_results_icl.py		plot_results_icl.py
plot_results_iupac.py		plot_results_iupac.py
plot_results_mixed.py		plot_results_mixed.py
plot_results_multiobj.py		plot_results_multiobj.py
plot_results_prompts.py		plot_results_prompts.py
plot_timings_finetuning.py		plot_timings_finetuning.py
requirements.txt		requirements.txt
ruff.toml		ruff.toml
run_bolift.py		run_bolift.py
run_finetuning.py		run_finetuning.py
run_fixed_features.py		run_fixed_features.py
run_multiobj.py		run_multiobj.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Sober Look at LLMs for Material Discovery

Setup

Fixed-Feature Experiments

Finetuning Experiments

BO-LIFT In Context Learning Baseline

Citation

About

Releases

Packages

Languages

wiseodd/llm-bayesopt-exps

Folders and files

Latest commit

History

Repository files navigation

A Sober Look at LLMs for Material Discovery

Setup

Fixed-Feature Experiments

Finetuning Experiments

BO-LIFT In Context Learning Baseline

Citation

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages