GitHub - cesar-leblanc/PlantBERT: Learning the syntax of plant assemblages

Pl@ntBERT

Learning the syntax of plant assemblages

View framework · Report Bug · Request Feature

This is the code for the framework of the paper Learning the syntax of plant assemblages to be submitted in Nature Plants.
If you use this code for your work and wish to credit the authors, you can cite the paper (it will be submitted to arXiv very soon):

@article{leblanc2025learning,
  title =        {Learning the syntax of plant assemblages},
  author =       {Leblanc, César and Bonnet, Pierre and Servajean, Maximilien and Thuiller, Wilfried and Chytrý, Milan and Joly, Alexis},
  journal =      {arXiv preprint arXiv:XXXX.XXXXX},
  year =         {2025},
}

This framework aims to leverage large language models to learn the "syntax" of plant species co-occurrence patterns. In particular, because Pl@ntBERT captures latent dependencies between species across diverse ecosystems, the framework can be used to identify the habitats of vegetation plots.

Prerequisites

Python version 3.8 or higher, pip, Git, CUDA, and Git LFS are required.

On many systems Python comes pre-installed. You can try running the following command to check and see if a correct version is already installed:

python --version

If Python is not already installed or if it is installed with version 3.7 or lower, you will need to install a functional version Python on your system by following the official documentation that contains a detailed guide on how to setup Python.

If you have Python version 3.4 or later (which is required to use Pl@ntBERT), pip should be included by default. To make sure you have it, you can type:

pip --version

If pip is not installed, you can install it by following the instructions here.

To check whether git is already installed or not, you can run:

git --version

If git is not installed, please install it by following the official instructions here.

To check whether CUDA is already installed or not on your system, you can try running the following command:

nvcc --version

If it is not, make sure to follow the instructions here.

To check whether Git LFS is already installed or not on your system, you can try running the following command:

git-lfs --version

If Git LFS is not installed, please install it by following the official instructions here.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.github		.github
Data		Data
Datasets		Datasets
Images		Images
Scripts		Scripts
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
README.md		README.md
SECURITY.md		SECURITY.md
UNLICENSE.txt		UNLICENSE.txt
environment.yml		environment.yml
requirements.txt		requirements.txt

License

cesar-leblanc/PlantBERT

Folders and files

Latest commit

History

Repository files navigation

Pl@ntBERT

Table of Contents

Prerequisites

Data

Installation

Examples

Curation

Masking

Classification

Inference

Pipeline

Libraries

Roadmap

Unlicense

Contributing

Troubleshooting

Team

Structure

About

Resources

License

Code of conduct

Security policy

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages