The current species can be seen in args.json
in the species
object. To add a new species do the following:
- Add new database to mrcdata.
- Add new species to
args.json
in beebop_py with properties.
The app assigns clusters with quality control (qc) on. This is to enable the --run-qc
flag as per here.
The arguments along with --qc-run
can be found at args.json
in the qc_dict
json object. These are species dependent and can be changed as per the species requirements.
git clone [email protected]:bacpop/beebop_py.git
You will need species databases, please download and extract it into /storage
. You can download all databases with the following command:
./scripts/download_databases
or just the reference databases with:
./scripts/download_databases --refs
You will need Python installed, as well as Poetry, which you can get on Linux with
curl -sSL https://install.python-poetry.org | python3 -
To install PopPUNK, follow these steps:
First, create a new conda environment: conda create --name beebop_py python=3.9
and activate it with conda activate beebop_pyonda activate beebop_py
Then install PopPUNK to your computer - either the latest version:
pip3 install git+https://github.com/bacpop/PopPUNK#egg=PopPUNK
..or a specific version:
pip3 install git+https://github.com/bacpop/[email protected]#egg=PopPUNK # for PopPUNK v2.6.7. replace with desired version
If there are problems installing PopPUNK, you may need to install one or more of the following packages with sudo apt get install
:
- cmake
- libhdf5-dev
- libeigen3-dev
- libopenblas-dev
Please install the dependencies for PopPUNK with:
conda install requests pandas networkx pp-sketchlib scikit-learn hdbscan biopython tqdm treeswift rapidnj
The graph-tool
and mandrake
packages must be installed from conda-forge
:
conda install -c conda-forge graph-tool mandrake
To install all other required dependencies go into the beebop_py project folder and run (having the conda environment 'beebop_py' activated)
poetry install
Start the flask app with
./scripts/run_app
In a second terminal, you can now query the flask endpoints, e.g.:
- /version
curl http://127.0.0.1:5000/version
Before testing, Redis and rqworker must be running. From the root of beebop_py, run (with 'beebop_py' env activated)
./scripts/run_test_dependencies
Testing can be done in a second terminal (make sure to activate 'beebop_py') by running
TESTING=True poetry run pytest
To use a specific version, commit or branch of PopPUNK in a beebop_py deployment, you can update POPPUNK_VERSION
in common
.
The new dev images built with /docker/build --with-dev
will have a -dev postfix.
You can build the image with /docker/build --with-dev
, this new image can now be used by Beebop.
A pull request can be created so GHA pushes the images to the docker hub. Add --with-dev
to the build & push commands pipeline.yaml
.
Ensure to remove the --with-dev
flag before merging the PR.
Then on the beebop-deploy
the api image can be updated with the new dev image.