portal-containers

Docker containers to pre-process data for visualization in the portal.

The subdirectories in this repo all have the same structure:

context/: A Docker context, including a Dockerfile and typically main.py, requirements.txt, and requirements-freeze.txt.
test-input/, test-output-actual/, test-output-expected/: Test fixtures.
VERSION: contains a semantic version number
and a README.md.

Images are named by the containing directory. Running test.sh will build (and test!) all the images. You can then define $INPUT_DIR, $OUTPUT_DIR, and $IMAGE to run an image with your own data:

docker run \
  --mount type=bind,source=$INPUT_DIR,target=/input \
  --mount type=bind,source=$OUTPUT_DIR,target=/output \
  $IMAGE

To push the latest versions to dockerhub just run:

test_docker.sh push

Getting it to run in production

This repo is included as a submodule in ingest-pipeline: When there are changes here that you want run in production:

bump the VERSION file in to the corresponding containers subdirectory
update the version referenced in the corresponding .cwl file in the root directory
run test_docker.sh push
make a PR in ingest-pipeline to update that submodule to the latest code here, and make Joel a reviewer on the PR.

Depending on the rate of change, it might be good to have a weekly routine of making PRs to ingest-pipeline. TBD.

In addition, each workflow must have a corresponding -manifest.json file conforming to this schema, which has a pattern, description, and edam_ontology_term entry for each output file. (see here for information about EDAM).

# In ingest-pipeline:
git checkout devel
git pull
git submodule update --init --recursive # This fails right now because we're not using plain URLs in .gitmodules.
git checkout -b username/update-portal-containers
cd src/ingest-pipeline/airflow/dags/cwl/portal-containers/
git checkout master
git pull
cd -
git commit -am 'Update portal-containers'
git push origin
# And then make a PR at: https://github.com/hubmapconsortium/ingest-pipeline

Here is a template for the PR into ingest-pipelines, if there is a new pipeline in portal-containers that needs to be run. This helps us communicate what the pipeline's input, output, and purpose are:

# --NAME OF THE PIPELINE--

## Input Pipeline/Original Dataset:

## Output Pipeline (Optional):

## Description:

For example:

# [ome-tiff-tiler](https://github.com/hubmapconsortium/portal-containers/blob/master/ome-tiff-tiler.cwl)

## Input Pipeline/Original Dataset + Files:

- High resolution imaging from Vanderbilt data (OME-TIFF) files, such as those in `/hive/hubmap/lz/Vanderbilt TMC/e6e9bb7c01d3cb9cdb31a8da857f8832/processedMicroscopy/`

## Output Pipeline:

- [ome-tiff-offsets](https://github.com/hubmapconsortium/portal-containers/blob/master/ome-tiff-offsets.cwl)

## Description:

This pipeline takes as input Vanderbilt's processed microscopy data and outputs an image pyramid for visualization.  In addition, the `ome-tiff-offsets` pipeline needs to be run the output of `ome-tiff-tiler` so that images with z-stacks/large numbers of channels can be efficiently visualized.

Name		Name	Last commit message	Last commit date
Latest commit History 86 Commits
containers		containers
workflows		workflows
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
README.md		README.md
anndata-to-ui-manifest.json		anndata-to-ui-manifest.json
anndata-to-ui.cwl		anndata-to-ui.cwl
clean.sh		clean.sh
h5ad-to-arrow-manifest.json		h5ad-to-arrow-manifest.json
h5ad-to-arrow.cwl		h5ad-to-arrow.cwl
mudata-to-ui-manifest.json		mudata-to-ui-manifest.json
mudata-to-ui.cwl		mudata-to-ui.cwl
ome-tiff-offsets-manifest.json		ome-tiff-offsets-manifest.json
ome-tiff-offsets.cwl		ome-tiff-offsets.cwl
ome-tiff-segments-manifest.json		ome-tiff-segments-manifest.json
ome-tiff-segments.cwl		ome-tiff-segments.cwl
requirements.txt		requirements.txt
scatac-csv-to-arrow-manifest.json		scatac-csv-to-arrow-manifest.json
scatac-csv-to-arrow.cwl		scatac-csv-to-arrow.cwl
seg-mudata-to-zarr-manifest.json		seg-mudata-to-zarr-manifest.json
seg-mudata-to-zarr.cwl		seg-mudata-to-zarr.cwl
sprm-to-anndata-manifest.json		sprm-to-anndata-manifest.json
sprm-to-anndata.cwl		sprm-to-anndata.cwl
sprm-to-json-manifest.json		sprm-to-json-manifest.json
sprm-to-json.cwl		sprm-to-json.cwl
test.sh		test.sh
test_cwl.sh		test_cwl.sh
test_docker.sh		test_docker.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

portal-containers

Getting it to run in production

About

Releases

Packages

Contributors 9

Languages

License

hubmapconsortium/portal-containers

Folders and files

Latest commit

History

Repository files navigation

portal-containers

Getting it to run in production

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 9

Languages

Packages