Gathering requirements #710
hammer
started this conversation in
Discourse import
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
(Posted by @alimanfoo)
Hi folks,
Following up on our conversations around gathering requirements in the form of some analysis stories, I made a new github repo at https://github.com/scikit-allel/requirements.
I added a first simple analysis story which describes a population structure PCA, just to test out the approach and explore a good way to document requirements. I tried to be fairly specific, but haven't bottomed out every detail.
I also added some pages describing some existing public datasets that could be used for testing etc. Not sure this is the best place for this type of info ultimately, but seemed useful to have this together with requirements.
Here's some info about the Ag1000G phase 2 dataset.
Here's some info about the (human) 1000 genomes phase 3 dataset, which I've converted to zarr and is currently in transit up to GCS.
I know neither of these are biobank scale, but useful for testing. It would be great to add a simulated biobank-scale dataset.
Any feedback very welcome on whether this is a useful approach.
Cheers,
Alistair
Beta Was this translation helpful? Give feedback.
All reactions