Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Vcf gz datatype #4254

Merged
merged 13 commits into from
Aug 10, 2017
Merged

Vcf gz datatype #4254

merged 13 commits into from
Aug 10, 2017

Conversation

ffinfo
Copy link
Contributor

@ffinfo ffinfo commented Jun 28, 2017

No description provided.

@mvdbeek
Copy link
Member

mvdbeek commented Jun 28, 2017

@galaxybot test this

f.seek(offset)
ck_data = f.read(ck_size or trans.app.config.display_chunk_size)
if ck_data and ck_data[-1] != '\n':
f = compression_utils.get_fileobj(dataset.file_name)
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think we can do with compression_utils.get_fileobj(dataset.file_name) as f:.

@mvdbeek
Copy link
Member

mvdbeek commented Jul 6, 2017

@ffinfo let me know if you want to continue working on this (I think we're almost there), otherwise I can do the last bit of polishing.

@ffinfo
Copy link
Contributor Author

ffinfo commented Jul 6, 2017

@mvdbeek Ye was still planning to continue on this. I think I still have some uncommitted code for a test tool. Was a bit of a chaos when I did came back to Leiden. You know how that goes with todo's ;)

<param name="input" format="vcf_bgzip" type="data" label="Source file"/>
</inputs>
<outputs>
<data format="data" ftype="vcf_bgzip" name="output" />
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This should be format="vcf_bgzip". ftype is (confusingly ...) only used in the test section.

@ffinfo ffinfo changed the title WIP: Vcf gz datatype Vcf gz datatype Jul 6, 2017
@mvdbeek
Copy link
Member

mvdbeek commented Jul 7, 2017

@galaxybot test this

mvdbeek and others added 12 commits July 7, 2017 18:29
This adds support for uploading VcfGz files in the upload box. When a
user does this, the file will not be uncompressed. Also adds
uncompressed preview.

TODO:
  - Move Vcf class to BaseVcf
  - Vcf and VcfGz should both inherit from BaseVcf
  - Add a metadata element for VcfGz indexes (vzf.gz.tbi)
  - Create index (in set_meta method of VcfGz class)
  - Write test tools

This is a kcikstart to @ffinfo GCC hack project.
@mvdbeek mvdbeek force-pushed the vcf_gz_datatype branch from 2b83183 to 5cc71d2 Compare July 7, 2017 15:30
@mvdbeek
Copy link
Member

mvdbeek commented Jul 7, 2017

@galaxybot test this

@mvdbeek
Copy link
Member

mvdbeek commented Aug 4, 2017

@ffinfo this is looking good to me, If you're happy with this I can merge it. bctools generates csi indexes by default (i.e without the -t flag), are these widely used ? Should we also implement them ?

@ffinfo
Copy link
Contributor Author

ffinfo commented Aug 4, 2017

Well tools like Picard and GATK are using by default '.tbi' extensions. The csi is more used for bcf files and not for vcf files. That why I think it's better to keep it with the '-t' flag

@mvdbeek
Copy link
Member

mvdbeek commented Aug 7, 2017

Alright, we can always add this if there is demand.

@mvdbeek
Copy link
Member

mvdbeek commented Aug 7, 2017

@galaxybot test this

@mvdbeek
Copy link
Member

mvdbeek commented Aug 7, 2017

@galaxybot test this

@jmchilton
Copy link
Member

Awesome - thanks for the contribution @ffinfo and @mvdbeek !

@jmchilton jmchilton merged commit bcdf2c8 into galaxyproject:dev Aug 10, 2017
nsoranzo added a commit to nsoranzo/galaxy that referenced this pull request Aug 22, 2017
In galaxyproject#4254 `vcf_bgzip` was made
a proper datatype with the introduction of the `VcfGz` class, but was
still described as a subclass in `datatypes_conf.xml` .

Also fix `file_ext` attribute in `BaseVcf` and subclasses.
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants