Remove empty file from a dataset collection #5090

alimatai · 2017-11-29T15:57:00Z

Hello,

I'm working on a tool which consists in two scripts following each others; the second script takes as input the output of the first one.

The point is that with some datafiles, the first script fails and there is no output file. I have implemented a proper exit in the second script to prevent a crash of the tool, but when I use a dataset collection, an empty output file remains (it's green and empty, not red).

Is it possible to delete it or not make it appear at all ? If I'm correct there is a tool which remove failed datafiles within a dataset collection, could it work in that case ?

All the best,

Mataivic.

jmchilton · 2018-01-25T04:14:36Z

Thanks for the issue - I don't have any progress to report but I wanted to let you know I saw this and that I think it is a really good idea.

alimatai · 2018-03-01T08:25:11Z

@jmchilton I'm looking at the class FilterFailedDatasetsTool (here). I assume an element of a collection is returned in the filtered collection when valid == True ?

Is there an element attribute which indicate that an element is empty ? If yes, a supplemental condition would prevent empty files to be returned in the filtered collection, right ? For example, use an empty state/attribute on the datafile, or check if the size of the file equals 0 ?

I'm trying to find if there is an element class or an empty attribute, but I don't know if there is any (it's my first time in galaxy source code :) ). The only things I found are here :

line 1272, in the class History : dict_element_visible_keys = ['id', 'name', 'genome_build', 'deleted', 'purged', 'update_time', 'published', 'importable', 'slug', 'empty']
line 1777 if the class DataSet : ```
states = Bunch(NEW='new',UPLOAD='upload',QUEUED='queued',RUNNING='running',OK='ok',EMPTY='empty', ....)

Could it be possible to use the 'empty' element of these lists ?

alimatai · 2018-03-03T10:12:40Z

I found a way to do it : write if element.is_ok and element.has_data(): instead of only if element.is_ok():. It seems to work fine on my galaxy instance. I can make a PR on monday.

hexylena · 2019-08-14T13:30:24Z

I think we have a filter failed now

nsoranzo · 2019-08-14T15:12:01Z

@erasche This was for filtering empty not failed datasets in a collection.

hexylena · 2019-08-14T15:13:23Z

the first script fails and there is no output file. I have implemented a proper exit in the second script to prevent a crash of the tool

so they should revert this proper exit, and then it's solved :)

mvdbeek · 2019-08-14T15:15:21Z

But we got this now as well, @Mataivic implemented this in #5640

alimatai mentioned this issue Mar 1, 2018

Improvements for Collection Operations #2496

Open

8 tasks

jennaj mentioned this issue Mar 2, 2018

Testing and tool/doc updates galaxyproject/usegalaxy-playbook#86

Closed

57 tasks

hexylena closed this as completed Aug 14, 2019

nsoranzo reopened this Aug 14, 2019

mvdbeek closed this as completed Aug 14, 2019

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove empty file from a dataset collection #5090

Remove empty file from a dataset collection #5090

alimatai commented Nov 29, 2017

jmchilton commented Jan 25, 2018

alimatai commented Mar 1, 2018 •

edited

Loading

alimatai commented Mar 3, 2018

hexylena commented Aug 14, 2019

nsoranzo commented Aug 14, 2019

hexylena commented Aug 14, 2019 •

edited

Loading

mvdbeek commented Aug 14, 2019 •

edited by hexylena

Loading

Remove empty file from a dataset collection #5090

Remove empty file from a dataset collection #5090

Comments

alimatai commented Nov 29, 2017

jmchilton commented Jan 25, 2018

alimatai commented Mar 1, 2018 • edited Loading

alimatai commented Mar 3, 2018

hexylena commented Aug 14, 2019

nsoranzo commented Aug 14, 2019

hexylena commented Aug 14, 2019 • edited Loading

mvdbeek commented Aug 14, 2019 • edited by hexylena Loading

alimatai commented Mar 1, 2018 •

edited

Loading

hexylena commented Aug 14, 2019 •

edited

Loading

mvdbeek commented Aug 14, 2019 •

edited by hexylena

Loading