completed flag check may erroneously stop on regex matches #470

nsheff · 2024-02-17T13:51:48Z

I'm trying to submit 6 jobs with looper. I've never submitted any before, it's a brand new project. I noticed one of them says:

Found existing status: completed. Skipping sample.

This is bizarre because it's a brand new project! It has never been submitted before.

I realize this sample shares a prefix with another sample: one is named pairs_swap_maintain_coords, which the pipeline runs, and then the next sample is named pair_swap -- which the pipeline incorrectly says is already completed.

I'm guessing there's a regex that's looking for {sample_name}*_completed.flag -- if that's the case, it would actually register the first one as completed for the second one, and then never submit that job.

The text was updated successfully, but these errors were encountered:

nsheff · 2024-02-17T13:54:41Z

This may actually be a bug with pipestat, rather than looper. I'm guessing this coincides with a switch to using pipestat for status checks.

donaldcampbelljr · 2024-02-21T20:18:30Z

I can't reproduce this using a modified hello_looper example for both the basic and the pipestat approaches (which look for their flags in slightly different ways).

donaldcampbelljr · 2024-02-21T20:22:56Z

However, looking at the basic, non-pipestat example, I can see where the function fetch_sample_flags might have issues if you had a flag from a different sample in the results folder, because of this logic:

looper/looper/utils.py

Lines 93 to 98 in 1468956

    
           folder_contents = [os.path.join(sfolder, f) for f in os.listdir(sfolder)] 
        
           return [ 
        
               x 
        
               for x in folder_contents 
        
               if os.path.splitext(x)[1] == ".flag" and os.path.basename(x).startswith(pl_name) 
        
           ]

Appears it is only concerned with .flag and the pipeline_name. The sample name doesn't matter.

nsheff · 2024-03-12T14:07:39Z

was this fixed by the pipestat update referenced above?

donaldcampbelljr · 2024-03-12T19:14:21Z

I don't believe so. The pipestat code above was broken for filebackend and is not used for getting sample statuses.

donaldcampbelljr · 2024-03-14T17:21:42Z

Should be solved with the above commit.

nsheff added the bug label Feb 17, 2024

github-project-automation bot added this to PEP and PEP Vision Feb 17, 2024

nsheff mentioned this issue Feb 21, 2024

pipestat retrieval breaks with common prefix pepkit/pipestat#159

Closed

nsheff added this to the v1.8.0 milestone Mar 12, 2024

donaldcampbelljr added a commit that referenced this issue Mar 13, 2024

fix for #470

c089082

donaldcampbelljr added the likely-solved label Mar 13, 2024

donaldcampbelljr closed this as completed Jun 6, 2024

github-project-automation bot moved this to Done in PEP Jun 6, 2024

github-project-automation bot moved this to Done in PEP Vision Jun 6, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

completed flag check may erroneously stop on regex matches #470

completed flag check may erroneously stop on regex matches #470

nsheff commented Feb 17, 2024

nsheff commented Feb 17, 2024

donaldcampbelljr commented Feb 21, 2024

donaldcampbelljr commented Feb 21, 2024

nsheff commented Mar 12, 2024 •

edited

Loading

donaldcampbelljr commented Mar 12, 2024

donaldcampbelljr commented Mar 14, 2024

completed flag check may erroneously stop on regex matches #470

completed flag check may erroneously stop on regex matches #470

Comments

nsheff commented Feb 17, 2024

nsheff commented Feb 17, 2024

donaldcampbelljr commented Feb 21, 2024

donaldcampbelljr commented Feb 21, 2024

nsheff commented Mar 12, 2024 • edited Loading

donaldcampbelljr commented Mar 12, 2024

donaldcampbelljr commented Mar 14, 2024

nsheff commented Mar 12, 2024 •

edited

Loading