Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve lsof file checking #6069

Open
3 tasks
masaball opened this issue Oct 7, 2024 · 2 comments
Open
3 tasks

Improve lsof file checking #6069

masaball opened this issue Oct 7, 2024 · 2 comments

Comments

@masaball
Copy link
Contributor

masaball commented Oct 7, 2024

Description

In our batch process we call lsof to make sure the files are fully uploaded to the dropbox before being ingested. This method is also utilized in the regular file upload flow so that only files that are fully processed will be shown in the dropbox listing.

This accounts for as much as 30-40% of the load time when rendering the file-upload-step page. Improving the efficiency of the check, moving the check to happen when opening dropbox instead of at initial page load, or removing the file check entirely would net us a significantly lower load time.

This will require a change since the contents of the storage source are probably being written into the DOM at page load. A moderate amount of work will need to be done in Avalon with Browse Everything to make this change. Is this worth it, especially in the context of moving away from Browse Everything in the near future?

Done Looks Like

  • Remove check on page load when clicking on Manage Files
  • Move the method to be called when "Open Dropbox" is selected
  • Can the lsof call be removed completely? Investigate and create new issue
@joncameron
Copy link
Contributor

Could there be a check against modification times on files, and then run lsof only on those? This could be a way to speed things up.

@elynema
Copy link
Contributor

elynema commented Oct 11, 2024

If we remove the lsof check on batch ingest in general, files could be truncated when ingested or could error on encoding. If there is an error on encoding, then that will go out in a report to the user post-ingest. If the file is truncated, there's no error to report. However, we think the moving the original file to ArchiverSpool to get sent to SDA happens post-encoding, so by that time it is much more likely that the entire file has uploaded and the full version is sent to tape for safe storage.

@joncameron joncameron changed the title Improve or remove lsof file checking Improve lsof file checking Nov 20, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

3 participants