-
Notifications
You must be signed in to change notification settings - Fork 84
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support bagging to a destination other than the source #35
Comments
bump |
@finoradin Thanks – I'll work on a pull request for this. Strong +1 on checksumming the source files — the way I typically do things like that would be streaming reads at some large block size since that's also good for performance on network filesystems and it's a natural extension to feed each block into the hashers as well. |
I'm willing to take this on. Would it be agreeable to add a I would, of course, checksum the files as described above. |
@runderwood +1 – that should cover the most common use-cases |
@acdha Clarification: With multiple sources, should:
|
I would vote for choice 3, one bag with each directory for each source |
@johnscancella I can see that being useful, but I also find it counter-intuitive, since directories in EDIT: I also should mention that merging directories seems consistent w/ some of the use cases that have come up for me. But that's also a bit weird, I suppose -- though maybe no weirder than in-place bagging. |
FYI, I have a branch where bagging to a destination (with one source only) seems to work alright. |
I would be proposing that given the source directories
|
OK. Sounds good to me. I'll give it a shot. |
@johnscancella Sorry, but does it make more sense to you to have the behavior remain the same with just one source, such that even when providing only one
...or should, in that case, the contents of |
I would go with
becomes
|
Currently the module only allows one to do what the LOC Java library calls "bag in place". It would be very useful to have built-in the ability to specify one or more payloads as the "source" and to then specify a "destination" where the bag containing the payloads will be created.
Minor but important note – the hashes in the manifest should be generated from the source payloads, not the copied files in the bag.
The text was updated successfully, but these errors were encountered: