Skip to content
This repository has been archived by the owner on Apr 16, 2020. It is now read-only.

DPLA (Digital Public Library of America) Dataset on IPFS #68

Open
flyingzumwalt opened this issue Aug 4, 2016 · 4 comments
Open

DPLA (Digital Public Library of America) Dataset on IPFS #68

flyingzumwalt opened this issue Aug 4, 2016 · 4 comments

Comments

@flyingzumwalt
Copy link
Contributor

The Digital Public Library of America (DPLA) provides open and coherent access to our society’s digitized cultural heritage by aggregating info about all the digital materials held at many of the universities, public libraries, and other public-spirited organizations in the USA. It's a huge trove of metadata with pointers to a massive amount of digital materials that don't get enough attention.

Anyone interested in putting the DPLA dataset on IPFS? @cmh2166 @anarchivist @dchud @mjgiarlo @edsu @bibliotechy

It would also be possible to put the whole DPLA metadata processing pipeline onto IPFS. @chadfennell ?

@danfowler
Copy link

@flyingzumwalt did you get anywhere here?

@flyingzumwalt
Copy link
Contributor Author

@danfowler I haven't gotten any nibbles on this one, but I do know that @mdellabitta has recently done great work converting DPLA's internal ETL workflows to use Apache Spark. This makes me suspect that it would be very easy to pipe a copy of their dataset into IPFS.

This might also be a good point to experiment with using IPFS to track derivative datasets that people produce based on the complete DPLA set. Likewise it might be a good time to explore using IPFS in the aggregation flow from DPLA hubs to the national aggregator.

@cmharlow
Copy link

cmharlow commented Apr 14, 2017 via email

@flyingzumwalt
Copy link
Contributor Author

I definitely want to give these discussions a home that works. Redirecting that discussion to ipfs/community#224

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants