-
Notifications
You must be signed in to change notification settings - Fork 113
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
data.json harvesting fails after project-open-data.cio.gov is redirected #1895
Comments
Debugging shows that it only affects dataset that has 3+ level of organizations, as in this one:
On the 3rd level of |
POD schema files have been moved to new location at: https://resources.data.gov/schemas/dcat-us/v1.1/schema/catalog.json In a mock harvesting environment it shows that if we can update the |
@FuhuXia @adborden |
While we work out the longer-term fix with pod.c.g, I think there is a short term fix is to create a complete jsonschema based on the v1.1 schema and use that locally within the ckan extension. Right now, we have a partial implementation of this. Our embedded schema in ckanext-datajson uses Ultimately, we want to resolve the issue with p.c.g but we don't want to rush to an incomplete or short-sighted solution, so for this issue let's aim for a local fix. |
Good point to stick to localized schema file. As a matter of fact we are using localized an all-in-one dataset schema file. This pod.c.g redirecting issue made us aware that there is bug in the
|
GitHub issue created on jsonschema repo for this reading-external-url-with-local-definition issue. |
project-open-data.cio.gov has been restored, so this is no longer an urgent issue, but still an issue. |
@adborden @FuhuXia Maybe this PR python-jsonschema/jsonschema#717 could solve the problem. |
After https://project-open-data.cio.gov is redirected to https://resources.data.gov, data.json harvesting start to fail. It only affects some datasets in some data.json sources. Debugging shows that jsonschema validator tries to reach url https://project-open-data.cio.gov/v1.1/schema/organization.json and raised an exception when it is redirected to resources's homepage.
How to reproduce
harvest GSA's data.json source https://open.gsa.gov/data.json
Expected behavior
harvest should complete
Actual behavior
harvest is stuck. errors on fetch log.
The text was updated successfully, but these errors were encountered: