-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Search Engine- What formal metadata would look like #40
Comments
An update regarding this issue. I have chosen at the moment to expose the opensearch mode of the pycsw module to implement a search engine into our instance of the EDR-API. Under the hood explanation With these metadata artifacts (let's call it collections.json files), each json file containing each collection ID and metadata, we can power a search engine. Using the pycsw module, I have implemented a search engine using opensearch. To implement the search engine, metadata is extracted from the collections.json files for each dataset and translated into csw:records in xml format. With a directory full of these xml csw:records, each record being a specific collection, and each record containing metadata that the search engine will search from, a pycsw process is used to ingest the records into a sqlite3 database. With this, pycsw is implemented with our metadata records. In Practice
Further work
|
Nice work @ShaneMill1. We could think about a collection, here, being modeled as a collection of variables at a certain xyzt. We could also associate this with the common view of OGC API - Records of discovery metadata. So a discovery metadata record (say WMO Core Metadata profile) can have a link relation for a real-time data search of a given dataset. This could link to an OGC API - Records implementation of this finer level of granularity (per this issue).
To be clear, this doesn't mean that WIS records are the way into a real-time search, but one way that is interoperable. We can use the OGC API - Record draft record model and extend it for MetOcean needs. |
As a result of the EDR-API Sprint, I worked on a basic search engine implementation that simply allows a user to search from the landing page of the NWS EDR implementation, and a dictionary of links to the collections containing the keywords is returned. This allows the users to be taken directly to the link where they can immediately make a query, rather than navigating the structure of the API to find whether or not that collection contains the data they are looking for.
This is a nice simple implementation, but we need to work on how formal metadata would look when returned by the search engine.
I have been looking at the pycsw module which offers several different implementations of relevant OGC services including Opensearch geo and time extensions as well as CSW. Through the EDR, do we offer some of these services? All of the services? Is it up to the data provider?
@tomkralidis had this comment on the Sprint Issue, and this is an important thing to discuss so this issue serves that purpose:
The text was updated successfully, but these errors were encountered: