-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
GREI 4: Task 6 - Assemble and provide metrics about HDV data collection for therapeutic areas #229
Comments
Status: May 2024
|
@sbarbosadataverse, I searched for datasets whose metadata contains:
Could you review the datasets from that search to see if many of these datasets seem irrelevant? This will help me evaluate the way I'm finding these datasets, before we consider using more search terms like we spoke about, such as the "therapeutic areas" at https://www.cdisc.org/standards/therapeutic-areas/disease-area and the names of NIH centers and institutes. We also spoke about looking at the keywords and topic classifications in the metadata of datasets from NIH-funded research in the Harvard Dataverse Repository (#217), and using those as search terms, too. I put those keywords and topic classifications in tabs in the spreadsheet at https://docs.google.com/spreadsheets/d/1OAQiSkgyeb_YdM4rFhl439FUeNadvmg5R5r0d4PN4us. Could you take a look? My impression is that someone with domain knowledge would need to review these before we can use them for searching. Feels like many of the keywords and especially the topic classifications wouldn't be that helpful, but I'm not sure. Maybe we could use only the keywords when we see that it comes from a relevant vocabulary, like MeSH, SNOMED-CT, and NCIT. |
Status: June 2024
|
@sbarbosadataverse and @cmbz, I'm going to close this GitHub issue. I'm curious how these counts will be used and during the GREI-Monthly CWG Meeting on July 10 I plan to ask about them (unless someone else brings them up). |
Re-opening this issue. @sbarbosadataverse asked that I include counts of datasets that include the term "covid19"in their metadata. I'm getting that count now and will update the Harvard Dataverse tab of the the Top Biomedical Research Categories in GREI Repository Holdings spreadsheet today. |
I updated the "Harvard Dataverse" tab and the "Aggregate" tab of the Top Biomedical Research Categories in GREI Repository Holdings spreadsheet. |
Closed the last remaining checkbox as we consider in a new issue how to make use of the information we collected for this ask from the GREI planning unit |
Overview
Cardiology
,Systems Neuroscience
,Quality of Life
,Bioinformatics Software
, andReal-Time Polymerase Chain Reaction
Tasks
Resources
The text was updated successfully, but these errors were encountered: