- Look at article for notes and references
Steps to generate article
pwd # make sure that you are at equivalent path
/projects/VaidhyaMegha/vaidhyamegha-knowledge-graphs/docs/open_knowledge_graph_on_clinical_trials
rm -f out.*
docker run --rm -v "$(pwd):/data" -u "$(id -u)" pandocscholar/alpine
xdg-open out.pdf
- What are knowledge graphs
Below is a brief specification
- Inputs
- Outputs
- Clinical Trials RDF with below constituent ids and their relationships
- MeSH, Clinical Trial, PubMed Article, Symptom/Phenotype, Genotype(from Human Genome)
- Additionally, clinical trial -> clinical trial links across trial registries will also be discovered and added.
- Clinical Trials RDF with below constituent ids and their relationships
- Input : one or more of these ids as input.
- Output : { input : { id_type : xxx, key : key1, value : value1 }, output : [ { id_type : xxx, key : key1, value : value1 } ] }
- ACMG
-
HSDN supplementary data files "Combined-Input.tsv", "Symptom-Occurence-Input.tsv", and "Disease-Occurence-Input.tsv" were taken as input and new output files which also have the MeSH IDs were created.
-
Analysing bipartite symptoms to diseases network : Instead of the supplementary data files from HSDN, files retrieved from above LeoBman/HSDN were used. MeSH diseases where then mapped to the Disease Ontologies diseases used in Hetionet v1.0. Ultimately no data from HSDN was used in Hetionet, instead re-extracted symptom–disease relationships from MRCOC - MEDLINE topic co-occurrence were used.
-
Comparison of hetio/medline to MRCOC
MEDLINE produces co-occurrence files under the codename MRCOC. More information is available in the 2016 report Building an Updated MEDLINE Co-Occurrences (MRCOC) File. These files might be a viable alternative to the analyses in this repository for certain applications. However, they don't appear to contain topics for supplemental concept records (for example MeSH term C000591739). Feel free to open an issue with additional insights on or comparisons to MRCOC
- Look at References section in Article.md. Additional references are below.
- Introduction to knowledge graphs
- What is PheGenI
- PheGenI: The Phenotype-Genotype Integrator demo
- Downstream analysis of PheGenI results demo
- Download
- Read RDF files
- Write RDF files
- Querying a model
- SparQL
- index RDF files
- query RDF files using indexes
- count triples query
- Execute SparQL query programmatically
- Configure Java heap size and if needed stack size.
- Invoke Entrez API using Spring webclient
- Handling XML response
- JAXB impl in Java 11
- JAXB marshalling objects in lists
- Java multiple resources with autocloseable try-with-resources
- escape pipe character in grep
- Other older relevant links on Java and Entrez
- download-pubmed-abstracts-in-java
- Github repo : pubmed_ws_client
- Read file into a string
- PostgreSQL array columns
- JDBC insert into array columns
- PostgreSQL date column with default value
- PostgreSQL upsert statement
- PostgreSQL - pg_restore - restore only one selected schema
- PostgreSQL - Array functions
- Execute query on PostgreSQL using psql non-interactively
- Save psql inline query output to a file
- In PostgreSQL formulate a query to get all items in an array column
- Tutorial for using GraphQL with Postman
- GraphQL queries from Postman to neo4j.
- Discussion on SparQL for neo4j.
- Suggests it's feasible to execute SparQL 'get' query from Jena to Neo4j using n10s plugin.
- GraphQL's integration in to Apache Superset.
- Subsequent fork with commits from the above commenter - graphadvantage - to address this need.
- REST API of Superset with a comment on GraphQL.