Skip to content

Commit

Permalink
Merge pull request #86 from Sage-Bionetworks/AG-1196
Browse files Browse the repository at this point in the history
AG-1196: Sort gene_info.biodomains to generate consistent ordering
  • Loading branch information
JessterB authored Aug 18, 2023
2 parents dbf3475 + e99eded commit 046f451
Showing 1 changed file with 3 additions and 1 deletion.
4 changes: 3 additions & 1 deletion src/agoradatatools/etl/transform/gene_info.py
Original file line number Diff line number Diff line change
Expand Up @@ -80,6 +80,9 @@ def transform_gene_info(
.rename(columns={"biodomain": "biodomains"})
)

# sort biodomains list alphabetically
biodomains['biodomains'] = biodomains['biodomains'].apply(sorted)

# For genes with either is_adi or is_tep set to True, create a resource URL that opens
# the portal page to the specific gene. This must be done using the hgnc_symbol from the
# tep_info file and not the symbol in gene_info, because there are some mismatches
Expand All @@ -95,7 +98,6 @@ def transform_gene_info(
)

# Merge all the datasets

gene_info = gene_metadata

for dataset in [
Expand Down

0 comments on commit 046f451

Please sign in to comment.