AG-1395: Bump harmonized targets source version to pick up delimiter fix #152

JessterB · 2024-10-28T18:38:00Z

No description provided.

jaclynbeck-sage

Actually I take back my approval until CI has re-run. I see this may have caused an issue with gene_info but want to let CI run again to make sure.

jaclynbeck-sage

Yeah ok, it looks like something about the new file did something bad to gene_info processing. What is different about the new file?

JessterB · 2024-10-28T19:23:41Z

Yeah ok, it looks like something about the new file did something bad to gene_info processing. What is different about the new file?

@jaclynbeck-sage I converted some semi-colon delimiters to commas, and forgot to escape them. I'll update the source and verify locally before I push up a new commit...sorry, I should have tested this locally first.

jaclynbeck-sage · 2024-10-28T20:06:44Z

Thanks. It looks like something is... off with this file (as well as its previous versions 48 and 47 which is as far back as I looked). The raw file has 1362 lines in it, but if you read it in as a CSV either programmatically or with Excel it only has 1162 lines (versions 48-50). For v47 it as 1334 lines / 1134 lines, so a 200-line discrepancy in both versions.

Update: It looks like starting line 320, The Emory data has line breaks in the Data_used_to_support_target_selection column, which are escaped, so it does get read in successfully as one field. I guess that's ok? The raw file looks like:

320  ...,"Discovery quantitative proteomics of FrCx 
321    WPCNA of multiple and consensus cohorts
322    ANOVA",...

and the read-in string looks like:
"Discovery quantitative proteomics of FrCx \n WPCNA of multiple and consensus cohorts\n ANOVA"

Is this something we're aware of/handle on the front end for display?

JessterB · 2024-10-28T20:17:56Z

Thanks. It looks like something is... off with this file (as well as its previous versions 48 and 47 which is as far back as I looked). The raw file has 1362 lines in it, but if you read it in as a CSV either programmatically or with Excel it only has 1162 lines (versions 48-50). For v47 it as 1334 lines / 1134 lines, so a 200-line discrepancy in both versions.

Update: It looks like starting line 320, The Emory data has line breaks in the Data_used_to_support_target_selection column, which are escaped, so it does get read in successfully as one field. I guess that's ok? The raw file looks like:
320  ...,"Discovery quantitative proteomics of FrCx 
321    WPCNA of multiple and consensus cohorts
322    ANOVA",...
and the read-in string looks like: "Discovery quantitative proteomics of FrCx \n WPCNA of multiple and consensus cohorts\n ANOVA"

Is this something we're aware of/handle on the front end for display?

@jaclynbeck-sage This seems to look ok in Agora in prod today (for the 2 genes I checked, RPH3A & STX1A): mv68 -> gene_info v56 -> harmonized targets v48

...but I could remove those /n since they are confusing and useless.

jaclynbeck-sage · 2024-10-28T20:27:18Z

...but I could remove those /n since they are confusing and useless.

Maybe? I'm guessing the line breaks are to indicate these are 3 separate items, not one single discovery method, so the way it's displaying in Agora right now isn't right because it looks like it's all mashed together in one sentence. I think probably the \n should be replaced with commas but also there's extra spaces around the \n that need to be dealt with in order for commas to look normal.

…8 Emory nominations

sonarcloud · 2024-10-28T23:23:23Z

Quality Gate passed

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarCloud

JessterB · 2024-10-28T23:30:30Z

@jaclynbeck-sage Those newlines go back to v1 of the source file from 2018 and no one has complained yet, but I went ahead and fixed it (in v51) by replacing the newlines with commas, and cleaning up extra spaces.

jaclynbeck-sage

Looks good, thank you!!

AG-1395: Bump harmonized targets source version to pick up delimiter fix

8ccc724

JessterB requested review from jaclynbeck-sage and beatrizsaldana October 28, 2024 18:38

jaclynbeck-sage approved these changes Oct 28, 2024

View reviewed changes

jaclynbeck-sage reviewed Oct 28, 2024

View reviewed changes

jaclynbeck-sage requested changes Oct 28, 2024

View reviewed changes

AG-1395: Bump to source version with quoted strings

c4f5204

AG-1395: Bump source file version to pick up additional commas in 201…

76d171c

…8 Emory nominations

jaclynbeck-sage approved these changes Oct 29, 2024

View reviewed changes

JessterB merged commit 3382947 into dev Oct 29, 2024
9 checks passed

JessterB deleted the AG-1395 branch October 29, 2024 16:45

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

AG-1395: Bump harmonized targets source version to pick up delimiter fix #152

AG-1395: Bump harmonized targets source version to pick up delimiter fix #152

JessterB commented Oct 28, 2024

jaclynbeck-sage left a comment

jaclynbeck-sage left a comment

JessterB commented Oct 28, 2024

jaclynbeck-sage commented Oct 28, 2024

JessterB commented Oct 28, 2024 •

edited

Loading

jaclynbeck-sage commented Oct 28, 2024 •

edited

Loading

sonarcloud bot commented Oct 28, 2024

JessterB commented Oct 28, 2024 •

edited

Loading

jaclynbeck-sage left a comment

AG-1395: Bump harmonized targets source version to pick up delimiter fix #152

AG-1395: Bump harmonized targets source version to pick up delimiter fix #152

Conversation

JessterB commented Oct 28, 2024

jaclynbeck-sage left a comment

Choose a reason for hiding this comment

jaclynbeck-sage left a comment

Choose a reason for hiding this comment

JessterB commented Oct 28, 2024

jaclynbeck-sage commented Oct 28, 2024

JessterB commented Oct 28, 2024 • edited Loading

jaclynbeck-sage commented Oct 28, 2024 • edited Loading

sonarcloud bot commented Oct 28, 2024

Quality Gate passed

JessterB commented Oct 28, 2024 • edited Loading

jaclynbeck-sage left a comment

Choose a reason for hiding this comment

JessterB commented Oct 28, 2024 •

edited

Loading

jaclynbeck-sage commented Oct 28, 2024 •

edited

Loading

JessterB commented Oct 28, 2024 •

edited

Loading