Using DIA-NN results #11

Maithy15 · 2023-03-24T12:56:58Z

Hi,

Thanks for the nice tool. It would be great if py_diaid could use DIA-NN output as well.

Thanks
Maithy

Cajun-data · 2024-11-05T00:34:51Z

This appears to have been added in the latest version of pyDIAID, however it isn't clear to me which file pyDIAID is expecting from DIA-NN. Furthermore, I am not certain pYDIAID is compatible with DIA-NN 1.9.2 output at the moment. It would be great if a developer could chime in on some of these questions related to using DIA-NN output.

Cajun-data · 2024-11-05T03:57:13Z

There are some clues in the loader_proteomics_library.py function. Specifically:

dataframe (pd.DataFrame): imported library file from the analysis software
"DIANN". File format: .csv, required columns:
'PrecursorMz',
'IonMobility',
'PrecursorCharge',
'ProteinName',
'ModifiedPeptide'.

Therefore, in R (my preferred language) I can convert a .parquet DIA library to these specifications.

###Convert parquet to csv/tsv

library(arrow)
library(tidyverse)

# Load the Parquet file
df <- read_parquet("DIA_Library.parquet")

#
#"DIANN". File format: .csv, required columns: 
#  'PrecursorMz',
#'IonMobility',
#'PrecursorCharge',
#'ProteinName',
#'ModifiedPeptide'.

df <- df %>%
  rename(IonMobility = IM,
         PrecursorMz = Precursor.Mz,
         PrecursorCharge = Precursor.Charge,
         ProteinName = Protein.Names,
         ModifiedPeptide = Modified.Sequence) %>%
  select(PrecursorMz,IonMobility,PrecursorCharge,
         ProteinName, ModifiedPeptide) %>%
  distinct(ModifiedPeptide, PrecursorCharge, .keep_all = T)

# Save as CSV
write.csv(df, "DIA_Library.csv", row.names = FALSE)

Maithy15 · 2024-11-05T07:02:04Z

Did the converted format work for you?

Cajun-data · 2024-11-05T16:24:52Z

Did the converted format work for you?

Yep - the above code works for me. Just make sure to use the right parquet file since there are a few that typically show up in the output. I used the one that corresponds to an experimentally-derived spectral library.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Using DIA-NN results #11

Using DIA-NN results #11

Maithy15 commented Mar 24, 2023

Cajun-data commented Nov 5, 2024

Cajun-data commented Nov 5, 2024

Maithy15 commented Nov 5, 2024

Cajun-data commented Nov 5, 2024

Using DIA-NN results #11

Using DIA-NN results #11

Comments

Maithy15 commented Mar 24, 2023

Cajun-data commented Nov 5, 2024

Cajun-data commented Nov 5, 2024

Maithy15 commented Nov 5, 2024

Cajun-data commented Nov 5, 2024