Preprocessed units for the segments #4

mutiann · 2023-08-28T21:12:48Z

Hello!

I'm recently doing some experiments on NMSQA, and the code for DUAL provided here are really helpful! While I encountered some difficulty building the units using scripts provided to reproduce the results. Particularly, I'm trying to extract the units for each segment of context, while the preprocessed ones currently provided in the repo are already concatenated for each article following the standard QA scheme (using the merge_passage.py, I guess). May I know if the preprocessed units for each segment could be provided?

Thank you!

P.S. Just seen you and had some chat on Interspeech at the poster. The work was really impressive and useful for us :)

mutiann · 2023-08-28T21:44:40Z

(Or do you have any number for the performance of DUAL with the view only on each segment?)

DanielLin94144 · 2023-08-29T05:40:48Z

Hi @mutiann ! Thanks for the question. Here is the segment hubert units from hubert large 22-th layer with 128 number of clusters.
google drive link

mutiann · 2023-08-31T09:17:09Z

Thank you very much! Let me have a look at them.

mutiann · 2023-08-31T13:08:31Z

Thank you very much! Actually I am looking for the HuBERT units for each segment (for, e.g., context-0_0_1, context-0_0_2, ...), while it seems that the provided units above and in the README are for each paragraph as in standard SQuAD (e.g. context-0_0, context-0_1, context-0_2, ...) that are merged from those from each segment. May I know if those for the each segment could be provided?

BTW, out of curiousity, have you tried to have any experiment that works on each segment instead of having a view on the paragraph?

Thanks in advance!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Preprocessed units for the segments #4

Preprocessed units for the segments #4

mutiann commented Aug 28, 2023

mutiann commented Aug 28, 2023

DanielLin94144 commented Aug 29, 2023 •

edited

Loading

mutiann commented Aug 31, 2023

mutiann commented Aug 31, 2023

Preprocessed units for the segments #4

Preprocessed units for the segments #4

Comments

mutiann commented Aug 28, 2023

mutiann commented Aug 28, 2023

DanielLin94144 commented Aug 29, 2023 • edited Loading

mutiann commented Aug 31, 2023

mutiann commented Aug 31, 2023

DanielLin94144 commented Aug 29, 2023 •

edited

Loading