Name | task_name |
jiant |
Downloader | jiant_task_name |
Misc |
---|---|---|---|---|---|
Argument Reasoning Comprehension | arct | ✅ | ✅ | arct | Github |
Abductive NLI | abductive_nli | ✅ | ✅ | abductive_nli | |
SuperGLUE Winogender Diagnostic | superglue_axg | ✅ | ✅ | superglue_axg | SuperGLUE |
Acceptability Definiteness | acceptability_definiteness | ✅ | ✅ | acceptability_definiteness | Function Words |
Acceptability Coord | acceptability_coord | ✅ | ✅ | acceptability_coord | Function Words |
Acceptability EOS | acceptability_eos | ✅ | ✅ | acceptability_eos | Function Words |
Acceptability WH Words | acceptability_whwords | ✅ | ✅ | acceptability_whwords | Function Words |
Adversarial NLI | adversarial_nli_{round} |
✅ | ✅ | adversarial_nli | 3 rounds |
ARC ("easy" version) | arc_easy | ✅ | ✅ | arc_easy | site |
ARC ("challenge" version) | arc_challenge | ✅ | ✅ | arc_challenge | site |
BoolQ | boolq | ✅ | ✅ | boolq | SuperGLUE |
BUCC2018 | bucc2018_{lang} |
✅ | ✅ | bucc2018 | XTREME, multi-lang |
CommitmentBank | cb | ✅ | ✅ | cb | SuperGLUE |
CCG | ccg | ✅ | ccg | ||
CoLA | cola | ✅ | ✅ | cola | GLUE |
CommonsenseQA | commonsenseqa | ✅ | ✅ | commonsenseqa | |
EP-Const | nonterminal | ✅ | nonterminal | Edge-Probing | |
COPA | copa | ✅ | ✅ | copa | SuperGLUE |
EP-Coref | coref | ✅ | coref | Edge-Probing | |
Cosmos QA | cosmosqa | ✅ | ✅ | cosmosqa | |
EP-UD | dep | ✅ | dep | Edge-Probing | |
EP-DPR | dpr | ✅ | dpr | Edge-Probing | |
Fever NLI | fever_nli | ✅ | ✅ | fever_nli | |
GLUE Diagnostic | glue_diagnostics | ✅ | ✅ | glue_diagnostics | GLUE |
HellaSwag | hellaswag | ✅ | ✅ | hellaswag | |
MCScript2.0 | mcscript | ✅ | mcscript | data | |
MCTACO | mctaco | ✅ | ✅ | mctaco | |
MCTest | mctest160 or mctest500 | ✅ | ✅ | mctest160 or mctest600 | data |
MLM | * | ✅ | * | mlm_simple | See task-specific notes. |
MLQA | mlqa_{lang1}_{lang2} |
✅ | ✅ | mlqa | XTREME, multi-lang |
MNLI | mnli | ✅ | ✅ | mnli | GLUE, MNLI-matched |
MNLI-mismatched | mnli_mismatched | ✅ | ✅ | mnli_mismatched | GLUE |
MRPC | mrpc | ✅ | ✅ | mrpc | GLUE |
MultiRC | multirc | ✅ | ✅ | multirc | SuperGLUE |
Mutual (standard version) | mutual | ✅ | ✅ | mutual | site |
Mutual ("challenge" version) | mutual_plus | ✅ | ✅ | mutual_plus | site |
Natural Questions | mrqa_natural_questions | ✅ | ✅ | mrqa_natural_questions | MRQA version of task |
NewsQA | newsqa | ✅ | ✅ | newsqa | |
PIQA | piqa | ✅ | ✅ | piqa | PIQA |
QAMR | qamr | ✅ | ✅ | qamr | |
QA-SRL | qasrl | ✅ | ✅ | qasrl | |
QuAIL | quail | ✅ | ✅ | quail | site |
Quoref | quoref | ✅ | ✅ | quoref | |
EP-NER | ner | ✅ | ner | Edge-Probing | |
PAWS-X | pawsx_{lang} |
✅ | ✅ | pawsx | XTREME, multi-lang |
WikiAnn | panx_{lang} |
✅ | ✅ | panx | XTREME, multi-lang |
EP-POS | pos | ✅ | pos | Edge-Probing | |
QNLI | qnli | ✅ | ✅ | qnli | GLUE |
QQP | qqp | ✅ | ✅ | qqp | GLUE |
ROPES | ropes | ✅ | ✅ | ropes | |
RACE | race | ✅ | ✅ | race | race , race_middle , race_high |
ReCord | record | ✅ | ✅ | record | SuperGLUE |
RTE | rte | ✅ | ✅ | rte | GLUE, SuperGLUE |
SciTail | scitail | ✅ | ✅ | scitail | |
SentEval: Bigram Shift | senteval_bigram_shift | ✅ | ✅ | senteval_bigram_shift | SentEval |
SentEval: Coord Inversion | senteval_coordination_inversion | ✅ | ✅ | senteval_coordination_inversion | SentEval |
SentEval: Obj number | senteval_obj_number | ✅ | ✅ | senteval_obj_number | SentEval |
SentEval: Odd Man Out | senteval_odd_man_out | ✅ | ✅ | senteval_odd_man_out | SentEval |
SentEval: Past-Present | senteval_past_present | ✅ | ✅ | senteval_past_present | SentEval |
SentEval: Sentence Length | senteval_sentence_length | ✅ | ✅ | senteval_sentence_length | SentEval |
SentEval: Subj Number | senteval_subj_number | ✅ | ✅ | senteval_subj_number | SentEval |
SentEval: Top Constituents | senteval_top_constituents | ✅ | ✅ | senteval_top_constituents | SentEval |
SentEval: Tree Depth | senteval_tree_depth | ✅ | ✅ | senteval_tree_depth | SentEval |
SentEval: Word Content | senteval_word_content | ✅ | ✅ | senteval_word_content | SentEval |
EP-Rel | semeval | ✅ | semeval | Edge-Probing | |
SNLI | snli | ✅ | ✅ | snli | |
SocialIQA | socialiqa | ✅ | ✅ | socialiqa | |
EP-SPR1 | spr1 | ✅ | spr1 | Edge-Probing | |
EP-SPR2 | spr2 | ✅ | spr2 | Edge-Probing | |
SQuAD 1.1 | squad_v1 | ✅ | ✅ | squad | |
SQuAD 2.0 | squad_v2 | ✅ | ✅ | squad | |
EP-SRL | srl | ✅ | srl | Edge-Probing | |
SST-2 | sst | ✅ | ✅ | sst | GLUE |
STS-B | stsb | ✅ | ✅ | stsb | GLUE |
SuperGLUE Broad Coverage Diagnostic | superglue_axb | ✅ | ✅ | superglue_axb | SuperGLUE |
SWAG | swag | ✅ | ✅ | swag | |
Tatoeba | tatoeba_{lang} |
✅ | ✅ | tatoeba | XTREME, multi-lang |
TyDiQA | tydiqa_{lang} |
✅ | ✅ | tydiqa | XTREME, multi-lang |
UDPOS | udpos_{lang} |
✅ | ✅ | udpos | XTREME, multi-lang |
WiC | wic | ✅ | ✅ | wic | SuperGLUE |
Winogrande | winogrande | ✅ | ✅ | winogrande | |
WNLI | wnli | ✅ | ✅ | wnli | GLUE |
WSC | wsc | ✅ | ✅ | wsc | SuperGLUE |
XNLI | xnli_{lang} |
✅ | ✅ | xnli | XTREME, multi-lang |
XQuAD | xquad_{lang} |
✅ | ✅ | xquad | XTREME, multi-lang |
task_name
: Name-by-convention, used by downloader, and used inJiantModel
to map from task names to task-models. You can change this as long as your settings are internally consistent.jiant
: Whether it's supported injiant
(i.e. you can train/eval on it)- Downloader: Whether you can download using the downloader.
jiant_task_name
: Used to determine the programmatic behavior for the task (how to tokenize, what kind of task-model is compatible). Is tied directly to the code. See:jiant.tasks.retrieval
.