Code for 2024 EMNLP Findings
SynTQA: Synergistic Table-based Question Answering via Mixture of Text-to-SQL and E2E TQA
Under Construction ...
run.py
is the main code to finetune and inference pretrained T5 and Omnitab models. classifier.py
train the random forest classier to select more correct answer from predictions (i.e., csv files in predict
folder) from end-to-end table QA model and Text-to-SQL model. A data
folder should be prepared in the root dir which clones from Squall
and WikiSQL
projects.
For any issues or questions, kindly email us at: Siyue Zhang ([email protected]).
@inproceedings{zhang-etal-2024-syntqa,
title = "{S}yn{TQA}: Synergistic Table-based Question Answering via Mixture of Text-to-{SQL} and {E}2{E} {TQA}",
author = "Zhang, Siyue and
Luu, Anh Tuan and
Zhao, Chen",
editor = "Al-Onaizan, Yaser and
Bansal, Mohit and
Chen, Yun-Nung",
booktitle = "Findings of the Association for Computational Linguistics: EMNLP 2024",
month = nov,
year = "2024",
address = "Miami, Florida, USA",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2024.findings-emnlp.131",
pages = "2352--2364",
abstract = "Text-to-SQL parsing and end-to-end question answering (E2E TQA) are two main approaches for Table-based Question Answering task. Despite success on multiple benchmarks, they have yet to be compared and their synergy remains unexplored. In this paper, we identify different strengths and weaknesses through evaluating state-of-the-art models on benchmark datasets: Text-to-SQL demonstrates superiority in handling questions involving arithmetic operations and long tables; E2E TQA excels in addressing ambiguous questions, non-standard table schema, and complex table contents. To combine both strengths, we propose a Synergistic Table-based Question Answering approach that integrate different models via answer selection, which is agnostic to any model types. Further experiments validate that ensembling models by either feature-based or LLM-based answer selector significantly improves the performance over individual models.",
}