[FEATURE] Trace Question Answering models to TorchScript and Onnx format #304

dhrubo-os · 2023-09-27T23:57:50Z

We are planning to add more model supports in ML-Commons. opensearch-project/ml-commons#1164

The target of this issue is to trace 3 popular pre-trained Question answering models to TorchScript and Onnx format. In this repo we traced pre-trained sentence embedding models into torchscript and onnx

We need to build the similar method to trace summarization models. Primarily we can target these models:

I created a feature branch : feature/summarization_model_conversation/. All the development of this issue should be done in that branch.

faradawn · 2023-10-12T03:23:34Z

[2023-10-11] Hi, I would like to tackle this issue.

I plan to create a similiar script called distilbert model besides Sentence Transformer:

opensearch-py-ml/opensearch_py_ml/ml_models/sentencetransformermodel.py
opensearch-py-ml/opensearch_py_ml/ml_models/distilbertmodel.py

Specifically, I plan to

Download a Distlled Bert model.
Create a save_to_pt and to_onnx function utilizing torch.jit.trace.
In the future, create a train function if needed.

Should I push directly to feature/summarization_model_conversation/ branch? Since the assignee to issue #303 is also pushing to this branch.

If there is anything I can do, please let me know.

[2023-10-12] I would like to select the "cased" model.

Among the three models, only the "cased" model answered my questions correctly.

Test 1

Context: I like hot drinks. Tea is hot. Coke is cold.
Question: Which one will I pick?

distilbert-base-cased-distilled-squad -> Tea.
distilbert-base-uncased-distilled-squad -> Coke.
bert-large-uncased-whole-word-masking-finetuned-squad -> Coke.

Test 2

Context: I live in Beijing, China. People in China speak Chinese. People in U.S. speak English. 
Question: What language do I speak?

distilbert-base-cased-distilled-squad -> Chinese.
distilbert-base-uncased-distilled-squad -> english.
bert-large-uncased-whole-word-masking-finetuned-squad -> english.

dhrubo-os · 2023-10-12T16:41:55Z

Sure assigning to you.

dhrubo-os · 2023-10-12T17:30:01Z

May be we can create a question answering model class and work in there? distilbertmodel is one type of model, so I don't think we should create separate classes for distilbertmodel

dhrubo-os · 2023-10-12T17:30:22Z

Yeah, you can raise the PR in feature/summarization_model_conversation/ branch too.

faradawn · 2023-10-12T17:51:33Z

Hi @dhrubo-os,

Got it -- make sense! Will 1) create a question_answering.py model, and 2) raise PR in that branch!

Thanks,
Faradawn

dhrubo-os added enhancement New feature or request untriaged good first issue Good for newcomers and removed untriaged labels Sep 27, 2023

dhrubo-os assigned faradawn Oct 12, 2023

faradawn mentioned this issue Nov 1, 2023

[Feature] Add Question Answering Model (old) #329

Closed

5 tasks

faradawn mentioned this issue Nov 30, 2023

[Feature] Add New Question Answering Model #349

Open

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[FEATURE] Trace Question Answering models to TorchScript and Onnx format #304

[FEATURE] Trace Question Answering models to TorchScript and Onnx format #304

dhrubo-os commented Sep 27, 2023 •

edited

Loading

faradawn commented Oct 12, 2023 •

edited

Loading

dhrubo-os commented Oct 12, 2023

dhrubo-os commented Oct 12, 2023

dhrubo-os commented Oct 12, 2023

faradawn commented Oct 12, 2023

[FEATURE] Trace Question Answering models to TorchScript and Onnx format #304

[FEATURE] Trace Question Answering models to TorchScript and Onnx format #304

Comments

dhrubo-os commented Sep 27, 2023 • edited Loading

faradawn commented Oct 12, 2023 • edited Loading

[2023-10-11] Hi, I would like to tackle this issue.

[2023-10-12] I would like to select the "cased" model.

Test 1

Test 2

dhrubo-os commented Oct 12, 2023

dhrubo-os commented Oct 12, 2023

dhrubo-os commented Oct 12, 2023

faradawn commented Oct 12, 2023

dhrubo-os commented Sep 27, 2023 •

edited

Loading

faradawn commented Oct 12, 2023 •

edited

Loading