Some plan #83

chengchingwen · 2022-02-10T16:56:10Z

Here are some stuff I'm going to rewrite for the new release:

Tokenizer: Define the tokenizer with TextEncodeBase.jl and replace the old Basic.Vocabulary with TextEncodeBase.Vocab.
Layers: Rewrite the attention layer with NeuralAttentionlib.jl
Huggingface: Use HuggingFaceApi.jl for download and manage files, ~~and use StructWalk.jl to transform the state_dict~~. Remove the Pretrain submodule and use the huggingface one.

feel free to add comments.

The text was updated successfully, but these errors were encountered:

chengchingwen · 2022-05-24T02:46:41Z

The new tokenizer api (using TextEncodeBase) is basically finished and included in the 0.1.16 release, though the gpt part is ignored for now. For the next step, I will be fixing the huggingface download issue with HuggingFaceApi.jl. Rewriting the attention layer might be breaking, so that would probably be the last one to do.

Some other issue that might also need to be tracked:

update gpt with the new api
Most documents are outdated
"time-to-first-plot" issue
Redesign (or get rid of) the Datasets api
Remove internal use of Stacks

MNLubov · 2022-07-04T15:58:13Z

@chengchingwen
Peter, what is the approximate timeframe for implementing the model transfer from Huggingface?

chengchingwen · 2022-07-04T16:18:01Z

@MNLubov Are you looking for a specific model from HuggingFace? I'm trying to fix the huggingface module this month, so if everything goes well, it would be workable again before August.

Just to clarify, even if that huggingface module is fixed, it's still possible that we don't have the implementation for that model type (by model type, I mean something like bert, gpt2, t5 etc). So if you are looking for a model type that we don't have, please open another issue (and the timeline would be unknown for now

MNLubov · 2022-07-04T16:57:10Z

@chengchingwen Thanks for the clarification. Currently I am testing different sentence-transformers from Huggingface to find the most suitable for my purposes. As a temporary solution, I use PyCall to find the most suitable one.
As far as I understand you have now bert, gpt and roberta implementations.

chengchingwen · 2022-07-04T18:30:04Z

@MNLubov Yes. I haven't investigate the sentence-transformers implementation, but it seem that it can also be done with normal huggingface interface. Like this one https://huggingface.co/sentence-transformers/all-MiniLM-L12-v2, it's a bert model, so it should be workable following the huggingface transformer usage in the readme after we fix the module.

chengchingwen self-assigned this Feb 10, 2022

chengchingwen mentioned this issue Apr 4, 2022

Adapting TextEncodeBase #88

Merged

8 tasks

chengchingwen mentioned this issue Jun 21, 2022

Support for HuggingFace bert-base-portuguese-cased #78

Closed

chengchingwen mentioned this issue Jul 17, 2022

Adapting Huggingfaceapi #103

Merged

chengchingwen mentioned this issue Aug 24, 2022

gpt / gpt2 textencoder and hgf gpt2 tokenizer #111

Merged

chengchingwen mentioned this issue Sep 8, 2023

Transfer to the FluxML org? #70

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Some plan #83

Some plan #83

chengchingwen commented Feb 10, 2022 •

edited

Loading

chengchingwen commented May 24, 2022 •

edited

Loading

MNLubov commented Jul 4, 2022

chengchingwen commented Jul 4, 2022

MNLubov commented Jul 4, 2022

chengchingwen commented Jul 4, 2022

Some plan #83

Some plan #83

Comments

chengchingwen commented Feb 10, 2022 • edited Loading

chengchingwen commented May 24, 2022 • edited Loading

MNLubov commented Jul 4, 2022

chengchingwen commented Jul 4, 2022

MNLubov commented Jul 4, 2022

chengchingwen commented Jul 4, 2022

chengchingwen commented Feb 10, 2022 •

edited

Loading

chengchingwen commented May 24, 2022 •

edited

Loading