BERT implementation #43

jbarrow · 2023-12-08T10:16:00Z

Added and documented the BERT implementation. It currently uses the HF tokenizer, because that handles a lot of nice things (like padding, masking, and token_type_ids).

It's tested against the HuggingFace implementation at: batch sizes >= 1, different models (bert-base-uncased, bert-base-cased, bert-large-uncased, bert-large-cased), etc.

awni

This is really nicely done!! I love it. I left some mostly minor comments.

Please address and then we can merge it into the examples.

Also could you add a requirements.txt for dependenies?

bert/README.md

bert/model.py

bert/convert.py

jbarrow · 2023-12-09T15:53:24Z

Outside of anything that requires a change to MLX core, I think I've made the changes.

I'll put off removing the MultiHeadAttention class until I've made that change, it's been pulled into MLX, and there's been a released version. 😄

jbarrow · 2023-12-09T16:50:07Z

Submitted and tested a PR for the bias change in mlx. I can submit a new PR here once it's merged/released.

awni

Ok, just two more comments, then we can merge this and follow up after the change to core multiheadedattention.

bert/README.md

bert/model.py

…and fixing code type

awni

Brilliant, thank you for putting this together!

BERT implementation

jbarrow added 2 commits December 8, 2023 05:14

BERT implementation

4e5b8ce

Update README for mlx-examples repo

e05ee57

awni self-requested a review December 8, 2023 15:24

awni reviewed Dec 9, 2023

View reviewed changes

bert/model.py Show resolved Hide resolved

awni reviewed Dec 9, 2023

View reviewed changes

bert/convert.py Outdated Show resolved Hide resolved

jbarrow added 3 commits December 9, 2023 10:41

Cleaning implementation for merge

7320456

Updating README

45ca4ed

Requirements for running BERT

20d920a

awni reviewed Dec 9, 2023

View reviewed changes

bert/README.md Outdated Show resolved Hide resolved

bert/model.py Outdated Show resolved Hide resolved

Updating README for current example, making python>=3.8 compatibile, …

d873e10

…and fixing code type

awni approved these changes Dec 9, 2023

View reviewed changes

awni merged commit 46c6bbe into ml-explore:main Dec 9, 2023

Blaizzy pushed a commit to Blaizzy/mlx-examples that referenced this pull request Mar 13, 2024

Merge pull request ml-explore#43 from jbarrow/main

07cdcef

BERT implementation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

BERT implementation #43

BERT implementation #43

jbarrow commented Dec 8, 2023

awni left a comment

jbarrow commented Dec 9, 2023

jbarrow commented Dec 9, 2023

awni left a comment

awni left a comment

BERT implementation #43

BERT implementation #43

Conversation

jbarrow commented Dec 8, 2023

awni left a comment

Choose a reason for hiding this comment

jbarrow commented Dec 9, 2023

jbarrow commented Dec 9, 2023

awni left a comment

Choose a reason for hiding this comment

awni left a comment

Choose a reason for hiding this comment