Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

handle character variants when loading data #7

Open
thatbudakguy opened this issue Jan 22, 2022 · 1 comment
Open

handle character variants when loading data #7

thatbudakguy opened this issue Jan 22, 2022 · 1 comment
Labels
enhancement New feature or request

Comments

@thatbudakguy
Copy link
Member

see https://github.com/direct-phonology/core/blob/6a800a3201de43c039a6f7f096aef3a65a843922/core/bin/gentable.py#L84-L134

@thatbudakguy
Copy link
Member Author

thatbudakguy commented Feb 10, 2022

see also spacy's own docs on data augmentation which lets you swap variants to get a more robust training process. our existing variant file could be converted into this format pretty easily, and then we could use the orth_variants augmenter or define our own.

@thatbudakguy thatbudakguy changed the title handle character variants when annotating handle character variants when loading data Feb 20, 2022
@thatbudakguy thatbudakguy transferred this issue from another repository May 12, 2022
@thatbudakguy thatbudakguy added the enhancement New feature or request label May 12, 2022
thatbudakguy added a commit that referenced this issue Aug 2, 2022
thatbudakguy added a commit that referenced this issue Aug 2, 2022
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant