Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Enforcing POS comes first #5

Open
yuvalpinter opened this issue Feb 6, 2022 · 1 comment
Open

Enforcing POS comes first #5

yuvalpinter opened this issue Feb 6, 2022 · 1 comment

Comments

@yuvalpinter
Copy link

Hi!

As far as I can tell, the UniMorph convention for attribute ordering is that the POS is the first entry before the first ;. Nothing in the code I could see enforces this, and indeed when operating on a conllu file a lot of the POSs appear in random locations of the string (when it appears at all), making the tag difficult to process downstream. Can this be fixed?

Sample Input (en_gum treebank, l.152):
9 years year NOUN NNS Number=Plur 7 nmod 7:nmod:to Entity=23)|SpaceAfter=No

Output:
9 years year NOUN NNS PL;N 7 nmod 7:nmod:to Entity=23)|SpaceAfter=No

Thanks,

  • Yuval
@kylebgorman
Copy link

This would be a post-processing step but this should help with that. (I wonder how up to date it is, but it worked when I wrote it.)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants