-
Notifications
You must be signed in to change notification settings - Fork 41
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
implement Polarity=Neg #526
Comments
GUM appears to implement this by rule. Breakdown of results:
Some false positives:
|
Do we like this feature with more Polarity in the treebanks? PUD has |
@AngledLuffa Sorry I don't understand the question |
are we adding Polarity to more words in more treebanks? in PUD, for example, should I add Polarity=Neg to the examples I listed? (which are all Neg in GUM) |
I guess it would be nice, since the guidelines for Polarity do mention that it can apply in a language to a mixture of bound affixes and function words. I wonder if "without" should also be included (as the negation of "with"). |
from GUM
|
unfortunately? unborn? uncovering_VERB? not this one? unassuming? unearthed? i vote not this one either undisputed? this looks weird
unemployment? probably not underground? could break it up into un-der-ground |
double check the linked change please? |
As I suggested above (but waiting for @amir-zeldes's input), I would not use Polarity on any VERBs, and would omit the under- ones. I don't have a problem with unborn, unfortunately, unassuming, undisputed, unemployment as their meaning is more or less 'not'/'no' + stem. |
My proposed change to PUD doesn't have any under- or any VERB |
Thanks for catching, will fix.
I'm open to this, but would like to understand the reasoning better. Etymologically, verbal "un" is indeed distinct, but in the context of synchronic English, I don't see a big difference between "uncovered" as a participle and "uncover" as a verb, which means 'make not covered'. Are we sure VERBs can't be negative in that sense? Do we remove the feature on uncovered/VBN/VERB but keep it if we have the same structure on "unbaked", because there is no lemma "unbake" and we are forced to tag JJ/ADJ?
Nice catch! 🤣 |
For a dynamic event, I would expect "uncovered" as an adjective can mean 'not covered' (negation of adjective), or a participle derived from 'uncover' can mean 'uncovering has happened'. It's a structural ambiguity of the morphology: un[[cover]ed] vs. [un[cover]]ed. So I would not assume the two uses of "un-", one of which takes a verb stem and one of which takes an adjective stem, are the same in terms of morphological features. |
I'm not sure about this from a typological perspective...
Negations can have all sorts of aspects - "not" and "never" both negate verbs and are Polarity=Neg, but "never" means something more than just "X doesn't happen". Similarly if you have a modal, you get various scope readings on the verbal complex (must [not happen] vs. [must not] happen - the default reading is flipped for German <> English, which can lead to hilarious misunderstandings). I thought "Neg" just means "some kind of negation", which means different things for "no one", "nowhere", "not"...
It's not unusual for negative morphology to be unproductive or limited. In some languages it's limited just to a special form of the copula (e.g. Church Slavonic), or just to existence predicates (Arabic/Hebrew/Coptic lexical negative existentials). I don't think that's a criterion for what carries this feature.
Yes, I agree it's ambiguous and not the same, but I'm not sure that means that one of them is not Polarity=Neg. I guess the question is how broad we want Polarity to be? I could certainly imagine someone would be interested to know which verbs do this, and would discover for example that "rain" doesn't do it in English. But without the annotation, that's not exposed to the user. Maybe something to discuss with the group? I'm not 100% for including it on these verbs, just feeling a bit uncertain about throwing something out which seems non-arbitrary/meaningful. |
Limited to certain grammatical constructions is one thing. I'm expressing skepticism about the ones limited based on the lexical semantics. To put it another way: if you asked me to negate the verb "covering", I would say "not covering". "Uncovering" is contrary to covering in the same way that "opening" is contrary to "closing". That's not regular negation, even if it can be expressed morphologically for certain verbs. "Uncovering" can be negated too: "not uncovering". Do we know of other languages using Polarity=Neg in a very broad way? If you think more input is necessary I suggest opening an issue in the docs repo. |
I would suggest a perhaps clearer example of modal verb: may [not happen] vs [may not] happen. And the next time I listen to M. Jackson I'll think about unrain vs unbreak my heart. This work definitely needs a lot more investigation, and discussion.
|
The Core Group discussed and concluded that Maybe we should implement |
Per https://universaldependencies.org/u/feat/Polarity.html, also implemented the feature for interjections "yes" and "no" |
Hm, I think this makes this annotation rather useless for English, but I'll respect and implement the decision. I take it this does not apply to "never" or other items which can stand in a paradigm with "not"? I'm moving the old GUM polarity to a misc attribute |
"never" will be |
OK, I've applied Polarity=Neg based on this spec, let me know if we're adding PronTypes as well or if you want to wait for more discussion. |
…tives trigger the feature (UniversalDependencies/UD_English-EWT#526)
documented at https://universaldependencies.org/en/feat/Polarity.html |
I agree For example, English lemma not and no have Thus, if we decided that e.g. the English prefix un in adjectives (or another prefix in another language) is "the most grammatical form of negation", we should have e.g. form=unnecessary, lemma=necessary, Polarity=Neg. 15 years ago, I did an English lemmatizer that tried to identify negative adjectives, adverbs and nouns (but not verbs), but I admit there is a "canny valley" of non easy cases making me feel uneasy. Thus I don't suggest we go this way in UD. *) Personally, I would prefer |
@martinpopel I made an English-specific documentation page: https://universaldependencies.org/en/feat/Polarity.html If you have input for the universal page please post that to https://github.com/UniversalDependencies/docs |
The Pronouns dataset already has |
PUD has a couple oddities for its Most of them already have
The other weirdness is that quite a few have the UPOS tag
Presumably those should all be changed to PART? In general
... side note, extra |
…iated spellings (UniversalDependencies/docs#517 - also fix neaten.py cause of false negative in #532); some typos (including "develope", #526)
https://universaldependencies.org/u/feat/Polarity.html (no English-specific documentation though)
Looking at the comparison of English treebanks, GUM/GENTLE/GUMReddit cover both prefixes and function words, while some of the others only cover function words.
The text was updated successfully, but these errors were encountered: