Skip to content

Release version for 20181111 / UD v. 2.3

Compare
Choose a tag to compare
@flammie flammie released this 18 Nov 18:23

This is a scheduled update release to follow UD release, but also has lots of new lexical data, and API for python, support for parsing sentence at a time instead of word at a time is necessary for most future stuff, and a downloader (beta) for limited systems and people who don't want to compile the language models

Significant changes in 20180111

  • Universal dependencies version 2.3 is a reference for recall tests
  • At least 18,380 new words: 340,931 insertions(+), 322,551 deletions(-)
    • Imported enwikt data on top of re-importing fiwikt, joukahainen
  • New CG based on UD tags
  • Some universal dependencies guessed (analysers using dep guessing are slower
    and process sentences instead of words)
  • Default processing mode for many analysers is now sentence-based
  • Slightly extended python API (somewhat modeled like SpaCy but not quite)
  • Ability to download compiled FST models from release instead of
    self-compiling (beta)
  • Unimorph used as new Recall / Precision reference gold test set
  • Probably some fixes to recasing
  • Gradle support for java stuff
  • renamed origin unihu → finer