This repository contains a Java library defining a set of Apache Spark ML components specialized on NLP (Natural Language Processing) tasks. NLP includes all the set of technologies suitable for text mining and opinion mining from analysis of textual datasets.
IMPORTANT NOTE: CURRENTLY THE STATUS OF THE LIBRARY IS PRE-ALPHA AND THERE IS A LOT OF WORK TO DO BEFORE RELEASING A FIRST USABLE VERSION OF THE SOFTWARE.