SDK/API for machine learning and information extraction, primarily on text data. A range of algorithms are included and is integrated tightly with the visualization tools for manual and automatic annotation of text. Java classes for storing text, annotating text, and learning to extract entities and categorize text.