Annobase
Introduction
Annobase is a tool that provides light-weight but comprehensive representations for text annotation, implemented in Java. A main goal of the software is to provide a human-readable representation for text annotation that is easy to parse and extend, while also providing a pre-defined set of linguistically-motivated annotations and operations for them. Below is an example of how to use Annobase.
String text = "This is a sentence. This is another sentence"; AnnotationBase annBase = new AnnotationBase(text); // Run your sentence segmenter and tokenizer, and set sentences and tokens to // the annotation base instance. List<Sentence> sents = annBase.getSentences(); Sentence firstSent = sents.get(0); Token firstToken = firstSent.getFirstToken();
Requirements
You need the following software to run Annobase.
- Java 8
- Apache Commons IO 2.4
Download
The latest version is 1.0.1. Annobase is available for download, and licensed under the GNU General Public License (version 2 or later).
Documentation
Citation
If you are using Annobase, please cite it as follows.
Jun Araki. 2015. Annobase: A Light-weight Representation for Text Annotation. http://junaraki.net/software/annobase
.
Here is the corresponding BibTex entry for the citation:
@misc{Araki2015Annobase, author = {Jun Araki}, title = {{Annobase}: {A} Light-weight Representation for Text Annotation}, howpublished = {\url{http://junaraki.net/software/annobase}}, year = {2015}, }
Change log
Annobase 1.0.1 (2015-05-13):
- Several minor enhancements.
Annobase 1.0.0 (2015-05-11):
- Initial release.