Sherlock Holmes Texts

Here are the texts of two stories, tokenized, pos-tagged and annotated with extended wordnet senses. This release is prepared for the shared annotation task of the Events and Stories in the News workshop at ACL 2017. The texts are part of the NTU Multilingual Corpus, only the English texts are given here.

Texts are tokenized, pos-tagged and tagged with wordnet senses. They are released in a modified version of the NLP annotation format (NAF).

Sense tags are:

The stories are public domain, the annotations are released under the Creative Commons Attribution 4.0 International License Creative Commons License.


Francis Bond <bond@ieee.org>
Division of Linguistics and Multilingual Studies
Nanyang Technological University
Level 3, Room 55, 14 Nanyang Drive, Singapore 637332
Tel: (+65) 6592 1568; Fax: (+65) 6794 6303