The Ohio State University

www.osu.edu

  1. Help
  2. Campus map
  3. Find people
  4. Webmail


Detection of Errors and Correction
in Corpus Annotation

Increasing the Recall of Corpus Annotation Error Detection

Adriane Boyd, Markus Dickinson, and Detmar Meurers

Proceedings of the Sixth Workshop on Treebanks and Linguistic Theories (TLT 2007). Bergen, Norway.

While error detection approaches have been developed for various types of corpus annotation, so far only limited attention has been paid to the recall of those methods. We show how the recall of the so-called variation $n$-gram method can be increased by examining comparable part-of-speech tag sequences instead of the recurring strings themselves. To guide the search for erroneous annotation and to distinguish errors with high precision, we also develop new context reliability indicators.


Electronically available file formats:


Bibtex entry:

@InProceedings{boyd-et-al:07a,
  author =       {Adriane Boyd and Markus Dickinson and Detmar Meurers},
  title =        {Increasing the Recall of Corpus Annotation
                  Error Detection},
  booktitle =    {Proceedings of the Sixth Workshop on Treebanks 
                  and Linguistic Theories (TLT 2007)},
  address =      {Bergen, Norway},
  url =          {\url{http://decca.osu.edu/publications/boyd-et-al-07b.html}},
  year =         {2007}
}