Annotation Guidelines for Compound Analysis

This technical report introduces three sets of annotation guidelines for the analysis of compounds in Afrikaans and Dutch. The first protocol serves the annotation of compound boundaries when creating a dataset to use for compound segmentation. The second and third protocol serve the semantic annotation of the relation between the constituents of compounds. Where the second protocol only focuses on noun-noun (NN) compounds, the third protocol deals with other two-part nominal (XN) compounds.

 

The report further contains a terminology list with definitions of concepts and abbreviations relevant to the analysis of compounds and an overview of the AuCoPro project in the context of which these guidelines were developed.

Issue #: 
005
Author(s): 

Ben Verhoeven
Gerhard van Huyssteen
Menno van Zaanen
Walter Daelemans

ISSN: 
2033-3544
Published: 
31/01/2014
AttachmentSize
PDF455.03 KB
ctrs-005-front.jpg