Annotation of Multiword Expressions in the Prague Dependency Treebank
Anotace víceslovných výrazů v Pražském závislostním korpusu
dissertation thesis (DEFENDED)

View/ Open
Permanent link
http://hdl.handle.net/20.500.11956/35012Identifiers
Study Information System: 39820
Collections
- Kvalifikační práce [11322]
Author
Advisor
Referee
Pala, Karel
Pecina, Pavel
Faculty / Institute
Faculty of Mathematics and Physics
Discipline
Mathematical Linguistics
Department
Institute of Formal and Applied Linguistics
Date of defense
23. 9. 2010
Publisher
Univerzita Karlova, Matematicko-fyzikální fakultaLanguage
English
Grade
Pass
This thesis explores annotation of multiword expressions in the Prague Dependency Treebank 2.0. We explain, what we understand as multiword expressions (MWEs), review the state of PDT 2.0 with respect to MWEs and present our annotation. We describe the data format developed for the annotation, the annotation tool, and other soware developed to allow for visualisation and searching of the data. We also present the annotation lexicon SemLex and analysis of the annotation.