Bulgarian-English Treebank: Design and Implementation

Kiril Simov, Petya Osenova


The paper describes the construction of a Bulgarian-English treebank aligned on the word and semantic level. We consider the manual word level alignment easier and more reliable than the manual alignment on syntactic and semantic level. Thus, after manual word level alignment we apply an automatic procedure for the construction of semantic level alignments. Our work presents the main steps of this automatic procedure which exploits the syntactic analysis of both sentences, morphosyntactic annotation, manual word level alignment in producing the semantic annotation of the sentences and semantic alignment. Last, but not least, a method for identification of potential errors is discussed us- ing the automatically constructed semantic analyses of Bulgarian sentences and their comparison to the semantic representation of the English sentences.


alignment; English; Bulgarian

