Building and Evaluating a Distributional Memory for Croatian
Jan Snajder, Sebastian Pado and Zeljko Agic
The 51st Annual Meeting of the Association for Computational Linguistics - Short Papers (ACL Short Papers 2013)
Sofia, Bulgaria, August 4-9, 2013
We report on the first structured distributional semantic model for Croatian, dm.hr. It is constructed after the model of the English Distributional Memory (Baroni and Lenci, 2010), from a dependency-parsed Croatian web corpus, and covers around 2M lemmas. We give details on the linguistic processing and the design principles. An evaluation shows state-of-the-art performance on a semantic similarity task with particularly good performance on nouns. The resource is freely available.
Conference Manager (V2.61.0 - Rev. 2792M)