LX-LR4DistSemEval: a collection of language resources for the evaluation of distributional semantic models of Portuguese

  • Andreia Querido Faculty of Sciences of the University of Lisbon
  • Rita de Carvalho Faculty of Sciences of the University of Lisbon
  • João Rodrigues Faculty of Sciences of the University of Lisbon
  • Marcos Garcia Faculty of Philology, University of Coruña
  • João Silva Faculty of Sciences of the University of Lisbon
  • Catarina Correia Faculty of Sciences of the University of Lisbon
  • Nuno Rendeiro Faculty of Sciences of the University of Lisbon
  • Rita Pereira Faculty of Sciences of the University of Lisbon
  • Marisa Campos Faculty of Sciences of the University of Lisbon
  • António Branco Faculty of Sciences of the University of Lisbon

Abstract

In this paper we describe a collection of publicly available data sets for Portuguese that are suitable for the evaluation of distributional semantics models in lexical similarity tasks and in conceptual categorization tasks. These data sets were adapted from English gold-standard test sets, allowing any Portuguese distributional semantics model to be evaluated and also to be compared to mainstream results that have been obtained for this language. We also present an online service that showcases some functionalities of the distributional semantics models

Published
2017-09-23
How to Cite
QUERIDO, Andreia et al. LX-LR4DistSemEval: a collection of language resources for the evaluation of distributional semantic models of Portuguese. Revista da Associação Portuguesa de Linguística, [S.l.], n. 3, p. 265-283, sep. 2017. ISSN 2183-9077. Available at: <http://ojs.apl.pt/index.php/rapl/article/view/15>. Date accessed: 17 aug. 2018. doi: https://doi.org/10.26334/2183-9077/rapln3ano2017a15.
Section
Articles