LX-LR4DistSemEval: a collection of language resources for the evaluation of distributional semantic models of Portuguese

  • Andreia Querido
  • Rita Carvalho
  • João Rodrigues
  • Marcos Garcia
  • João Silva
  • Catarina Correia
  • Nuno Rendeiro
  • Rita Valadas Pereira
  • Marisa Campos
  • António Branco
Palavras-chave: distributional semantics, data sets, evaluation, Portuguese, semântica distribucional, conjuntos de dados, avaliação, português

Resumo

In this paper we describe a collection of publicly available data sets for Portuguese that are suitable for the
evaluation of distributional semantics models in lexical similarity tasks and in conceptual categorization tasks.
These data sets were adapted from English gold-standard test sets, allowing any Portuguese distributional
semantics model to be evaluated and also to be compared to mainstream results that have been obtained for this
language. We also present an online service that showcases some functionalities of the distributional semantics
models.

Publicado
2017-09-29
Como Citar
Querido, A., Carvalho, R., Rodrigues, J., Garcia, M., Silva, J., Correia, C., Rendeiro, N., Valadas Pereira, R., Campos, M., & Branco, A. (2017). LX-LR4DistSemEval: a collection of language resources for the evaluation of distributional semantic models of Portuguese. Revista Da Associação Portuguesa De Linguística, (3), 265-283. https://doi.org/10.26334/2183-9077/rapln3ano2017a15