CORPORART - a corpus of Public Art for lexicon extraction: representativity and comparability in specialized corpora
DOI:
https://doi.org/10.26334/2183-9077/rapln5ano2019a4Keywords:
specialized corpora, specialized lexicon, corpus organization, representativity, public artAbstract
This paper introduces the CORPORART, a bilingual corpus of Public Art. CORPORART intends to gather, in a single collection of bilingual data, representative samples of specialized language in European Portuguese and Italian. The compilation of this corpus is part of an ongoing doctoral project, which aims to integrate specialized lexical units into a pre-existing common language resource, WordNet.PT (Marrafa et al., 2005), in the perspective of contributing to streamline communication between heterogeneous interlocutors (Amaro & Mendes, 2012). Assuming that the structure of the corpus heavily depends on the goals of the investigation, this paper presents the linguistic and extralinguistic parameters adopted for the construction and organization of the corpus, as well as the criteria for text processing. In particular, we will deepen the notion of representativity and comparability considering the specificity of this case study, outlining a work practice proposal oriented to guarantee these two flexible dimensions within the specialized languages context.
Downloads
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2019 Chiara Barbero

This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Authors retain copyright and concede to the journal the right of first publication. The articles are simultaneously licensed under the Creative Commons Attribution License, which allows sharing of the work with an acknowledgement of authorship and initial publication in this journal.
The authors have permission to make the version of the text published in RAPL available in institutional repositories or other platforms for the distribution of academic papers (e.g., ResearchGate).