The 'Digital Edition of the Vocabularies of the Academy of Sciences' project: VOLP-1940

Authors

  • Ana Salgado Academia das Ciências de Lisboa, Instituto de Lexicologia e Lexicografia da Língua Portuguesa
  • Rute Costa Centro de Linguística da Universidade NOVA de Lisboa / FCSH-NOVA

DOI:

https://doi.org/10.26334/2183-9077/rapln7ano2020a17

Keywords:

lexicography, vocabularies, Text Encoding Initiative (TEI), linguistic annotation, Digital Humanities

Abstract

This paper presents the Digital Edition of the Vocabularies of the Academy of Sciences project, which aims to digitise the spelling vocabularies of the Lisbon Academy of Sciences (ACL) in order to create a digital lexicographic corpus bringing together the printed versions of all these lexicographical reference works – the 1940, 1947, 1970, and finally the 2012 editions. The first stage started with the Vocabulário Ortográfico da Língua Portuguesa [Orthographic Vocabulary of the Portuguese Language] (VOLP-1940), our case study. After digitising this vocabulary, the work described here focuses on the linguistic annotation of VOLP-1940 using eXtensible Markup Language (XML), an annotation metalanguage, and following the annotation directives of the Text Encoding Initiative (TEI), more specifically the application of TEI Lex-0, a new TEI sub-format. We aim to highlight the need for rigorous linguistic data processing in the creation of new lexical resources to increase the quality of their description and applicability.

Downloads

Download data is not yet available.

Published

2020-11-30

How to Cite

Salgado, A., & Costa, R. (2020). The ’Digital Edition of the Vocabularies of the Academy of Sciences’ project: VOLP-1940. Journal of the Portuguese Linguistics Association, (7), 275–294. https://doi.org/10.26334/2183-9077/rapln7ano2020a17