Speech technologies and Russian pronunciation variation in the context of VoiceInteraction

Authors

  • Anna Havras Universidade de Lisboa / VoiceInteraction
  • Carlos Mendes VoiceInteraction – Tecnologias de Processamento de Fala
  • Gueorgui Hristovsky Universidade de Lisboa
  • Sérgio Paulo VoiceInteraction – Tecnologias de Processamento de Fala
  • Helena Moniz Universidade de Lisboa / INESC-ID https://orcid.org/0000-0003-0900-6938

DOI:

https://doi.org/10.26334/2183-9077/rapln10ano2023a8

Keywords:

automatic speech recognition, phonetics, Russian language, filled pauses, varieties

Abstract

This article aims to describe the work conducted at VoiceInteraction, a company specialized in speech processing solutions, with a particular focus on automatic transcription using a Hybrid Automatic Speech Recognizer (ASR). The primary objective revolved around studying the phonetic characteristics of the Russian language, encompassing four main tasks: describing the phonetic-phonological inventory, validating news transcriptions, validating a previously created lexicon, and integrating filled pauses into the ASR. This work contributed to the Artificial Intelligence and Advanced Data Analysis for Authority Agencies (AIDA) project, funded by the European Commission under the Horizon 2020 program, by transcribing the data in the Russian language.

Downloads

Download data is not yet available.

Published

2023-10-22

How to Cite

Havras, A., Mendes, C., Hristovsky, G., Paulo, S., & Moniz, H. (2023). Speech technologies and Russian pronunciation variation in the context of VoiceInteraction. Journal of the Portuguese Linguistics Association, (10), 138–161. https://doi.org/10.26334/2183-9077/rapln10ano2023a8