Estratégias Lexicométricas para Detetar Especificidades Textuais

TítuloEstratégias Lexicométricas para Detetar Especificidades Textuais
AutoresÁlvaro Iriarte Sanromán, Pablo Gamallo, Alberto Simões
TipoArtículo de revista
Fonte Linguamática, Universidade do Minho and Universidade de Vigo, Vol. 10, No. 1, pp. 19-26 , 2018.
RankRanked Q1 in Linguistics and Language by CiteScore
DOI10.21814/lm.10.1.263
AbstractIn this article we propose to to define and develop an automatic strategy to search for lexical specificities within sets of texts using simple lexical units and multiword expressions (MWE). We propose a methodology for calculating the divergence of lemma and MWE distributions that will automatically find differences and similarities between unlabeled texts. This methodology can be used to subsequently identify groups of texts to which quantitative and qualitative analyzes will be applied (semiautomatically and/or with human intervention). In a first test, we used two specialized texts (from the area of Paediatrics) and a literary text, assuming that the texts of specialty should present greater divergences with respect to the literary text than among themselves. As the tests that were done showed the expected trend, we decided to apply the same methodology to a second set of texts (three sets of interviews done to visitors in the city of Santiago de Compostela). end{abstract}
Palabras chavedivergencia de Kullback-Leibler, divergˆencia lexical, lexicometria