6533b7d6fe1ef96bd1267327

RESEARCH PRODUCT

O pewnej możliwości ewaluacji frazeologii na przykładzie danych z portalu Грамота.ру i z Narodowego Korpusu Języka Rosyjskiego

subject

lexicographycorpus linguisticsphraseologydata miningphraseography

description

The author of the article has run an experiment based on extracting a portion of phraseology from an online Russian language dictionary for further corpus-driven study. On the basis of the list of 100 most common Russian nouns the author has constructed queries to the Грамота.ру web portal that led to extracting over 600 idioms. These were subsequently used to perform another search in the Russian National Corpus. The main goal of this article is to construct a small dictionary of phraseological units extracted from Грамота.ру, as well as discuss the problem of evaluation of phraseology with the use of corpus-extracted data. The author argues that this kind of aproach can provide a considerable amount of valuable information, regardless of obvious differences in how phraseology works in dictionaries and in texts.