Formation of a database of phraseological units based on the corpus of the Uzbek language
Abstract
This article discusses the issues of forming a database of phraseological units based on the Uzbek language corpus. A linguistic database is a structured collection of information that stores words, phrases, grammatical forms, idioms and other linguistic elements, and is used in lexicography to create dictionaries and reference books, in natural language processing to support technologies related to automatic translation, speech recognition and text generation, in linguistic research to analyze the structure of the language, the change and use of language units, and in language education to create educational materials and tools for language learning.
Keywords
Linguistic corpus, phraseological unit, searchHow to Cite
References
Раҳматуллаев Ш. Ўзбек тилининг изоҳли фразеологик луғати. – Тошкент: Ўқитувчи, 1978. – Б. 29-30.
Erman B., & Warren, B. (2000). The idiom principle and the open choice principle. Text & Talk 20(1), 29–62.
Grant, L. (2005). Frequency of core idioms in the British National Corpus (BNC). International Journal of Corpus Linguistics, 10(4), 429-451.
Heid U. Computational linguistic aspects of phraseology II // Phraseologie - Phraseology: Ein Internationales Handbuch Zeitgen?ssischer Forschung - An International Handbook of Contemporary Research / hrsg. von H. Burger, G. Ungeheuer, H. E. Wiegand. Berlin: De Gruyter, 2007. S. 1036-1044.
License
Copyright (c) 2025 Yodgorov Umidjon Saydilla oglu

This work is licensed under a Creative Commons Attribution 4.0 International License.