Establishment of a national corpus the uzbek language is a requirement of a new era
The article is scientifically substantiated by the need to create a national corpus of the Uzbek language. Suggestions are given on the structure of the corpus, the program interface, the algorithm of the program, the technology for obtaining the results. Based on the experience of world scientists, the requirements for the data encoding format for the national corpus of the Uzbek language are described. The article says that the interface of the national corpus and the author's corpus has a different design, structure, its improvement is the responsibility of the author of the corpus, and the interface should evoke the first impression of the corpus, an attractive appearance. The interface should take into account decorations that reflect the national color, as well as symbols reflecting the classics or modernity, the interface should reflect the life and work of the artist, the works created by him should open in separate windows, partly in photo galleries. The types of internal and external interface are also discussed. The article analyzes the linguistic module and algorithm and its types from independent components of linguistic programs. The need for an algorithm for phonological, morphological and spelling rules for the formation of the lexical and grammatical code is scientifically substantiated. The importance of language modules, such as phonology, morphology and spelling, in the formation of the linguistic base of the national corpus of the Uzbek language is emphasized.