PRINCIPLES OF DATA COLLECTION IN CREATING A DIACHRONIC CORPUS
Abstract
The corpus is a set of selected, sufficient linguistic texts (oral or written) that can be formed and classified on the basis of strict principles based on the pragmatic purpose of the corpus compiler, meet the criteria of representativeness, combine principles such as processing, tagging, and perfect computer automation, which is a database with a convenient quantity and consistency for systematic search, reference results, and empirical analysis.












