语料库

yǔliàokù
Meaning: linguistic corpus

📚 Word Explanation

语料库 (yǔliàokù)

A yǔliàokù (linguistic corpus) is a large, structured collection of authentic texts or spoken language recordings, used for linguistic research, language teaching, and natural language processing. The word combines three characters: (language), liào (material, substance), and (warehouse, repository). Literally, it means 'a warehouse of language material' — emphasizing its function as a carefully organized, searchable archive of real-world language use.

Linguists build corpora to study grammar patterns, vocabulary frequency, dialect variation, or historical language change. Modern corpora are usually digital and annotated with part-of-speech tags, syntactic trees, or semantic labels. They range from general-purpose collections (e.g., the Chinese National Corpus) to specialized ones — such as medical Chinese, business negotiations, or children’s speech — making them indispensable tools in computational linguistics and evidence-based language education.

💬 Example Sentences

Related Words

💬 Comments 0 comments
Loading...