Corpora of Vietnamese Texts (CVT)
Research and Clinical Applications
There are many potential research and clinical applications
of the CVT in the areas of child language acquisition, second language
acquisition and foreign language learning.
Examples of clinical applications include:
·
The Vietnamese children’s literature corpus word
list may be used to identify highly frequent words that students may learn as
sight words for reading.
·
The word list based on the entire CVT may be
used to identify the most commonly occurring consonant and vowel sounds and
tones (see
Related Publications.) for the purpose of foreign accent
enhancement and speech remediation.
·
The CVT, as with corpora of other languages, may
be used in the design of dictionaries, thesaurus, and other language reference
materials.
Examples of research applications include the use of corpora
data to empirically test assumptions and hypotheses related to linguistics and
language development such as:
·
Assumptions about the structure and form of the
Vietnamese language
·
Assumptions about theoretical issues in
linguistics such as universal and language-specific features, markedness,
phonological saliency, etc.
·
Hypotheses on discourse
·
Hypotheses on gender roles, particularly in
children’s literature
·
Description of the differences between
translated and original texts
·
Description of language change (i.e. how
Vietnamese language use differs in Viet Nam and abroad).
This project is a work in progress. Please feel free to contact
me with any suggestions, comments, or questions.