An Exploration of Vocabulary Knowledge in English Short Talks- A Corpus-Driven Approach

Full Text: PDF &nbsp;
DOI: 10.5539/ijel.v2n4p33

Yu-Chia Wang

doi:10.5539/ijel.v2n4p33

An Exploration of Vocabulary Knowledge in English Short Talks- A Corpus-Driven Approach

Yu-Chia Wang

Abstract

Adopting a corpus-driven approach, the study aimed to explore the vocabulary knowledge in English short talks including word patterns, features, and usages that are most likely to be encountered by language users in the real context. A specific corpus TED was conducted through a collection of English talks that are less than 20 minutes from the website TED Talks. In addition, the existed corpus BASE (British Academic Spoken English) was included in the study as a sample of talks longer than 20 minutes. Applying three corpus tools, AntConc (Anthony, 2003), RANGE (Nation & Heartkey, 2002), and KfNgram (Fletcher, 2007), the researcher was able to compile frequency-ordered word lists, concordance lines, vocabulary coverage, and lists of lexical bundles. The results showed that although the most frequently-used words in TED corpus and BASE corpus were similar grammatical items, the order was quite different. Moreover, the chi-square test showed a significant difference among four pronouns I, You, We, They between the two corpora and also in different parts of the TED corpus. Finally, the results of concordance lines and lexical bundles presented the “typical” and “frequent” word usages in the beginning, middle, and ending part of English short talks. It is suggested that teachers can build their own corpus to meet specific teaching purposes or learner’s needs, and to generate the corpus results into classroom materials while teaching English short talks.