Аннотация
The list is derived from a corpus of 6,192,659 words, composed of 12 American TV shows. The shows were selected to be balanced and representative of spoken American English.
The top 5,062 lemmas in the corpus amount to a total of 5,796,570 words. Places (toponyms) and character names have been excluded from the list for practical reasons, but are easy to understand and covered a total of 177,952 words. Bottom-line: (5796570 + 177952) / 6192659 = 96% coverage.
In other words, understanding the top 5,000 words given in this list will let you, in practice, understand 96% of the corpus and, by extension, 96% of spoken American English.
(FYI, the 12 shows used for the corpus are: Friends, How I Met Your Mother, Sex And The City, South Park, Community, The Office, Modern Family, Family Guy, The Simpsons, The Big Bang Theory, Curb Your Enthusiasm, Seinfeld.)









Комментарии к книге "Frequency List: The Top 5,000 Words in Spoken American English!"