Frequency Lists

The following are a collection of frequency lists for various languages. I will only list those that present their sources, and are complete frequency lists (no words intentionally omitted). These are grouped by vocabulary list size and by language.

1,000+ words:

Arabic - Part II
Chinese - Part II, Part III, Part IV (Patrick Hassel Zein)
Chinese - Part II, Part III (Subtlex-CH)
German
Greek
Khmer (English translations incomplete)
Maori
Thai
Turkish

5,000+ words:

Arabic
English
French
German
Korean - Part II, Part III, Part IV, Part V
Norwegian - Part II
Portuguese
Russian - Part II, Part III, Part IV, Part V, Part VI
Spanish

10,000+ words:

Russian - Part II

1 Like

For Chinese, there are these courses based upon the subtlex-ch: http://www.memrise.com/courses/english/?q=subtlex My courses in that list are a bit random, but @BenWhately’s and @hengfun’s courses should be good.

Thanks; when there are multiples for each set (as in Chinese here) I will specify the sources after the links.

I’ve made one myself for Indonesian here:
http://lusentoj.neocities.org/languages/indonesian/INDO-vortoft.html

As it says in the link, all words with less than 200 hits were removed, because there was too much interference from words that aren’t actually in Indonesian. And the words were only taken from non-professional fiction writing, nothing like newspapers or Twitter.

EDIT: Oops, didn’t realize you meant “courses created from word frequency lists” and not “actual word frequency lists”. Oh well.

That is a good idea for a next project, actually.