first20hours/google-10000-english
This repo contains a list of the 10,000 most common English words in order of frequency, as determined by n-gram frequency analysis of the Google's Trillion Word Corpus.
NOASSERTION
Issues
- 1
'voyuer' is not a real word
#31 opened by vkalantar - 1
there is no word 'adept' ?
#38 opened by pencilCool - 0
Several single character words are not words
#43 opened by barrybriggs - 0
Hangman
#42 opened by 27-04-2009 - 0
divx is not an english word
#41 opened by frankh - 0
- 4
Clearer copyright
#21 opened by HubKing - 2
- 0
Could we get a SFW 20k word list?
#32 opened by Anonymus1 - 0
Various brand names are included
#30 opened by vkalantar - 0
- 0
"profileprofile" word (duplicated)
#28 opened by Idan503 - 1
missing very common words
#14 opened by mdtr - 1
How do you get it to work?
#27 opened by jw4wellness - 0
Masturbating?
#20 opened - 0
Contractions?
#23 opened by brandonchinn178 - 4
Is there a Spanish version?
#15 opened by BayInternetGroup - 0
All words are lowercased, even Proper Nouns
#18 opened by giorgio79 - 1
Some bad words not filtered from clean versions
#16 opened by Elizafox - 0
- 2
top 10k english words that are words?
#10 opened by tedder - 1
Unclear license
#11 opened by l0b0 - 5
Why are there ~1500 duplicate words here?
#6 opened by farzher - 2
- 1
Frequency Fail
#1 opened by trans