Each line of the file indicates a word, and the number of occurrences of that word in the 20120701 and 20200217 Google Books Ngrams datasets, respectively. The entire list is sorted in decreasing order of frequency in the 20200217 dataset.
References:
- https://possiblywrong.wordpress.com/2021/02/27/an-updated-google-books-word-frequency-list/
- Google Books Ngram Viewer Exports, English versions 20120701 and 20200217
- Atkinson, K., Spell Checker Oriented Word List (SCOWL)
- WordNet Lexical Database for English version 3.1
The included SCOWL words were generated with the following configuration and accompanying license information:
Custom wordlist generated from http://app.aspell.net/create using SCOWL with parameters: diacritic: strip max_size: 95 max_variant: 3 special: spelling: US
Using Git Commit From: Mon Dec 7 20:14:35 2020 -0500 [5ef55f9]
Copyright 2000-2019 by Kevin Atkinson
Permission to use, copy, modify, distribute and sell these word lists, the associated scripts, the output created from the scripts, and its documentation for any purpose is hereby granted without fee, provided that the above copyright notice appears in all copies and that both that copyright notice and this permission notice appear in supporting documentation. Kevin Atkinson makes no representations about the suitability of this array for any purpose. It is provided "as is" without express or implied warranty.
The included WordNet words were retrieved from the WordNet database with the following license information:
WordNet Release 3.1
This software and database is being provided to you, the LICENSEE, by Princeton University under the following license. By obtaining, using and/or copying this software and database, you agree that you have read, understood, and will comply with these terms and conditions.:
Permission to use, copy, modify and distribute this software and database and its documentation for any purpose and without fee or royalty is hereby granted, provided that you agree to comply with the following copyright notice and statements, including the disclaimer, and that the same appear on ALL copies of the software, database and documentation, including modifications that you make for internal use or for distribution.
WordNet 3.1 Copyright 2011 by Princeton University. All rights reserved.
THIS SOFTWARE AND DATABASE IS PROVIDED "AS IS" AND PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES, EXPRESS OR IMPLIED. BY WAY OF EXAMPLE, BUT NOT LIMITATION, PRINCETON UNIVERSITY MAKES NO REPRESENTATIONS OR WARRANTIES OF MERCHANT- ABILITY OR FITNESS FOR ANY PARTICULAR PURPOSE OR THAT THE USE OF THE LICENSED SOFTWARE, DATABASE OR DOCUMENTATION WILL NOT INFRINGE ANY THIRD PARTY PATENTS, COPYRIGHTS, TRADEMARKS OR OTHER RIGHTS.
The name of Princeton University or Princeton may not be used in advertising or publicity pertaining to distribution of the software and/or database. Title to copyright in this software, database and any associated documentation shall at all times remain with Princeton University and LICENSEE agrees to preserve same.