/SFGram-dataset

SFGram (Science-Fiction Gram) is a dataset of public science-fiction novels, books and movie covers. It is designed to be used by researchers to study the evolution of the science-fiction literature over time and to test machine learning algorithms on authorship attribution and document classification tasks. All the documents are now published on the public domain and were obtained from the Gutenberg project or the archive.org website.

Primary LanguageOpenEdge ABLGNU General Public License v3.0GPL-3.0

Stargazers

No one’s star this repository yet.