Open Source Scripts etc for the Allen Institute Kaggle challenge. https://www.kaggle.com/c/the-allen-ai-science-challenge
This extracts "facts" from "Lesson Summary" and "Key Concepts" ebooks from http://www.ck12.org/
To run:
- Download ebooks from ck12.org (eg, http://www.ck12.org/earth-science/#view_books - Different links for different subjects)
- Put them in the "books" sub-directory
- Run python ck12-fact-extractor.py