Dictionary search using Python
petermr opened this issue · 3 comments
petermr commented
Python code to read fulltext.xml
or the section
s and match words using one or more dictionaries.
petermr commented
Approximate:
- glob.glob(“/*_methods//*.xml”)
- elementTree to get the text of the XML files
- Search text using Python for stem/lowercase/stopwords and return wikiData ID and context
AmbrineH commented
ShweataNHegde commented
Peter had written a Jupyter Notebook (probably a month ago) which did something similar. Here is the link to the notebook. https://github.com/petermr/ami3/blob/master/src/ipynb/text.ipynb