/policy-topic-model

Topic model of policy papers on artificial intelligence

Primary LanguageJupyter NotebookCreative Commons Zero v1.0 UniversalCC0-1.0

AI Policy Topic Model

My work during an internship at UCD Centre for Digital Policy in 2022.

Pre-processing

Visualization

Dependencies

  • stopwordsiso - for stopword list
  • NLTK - for lemmatizer
  • scikit-learn - for document classifier (using Latent Dirichlet Allocation - LDA)
  • pyLDAvis - for visualization
  • Apache PDFBox 3 is required for text extraction from PDF.