/WebScrapping-PoetryFoundation

Extracting all the poems from Poetry Foundation, using Selenium, ,Beautiful Soup and Multiprocessing

Primary LanguagePythonMIT LicenseMIT

WebScrapping-PoetryFoundation

Extracting all the poems from Poetry Foundation, using Selenium, Beautiful Soup and Multiprocessing in Python.

The dataset extracted contains the:

  • Poem
  • Poem's Title
  • Poet
  • Tags


  • The dataset was created with intention for Artificial Poem Generation. It could be used for various other NLP tasks like classification, and semantic analysis. I hope, that dataset is helpful!

    The prominent tags featured in this dataset are highlighted by this word cloud:


    WordCloud Tags