Exploring different NLP operations on 20 newsgroup dataset.
The data contains approximately 20,000 across 20 online newsgroups
The 20 different newsgroups are:
- alt.atheism
- comp.graphics
- comp.os.ms-windows.misc
- comp.sys.ibm.pc.hardware
- comp.sys.mac.hardware
- comp.windows.x
- misc.forsale
- rec.autos
- rec.motorcycles
- rec.sport.baseball
- rec.sport.hockey
- sci.crypt
- sci.electronics
- sci.med
- sci.space
- soc.religion.christian
- talk.politics.guns
- talk.politics.mideast
- talk.politics.misc
- talk.religion.misc
- numpy
- matplotlib
- sklearn
- seaborn
- nltk