Notes -
- The following project was completed for the course CSC 434 Artificial Intelligence and Machine Learning, offered at The College of Brockport.
- Citations are referred to in brackets [#].
Date Submitted: 12/2021, Fall Semester
Project Target: The goal of this project is to build a Natural Language Processing (NLP) model which utilizes multiclass classification to perform the task of sentiment analysis upon a set of data.
Built with -
- Tensorflow
Associated Packages -
- Keras
- nltk
- pandas
Dataset Source -
- Kaggle
Dataset Used -
- State of The Union Corpus (1790-2018) [5]
The code written was heavily referenced from the textbook, "Deep learning with python" by Francois Chollet [1].
Overview: Following consecutive steps, building the desired NLP model involves importing a dataset, preprocessing through data tokenization, encoding, vectorization, creation of a model, and finally validation of the built model, thus meeting target requirements. Transformations done upon the original dataset are done in sentence_df.py while preprocessing the data through vectorization into tensors as well as building, plotting, and validation of the model is done in SOU.py.
REFERENCES:
- Chollet, Francois, “Getting started with neural networks: Classification and regression,” in Deep learning with python, S.l., CA: O'REILLY MEDIA, 2021.
- M. Mogyorosi, “Sentiment analysis: First steps with Python's NLTK library,” Real Python, 24-Sep-2021. [Online]. Available: https://realpython.com/python-nltk- sentiment-analysis/. [Accessed: 19-Nov-2021].
- R. F. Baumeister and K. D. Vohs, “Content Analysis,” Encyclopedia of Social Psychology. SAGE, London, 2007.
- NobelNobel, cs95 , jpp, Heraknos, BENY , and c z, “Split cell into multiple rows in pandas dataframe,” Stack Overflow, 01-Jul-1966. [Online]. Available: https://stackoverflow.com/questions/50731229/split-cell-into-multiple-rows-in-pandas-dataframe. [Accessed: 17-Nov-2021].
- R. Tatman, “State of the Union Corpus (1790 - 2018),” Kaggle, 19-Oct-2018. [Online]. Available: https://www.kaggle.com/rtatman/state-of-the-union-corpus-1989-2017. [Accessed: 11-Oct-2021].
- Sulphix, D. Dotterel, and S. Scarab, “Remove punctuation in dataframe column code example,” remove punctuation in dataframe column Code Example. [Online]. Available: https://www.codegrepper.com/code-examples/python/remove+punctuation+in+dataframe+column. [Accessed: 17-Nov-2021].
CREDITS: coleman3616, dhodzic1, kyle-knopp, mwarren585