/NLP

Primary LanguageJupyter Notebook

CSCI 6040 Spring 2022

Student Email
Amy Wightman dunphya20@students.ecu.edu
James Rosebaugh-Nordan rosebaughnordanj20@students.ecu.edu
Connor Bullard bullardc21@students.ecu.edu

Authors' Repository


https://github.com/webis-de/emnlp21-same-sentiment

Our Implementation


https://github.com/ymadh/NLP/blob/main/Final%20Project/OurImplementation_v2.ipynb

The datasets referenced in this notebook use a portion of the kaggle yelp data files above which were created using head() in the shell.
To use the full dataset, run the kaggle test notebook which will download the files to the server you are using.
Dataset can also be downloaded directly from yelp using this link.

Failed Attempts & Previous Attempts


From the author's code with some minor changes to prepare for execution:

https://github.com/ymadh/NLP/blob/shared/D2_samesentiment_yelp_create_pairs.ipynb - This was successful
https://github.com/ymadh/NLP/blob/shared/D2_samesentiment_yelp_base.ipynb - Times out on the last cell. This requires you to run MLFlow in the backround (per the authors' README)
https://github.com/ymadh/NLP/tree/main/Final%20Project/Reproduction%20Attempt - Other files we were unsuccessful in modifying but attempted to use (some were Jupyter notebooks converted to Python scripts)

Optional


Login to kaggle and get API credentials. Update kaggle.json.
Run kaggle_test.ipynb notebook to get all the data files.