[ ] Read and understand the task. Link to task
[ ] We will be working off Google Collab
[ ] Download the data. Link to Data
[ ] Perform exploratory data analysis to understand the data. Write a short summary of the data analysis.
[ ] Understand the data and data source. What characteristics does r/OffMyChest and r/CasualConversations have? How are they different from other subreddits?
[ ] Understand the task labels. What are the disclosure/ supportive characteristics mean? Are there any literature review on how they are represented in language? Literature review could be NLP focused or psychology focused.
[ ] Perform Task 1: Predict labels for Disclosure for sentences based on training data.
[ ] Construct a neural network taking in training data and its output.
- Refrence from Machine Learning Classification Models
[ ] Perform the testing of neural network