/LanguageGenerationCapstoneProject

Language Generation Capstone Project

Primary LanguageJupyter Notebook

LanguageGenerationCapstoneProject

Language Generation Capstone Project

Data

The data used by this project is Reddit comment data crawled using Reddit API. The data contains information of a large set of dimension fields listing as following.

Fields
body
author
subreddit
_replies
_submission
awarders
total_awards_received
approved_at_utc
link_title
mod_reason_by
banned_by
author_flair_type
removal_reason
link_id
author_flair_template_id
likes user_reports
saved id
banned_at_utc
mod_reason_title
gilded
archived
no_follow
num_comments
can_mod_post
created_utc
send_replies
parent_id score
author_fullname
over_18
approved_by
mod_note
all_awardings
subreddit_id
edited
author_flair_css_class
name
author_patreon_flair
downs author_flair_richtext
is_submitter
body_html
gildings
collapsed_reason
distinguished
associated_award
stickied
author_premium
can_gild
author_flair_text_color
score_hidden
permalink
num_reports
link_permalink
report_reasons
link_author
author_flair_text
link_url
created
collapsed
subreddit_name_prefixed
controversiality
locked
author_flair_background_color
collapsed_because_crowd_control
mod_reports
quarantine
subreddit_type
ups _fetched