rimonim's Stars
stanfordnlp/GloVe
Software in C and data files for the popular GloVe model for distributed word representations, a.k.a. word vectors or embeddings
bbc/bbplot
R package that helps create and export ggplot2 charts in the style used by the BBC News data team
quanteda/quanteda
An R package for the Quantitative Analysis of Textual Data
umbrae/reddit-top-2.5-million
This is a dataset of the all-time top 1,000 posts, from the top 2,500 subreddits by subscribers, pulled from reddit between August 15–20, 2013.
bohanli/BERT-flow
TensorFlow implementation of On the Sentence Embeddings from Pre-trained Language Models (EMNLP 2020)
infer-actively/pymdp
A Python implementation of active inference for Markov Decision Processes
yrosseel/lavaan
an R package for structural equation modeling and more
trinker/sentimentr
Dictionary based sentiment analysis that considers valence shifters
mjockers/syuzhet
An R package for the extraction of sentiment and sentiment-based plot arcs from text
aesuli/SentiWordNet
The SentiWordNet sentiment lexicon
cjbarrie/academictwitteR
Repo for academictwitteR package to query the Twitter Academic Research Product Track v2 API endpoint.
JULIELab/EmoBank
This repository contains EmoBank, a large-scale text corpus manually annotated with emotion according to the psychological Valence-Arousal-Dominance scheme.
marcoguerini/DepecheMood
High-coverage and high-precision lexica of terms annotated with emotion scores for English and Italian.
trinker/lexicon
A data package containing lexicons and dictionaries for text analysis
ivan-rivera/RedditExtractor
A minimalistic R wrapper for the Reddit API
antndlcrx/oxford-llms-workshop
Workshop on Learning and Applying Large Language Models for Social Science Research
bcongelio/nfl-analytics-with-r-book
The repo for Introduction to NFL Analytics with R (published with CRC Press)
randel/MixRF
A random-forest-based approach for imputing clustered incomplete data
nicolarighetti/CooRTweet
CooRTweet: Coordinated Networks Detection on Social Media | Detects a variety of coordinated actions on social media and outputs the network of coordinated users along with related information.
kasperwelbers/corpustools
An R corpus class for tokenized texts
davidycliao/flaiR
flairR: Bring Amazing Flair NLP to R
JULIELab/MEmoLon
Repository for our ACL 2020 paper "Learning and Evaluating Emotion Lexicons for 91 Languages"
hauselin/domain-quality-ratings
Comprehensive database of ratings for 11k news domains
abresler/bertopic
R port of bertopic
Isabellevdv/grievancedictionary
van der Vegt, I., Mozes, M., Kleinberg, B. & Gill, P.(2021). The Grievance Dictionary: Understanding Threatening Language Use. Behavior Research Methods.
georgepar/simple-good-turing
An implementation of the simple Good Turing smoothing algorithm in Python using NumPy
mattansb/brms.exgaussian
SenticNet/personality-recognition
Experiments for automated personality detection using Language Models and psycholinguistic features on various famous personality datasets including the Essays dataset (Big-Five)
jewbee2000/HouseMatch
Given an Image of the front of a house, and a general location/radius. This tool uses the Street View API and OpenCV to find the exact coordinates of the house.
yoniabrams/Table_Scraper
Command-line web scraper app which extracts data from any table in any Wikipedia page (with a table) to allow one to run quick data analytics and visualizations on a Wikipedia table.