Pinned Repositories
Ad-Library-API-Script-Repository
GitHub repository of commonly used python scripts that allows everyone to pull data via the Ad Library API
Ad_Library_API
Python code package to scrape the Facebook Ad Library data
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
aeneas-vagrant
aeneas-vagrant automates the creation of a Vagrant box to run aeneas
anomalize
Tidy anomaly detection
asr-data
Data and code for a small project on meta-information from the American Sociological Review
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
predicting-poverty-replication
A Python3 and PyTorch replication of Jean et al. (2016). Original paper Github: https://github.com/nealjean/predicting-poverty
joshzyj's Repositories
joshzyj/Pre-trained-Models
预训练语言模型综述
joshzyj/Scraper_Seeking_Alpha
Scrape earnings calls
joshzyj/human-centered-machine-learning
joshzyj/intro_spatial_abm
Intro to creating spatial ABM with Netlogo and QGIS
joshzyj/Engagement-and-Stock-Price-Analysis-of-CEOs-on-Twitter
Project on engagement and stock price analysis of CEOs on Twitter. Extracted the data from Twitter API and Yahoo Finance and implemented sentiment analyzer, topic modeling (LDA), stock price regression and engagement analysis to determine the factors that make a CEO influential
joshzyj/Keras-for-computer-vision
Introductions to Keras to perform computer vision tasks, with data exploration, error analysis and improving results.
joshzyj/Ad_Library_API
Python code package to scrape the Facebook Ad Library data
joshzyj/talkdown
Dataset and pre-trained model of EMNLP-IJCNLP 2019 paper "TalkDown: A Corpus for Condescension Detection in Context."
joshzyj/webinars
Code and slides for RStudio webinars
joshzyj/Facebook-Ad-Scraper
A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.
joshzyj/ethnicolr
Predict Race and Ethnicity Based on the Sequence of Characters in a Name
joshzyj/Ad-Library-API-Script-Repository
GitHub repository of commonly used python scripts that allows everyone to pull data via the Ad Library API
joshzyj/facebook-ad-library-scraper
A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.
joshzyj/DataPipelines_Earnings_Calls_Transcripts
Docker data pipeline for Data collection, Transformation, and storing it into MongoDB
joshzyj/seekingalpha_transcript_extractor
Extract text of earnings call transcripts hosted on SeekingAlpha. From there look through individual transcripts to obtain various metrics regarding the speakers on the call. Output desired information to final spreadsheet.
joshzyj/xtdcce2
Estimating Dynamic Common Correlated Effects Models in Stata
joshzyj/PraatScripts
These are praat scripts I use in my research, implemented in parselmouth for python for use in binder
joshzyj/dime_race
Using ethnicolr to predict DIME
joshzyj/EarningsCall_Dataset
The earnings conference call dataset of S&P 500 companies
joshzyj/US_County_Level_Election_Results_08-16
United States General Election Presidential Results by County from 2008 to 2016
joshzyj/flair
A very simple framework for state-of-the-art Natural Language Processing (NLP)
joshzyj/DS-Take-Home
My solution to the book A Collection of Data Science Take-Home Challenges
joshzyj/shiny-examples
joshzyj/textract
extract text from any document. no muss. no fuss.
joshzyj/caffe
Caffe: a fast open framework for deep learning.
joshzyj/pdf-to-csv-table-extactor
Extract tables from scanned documents pdf into csv file using ocr and image processing
joshzyj/new.crimenmexico
Website for https://elcri.men
joshzyj/lectures
Lecture notes for EC 607
joshzyj/EmbeddingDynamicStereotypes
joshzyj/incarceration_trends
Incarceration Trends Dataset and Documentation