Pinned Repositories
Ad-Library-API-Script-Repository
GitHub repository of commonly used python scripts that allows everyone to pull data via the Ad Library API
Ad_Library_API
Python code package to scrape the Facebook Ad Library data
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
aeneas-vagrant
aeneas-vagrant automates the creation of a Vagrant box to run aeneas
anomalize
Tidy anomaly detection
asr-data
Data and code for a small project on meta-information from the American Sociological Review
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
predicting-poverty-replication
A Python3 and PyTorch replication of Jean et al. (2016). Original paper Github: https://github.com/nealjean/predicting-poverty
joshzyj's Repositories
joshzyj/Web-Scraping-using-Selenium-Python
joshzyj/places365
The Places365-CNNs for Scene Classification
joshzyj/intersectional-bias-in-ml
Intersectional bias in hate speech and abusive language datasets
joshzyj/leetcode-1
LeetCode Solutions: A Record of My Problem Solving Journey.( leetcode题解,记录自己的leetcode解题之路。)
joshzyj/essentia
C++ library for audio and music analysis, description and synthesis, including Python bindings
joshzyj/stanza
Official Stanford NLP Python Library for Many Human Languages
joshzyj/CoreNLP
Stanford CoreNLP: A Java suite of core NLP tools.
joshzyj/pyAudioAnalysis
Python Audio Analysis Library: Feature Extraction, Classification, Segmentation and Applications
joshzyj/ML.family.edu.AddHealth
Documentation of programming scripts for the cross-study synthesis on adolescents' family experiences as predictors of young adult educational attainment with machine learning, based on Add Health data
joshzyj/facenet
Face recognition using Tensorflow
joshzyj/EconomicTracker
Download data from the Opportunity Insights Economic Tracker — https://tracktherecovery.org/
joshzyj/aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
joshzyj/esper-tv
Esper instance for TV news analysis
joshzyj/dowhy
DoWhy is a Python library for causal inference that supports explicit modeling and testing of causal assumptions. DoWhy is based on a unified language for causal inference, combining causal graphical models and potential outcomes frameworks.
joshzyj/gcloud
Google Cloud tutorial and setup
joshzyj/puppeteer
Headless Chrome Node.js API
joshzyj/covid19_twitter
Covid-19 Twitter dataset for non-commercial research use and pre-processing scripts - under active development
joshzyj/r-web-scraping-cheat-sheet
Guide, reference and cheatsheet on web scraping using rvest, httr and Rselenium.
joshzyj/LeetCode
This repository contains the solutions and explanations to the algorithm problems on LeetCode. Only medium or above are included. All are written in C++/Python and implemented by myself. The problems attempted multiple times are labelled with hyperlinks.
joshzyj/covid19policytrackers
This is a collection of COVID-19 policy trackers and data. It covers cross-country research in the areas of non-pharmaceutical interventions, economic and social policy responses, public attitudes, politics and media coverage.
joshzyj/phub_scrape_public
joshzyj/cloudflare-scrape
A Python module to bypass Cloudflare's anti-bot page.
joshzyj/headless-chrome-crawler
Distributed crawler powered by Headless Chrome
joshzyj/keras-1
R Interface to Keras
joshzyj/corpwatchapi
The CorpWatch API uses automated parsers to extract the subsidiary relationship information from Exhibit 21 of companies' 10-K filings with the SEC and provides a free, well-structured interface for programs to query and process the data.
joshzyj/earnings-calls
Earnings calls of all S&P500 companies from 1995 to 2015
joshzyj/keras
Deep Learning for humans
joshzyj/COVID-19_US_County-level_Summaries
Attempt to find correlation between a region's demographic/economic factors with its ability to manage disease spread
joshzyj/scipy-lecture-notes
Tutorial material on the scientific Python ecosystem
joshzyj/covid-19-data
An ongoing repository of data on coronavirus cases and deaths in the U.S.