Pinned Repositories
Ad-Library-API-Script-Repository
GitHub repository of commonly used python scripts that allows everyone to pull data via the Ad Library API
Ad_Library_API
Python code package to scrape the Facebook Ad Library data
aeneas
aeneas is a Python/C library and a set of tools to automagically synchronize audio and text (aka forced alignment)
aeneas-vagrant
aeneas-vagrant automates the creation of a Vagrant box to run aeneas
anomalize
Tidy anomaly detection
asr-data
Data and code for a small project on meta-information from the American Sociological Review
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
BERTopic
Leveraging BERT and c-TF-IDF to create easily interpretable topics.
predicting-poverty-replication
A Python3 and PyTorch replication of Jean et al. (2016). Original paper Github: https://github.com/nealjean/predicting-poverty
joshzyj's Repositories
joshzyj/tiktok-scraper
TikTok Scraper. Download video posts, collect user/trend/hashtag/music feed metadata, sign URL and etc.
joshzyj/nyc-taxi-data
Import public NYC taxi and for-hire vehicle (Uber, Lyft, etc.) trip data into PostgreSQL database
joshzyj/DeepLearning4HumanMobility
joshzyj/fastText
Library for fast text representation and classification.
joshzyj/blockgroupvoting
Projecting election results from state based voting precincts onto census block group geographies.
joshzyj/test
A modern, highly customizable, responsive Jekyll template for course websites.
joshzyj/social-dimensions
Data and code accompanying the paper "Quantifying social organization and political polarization in online platforms"
joshzyj/Neural-Media-Bias-Detection-Using-Distant-Supervision-With-BABE
joshzyj/PlotNeuralNet
Latex code for making neural networks diagrams
joshzyj/ConvNeXt
Code release for ConvNeXt model
joshzyj/ERNIE
Official implementations for various pre-training models of ERNIE-family, covering topics of Language Understanding & Generation, Multimodal Understanding & Generation, and beyond.
joshzyj/TextPruner
A PyTorch-based model pruning toolkit for pre-trained language models
joshzyj/MacBERT
Revisiting Pre-trained Models for Chinese Natural Language Processing (Findings of EMNLP 2020)
joshzyj/site
Course materials for the Automating GIS processes -course, University of Helsinki, Finland
joshzyj/us-polling-places
Standardized data on historical general election polling places in the United States.
joshzyj/r5
Developed to power a web-based interface for scenario planning and land-use/transport accessibility analysis, R5 is Conveyal's routing engine for multimodal (transit/bike/walk/car) networks with a particular focus on public transit
joshzyj/POIR613
Course materials: POIR 613 - Computational Social Science - USC Fall 2021
joshzyj/r5r
joshzyj/opentripplanner-1
An R package to set up and use OpenTripPlanner (OTP) as a local or remote multimodal trip planner.
joshzyj/Chinese-BERT-wwm
Pre-Training with Whole Word Masking for Chinese BERT(中文BERT-wwm系列模型)
joshzyj/youtube-8m
Starter code for working with the YouTube-8M dataset.
joshzyj/zipcodeR
An R package that makes working with U.S. ZIP codes painless.
joshzyj/scales_human_mobility
joshzyj/CHECKED
joshzyj/graphhopper
Open source routing engine for OpenStreetMap. Use it as Java library or server.
joshzyj/OpenTripPlanner
An open source multi-modal trip planner
joshzyj/COVID19USFlows
Multiscale Dynamic Human Mobility Flow Data in the U.S. during the COVID-19 epidemic
joshzyj/roberta_zh
RoBERTa中文预训练模型: RoBERTa for Chinese
joshzyj/bnk_afi_si
joshzyj/wru
Who Are You? Bayesian Prediction of Racial Category Using Surname and Geolocation