j2kao
Data Scientist, Software Engineer, Language Nerd. Currently: Computational Journalist @ProPublica.
SF Bay Area
Pinned Repositories
cc-index-table
Index Common Crawl archives in tabular format
data-science-tutorials
Data science tutorials - working examples
dsp
Metis Data Science Bootcamp - Official Prework Repository
fcc_nn_research
(somewhat) cleaned-up notebooks used in researching public comments for FCC Proceeding 17-108 (Net Neutrality Repeal)
juriscraper
An API to scrape American court websites for metadata.
keras2_crf
parler-parse
WIP: Parse archived parler pages into structured html
ResearchKit
ResearchKit is an open source software framework that makes it easy to create apps for medical research or for other research projects.
ThinkStats2
Text and supporting code for Think Stats, 2nd Edition
twitter-photos
Simple, fast command-line tool to get photos from Twitter accounts
j2kao's Repositories
j2kao/fcc_nn_research
(somewhat) cleaned-up notebooks used in researching public comments for FCC Proceeding 17-108 (Net Neutrality Repeal)
j2kao/cc-index-table
Index Common Crawl archives in tabular format
j2kao/data-science-tutorials
Data science tutorials - working examples
j2kao/dsp
Metis Data Science Bootcamp - Official Prework Repository
j2kao/juriscraper
An API to scrape American court websites for metadata.
j2kao/keras2_crf
j2kao/parler-parse
WIP: Parse archived parler pages into structured html
j2kao/ResearchKit
ResearchKit is an open source software framework that makes it easy to create apps for medical research or for other research projects.
j2kao/ThinkStats2
Text and supporting code for Think Stats, 2nd Edition
j2kao/twitter-photos
Simple, fast command-line tool to get photos from Twitter accounts
j2kao/twitter_createtime
A notebook for looking at the sequence of created_at timestamps of twitter followers
j2kao/wicked_pdf
PDF generator (from HTML) plugin for Ruby on Rails