Pinned Repositories
ftools
Fast Stata commands for large datasets
reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
Algorithms
Data Structures and Algorithms in Python
dirbot
Scrapy project to scrape public web directories (educational)
distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
grab
Web Scraping Framework
gscholar
Query Google Scholar with Python
kolexiang's Repositories
kolexiang/mostly-harmless-replication
Replication of tables and figures from "Mostly Harmless Econometrics" in Stata, R, Python and Julia.
kolexiang/ftools
Fast Stata commands for large datasets
kolexiang/FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
kolexiang/parallel
PARALLEL: Stata module for parallel computing
kolexiang/SJTUThesis
A XeLaTeX template for Shanghai Jiao Tong University (SJTU) thesis.
kolexiang/mtheme
A modern LaTeX Beamer theme
kolexiang/Spider
kolexiang/seaborn
Statistical data visualization using matplotlib
kolexiang/reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
kolexiang/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
kolexiang/tushare
TuShare is a utility for crawling historical data of China stocks
kolexiang/Algorithms
Data Structures and Algorithms in Python
kolexiang/scholar.py
A parser for Google Scholar, written in Python
kolexiang/robobrowser
kolexiang/grab
Web Scraping Framework
kolexiang/zhihu-python
获取知乎内容信息,包括问题,答案,用户,收藏夹信息
kolexiang/pyspider
A Powerful Spider(Web Crawler) System in Python.
kolexiang/go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
kolexiang/zhihu-spider
A web spider for zhihu.com
kolexiang/MySQLdb1
MySQL database connector for Python (legacy version)
kolexiang/spider_python
爬爬爬
kolexiang/pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
kolexiang/gscholar
Query Google Scholar with Python
kolexiang/dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
kolexiang/dirbot
Scrapy project to scrape public web directories (educational)
kolexiang/per-shipment-costs-replication
Replication code and data for Hornok and Koren, 2014. "Per-Shipment Costs and the Lumpiness of International Trade." Review of Economics and Statistics.
kolexiang/tigerspider
tigerspider: a fast high-level screen scraping and web crawling framework for Python.
kolexiang/RStata
An R package which translates Stata syntax into R
kolexiang/distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现