Pinned Repositories
Algorithms
Data Structures and Algorithms in Python
dirbot
Scrapy project to scrape public web directories (educational)
distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
ftools
Fast Stata commands for large datasets
go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
grab
Web Scraping Framework
gscholar
Query Google Scholar with Python
reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
kolexiang's Repositories
kolexiang/ftools
Fast Stata commands for large datasets
kolexiang/reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
kolexiang/Algorithms
Data Structures and Algorithms in Python
kolexiang/dirbot
Scrapy project to scrape public web directories (educational)
kolexiang/distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
kolexiang/dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
kolexiang/FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
kolexiang/go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
kolexiang/grab
Web Scraping Framework
kolexiang/gscholar
Query Google Scholar with Python
kolexiang/mostly-harmless-replication
Replication of tables and figures from "Mostly Harmless Econometrics" in Stata, R, Python and Julia.
kolexiang/mtheme
A modern LaTeX Beamer theme
kolexiang/MySQLdb1
MySQL database connector for Python (legacy version)
kolexiang/parallel
PARALLEL: Stata module for parallel computing
kolexiang/pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
kolexiang/per-shipment-costs-replication
Replication code and data for Hornok and Koren, 2014. "Per-Shipment Costs and the Lumpiness of International Trade." Review of Economics and Statistics.
kolexiang/pyspider
A Powerful Spider(Web Crawler) System in Python.
kolexiang/robobrowser
kolexiang/RStata
An R package which translates Stata syntax into R
kolexiang/scholar.py
A parser for Google Scholar, written in Python
kolexiang/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
kolexiang/seaborn
Statistical data visualization using matplotlib
kolexiang/SJTUThesis
A XeLaTeX template for Shanghai Jiao Tong University (SJTU) thesis.
kolexiang/Spider
kolexiang/spider_python
爬爬爬
kolexiang/tigerspider
tigerspider: a fast high-level screen scraping and web crawling framework for Python.
kolexiang/tushare
TuShare is a utility for crawling historical data of China stocks
kolexiang/zhihu-python
获取知乎内容信息,包括问题,答案,用户,收藏夹信息
kolexiang/zhihu-spider
A web spider for zhihu.com