kolexiang

Pinned Repositories

Algorithms
Data Structures and Algorithms in Python
Language:Python00
dirbot
Scrapy project to scrape public web directories (educational)
Language:Python00
distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
Language:Python00
dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
Language:Python00
FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
Language:Julia00
ftools
Fast Stata commands for large datasets
Language:Stata10
go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Language:Go00
grab
Web Scraping Framework
Language:Python00
gscholar
Query Google Scholar with Python
Language:Python00
reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
Language:Stata10

kolexiang's Repositories

kolexiang/ftools
Fast Stata commands for large datasets
Language:Stata10
kolexiang/reghdfe
Linear and IV Regressions With Many Fixed Effects (in Stata)
Language:Stata10
kolexiang/Algorithms
Data Structures and Algorithms in Python
Language:Python00
kolexiang/dirbot
Scrapy project to scrape public web directories (educational)
Language:Python00
kolexiang/distribute_crawler
使用scrapy,redis, mongodb,graphite实现的一个分布式网络爬虫,底层存储mongodb集群,分布式使用redis实现,爬虫状态显示使用graphite实现
Language:Python00
kolexiang/dryscrape
A lightweight Python library that uses Webkit to enable easy scraping of dynamic, Javascript-heavy web pages
Language:Python00
kolexiang/FixedEffectModels.jl
Linear and IV models with high dimensional categorical variables
Language:Julia00
kolexiang/go_spider
[爬虫框架 (golang)] An awesome Go concurrent Crawler(spider) framework. The crawler is flexible and modular. It can be expanded to an Individualized crawler easily or you can use the default crawl components only.
Language:Go00
kolexiang/grab
Web Scraping Framework
Language:Python00
kolexiang/gscholar
Query Google Scholar with Python
Language:Python00
kolexiang/mostly-harmless-replication
Replication of tables and figures from "Mostly Harmless Econometrics" in Stata, R, Python and Julia.
Language:Stata
kolexiang/mtheme
A modern LaTeX Beamer theme
Language:TeX
kolexiang/MySQLdb1
MySQL database connector for Python (legacy version)
Language:Python
kolexiang/parallel
PARALLEL: Stata module for parallel computing
Language:Stata
kolexiang/pattern
Web mining module for Python, with tools for scraping, natural language processing, machine learning, network analysis and visualization.
Language:Python
kolexiang/per-shipment-costs-replication
Replication code and data for Hornok and Koren, 2014. "Per-Shipment Costs and the Lumpiness of International Trade." Review of Economics and Statistics.
Language:HTML
kolexiang/pyspider
A Powerful Spider(Web Crawler) System in Python.
Language:Python
kolexiang/robobrowser
Language:Python
kolexiang/RStata
An R package which translates Stata syntax into R
kolexiang/scholar.py
A parser for Google Scholar, written in Python
Language:Python
kolexiang/scrapy
Scrapy, a fast high-level web crawling & scraping framework for Python.
Language:Python2 0
kolexiang/seaborn
Statistical data visualization using matplotlib
Language:Python
kolexiang/SJTUThesis
A XeLaTeX template for Shanghai Jiao Tong University (SJTU) thesis.
Language:Perl
kolexiang/Spider
kolexiang/spider_python
爬爬爬
Language:Python
kolexiang/tigerspider
tigerspider: a fast high-level screen scraping and web crawling framework for Python.
Language:Python
kolexiang/tushare
TuShare is a utility for crawling historical data of China stocks
Language:Python
kolexiang/zhihu-python
获取知乎内容信息，包括问题，答案，用户，收藏夹信息
Language:Python
kolexiang/zhihu-spider
A web spider for zhihu.com
Language:Python