Pinned Repositories
couponbook
Test App with Dropbox API
DoubanAnnotationCrawler
Grab Douban Annotations 豆瓣读书笔记备份下载脚本
getWikipediaMetaData
clean a wikipedia dump, index with lucene, search for a query and return its wiki title and category
HTML2PlainText
Strip HTML tags, escape HTML symbols but keep the format of new lines and new paragraphs.
lemmatizationWithNLTK
Do lemmatization for each file in a folder using nltk in python
Sina-Blog-Expo
stemmingWithNLTK
stemming using different stemmers with NLTK
XiaoxiaoLi's Repositories
XiaoxiaoLi/DoubanAnnotationCrawler
Grab Douban Annotations 豆瓣读书笔记备份下载脚本
XiaoxiaoLi/lemmatizationWithNLTK
Do lemmatization for each file in a folder using nltk in python
XiaoxiaoLi/getWikipediaMetaData
clean a wikipedia dump, index with lucene, search for a query and return its wiki title and category
XiaoxiaoLi/stemmingWithNLTK
stemming using different stemmers with NLTK
XiaoxiaoLi/HTML2PlainText
Strip HTML tags, escape HTML symbols but keep the format of new lines and new paragraphs.
XiaoxiaoLi/Sina-Blog-Expo
XiaoxiaoLi/couponbook
Test App with Dropbox API
XiaoxiaoLi/data-design
DATA+DESIGN中文版
XiaoxiaoLi/douban-exporter
An online service to export 豆瓣 (douban) data to Excel files.
XiaoxiaoLi/GuessTheNumber
XiaoxiaoLi/kaggle-instacart
XiaoxiaoLi/kaggle-quora-question-pair
XiaoxiaoLi/kaggle-toxic-comments
XiaoxiaoLi/kaggle-wsdm-music-recommendation
XiaoxiaoLi/Linear-Algebra-Refresher
Quiz answer for Udacity Course Linear Algebra Refresher https://www.udacity.com/course/linear-algebra-refresher-course--ud953
XiaoxiaoLi/lucene-wikipedia
Automatically exported from code.google.com/p/lucene-wikipedia
XiaoxiaoLi/MeerkatScripts
Trivial scripts related to Meerkat, mainly for data format transformation.
XiaoxiaoLi/sinaQingExport
Export script for Sina Qing blog. So sad the service is being shut down.
XiaoxiaoLi/stanford-tensorflow-tutorials
This repository contains code examples for the Stanford's course: TensorFlow for Deep Learning Research.
XiaoxiaoLi/ThinkStats2
Text and supporting code for Think Stats, 2nd Edition
XiaoxiaoLi/ud120-projects
Starter project code for students taking Udacity ud120
XiaoxiaoLi/word2vec-nlp-tutorial
https://www.kaggle.com/c/word2vec-nlp-tutorial/
XiaoxiaoLi/work-relax-timer