Pinned Repositories
2014
Official content for the Fall 2014 Harvard CS109 Data Science course
2014_data
Data directory for the CS109 Data Science course
5th_place_solution_facebook_check_ins
My solution rank 5th/1212 in Facebook check ins prediction competition at Kaggle
abcnn-keras
Keras implementation of ABCNN by Yin & Schütze (WIP)
AdversarialNetsPapers
The classical papers and codes about generative adversarial nets
Agriculture_KnowledgeGraph
农业知识图谱(KG):农业领域的信息检索,命名实体识别,关系抽取,分类树构建,数据挖掘
Algorithm_Interview_Notes-Chinese
2018/2019/校招/春招/秋招/算法/机器学习(Machine Learning)/深度学习(Deep Learning)/自然语言处理(NLP)/C/C++/Python/面试笔记
algorithms-data-structures
Continually updated Python Notebooks containing coding problems and solutions (algorithms and data structures).
mlstudy
mlstudy
laisun's Repositories
laisun/2014_data
Data directory for the CS109 Data Science course
laisun/AlwaysRemember
Zipfian capstone project - Dan Morris
laisun/darts-clone
A clone of Darts (Double-ARray Trie System)
laisun/datrie
Fast, efficiently stored Trie for Python. Uses libdatrie.
laisun/deep-recurrent
Deep bidirectional recurrent net for opinion mining (code for EMNLP14 paper)
laisun/DeepLearningMovies
Kaggle's competition for using Google's word2vec package for sentiment analysis
laisun/dictionarylearning
Online learning of sparse dictionaries
laisun/doc-ml-ufldl
Unsupervised Feature Learning and Deep Learning Tutorial
laisun/EnsembleSVM
A Library for Ensemble Learning Using Support Vector Machines
laisun/FactorizationMachine
laisun/koan
laisun/MachineLearning-C---code
using c++ code to show the example of machine learning
laisun/maxent
Maximum Entropy Modeling Toolkit for Python and C++
laisun/neuraltalk
NeuralTalk is a Python+numpy project for learning Multimodal Recurrent Neural Networks that describe images with sentences.
laisun/new-words-discovery
新词发现
laisun/nips14-ssl
Code for reproducing results of NIPS 2014 paper "Semi-Supervised Learning with Deep Generative Models"
laisun/plda
Automatically exported from code.google.com/p/plda
laisun/Predict-click-through-rates-on-display-ads
Display advertising is a billion dollar effort and one of the central uses of machine learning on the Internet. However, its data and methods are usually kept under lock and key. In this research competition, CriteoLabs is sharing a week’s worth of data for you to develop models predicting ad click-through rate (CTR). Given a user and the page he is visiting, what is the probability that he will click on a given ad? The goal of this challenge is to benchmark the most accurate ML algorithms for CTR estimation. All winning models will be released under an open source license. As a participant, you are given a chance to access the traffic logs from Criteo that include various undisclosed features along with the click labels.
laisun/pu-learning
Positive and unlabeled learning wrappers for scikit-learn
laisun/pydata-book
Materials and IPython notebooks for "Python for Data Analysis" by Wes McKinney, published by O'Reilly Media
laisun/semi-supervised-lda
Automatically exported from code.google.com/p/semi-supervised-lda
laisun/sentence2vec
Tools for mapping a sentence with arbitrary length to vector space
laisun/simhash_server
laisun/SVDRecommenderSystem
将SVD应用于推荐系统中的评分预测问题
laisun/test
laisun/textrank
TextRank implementation in Python.
laisun/tgrocery-docs
TextGrocery的文档,使用sphinx
laisun/Theano-Tutorials
Bare bones introduction to machine learning from linear regression to convolutional neural networks using Theano.
laisun/theano_exercises
Exercises for my tutorials on Theano
laisun/xgboost
eXtreme Gradient Boosting (GBDT, GBRT or GBM) Library for large-scale and distributed machine learning, on single node, hadoop yarn and more.