Pinned Repositories
angel
A Flexible and Powerful Parameter Server for large-scale machine learning
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
Argo
58.com轻量级web框架
btcplex
BTCplex is an open source Bitcoin block chain browser written in Go, it allows you to search and navigate the block chain.
camus
LinkedIn's Kafka to HDFS pipeline.
canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件
cat
Central Application Tracking
flume-kafka
A kafka source & sink for flume
spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
xgboost
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
yunguangwang891017's Repositories
yunguangwang891017/cat
Central Application Tracking
yunguangwang891017/spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
yunguangwang891017/xgboost
Large-scale and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, on single node, hadoop yarn and more.
yunguangwang891017/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
yunguangwang891017/ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
yunguangwang891017/camus
LinkedIn's Kafka to HDFS pipeline.
yunguangwang891017/canal
阿里巴巴mysql数据库binlog的增量订阅&消费组件
yunguangwang891017/cws_evaluation
Java开源项目cws_evaluation:中文分词器分词效果评估对比
yunguangwang891017/deeplearningbook-chinese
Deep Learning Book Chinese Translation
yunguangwang891017/disconf
Distributed Configuration Management Platform(分布式配置管理平台)
yunguangwang891017/elasticsearch
Open Source, Distributed, RESTful Search Engine
yunguangwang891017/faiss
A library for efficient similarity search and clustering of dense vectors.
yunguangwang891017/Familia
A Toolkit for Chinese Topic Modeling
yunguangwang891017/FM_FTRL
Hashed Factorization Machine with Follow The Regularized Leader for Kaggle Avazu Click-Through Rate Competition
yunguangwang891017/fnlp
中文自然语言处理工具包 Toolkit for Chinese natural language processing
yunguangwang891017/gobblin
Universal data ingestion framework for Hadoop.
yunguangwang891017/HanLP
汉语言处理包 中文分词 词性标注 命名实体识别 依存句法分析 关键词提取 自动摘要 短语提取 拼音 简繁转换
yunguangwang891017/incubator-airflow
Apache Airflow (Incubating)
yunguangwang891017/jieba
结巴中文分词
yunguangwang891017/jstorm
Java Storm
yunguangwang891017/kafka-manager
A tool for managing Apache Kafka.
yunguangwang891017/KafkaOffsetMonitor
A little app to monitor the progress of kafka consumers and their lag wrt the queue.
yunguangwang891017/learning-spark
Example code from Learning Spark book
yunguangwang891017/liblinear
yunguangwang891017/liblinear-java
Java version of LIBLINEAR
yunguangwang891017/ltp
Language Technology Platform
yunguangwang891017/Online-Random-Bit-Regression-FTRL
Online Random Bit Regression with FTRL-Proximal in Python
yunguangwang891017/scikit-learn
scikit-learn: machine learning in Python
yunguangwang891017/snownlp
Python library for processing Chinese text
yunguangwang891017/ssdb
SSDB - A fast NoSQL database, an alternative to Redis