ysykzheng's Stars
gradle/gradle
Adaptable, fast automation for all
apache/joshua
Apache Joshua
matomo-org/referrer-spam-list
Community-contributed list of referrer spammers. Comment +1 in any issue or Pull request and the spammer will be added to the list!
benoitc/http-parser
HTTP request/response parser for python in C
gabrie-allaigre/avatar-generator
Avatar generator in Java
markedjs/marked
A markdown parser and compiler. Built for speed.
apache/camel
Apache Camel is an open source integration framework that empowers you to quickly and easily integrate various systems consuming or producing data.
apache/hadoop
Apache Hadoop
h2oai/h2o-3
H2O is an Open Source, Distributed, Fast & Scalable Machine Learning Platform: Deep Learning, Gradient Boosting (GBM) & XGBoost, Random Forest, Generalized Linear Modeling (GLM with Elastic Net), K-Means, PCA, Generalized Additive Models (GAM), RuleFit, Support Vector Machine (SVM), Stacked Ensembles, Automatic Machine Learning (AutoML), etc.
google/leveldb
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
Alluxio/alluxio
Alluxio, data orchestration for analytics and machine learning in the cloud
redis/jedis
Redis Java client
apache/commons-vfs
Apache Commons VFS
tesseract-ocr/tesseract
Tesseract Open Source OCR Engine (main repository)
apache/zookeeper
Apache ZooKeeper
mimno/Mallet
MALLET is a Java-based package for statistical natural language processing, document classification, clustering, topic modeling, information extraction, and other machine learning applications to text.
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
liuhuanyong/ChineseEmbedding
Chinese Embedding collection incling token ,postag ,pinyin,dependency,word embedding.中文自然语言处理向量合集,包括字向量,拼音向量,词向量,词性向量,依存关系向量.共5种类型的向量
commonsense/conceptnet5
Code for building ConceptNet from raw data.
commonsense/conceptnet-numberbatch
yago-naga/yago3
YAGO is a large semantic knowledge base, derived from Wikipedia, WordNet, WikiData, GeoNames, and other data sources
alperakcan/fuse-ext2
Fuse-ext2 is a multi OS FUSE module to mount ext2, ext3 and ext4 file system devices and/or images with read write support.
studyzy/imewlconverter
”深蓝词库转换“ 一款开源免费的输入法词库转换程序
vinhkhuc/JFastText
Java interface for fastText
sparql-generate/sparql-generate
SPARQL-Generate implementation over Apache Jena
dbpedia/extraction-framework
The software used to extract structured data from Wikipedia
extjwnl/extjwnl
extJWNL (Extended Java WordNet Library) is a Java API for creating, reading and updating dictionaries in WordNet format.
usc-isi-i2/Web-Karma
Information Integration Tool
apache/jena
Apache Jena, A free and open source Java framework for building Semantic Web and Linked Data applications.
aurbroszniowski/os-platform-finder
Utility java class to return the current OS Platform