hrxx's Stars
goto456/stopwords
中文常用停用词表(哈工大停用词表、百度停用词表等)
liuhuanyong/HyponymyExtraction
HyponymyExtraction and Graph based on KB Schema, Baike-kb and online text extract, 基于知识概念体系,百科知识库,以及在线搜索结构化方式的词语上下位抽取与可视化展示
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
Embedding/Chinese-Word-Vectors
100+ Chinese Word Vectors 上百种预训练中文词向量
yuanguangxin/LeetCode
LeetCode刷题记录与面试整理
soimort/you-get
:arrow_double_down: Dumb downloader that scrapes the web
pythonstock/stock
stock,股票系统。使用python进行开发。
zvtvz/zvt
modular quant framework.
zhansliu/writemdict
A library for writing dictionary files in the MDict (.mdx) format
optuna/optuna
A hyperparameter optimization framework
iptv-org/iptv
Collection of publicly available IPTV channels from all over the world
notanewbie/LegalStream
An m3u8 playlist featuring many LEGALLY FREE IPTV streams. For use with VLC.
KeithGalli/pandas
Data & Code for my video on the Pandas library of Python
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
pbatard/rufus
The Reliable USB Formatting Utility
github/gitignore
A collection of useful .gitignore templates
snorkel-team/snorkel
A system for quickly generating training data with weak supervision
Lynten/stanford-corenlp
Python wrapper for Stanford CoreNLP.
stanfordnlp/CoreNLP
CoreNLP: A Java suite of core NLP tools for tokenization, sentence segmentation, NER, parsing, coreference, sentiment analysis, etc.
nltk/nltk
NLTK Source
emilwallner/Screenshot-to-code
A neural network that transforms a design mock-up into a static website.
cts2/twinkql
Twinkql is a SPARQL to Object Mapping Framework that lets you specify your SPARQL queries in XML, instead of in your code, and map the results to Java Beans.
Angel-ML/angel
A Flexible and Powerful Parameter Server for large-scale machine learning
saffsd/langid.py
Stand-alone language identification system
optimaize/language-detector
Language Detection Library for Java
openlink/virtuoso-opensource
Virtuoso is a high-performance and scalable Multi-Model RDBMS, Data Integration Middleware, Linked Data Deployment, and HTTP Application Server Platform
jexp/cy2neo
Cy2Neo - Tiny Neo4j Cypher Workbench with D3 Visualization
google/seq2seq
A general-purpose encoder-decoder framework for Tensorflow
cayleygraph/cayley
An open-source graph database
infinilabs/analysis-ik
🚌 The IK Analysis plugin integrates Lucene IK analyzer into Elasticsearch and OpenSearch, support customized dictionary.