Pinned Repositories
awesome-chinese-nlp
A curated list of resources for NLP (Natural Language Processing) for Chinese 中文自然语言处理相关资料
awesome-gulp-cn
Gulp资料大全:入门、插件、包等,已完结。
ChatScript
Natural Language tool/dialog manager
insurance-clause-pdf-format
保险条款pdf数据结构化
KeywordExtractor
使用python实现了一个简单的trie树结构,可增加/查找/删除关键词,用于中文的关键词匹配。
myflashtext
快速的中文字符串匹配小工具
rasa-nlu-trainer
GUI for editing rasa-nlu training data
rasa_core
machine learning based dialogue engine for conversational software
rasa_nlu
turn natural language into structured data
spark-ml-source-analysis
spark ml 算法原理剖析以及具体的源码实现分析
wuxiaobo's Repositories
wuxiaobo/Chinese_models_for_SpaCy
SpaCy 中文模型 | Models for SpaCy that support Chinese
wuxiaobo/analytics-zoo
Distributed Tensorflow, Keras and BigDL on Apache Spark
wuxiaobo/apistellar
web框架apistar增强版,轻松构建企业级web项目
wuxiaobo/ASR_Theory
中文语音识别理论,包括研一与研二期间部分所学,论文和PPT
wuxiaobo/AutoCrawler
Google, Naver multiprocess image web crawler (Selenium)
wuxiaobo/awaresome-neural-models-for-semantic-match
A curated list of papers dedicated to neural text (semantic) matching.
wuxiaobo/Awesome-System-for-Machine-Learning
A curated list of research in machine learning system. I also summarize some papers if I think they are really interesting.
wuxiaobo/Customer-Chatbot
中文智能客服机器人demo,包含闲聊和专业问答2个部分,支持自定义组件(Chinese intelligent customer chatbot Demo, including the gossip and the professional Q&A(FAQ) , support for custom components!)
wuxiaobo/fast-bert
Super easy library for BERT based NLP models
wuxiaobo/faster-CTPN
very fast CTPN
wuxiaobo/flashtext
Extract Keywords from sentence or Replace keywords in sentences.
wuxiaobo/flask
The Python micro framework for building web applications.
wuxiaobo/funNLP
中英文敏感词、语言检测、中外手机/电话归属地/运营商查询、名字推断性别、手机号抽取、身份证抽取、邮箱抽取、中日文人名库、中文缩写库、拆字词典、词汇情感值、停用词、反动词表、暴恐词表、繁简体转换、英文模拟中文发音、汪峰歌词生成器、职业名称词库、同义词库、反义词库、否定词库、汽车品牌&零件词库、时间抽取、连续英文切割、中文词向量大全、公司名字大全、古诗词库、IT词库、财经词库、成语词库、地名词库、历史名人词库、诗词词库、医学词库、饮食词库、法律词库、汽车词库、动物词库、中文聊天语料。
wuxiaobo/gnes
GNES is Generic Neural Elastic Search, a cloud-native semantic search system based on deep neural network.
wuxiaobo/gStore
gStore - a graph based RDF triple store.
wuxiaobo/KnowledgeGraph
知识图谱,本项目是一个开放的知识图谱项目,融合了两千五百多万的实体,拥有亿级别的实体属性关系。
wuxiaobo/learn_python3_spider
python3爬虫相关示例汇总:爬取当当网 Top 500 本五星好评书籍;爬取豆瓣最受欢迎的250部电影慢慢看;爬取b站上的NBA形象大使蔡徐坤和他的球友们;用多线程秒爬那些万恶的妹纸们,纸巾呢?;自动识别b站滑动验证码;搞事情了,用 Appium 爬取你的微信朋友圈
wuxiaobo/matchzoo-doc-zh
wuxiaobo/milvus
Milvus -- the world's fastest vector search engine.
wuxiaobo/MiningZhiDaoQACorpus
ZhiDaoChatCorpus, zhidao QA pairs crawled from Baidu zhidao which contains more than 5,800,000 question and 9,830,000 answers with certain tags。百度知道问答语料库,包括超过580万的问题,938万的答案,5800个分类标签。基于该问答语料库,可支持多种应用,如闲聊问答,逻辑挖掘。
wuxiaobo/navicat-keygen
A keygen for Navicat
wuxiaobo/nlp-learning-tutorials
wuxiaobo/nlp_chinese_corpus
大规模中文自然语言处理语料 Large Scale Chinese Corpus for NLP
wuxiaobo/pycorrector
pycorrector is a toolkit for text error correction. It was developed to facilitate the designing, comparing, and sharing of deep text error correction models.
wuxiaobo/pypeln
Concurrent data pipelines made easy
wuxiaobo/QASystemOnMedicalKG
disease centered Medical knowledge graph and qa system。知识图谱构建,自动问答,基于kg的自动问答。以疾病为中心的一定规模医药领域知识图谱,并以该知识图谱完成自动问答与分析服务。
wuxiaobo/spark-sklearn
Scikit-learn integration package for Apache Spark
wuxiaobo/SpiderKeeper
admin ui for scrapy/open source scrapinghub
wuxiaobo/Synonyms
中文近义词工具包
wuxiaobo/xgboost
Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or GBM) Library, for Python, R, Java, Scala, C++ and more. Runs on single machine, Hadoop, Spark, Flink and DataFlow