Pinned Repositories
elasticsearch-ansj-analysis-plugin
ansj analysis elasticsearch plugin
HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁
rust-raft
totoro
这个已经是历史了.作为保存我不打算删除了.新的分词ansj分词重新写了代码.准确率速度都比这个高出很多.请大家fork那个吧..这个的生命已经终结
tree_split
Tree-split 搬新家..给各位带来的不便深表歉意
WebCollector
WebCollector is an open source web crawler framework based on Java.It provides some simple interfaces for crawling the Web,you can setup a multi-threaded web crawler in less than 5 minutes.
word2vec
Automatically exported from code.google.com/p/word2vec
YuanXiaoSpider
java 爬虫 元宵版
ansjsun's Repositories
ansjsun/HanLP
自然语言处理 中文分词 词性标注 命名实体识别 依存句法分析 新词发现 关键词短语提取 自动摘要 文本分类聚类 拼音简繁
ansjsun/rust-raft
ansjsun/char_trie
Text segmentation based on trie tree, High performance, support for custom dictionary
ansjsun/dmz_locker
ansjsun/mem_btree
A Data Structure of BTree Implemented with Rust, support snapshot. not use any unsafe lib.
ansjsun/rust-faiss
ansjsun/anda
answers from data
ansjsun/awadb
AI Native database for embedding vectors
ansjsun/bleve
A modern text indexing library for go
ansjsun/blog
我的博客
ansjsun/chubaodb
ansjsun/chubaodb-1
a structured data system on top of ChubaoFS
ansjsun/chubaofs
A distributed file system and object store for cloud native applications
ansjsun/components-contrib
Community driven, reusable components for distributed apps
ansjsun/docker
Mirror of https://gitlab.com/openwrt/docker. Please use merge requests and issues at GitLab rather than here.
ansjsun/docs-zh
the Chinese document
ansjsun/ElasticHD
Elasticsearch 可视化DashBoard, 支持Es监控、实时搜索,Index template快捷替换修改,索引列表信息查看, SQL converts to DSL等
ansjsun/Enterprise-Registration-Data-of-Chinese-Mainland
**大陆 31 个省份1978 年至 2019 年一千多万工商企业注册信息,包含企业名称、注册地址、统一社会信用代码、地区、注册日期、经营范围、法人代表、注册资金、企业类型等详细资料。This repository is an dataset of over 10,000,000 enterprise registration data of 31 provinces in Chinese mainland from 1978 to 2019.【工商大数据】、【企业信息】、【enterprise registration data】。
ansjsun/gnet
🚀 gnet is a high-performance, lightweight, non-blocking, event-driven networking framework written in pure Go./ gnet 是一个高性能、轻量级、非阻塞的事件驱动 Go 网络框架。
ansjsun/libvirttime
libvirttime provides transparent time virtualization, all in userspace.
ansjsun/m3u8_downer
ansjsun/openwrt
ansjsun/ribsnetwork
ansjsun/rpcx
Faster multil-language bidirectional RPC framework in Go, like alibaba Dubbo and weibo Motan in Java, but with more features, Scale easily.
ansjsun/rust-rocksdb
rust wrapper for rocksdb
ansjsun/tantivy
Tantivy is a full-text search engine library inspired by Apache Lucene and written in Rust
ansjsun/tonic
A native gRPC client & server implementation with async/await support.
ansjsun/tptool
golang tp print tools
ansjsun/version_macro
rust build binary get git version and build time
ansjsun/wintun
Rust bindings to the wintun c library: https://www.wintun.net/