Pinned Repositories
aircompressor
A port of Snappy, LZO, LZ4, and Zstandard to Java
ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
APIJSON
🚀 零代码、热更新、全自动 ORM 库,后端接口和文档零代码,前端(客户端) 定制返回 JSON 的数据和结构。 🚀 A JSON Transmission Protocol and an ORM Library for automatically providing APIs and Docs.
Automatic-Corpus-Generation
This repository is for the paper "A Hybrid Approach to Automatic Corpus Generation for Chinese Spelling Check"
bert
TensorFlow code and pre-trained models for BERT
bert4keras
keras implement of transformers for humans
BertBasedCorrectionModels
PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型,使用PyTorch实现。
bfe-book
《深入理解BFE》(Book for opensource project BFE, in Chinese)
bfs
The Baidu File System.
OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
since1886's Repositories
since1886/Forward
A library for high performance deep learning inference on NVIDIA GPUs.
since1886/ccs-pykerberos
Python Kerberos library used by CalendarServer
since1886/transformers
🤗 Transformers: State-of-the-art Natural Language Processing for Pytorch, TensorFlow, and JAX.
since1886/textract
extract text from any document. no muss. no fuss.
since1886/notebooks
Notebooks using the Hugging Face libraries 🤗
since1886/braft
An industrial-grade C++ implementation of RAFT consensus algorithm based on brpc, widely used inside Baidu to build highly-available distributed systems.
since1886/tokenizers
💥 Fast State-of-the-Art Tokenizers optimized for Research and Production
since1886/sentencepiece
Unsupervised text tokenizer for Neural Network-based text generation.
since1886/pycorrector
pycorrector is a toolkit for text error correction. 文本纠错,Kenlm,Seq2Seq_Attention,BERT,MacBERT,ELECTRA,ERNIE,Transformer等模型实现,开箱即用。
since1886/libco
libco is a coroutine library which is widely used in wechat back-end service. It has been running on tens of thousands of machines since 2013.
since1886/aircompressor
A port of Snappy, LZO, LZ4, and Zstandard to Java
since1886/bert4keras
keras implement of transformers for humans
since1886/word-discovery
速度更快、效果更好的中文新词发现
since1886/friso
High performance Chinese tokenizer with both GBK and UTF-8 charset support based on MMSEG algorithm developed by ANSI C. Completely based on modular implementation and can be easily embedded in other programs, like: MySQL, PostgreSQL, PHP, etc.
since1886/mod_auth_gssapi
GSSAPI Negotiate module for Apache
since1886/OI-wiki
:star2: Wiki of OI / ICPC for everyone. (某大型游戏线上攻略,内含炫酷算术魔法)
since1886/or-tools
Google's Operations Research tools:
since1886/guava
Google core libraries for Java
since1886/leveldb
LevelDB is a fast key-value storage library written at Google that provides an ordered mapping from string keys to string values.
since1886/vmtouch
Portable file system cache diagnostics and control
since1886/ansj_seg
ansj分词.ict的真正java实现.分词效果速度都超过开源版的ict. 中文分词,人名识别,词性标注,用户自定义词典
since1886/bert
TensorFlow code and pre-trained models for BERT
since1886/bfe-book
《深入理解BFE》(Book for opensource project BFE, in Chinese)
since1886/roformer
Rotary Transformer
since1886/DDParser
百度开源的依存句法分析系统
since1886/YoungCorrector
基于规则的文本纠错系统。
since1886/BertBasedCorrectionModels
PyTorch impelementations of BERT-based Spelling Error Correction Models. 基于BERT的文本纠错模型,使用PyTorch实现。
since1886/Familia
A Toolkit for Industrial Topic Modeling
since1886/keras-bert
Implementation of BERT that could load official pre-trained models for feature extraction and prediction
since1886/KBQA-for-Diagnosis
Knowledge Graph,Question Answering System,基于知识图谱和向量检索的医疗诊断问答系统